Patent attributes
This application provides a video classification method, including: obtaining a video comprising a plurality of video frames; obtaining a visual signal feature sequence corresponding to the video using a first submodel in a video classification prediction model, each visual signal feature corresponding to a respective video frame in the video; obtaining an audio signal feature sequence corresponding to the visual signal feature sequence of the video using a second submodel in the video classification prediction model, each audio signal feature corresponding to a respective visual signal feature in the visual signal feature sequence; generating a target signal feature sequence according to the visual signal feature sequence and the audio signal feature sequence; and predicting a video type of the video based on a classification prediction result obtained from applying the target signal feature sequence to a third submodel in the video classification prediction model.