Patent attributes
A method is provided to identify whether video content, which includes a plurality of image frames, is likely to include an advertisement. The video content is split into a plurality of segments, each segment having a pre-specified duration. Subtitle text information is extracted from each segment and is passed through a natural language processing (NLP) language model to extract an embedding representing the subtitle text information for each of the segments, wherein the NLP language model is previously trained to differentiate between subtitle text information from video content items that were each previously identified as being an advertisement in comparison to subtitle text information from video content items that were each previously identified as not being an advertisement. The embedding representing the subtitle text information for each of the segments is passed through a classifier to obtain a probability regarding whether each segment is an advertisement or not.