US Patent 12080067 Classifying a video stream using a self-attention-based machine-learning model

Patent 12080067 was granted and assigned to Meta Platforms, Inc. on September, 2024 by the United States Patent and Trademark Office.

Overview Structured Data Issues Contributors Activity

All edits

Edits on 5 Sep, 2024

"Created via: Patent importer"

Golden AI

created this topic on 5 Sep, 2024

Edits made to:

Infobox (+21 properties)

Article (+596 characters)

‌

US Patent 12080067 Classifying a video stream using a self-attention-based machine-learning model

Article

Patent abstract

In one embodiment, a method includes accessing a stream of F video frames, where each of the F video frames includes N patches that are non-overlapping, generating an initial embedding vector for each of the N×F patches in the F video frames, generating a classification embedding by processing the generated N×F initial embedding vectors using a self-attention-based machine-learning model that computes a temporal attention and a spatial attention for each of the N×F patches, and determining a class of the stream of video frames based on the generated classification embedding.

Infobox

Is a