Patent attributes
In one embodiment, a method includes receiving a video file. The video file includes a corresponding audio stream. The method further includes accessing the audio stream, and generating, based on the audio stream, a representative vector. The vector has a particular number of dimensions. The method further includes accessing a label-embedding space, which has the same particular number of dimensions, and includes a number of regions that each correspond to a respective label. The method further includes determining a region of the label-embedding space that corresponds to the vector, the determined region corresponding to a particular label. The method further includes associating the particular label with the video file.