Patent attributes
People can be tracked across multiple segments of video data, which can correspond to different scenes in a single video file, or multiple video streams or feeds. An instance of video data can be broken up into segments that can each be analyzed to determine faces and bodies represented therein. The bodies can be analyzed across frames of the segment to determine body tracklets that are consistent across the segment. Associations of faces and bodies can be determined based using relative distances and/or spatial relationships. A subsequent clustering of these associations is performed to attempt to determine consistent associations that correspond to unique individuals. Unique identifiers are determined for each person represented in one or more segments of an instance of video data. Such an approach enables individual representations to be correlated across multiple instances.