Patent attributes
In some implementations, a computer system identifies multiple sub-sessions of a network-based communication session in which multiple remote endpoint devices each provide media streams over a communication network. For each of the sub-sessions, the computer system can identify the endpoint devices included in the sub-session. The computer system can obtain user state data for each of the endpoint devices, the user state data for each endpoint device being generated based on analysis of face images of the user of the endpoint device. The computer system aggregates the user state data to determine a sub-session state for each of the sub-sessions. During the communication session, the computer system communicates over the communication network with a remote device associated with the communication session to cause a user interface of the remote device to indicate the sub-session states determined for one or more of the multiple sub-sessions.