Patent attributes
A computer-implemented process is programmed to programmatically receive, using a first computer system, electronic digital data representing input time-correlated speech data and video data, determine a first text sequence corresponding to the input time-correlated speech data, the first text sequence comprising unstructured natural language text, determining syntactic structure data associated with the first text sequence, inputting the time-correlated video data and the syntactic structure data associated with the first text sequence into one or more machine learning models, the machine learning models producing an output of one or more scores for at least a portion of the time-correlated video data and first text sequence, transforming the output of one or more scores to yield and output set of summary points and suggestions, and transmitting a graphical element of the output set of summary points and suggestions for display.