Patent attributes
A method, system and computer-usable medium are disclosed for using a contextual graph to summarize a corpus of content. Natural Language Processing (NLP) preprocessing operations are performed on text within an input corpus to form a grammatical analysis. In turn, the grammatical analysis is used to generate semantic associations between phrases in the input corpus. The resulting semantic associations are then used to determine the thematic relevance of the individual sentences in the input corpus to form a context-based ranking. In turn, the context-based ranking is used to construct a context graph, the vertices of which are represented by phrases, and the edges are represented by an aggregate score resulting from performing calculations associated with semantic similarity of the phrases. The resulting context graph is then used to generate a content summarization for the input corpus.