Patent attributes
A method for creating a patent document summary from a patent document text is disclosed. The method includes creating a data repository of stop-words based on analysis of a plurality of patent documents, and generating an array including a plurality of tuples from the patent document text based on the stop-words in the data repository. The method further includes identifying at least one word-sequence from the array, such that each of the at least one word-sequence occurs at least twice within the patent document text, and that each of the at least one word-sequence includes a unique last word. The method further includes replacing, for each of the at least one word-sequence, second and subsequent occurrences of each of the at least one word-sequence within the patent document text with an associated substitute word-sequence, and generating the patent document summary for the patent document text.