Patent attributes
Provided is a system and method for processing contract documents. The method includes parsing a first contract document to identify a plurality of clauses in the first contract document, each clause of the plurality of clauses including a sequence of words, generating a plurality of representation vectors based on the first contract document and at least one embedding model, wherein each representation vector of the plurality of representation vectors is generated based on a separate clause of at least a subset of clauses of the plurality of clauses, comparing each representation vector of the plurality of representation vectors with a second plurality of representation vectors stored in a vector database, and generating output data based on the representation vectors and the first contract document.