Patent attributes
A method is provided for clause analysis in a legal domain. The method builds a coherence graph from a set of labeled training documents by (a) creating entity nodes from and of a same type as entities extracted from the set of labeled training documents, (b) creating clause nodes from labeled clauses in the set of labeled training documents, (c) forming bi-directional edges (i) between each of the clause nodes and the entity nodes belonging thereto, (ii) among parent-child clause nodes from among the clause nodes, and (iii) among same-level sibling clause nodes from among the clause nodes. The method merges nodes, from among the entity and clause nodes, that have a same semantic meaning. The method weights the bi-directional edges using a coherence metric. The method identifies a clause structure of a new document by matching the new document against the coherence graph using a node-covering algorithm.