Patent attributes
Systems and methods for extracting data from unstructured data sources based on proximity co-reference resolution model. The method includes receiving an electronic document from an unstructured data source and extracting entities from the electronic document. The method also includes receiving fields to be extracted from the electronic document and generating keywords based on the fields. Each of the entities is associated with at least one of the fields. The method further includes identifying keywords in the electronic document based on the generated keywords and calculating, for each of the fields, proximity scores based on a proximity co-reference resolution model. The method also includes, for each of the fields, identifying a field-entity pair based on the calculated proximity scores and generating for display on a user device the field-entity pair.