Patent attributes
A system for template invariant information extraction. The system comprises of processor, a first neural network model and a second neural network model. The processor is configured to recognize and extract entities and location of the entities in the input document using the first neural network model. The processor is further configured to classify whether the input document belongs to at least a template of the documents of the first training dataset using the second neural network model. The second neural network model comprises a linear classifier configured to generate a plurality of confidence scores for the input document corresponding to a unique template of the documents of the first training dataset. A threshold value to classify the input document belonging to template of the documents of the first training dataset is determined, Classification is done by comparing the confidence score with the threshold value.