Patent attributes
A computing device for extracting target data from a source document includes: a memory storing target data extraction rules; a processor connected with the memory, the processor configured to: obtain text recognition data extracted from an image of the source document, the text recognition data indicating locations of text structures in the source document; define text lines based on the text recognition data; identify a reference string from the text recognition data; select a subset of the text lines based on a location of the reference string and the target data extraction rules; and output the subset of the text lines as the target data.