US Patent 11893818 Optimization and use of codebooks for document analysis

Overview Structured Data Issues Contributors Activity

All edits

Edits on 9 Feb, 2024

"Created via: Patent importer"

Golden AI

created this topic on 9 Feb, 2024

Edits made to:

Infobox (+23 properties)

Article (+701 characters)

‌

US Patent 11893818 Optimization and use of codebooks for document analysis

Article

Patent abstract

A method of generating and optimizing a codebooks for document analysis comprises: receiving a first set of document images; extracting a plurality of keypoint regions from each document image of the first set of document images; calculating local descriptors for each keypoint region of the extracted keypoint regions; clustering the local descriptors such that each center of a cluster of local descriptors corresponds to a respective visual word; generating a codebook containing a set of visual words; and optimizing the codebook by maximizing mutual information (MI) between a target field of a second set of document images and at least one visual word of the set of visual words.

Infobox

Is a