Patent attributes
We have discovered a system and method for improving the quality of information extraction applications consisting of an ensemble of per-user, adaptive, on-line machine-learning classifiers that adapt to document content and judgments of users by continuously incorporating feedback from information extraction results and corrections that users apply to these results. The satellite classifier ensemble uses only the immediately available features for classifier improvement and it is independent of the complex cascade of earlier decisions leading to the final information extraction result. The machine-learning classifiers may also provide explanations or justifications for classification decisions in the form of rules, other machine-learning classifiers may provide feedback in the form of supporting instances or patterns. Developers control the number and type of classifiers in the satellite classifier ensemble, the pre-deployment training of these classifiers, and the features monitored by the implemented satellite classifier ensemble.