US Patent 8655803 Method of feature extraction from noisy documents

Is a

Patent

Patent attributes

Current Assignee

Xerox

Patent Jurisdiction

United States Patent and Trademark Office

Patent Number

8655803

Date of Patent

February 18, 2014

Patent Application Number

12336872

Date Filed

December 17, 2008

Patent Citations Received

‌

US Patent 11941497 System and method of operationalizing automated feature engineering

Patent Primary Examiner

‌

Kakali Chaki

Patent abstract

Aspect of the exemplary embodiment relate to a method and apparatus for automatically identifying features that are suitable for use by a classifier in assigning class labels to text sequences extracted from noisy documents. The exemplary method includes receiving a dataset of text sequences, automatically identifying a set of patterns in the text sequences, and filtering the patterns to generate a set of features. The filtering includes at least one of filtering out redundant patterns and filtering out irrelevant patterns. The method further includes outputting at least some of the features in the set of features, optionally after fusing features which are determined not to affect the classifiers accuracy if they are merged.

Timeline

No Timeline data yet.

Further Resources

Title

Author

Link

Type

Date

No Further Resources data yet.

US Patent 8655803 Method of feature extraction from noisy documents

Contents

Patent attributes

Timeline

Further Resources

References

Find more entities like US Patent 8655803 Method of feature extraction from noisy documents