US Patent 7711673 Automatic charset detection using SIM algorithm with charset grouping

Is a

Patent

Patent attributes

Current Assignee

Trend Micro

Patent Jurisdiction

United States Patent and Trademark Office

Patent Number

7711673

Date of Patent

May 4, 2010

Patent Application Number

11238349

Date Filed

September 28, 2005

Patent Citations Received

‌

US Patent 11675862 Online activity identification using artificial intelligence

‌

US Patent 11941497 System and method of operationalizing automated feature engineering

Patent Primary Examiner

‌

David R Vincent

Patent abstract

The invention relates, in an embodiment, to a computer-implemented method for automatic charset detection, which includes detecting an encoding scheme of a target document. The method includes training, using a plurality of text document samples, to obtain a set of machine learning models. Training includes using SIM (Similarity Algorithm) to generate the set of machine learning models from feature vectors obtained from the plurality of text document samples. The method also includes applying the set of machine learning models against a set of target document feature vectors converted from the target document to detect the encoding scheme.

Timeline

No Timeline data yet.

Further Resources

Title

Author

Link

Type

Date

No Further Resources data yet.

US Patent 7711673 Automatic charset detection using SIM algorithm with charset grouping

Contents

Patent attributes

Timeline

Further Resources

References

Find more entities like US Patent 7711673 Automatic charset detection using SIM algorithm with charset grouping