US Patent 8010541 Systems and methods for condensation-based privacy in strings

Is a

Patent

Patent attributes

Patent Jurisdiction

United States Patent and Trademark Office

Patent Number

8010541

Patent Inventor Names

Philip S. Yu0

Charu C. Aggarwal0

Date of Patent

August 30, 2011

Patent Application Number

11540406

Date Filed

September 30, 2006

Patent Primary Examiner

Charles Kim

Patent abstract

Novel methods and systems for the privacy preserving mining of string data with the use of simple template based models. Such template based models are effective in practice, and preserve important statistical characteristics of the strings such as intra-record distances. Discussed herein is the condensation model for anonymization of string data. Summary statistics are created for groups of strings, and use these statistics are used to generate pseudo-strings. It will be seen that the aggregate behavior of a new set of strings maintains key characteristics such as composition, the order of the intra-string distances, and the accuracy of data mining algorithms such as classification. The preservation of intra-string distances is a key goal in many string and biological applications which are deeply dependent upon the computation of such distances, while it can be shown that the accuracy of applications such as classification are not affected by the anonymization process.

Timeline

No Timeline data yet.

Further Resources

Title

Author

Link

Type

Date

No Further Resources data yet.

US Patent 8010541 Systems and methods for condensation-based privacy in strings

Contents

Patent attributes

Timeline

Further Resources

References

Find more entities like US Patent 8010541 Systems and methods for condensation-based privacy in strings