Patent attributes
Techniques are described relating to the detection of personal information that may be sent to parties outside of an organization. Techniques may include comparing portions of emails to several file templates to calculate a document exposure score. The document exposure score may indicate an overall similarity based upon the presence of a number of common items such as graphics, words, form fields, etc. When the document exposure score for a particular sent email is greater than a threshold value, the sent email may be re-routed and quarantined instead of being transmitted outside of the organization's local network. A secondary determination may also be performed that identifies personal information when a matching file template is not initially found and, if so, adds a new file template to a template database to improve the performance and accuracy of the system over time.