Patent attributes
Additional background information is used with a trained neural network based model to help classify whether text is a subtly and/or ambiguously offensive. This additional background information can come from different sources such as the article on which the comment was made, world knowledge about the external entities (e.g., Wikipedia, Urban Dictionary), phrases referenced in the text being classified, and, the context of the previous comments/text in the thread. The background information is retrieved based on key entities (e.g., people, places things) and/or key phrases in the comment. Sentence matrix encodings are built for both the comment and the background information. The background information encoding is used to condition the comment encoding. The background information encoding, and the conditioned comment encoding are fed to a trained multi-level perceptron to classify the comment as hate speech or non-hate speech.