Patent attributes
Systems and methods are disclosed for determining whether a short amount of text is irrelevant. Initially, an article is selected having one or more comments of varying length. Depending on the number of comments available, a native context may be constructed based on a given comment and other neighboring comments. In other embodiments, a transferred context may be constructed from the given comment and topically similar comments extracted from other, topically similar articles. A native context-aware feature may be determined from the constructed native context and a transferred context-aware feature may be determined from the constructed transferred context. These features may be leveraged by a language classifier to determine whether a given comment is irrelevant.