Patent attributes
Data characterizing a document including a target word and a plurality of potential meanings for the target word is received. A first set of context words is determined using a language model. The first set of context words is for the target word. A second set of context words is determined using a knowledge base and the language model. The second set of context words is for the plurality of potential meanings of the target word. A score is determined for each of the plurality of potential meanings by at least comparing the first set of context words and the second set of context words. A potential meaning selected from the plurality of potential meanings that has a highest score is selected as a disambiguation of the first word. Related apparatus, systems, techniques and articles are also described.