Patent 7328404 was granted and assigned to Microsoft on February, 2008 by the United States Patent and Trademark Office.
System and methods allowing for effective and reliable reading predictions for Japanese ideographs are provided. In an illustrative implementation, a reading predictions system operating in “learning” and “execution/run-time” modes is provided. In the “learning” mode the reading predictions system operates on a number of input sources to produce a decision tree that is used in the “execution/run-time” mode to return reading predictions for inputted Japanese sentences containing Japanese ideographs. Among the inputs utilized in the “learning” mode are base Japanese script readings, a training corpus, and quasi-phonological rules. From these inputs underlying readings and a decision tree are created. When operating in the “execution/run-time” mode, the reading predictions system employs a morphological analyzer to perform a morphology analysis on inputted sentences. Using the morphological analysis, the quasi-phonological rules, the underlying readings, and the decision tree reading predictions are provided.