Patent attributes
Methods, systems, and apparatus for extended training data sets for neural network-based IT solutions. In one aspect, a method includes collecting, support data generated during multiple support events including a corpus of articles referenced during the support event. Within the corpus of articles, restricted and unrestricted articles referenced in the support data and restricted and unrestricted articles not referenced in the support data are identified. Embedded vectors are generated for each article that is referenced in the support data from the article and a subset of the support data that references the article, and for each article that is not referenced by the support data, an embedded vector from only the article. A dimensionality of the embedded vectors is reduced and a neural network is trained using the embedded vectors to select a particular article of the corpus of articles responsive to a new support event.