Patent attributes
The present technology pertains to a method and system for assessing risks associated with facilities, based on using natural language processing. For example, a method can include receiving a natural language input comprising at least one raw text document associated with a facility and generating a plurality of segmented sentences from the raw text documents. The plurality of segmented sentences can be provided as inputs to a machine learning model trained to classify an input segmented sentence over a pre-defined lexicon of pharmaceutical terminology. Each segmented sentence can be classified into one or more classes given by the pre-defined lexicon of pharmaceutical terminology. A secondary classification can be performed for each classified segmented sentence to generate a production issue label based on an analysis of the classified segmented sentence. From the secondary classifications for the classified segmented sentences, at least one production category score for the facility can be generated.