A method, a computer program product, and an information handling system is provided for labeling unlabeled utterances given a taxonomy of labels utilizing topic word semi-supervised learning.