In automatic speech recognition, certain parsing information, such as rules and tags, may be embedded into a finite state transducer (FST) to produce FST output that includes speech recognition results along with codes indicating parsing results of the recognized speech. The codes in the FST output may be formatted using a markup language, such as XML or JSON, for processing by a later application. The FST may be constructed according to a grammar defining the parsing information. The codes for inclusion in the FST output may be embedded into arcs of the FST and then included in the FST output when the speech recognition engine traverses the arcs of the FST.