A system and method simulates conversation with a human user. The system and method receive media, convert the media into a system-specific format, and compare the converted media to a vocabulary. The system and method generate a plurality of intents and a plurality of sub-entities and transform them into a pre-defined format. The system and method route intents and the sub-entities to a first selected knowledge engine and a second knowledge engine. The first selected knowledge engine selects the second knowledge engine and each active grammar in the vocabulary uniquely identifies each of the knowledge engines.