Patent attributes
Methods and systems for similarity-based value-to-column classification are disclosed. A method includes: receiving, by a computing device, a natural language search query; determining, by the computing device, a filtering phrase in the natural language search query using a natural language understanding model; encoding, by the computing device, the filtering phrase; retrieving, by the computing device, a plurality of encoded columns; for each of the plurality of encoded columns, the computing device determining a similarity score based on a similarity between the encoded filtering phrase and the encoded column; and outputting, by the computing device, a column corresponding to an encoded column of the plurality of encoded columns having a highest similarity score.