Patent attributes
A method and apparatus for automatic schema detection and migration is disclosed. In embodiments, a file including NoSQL data is received and one or more data types are detected in a hierarchical data table description. Within a record of the NoSQL data, which may be stored in a JSON format, a field name and its data type, are stored in a schema describing the data table. As additional records are parsed, the schema is updated to include additional field names and data types, and may include designations such as repeated and optional, for some fields. In embodiments, the schema is a serialized data format, such as Google Protocol Buffers (Protobuf).