Patent 11604777 was granted and assigned to Amazon on March, 2023 by the United States Patent and Trademark Office.
Techniques for indexing large scale datasets are described. A method for indexing large scale datasets can include receiving, by an indexing service, a request to generate an index for a dataset stored in an data storage service, the request including indexing information for the dataset, determining, by the indexing service, an index type based at least on the dataset, generating, by the indexing service, the index based at least on the indexing information and the index type, and receiving, by the indexing service, a request from a query service to identify a subset of the dataset using the index.