A system reduces the time and resources needed to search a large dataset by preprocessing the dataset prior to receiving a query. Generally, the system evaluates and combines rules that govern the validity of the data. The system combines rules to determine conditions under which data objects in the dataset are invalid. After a query is issued, the system quickly eliminates data objects from the response by applying the conditions rather than the individual rules to the data objects.