Patent attributes
A system and method for transforming input data in a data graph is structured in such a way that it does not destroy embedded contextual data yet also keeps the number of edges in the data graph sufficiently small in number that computation with respect to the data in the data graph is feasible with existing computational resources on extremely large graph sets. Incoming data is represented as a collection of “cliques” rather than placing each data object into its own node in the graph database. Maintaining the clique structure though the graph build pipeline dramatically reduces the exponential increase in the number of edges in the graph, while also maintaining all of the contextual data presented on the input record.