The present invention relates to data processing system and method in supply chain application. The data processing system includes clustering of received supply chain data after normalization, tokenization and vectorization through graph-based analysis.