Patent attributes
Techniques are disclosed for testing applications and systems used to provide a high performance computing (HPC) system, such as a cluster or grid. A grid computing system is tested using a “shadow mode” approach, where information regarding a parallel computing job launched on a first instance of a computing grid is recorded, stored, and used to replay the parallel computing job on a second instance of the computing grid. For example, the parallel computing job may be executed using a production build of grid applications and replayed on a candidate build of grid applications. Output from executing the parallel computing job is also captured in order to provide a reference result which may be compared with candidate results generated by replaying the parallel computing job. The parallel computing job may specify a set of inputs and execution logic along with any other requirements or dependencies needed to accurately replay the parallel computing job.