A method for diagnosing software crashes includes calculating, using vector space modeling, angles between vectors representing stack-traces from reports in a repository on one or more server computer systems, to define similarities in the reports, wherein reports are deemed similar where angles between their respective vectors are less than a threshold value. The method further includes grouping the reports into similar sets using a maximal cliques process and automatically diagnosing, in response to a received software crash, a first stack-trace extracted from a first report of the received software crash, to determine a potential solution to the received software crash.