A hash code-based search apparatus includes a token set extracting unit configured to extract a token set having at least one token from a document, a hash code generating unit configured to generate N hash codes by applying N hash functions to the at least one token (where N is a natural number), and an index generating unit configured to generate a search index by indexing the document with the N hash codes.