Patent attributes
The method of indexing multimedia documents comprises the following steps: a) for each document identifying and extracting terms ti constituted by vectors characterizing properties of the; b) storing terms ti in a term base comprising P terms; c) determining a maximum number N of desired concepts that group together the most pertinent terms ti; d) calculating the matrix T of distances between the terms ti of the term base; e) decomposing the set P of terms ti of the term base into N portions Pj (1≦j≦N) such that P=P1∪ P2 . . . ∪ Pj . . . ∪ PN, each portion Pj comprising a set of terms tij and being represented by a concept cj, the terms ti being distributed in such a manner that the terms that are farther apart are to be found in distinct portions Pl, Pm, and the terms that are closer together are to be found in the same portion Pl; f) structuring the concept dictionary; and g) constructing a fingerprint base made up of the set of concepts ci representing the terms ti of the documents, each document being associated with a fingerprint that is specific thereto.