A method for building a classification model for classifying unclassified documents
based on the classification of a plurality of documents which respectively have
been classified as belonging to one of a plurality of classes, said documents being
digitally represented in a computer, said documents respectively comprising a plurality
of terms which respectively comprise one or more symbols of a finite set of symbols,
and said method comprising the following steps: representing each of said plurality
of documents by a vector of n dimensions, said n dimensions forming a vector space,
whereas the value of each dimension of said vector corresponds to the frequency
of occurrence of a certain term in the document corresponding to said vector, so
that said n dimensions span up a vector space; representing the classification
of said already classified documents into classes by separating said vector space
into a plurality of subspaces by one or more hyperplanes, such that each subspace
comprises one or more documents as represented by their corresponding vectors in
said vector space, so that said each subspace corresponds to a class.