The goal of text categorization is to classify a collection of documents and assign each document to the applicable categories. However, building text classification ”by hand” is time-consuming, thus it is more practical to use machine learning approach to classify from examples - to perform the category assignments automatically.
Many research in text classification focuses on the binary classification using Support Vector Machine (usually abbreviated as SVM for short) learning approach. However, SVM is typically used to perform binary classification, and binary classification is always not enough in practice. Hence, many scientist has started working on finding the best combination of SVM with other means to perform multi-class classification. One approach is to develop a alternative implementation of Support Vector Machine with information gain as evaluation method.
There are numerous tasks that can be automated with text categorization methods. Here are some of them [1]: