Class Imbalance

Class imbalance occurs when you have a data set in which there are only a few responsive documents (positives) and a large number of not responsive documents (negatives).

Prevalence values as small as 0,1% are commonly encountered for responsive documents. This class imbalance negatively affects the performance of the classifier.