ATTENTION/WARNING - NE PAS DÉPOSER ICI/DO NOT SUBMIT HERE

Ceci est la version de TEST de DIAL.mem. Veuillez ne pas soumettre votre mémoire sur ce site mais bien à l'URL suivante: 'https://thesis.dial.uclouvain.be'.
This is the TEST version of DIAL.mem. Please use the following URL to submit your master thesis: 'https://thesis.dial.uclouvain.be'.
 

A comparison of classification models for imbalanced datasets

(2017)

Files

Kurin_11251501_2017.pdf
  • Open access
  • Adobe PDF
  • 1.2 MB

Kurin_11251501_2017_Annexe1.pdf
  • Open access
  • Adobe PDF
  • 24.81 KB

Details

Supervisors
Faculty
Degree label
Abstract
Many practical classification problems are imbalanced; i.e., at least one of the classes constitutes only a very small minority of the data. For such problems, the interest usually leans towards correct classification of the minor class. Examples of such problems include fraud detection, rare disease diagnosing, etc. However, the most commonly used classification algorithms do not work well for such problems because they aim to minimize the overall error rate, rather than paying special attention to the minor class. In the master's thesis, a number of models are evaluated with the objective to find those that better address the classification problem of imbalanced datasets. A special focus is given to the investigation of some ways of dealing with imbalanced datasets based on a Logistic Regression.