ATTENTION/WARNING - NE PAS DÉPOSER ICI/DO NOT SUBMIT HERE

Ceci est la version de TEST de DIAL.mem. Veuillez ne pas soumettre votre mémoire sur ce site mais bien à l'URL suivante: 'https://thesis.dial.uclouvain.be'.
This is the TEST version of DIAL.mem. Please use the following URL to submit your master thesis: 'https://thesis.dial.uclouvain.be'.
 

Partially supervised feature selection using tree ensemble approaches

(2016)

Files

Saleh_30931400_2016.pdf
  • Closed access
  • Adobe PDF
  • 941.59 KB

Details

Supervisors
Faculty
Degree label
Abstract
This work offers a mechanism to perform feature selection using the benefits of having a priori partial knowledge about the relevance of some variables in a given context. It extends the embedded feature selection method of random forests by increasing the probability of the relevant features to be evaluated by Gini index at each node. We show that using the a priori knowledge in this way will deviate the embedded mechanism of variables selection towards the favored ones in a way that the overall selected variables will be indeed relevant. This kind of favoring has some drawbacks like overusing the favored variables, but it can be avoided by increasing the number of variables from which the selection is performed.