ATTENTION/WARNING - NE PAS DÉPOSER ICI/DO NOT SUBMIT HERE

Ceci est la version de TEST de DIAL.mem. Veuillez ne pas soumettre votre mémoire sur ce site mais bien à l'URL suivante: 'https://thesis.dial.uclouvain.be'.
This is the TEST version of DIAL.mem. Please use the following URL to submit your master thesis: 'https://thesis.dial.uclouvain.be'.
 

A Conditional Random Fields model for Drug Name Recognition in the cryptomarket forum of Silk Road 2

(2022)

Files

WERNER_07991100_2022.pdf
  • Open access
  • Adobe PDF
  • 967.92 KB

Details

Supervisors
Faculty
Degree label
Abstract
This thesis aimed at developing a Drug Name Recognition system to extract drug terms from the cryptomarket forum of Silk Road 2 thanks to a Conditional Random Fields model to operate a classification between the terms that are considered as completely new to a database of well-known drugs, those that are variants of already-known drugs and those that are variants of new drug terms. This thesis aimed at fulfilling two particular objectives. First, we wanted to analyze whether or not the use of a CRF model could improve the performance of the model. Second, we aimed at investigating whether forum posts could provide useful information for national agencies as regards the early appearance of drug names. Our model enabled us to discover the presence of 232 new drug names as well as to acknowledge that our model outperforms the results of the pre-annotation phase as well as of other studies.