This project is an extension of the FastContext project https://github.com/jianlins/FastContext, created in 2019 for the extension of the FastContext algorithm [1] to French.
This repository doesn't fork any code from FastContext, but provides the versioned list of rules in the FastContext format generated for the French language, as well as detailed instructions to use FastContext with our French adaptation.
The list of French lexical is generated by a semi-automatic process, which is documented comprehensively on a dedicated repository: https://github.com/practikpharma/FrenchFastContext-CuesGeneration
Andon Tchechmedjiev (@twktheainur), Mehdi Mirzapour (@mehdi-mirzapour) with supervision of Clement Jonquet (@jonquet)
LIRMM
The French Lexcical Cues Fast Context list stored in this repository is versionned:
- v0: This list was produced by taking the list compiled by @amineabdaoui for French ConText (https://github.com/practikpharma/FrenchConText/blob/master/src/main/java/org/context/implementation/ConTextFrench.java) and automatically transforming it into the FastContext's format. The process of creation of the list and integration of French Context in the SIFR Annotator is described here [2].
- v1 : This list was produced by @mehdi-mirzapour using the scripts in https://github.com/practikpharma/FrenchFastContext-CuesGeneration. Description of the process is under submission for publication. This list was used for the evaluation reported in the paper under review.
- v2: This list is a corrected version of v1 updated after evalauiton but using insights comming from evaluation error analysis.
The latest version of the list shall be copied here for use within the SIFR Annotator: https://github.com/sifrproject/annotators/blob/master/sifr-bioportal-annotation-postannotations/src/main/resources/french_context.tsv
[1] https://dx.doi.org/10.1016%2Fj.jbi.2018.08.002 [2] https://hal.archives-ouvertes.fr/lirmm-01656834