Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for information about models and annotation formats #203

Open
balachander1964 opened this issue Dec 25, 2020 · 4 comments
Open

Request for information about models and annotation formats #203

balachander1964 opened this issue Dec 25, 2020 · 4 comments

Comments

@balachander1964
Copy link

Hi, I will appreciate if you share the links to download the models and data annotation format details.

@swfarnsworth
Copy link
Member

MedaCy reads files in the BRAT format.

There's only one model online currently, namely clinical notes. However we do have other datasets available with different entity types. What types of entities did you want to be able to identify?

@balachander1964
Copy link
Author

balachander1964 commented Dec 28, 2020 via email

@swfarnsworth
Copy link
Member

Bala,

Thank you for the well wishes. I wish you and those close to you the same.

We have a dataset with most of the entity types you specified, however I am not sure that we have one that can identify severity and affected areas. A model trained upon it is given here, though the API for it was designed before we transitioned medaCy to a primarily command line application. This file is the actual model, and if you download it, you should be able to use it with medaCy's command line interface's predict functionality, using ClinicalPipeline as the pipeline option.

This particular model uses a conditional random field. We have had more success with BiLSTM and BERT models, though these have better performance when a GPU is available.

Please let us know if we can be of further assistance.

Steele

@balachander1964
Copy link
Author

balachander1964 commented Dec 29, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants