git clone --recursive https://github.com/amalic/bern-docker.git
- Lot's of disk space (~60 GB disk space)
- 32 GB Ram (see BERN project README)
- NVidia Docker
- Those files:
- GNormPlus.zip (~4GB)
- tmVarJava.zip
ftp://ftp.ncbi.nlm.nih.gov/pub/lu/Suppl/tmVar2/tmVarJava.zip
(~40GB) - CRF++-0.58.tar.gz (~4 MB)
- biobert_ner_models.zip (~7GB)
- data.zip (~750 MB)
- resources.zip (~3 GB)
Download and extract all archives into externalData folder within this repository by running.
sh prerequisites.sh
You might want to get a coffee or learn a new programming language, depending on your internet connection.
docker build -t bern-docker .
docker run --name bern-docker --log-opt max-size=10m -d --rm --gpus all -p 80:8888 \
-v $PWD/externalData/GNormPlusJava/Dictionary/:/app/GNormPlusJava/Dictionary/ \
-v $PWD/externalData/tmVarJava/Database:/app/tmVarJava/Database \
-v $PWD/externalData/biobert_ner_models/pretrainedBERT:/app/biobert_ner/pretrainedBERT/ \
-v $PWD/externalData/biobert_ner_models/conf/:/app/biobert_ner/conf/ \
-v $PWD/externalData/biobert_ner_models/tmp/:/app/biobert_ner/tmp/ \
-v $PWD/externalData/biobert_ner_models/result/:/app/biobert_ner/result/ \
-v $PWD/externalData/data/:/app/normalization/data/ \
-v $PWD/externalData/resources/:/app/normalization/resources/ \
bern-docker
Usage PMID(s) (HTTP GET)
- http://<YOUR_SERVER_ADDRESS>:<PORT>/?pmid=<a PMID or comma seperate PMIDs>&format=<json or pubtator>
- Example: http://localhost/?pmid=30429607&format=json&indent=true
- Example: http://localhost/?pmid=30429607&format=pubtator
- Example: http://localhost/?pmid=30429607,29446767&format=json&indent=true
@article{kim2019neural,
title={A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining},
author={Kim, Donghyeon and Lee, Jinhyuk and So, Chan Ho and Jeon, Hwisang and Jeong, Minbyul and Choi, Yonghwa and Yoon, Wonjin and Sung, Mujeen and and Kang, Jaewoo},
journal={IEEE Access},
volume={7},
pages={73729--73740},
year={2019},
publisher={IEEE}
}
@article{10.1093/bioinformatics/btz682,
author = {Lee, Jinhyuk and Yoon, Wonjin and Kim, Sungdong and Kim, Donghyeon and Kim, Sunkyu and So, Chan Ho and Kang, Jaewoo},
title = "{BioBERT: a pre-trained biomedical language representation model for biomedical text mining}",
journal = {Bioinformatics},
year = {2019},
month = {09},
issn = {1367-4803},
doi = {10.1093/bioinformatics/btz682},
url = {https://doi.org/10.1093/bioinformatics/btz682},
}