This is an official project repository of STHAL: Location mention Identification in Tweets of Indian context application built on the concept of Named Entity Recognition. As a result of our work, we prepare a publicly-available annotated dataset of Indian geolocations that can be used by the research community.
The project is maintained by Shobhit Sinha, Kartik Verma.
The dataset for can be found out at Dataset. Please note that this is an implementation only for research purpose use only.
The Code consist of three models mainly Conditional Random Fields (CRF), Bi-LSTM-CNN and BERT. If your are interested in the concept of Fine-grained location identification, I have written on STHAL (to be released soon).