Inference pipeline to deploy IndicTrans NMT models on top of Flask server.
- Python 3.6 +
- Ubuntu 16.04 +
Install various python libraries as mentioned in requirements.txt
file
pip install -r src/requirements.txt
- Put all the models in
src/nmt_models
folder as per the specs inconfig/fetch_models.json
- Run
app.py
to start the service with all the packages installed
python src/app.py
IndicTrans-v1 suports 11 major Indic languages:
Assamese (as) | Hindi (hi) | Marathi (mr) | Tamil (ta) |
Bangla (bn) | Kannada (kn) | Oriya (or) | Telugu (te) |
Gujarati (gu) | Malayalam (ml) | Panjabi (pa) |
IndicTrans-v2 supports all 22 scheduled langauges of India, which includes English, 20 Indic languages (4 Dravidian, 15 Indo-Aryan, 1 Munda) and 2 Tibeto-Burman languages (Bodo & Manipuri).
ISO 639 code | Language |
---|---|
as | Assamese - অসমীয়া |
bn | Bangla - বাংলা |
brx | Boro - बड़ो |
doi | Dogri - डोगरी |
gom | Goan-Konkani - कोंकणी |
gu | Gujarati - ગુજરાતી |
hi | Hindi - हिंदी |
kn | Kannada - ಕನ್ನಡ |
ks | Kashmiri - كٲشُر |
ks_Deva | Kashmiri - कॉशुर |
gom | Konkani Goan - कोंकणी |
mai | Maithili - मैथिली |
ml | Malayalam - മലയാളം |
mni | Manipuri - ꯃꯤꯇꯩꯂꯣꯟ |
mni_Beng | Manipuri - মিতৈলোন |
mr | Marathi - मराठी |
ne | Nepali - नेपाली |
or | Oriya - ଓଡ଼ିଆ |
pa | Panjabi - ਪੰਜਾਬੀ |
sa | Sanskrit - संस्कृतम् |
sat | Santali - ᱥᱟᱱᱛᱟᱲᱤ |
sd | Sindhi - سنڌي |
sd_Deva | Sindhi - सिंधी |
ta | Tamil - தமிழ் |
te | Telugu - తెలుగు |
ur | Urdu - اُردُو |
https://github.com/AI4Bharat/indicTrans
The indictrans inference service code (and models) are released under the MIT License.