Guyu (谷雨)

pre-training and fine-tuning framework for text generation

backbone code for "An Empirical Investigation of Pre-Trained Transformer Language Models for Open-Domain Dialogue Generation": https://arxiv.org/abs/2003.04195

@article{DBLP:journals/corr/abs-2003-04195,
  author    = {Piji Li},
  title     = {An Empirical Investigation of Pre-Trained Transformer Language Models
               for Open-Domain Dialogue Generation},
  journal   = {CoRR},
  volume    = {abs/2003.04195},
  year      = {2020},
  url       = {https://arxiv.org/abs/2003.04195},
  archivePrefix = {arXiv},
  eprint    = {2003.04195},
  timestamp = {Tue, 10 Mar 2020 13:33:48 +0100}}
}

torch>=1.0

Pre-training:

./prepare_data.sh
./train.sh
./inference.sh

Fine-tuning

Example: chat-bot

cd chat_bot
./prepare_data.sh
./fine_tune.sh
./inference.sh

Web Api:

./deploy.sh

Pre-trained models

12-layer, 768-hidden, 12-heads, Chinese (News + zhwiki, 200G) and English (Gigawords + Bookscorpus + enwiki, 60G)
24-layer, 1024-hidden, 16-heads, Chinese (News + zhwiki, 200G) and English (Gigawords + Bookscorpus + enwiki, 60G) Note: please use transformer_preln as the main model:https://github.com/lipiji/Guyu/blob/master/biglm.py#L8
download them: https://github.com/lipiji/Guyu/tree/master/model

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
chat-bot		chat-bot
model		model
toy		toy
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
adam.py		adam.py
api.py		api.py
biglm.py		biglm.py
data.py		data.py
deploy.sh		deploy.sh
inference.py		inference.py
inference.sh		inference.sh
label_smoothing.py		label_smoothing.py
optim.py		optim.py
prepare_data.py		prepare_data.py
prepare_data.sh		prepare_data.sh
train.py		train.py
train.sh		train.sh
transformer_postln.py		transformer_postln.py
transformer_preln.py		transformer_preln.py
utils.py		utils.py
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Guyu (谷雨)

Pre-training:

Fine-tuning

Web Api:

Pre-trained models

References:

About

Releases

Packages

Languages

License

lipiji/Guyu

Folders and files

Latest commit

History

Repository files navigation

Guyu (谷雨)

Pre-training:

Fine-tuning

Web Api:

Pre-trained models

References:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages