Annotated Transformer

One usable torch-based implementation of transformer for learning purpose. Copied from harvard nlp.

NoteBooks

transformer.ipynb: Notebook version of the original Encoder-Decoder structure described in Attention is All you Need.

transformer.py: The original Encoder-Decoder structure described in Attention is All you Need.
T4Tmodel.py: Transformer for Translation Model. Used in Example 1.
BERTmodel.py: Implementation of BERT. Used in Example 2, Example 3, example 4.

Check out at ./examples.

Example 1: One Chinese to English translation model, trained on WMT 2018 en-zh dataset.
Example 2: Pretrain a miniBert, and finetune it to apply to some downstream tasks.
Example 3: Fine-tuning our bert model on squad - one QA task dataset.
example 4: TODO Fine-tuning our bert model on Imdb - one emotional classification task dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
examples		examples
pic		pic
.gitignore		.gitignore
BERTmodel.py		BERTmodel.py
GPTmodel.py		GPTmodel.py
README.md		README.md
T4Tmodel.py		T4Tmodel.py
transformer.ipynb		transformer.ipynb
transformer.py		transformer.py