A PyTorch implementation of generative pre-trained transformers (GPT)

This is a personal exercise to build, train and use GPT, inspired by minGPT.

Prerequisite

direnv
Python 3.9+

Installation

git clone https://github.com/rygx/simple-gpt.git && cd simple-gpt
python -m venv .venv
pip install pip-tools
./update_deps.sh

Usage

Data

Any text-fromat data file should be usable, given the size can fit with training and inference environment.

Train

Use train/train_gpt.py. After training, model state dictionary and hyper parameters will be stored in models directory.

Generate

Use train/generate.py. Under models/ directory there is already a coarsely pre-trained model (using minGPT's tinyshakespeare sample) with ID 9cdb42ed-0b16-4a3a-88e2-fffa61fa4f50. Generate texts using this model with the following command:

python train/generate.py --dir "models" -u "9cdb42ed-0b16-4a3a-88e2-fffa61fa4f50" --prompt "QUEEN: "

Generation length (in number of tokens) and temperature can also be tuned using --length/-l and --temp/-t options.

TODOs/plans

Easier setup (e.g., using setuptool)
Unit tests :P
Implement own tokenizer

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
batcher		batcher
models		models
modules		modules
train		train
util		util
.envrc		.envrc
.gitignore		.gitignore
README.md		README.md
requirements.in		requirements.in
requirements.txt		requirements.txt
update_deps.sh		update_deps.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A PyTorch implementation of generative pre-trained transformers (GPT)

Prerequisite

Installation

Usage

Data

Train

Generate

TODOs/plans

About

Releases

Packages

Languages

rygx/simple-gpt

Folders and files

Latest commit

History

Repository files navigation

A PyTorch implementation of generative pre-trained transformers (GPT)

Prerequisite

Installation

Usage

Data

Train

Generate

TODOs/plans

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages