Quickstart

List of Contents

Quickstart
Getting Started
- Dependency Installation
- Model Preparation
Fine-tuning

Quickstart

This repository provides code to experiment with training large models on Moreh's MoAI Platform. With the MoAI platform you can scale to thousands of GPU/NPUs by automatic parallelization and optimization, without any code changes.

We currently provide six LLMs; Llama3, Llama2, Qwen1.5, Mistral, GPT and Baichuan2, as well as SDXL.

Getting Started

This repository contains examples of PyTorch training codes that can be executed on the MoAI Platform. Users using Pytorch on the MoAI Platform can easily train large models without extensive effort. For more information about the MoAI Platform and detailed tutorials, please visit the Moreh Docs.

Dependency Installation

First, clone this repository and navigate to the repo directory.

git clone https://github.com/moreh-dev/quickstart
cd quickstart

After you are in the quickstart directory, install the dependency packages for the model you want to fine-tune. The requirements files for each model are located in the requirements directory. For example, to install the dependencies for the Llama3 model, use the following command:

pip install -r requirements/requirements_llama3.txt

Model Preparation

If you want to fine-tune the Llama2, Llama3, or Mistral models, you need access to their respective Hugging Face repositories. Please ensure you have the necessary acess before starting model training.

Llama3 : https://huggingface.co/meta-llama/Meta-Llama-3-8B or https://huggingface.co/meta-llama/Meta-Llama-3-70B
Llama2 : https://huggingface.co/meta-llama/Llama-2-7b-hf
Mistral : https://huggingface.co/mistralai/Mistral-7B-v0.1

After obtaining access, authenticate your token with the following command:

huggingface-cli login

Fine-tuning

Activate Moreh Advanced Parallelization(AP)

The following line is added in the each code to enable AP on the MoAI Platform.

...

torch.moreh.option.enable_advanced_parallelization()

...

LLM Information

Information about the models currently supported by this repository, along with their target tasks and training scripts, are as follows:

Baseline Model	Task	Training Script	Dataset
Llama3 8B	Text Summarization	`tutorial/train_llama3.py`	cnn_dailymail
Llama3 70B	Text Summarization	`tutorial/train_llama3_70b.py`	cnn_dailymail
Llama2 7B	Text Summarization	`tutorial/train_llama2.py`	cnn_dailymail
Qwen1.5 7B	Code Generation	`tutorial/train_qwen.py`	iamtarun/python_code_instructions_18k_alpaca
Mistral v0.1 7B	Code Generation	`tutorial/train_mistral.py`	iamtarun/python_code_instructions_18k_alpaca
OPT 13B	Code Generation	`tutorial/train_opt.py`	mlabonne/Evol-Instruct-Python-26k
Baichuan2 13B	Chatbot	`tutorial/train_baichuan2_13b.py`	bitext/Bitext-customer-support-llm-chatbot-training-dataset

Training

Full Fine-tuning

Run the training script to fully fine-tune the model. For example, if you want to fine-tune the llama-3 8B model,

python tutorial/train_llama3.py \ 
  --epochs 1  \ 
  --batch-size 256 \ 
  --block-size 1024 \ 
  --lr 0.00001 \ 
  --save-dir ${SAVE_DIR_PATH} \ 
  --ignore-index -100 \ 
  --log-interval 1

LoRA Fine-tuning

To train the LoRA adapter only, you can give a --use-lora argument with LoRA config parameters.

python tutorial/train_llama3.py \ 
  --epochs 1 \ 
  --batch-size 256 \ 
  --block-size 1024 \ 
  --lr 0.00001 \ 
  --save-dir ${SAVE_DIR_PATH} \ 
  --log-interval 1 \ 
  --use-lora \ 
  --lora-r 64 \ 
  --lora-alpha 16 \ 
  --lora-dropout 0.1 \

Inference

Baseline Model	Inference Script
Llama3 8B	`tutorial/inference_llama3.py`
Llama2 7B	`tutorial/inference_llama2.py`
Qwen1.5 7B	`tutorial/inference_qwen.py`
Mistral 7B	`tutorial/inference_mistral.py`
OPT 13B	`tutorial/inference_opt.py`
Baichuan2 13B	`tutorial/inference_baichuan.py`

Perform inference by running the inference script for each model. For example, to proceed with inference on fine-tuned Llama3 models:

python tutorial/inference_llama3.py \ 
  --model-name-or-path ${SAVE_DIR_PATH}

If you want to perform inference with LoRA weights, add --use-lora argument to the inference script/

python tutorial/inference_llama3.py \ 
  --model-name-or-path ${SAVE_DIR_PATH} \ 
  --use-lora

# output example
Llama3: [SUMMARIZE] (CNN)Arsenal kept their slim hopes of winning this season's English Premier League title alive by beating relegation threatened Burnley 1-0 at Turf Moor. A first half goal from Welsh international Aaron Ramsey was enough to separate the two sides and secure Arsenal's ... [/SUMMAIRZE]
Arsenal beat Burnley 1-0 in the English Premier League.
Aaron Ramsey scores the only goal of the game.
Arsenal remain in second place.
Chelsea can extend their lead to seven points.

Stable Diffusion XL

We provide fine-tuning example code for the Stable Diffusion XL model.

Training

Baseline Model	Task	Training Script	Dataset
Stable Diffusion XL	Text-to-Image Generation	`tutorial/train_sdxl.py`	lambdalabs/naruto-blip-captions

Run the training script for Stable Diffusion XL:

python tutorial/train_sdxl.py \
  --epochs 20 \
  --batch-size 16 \
  --num-workers 8 \
  --lr=1e-05 \
  --save-dir=${SAVE_DIR_PATH} \
  --log-interval 1 \
  --lr-scheduler linear

Inference

After training, you can proceed inference with your fine-tuned model using the following command:

python tutorial/inference_sdxl.py \
  --model-name-or-path=${SAVE_DIR_PATH}

Adjust the prompt by editing the PROMPT variable in the inference script:

...
PROMPT = "Bill Gates with a hoodie"
...

The resulting image will be saved as sdxl_result.jpg.

The image on the left shows the inference results of the model before fine-tuning, while the image on the right shows the inference results of the fine-tuned model.

Name		Name	Last commit message	Last commit date
Latest commit History 204 Commits
assets		assets
dataset		dataset
git-hooks		git-hooks
moai-example		moai-example
model		model
requirements		requirements
tutorial		tutorial
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.style.yapf		.style.yapf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

List of Contents

Quickstart

Getting Started

Dependency Installation

Model Preparation

Fine-tuning

Activate Moreh Advanced Parallelization(AP)

LLM Information

Training

Full Fine-tuning

LoRA Fine-tuning

Inference

Stable Diffusion XL

Training

Inference

About

Releases

Packages

Contributors 10

Languages

moreh-dev/quickstart

Folders and files

Latest commit

History

Repository files navigation

List of Contents

Quickstart

Getting Started

Dependency Installation

Model Preparation

Fine-tuning

Activate Moreh Advanced Parallelization(AP)

LLM Information

Training

Full Fine-tuning

LoRA Fine-tuning

Inference

Stable Diffusion XL

Training

Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages