LLM Training & Inference

training w/ Fastchat inference w/ vllm

Create a new python environment. We will install everything with cuda 11.8. Other cuda versions and library versions likely won't work for now.

conda create -n llm python=3.10 -y
conda deactivate && conda activate llm
bash setup.sh

Play with vllm_test.py

Modify data_module.py to support the data format, and use finetune.sh or finetune_lora.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
3x+1.json		3x+1.json
README.md		README.md
data_module.py		data_module.py
deepspeed_config_s2.json		deepspeed_config_s2.json
deepspeed_config_s3.json		deepspeed_config_s3.json
finetune.sh		finetune.sh
finetune_lora.sh		finetune_lora.sh
gen_data.py		gen_data.py
setup.sh		setup.sh
train_full.py		train_full.py
train_lora.py		train_lora.py
vllm_test.py		vllm_test.py

Provide feedback