This repository stores scripts for ChemPile:
token_counter_cli.py-- script used to count tokens across the ChemPile subset.- Submodule
ddp_trainer:- Contains LoRA adapter merging scripts
- ChemBench evaluation scripts
- Folder
chempile-instructcontains the scripts used to generate the instruction dataset. review-appfolder contains resources to reproduce the review app.