Anni

Anni is a high-performance code assistant built upon the Qwen3 14B architecture. Fine-tuned on the OpenCodeReasoning-2 dataset, Anni is engineered to excel in deep algorithmic reasoning, competitive programming logic, and the implementation of complex, high-efficiency data structures.

🚀 Model Overview

Property	Value
Base Model	Qwen3 14B
Model Type	Language Model for Code
Context Length	32,000 tokens
Precision	BF16 / safetensors (merged)
Inference Framework	vLLM compatible

💻 Usage

Get started immediately using the provided Google Colab notebooks:

(Recommended) GGUF Inference : Open the Colab Notebook to run standard inference.
vLLM Serving: Open the Colab Notebook to run inference using the vLLM server.

🛠️ Development Setup

Prerequisites

Python Dependencies:
```
pip install -r requirements.txt
```
System Tools: Ensure tmux is installed on your system (required for training scripts).

Configuration

Environment Variables: Rename the example environment file and add your API tokens (WandB, HuggingFace, ModelScope).
```
mv config/example.env config/.env
# Edit config/.env with your keys
```
Training Config: Edit config/config.yaml to adjust hyperparameters.
- Note: Specify the LOCAL_STORAGE_PATH in src/train.py before starting training.

Running Training

To start the training process, run the shell script:

./scripts/train.sh

📂 Project Structure

Source (`src/`)

File	Description
`preprocess.py`	Downloads the OpenCodeReasoning-2 dataset and preprocesses it for training.
`train.py`	Downloads the base model and fine-tunes it on the preprocessed dataset.
`save.py`	Loads the fine-tuned LoRA adapters and saves the model as merged 16-bit and GGUF formats.
`upload.py`	Uploads the merged model to Hugging Face and ModelScope.

Scripts (`scripts/`)

File	Description
`train.sh`	Runs the training script with specified parameters.
`eval.sh`	Evaluates the model on the LiveCodeBench dataset.
`serve.sh`	Serves the model using the vLLM server.
`terminate_train.sh`	Terminates the training process.

Frontend (`web/`)

The frontend code for Anni is available in the web directory. 👉 View Frontend Documentation

⚖️ License

This repository’s model and its training code are released under the MIT License.
All other elements, such as frontend code, project name and logo, are trademarks of the developer and owner of this repository (Hans) and may not be used without explicit permission.

📚 Training Dataset Notice

The training dataset includes openly licensed sources under CC-BY-4.0, which permits commercial use with attribution.

Attribution:

OpenCoderReasoning-2 (CC-BY-4.0)

Note: The dataset itself is not included in this model release.

⚠️ Disclaimer

This model may generate incorrect or unsafe code. Evaluate and verify outputs before using in production.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
assets		assets
config		config
demo		demo
docs		docs
scripts		scripts
src		src
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Anni

🚀 Model Overview

💻 Usage

🛠️ Development Setup

Prerequisites

Configuration

Running Training

📂 Project Structure

Source (`src/`)

Scripts (`scripts/`)

Frontend (`web/`)

⚖️ License

📚 Training Dataset Notice

⚠️ Disclaimer

About

Uh oh!

Releases 1

Languages

License

CoderUni/Anni

Folders and files

Latest commit

History

Repository files navigation

Anni

🚀 Model Overview

💻 Usage

🛠️ Development Setup

Prerequisites

Configuration

Running Training

📂 Project Structure

Source (src/)

Scripts (scripts/)

Frontend (web/)

⚖️ License

📚 Training Dataset Notice

⚠️ Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Languages

Source (`src/`)

Scripts (`scripts/`)

Frontend (`web/`)