This repository is dedicated to evaluating Indic Large Language Models (LLMs) for multilingual Indian languages based on predefined questions and their generated outputs. The evaluation includes:
- Pre-generated model outputs stored in JSON format.
- Scoring and analysis of model responses.
- Multilingual model evaluation using
sglang
andTransformers
libraries.
- Contains JSON files with model-generated outputs for various prompts.
- These outputs are used for scoring and analysis.
- Stores Scores.txt, which includes:
- Model-generated response scores.
- Analysis of responses.
- Comparisons across different models.
- The main evaluation script.
- Implements:
- Question generation and submission to models.
- Evaluation using both
sglang
andTransformers
.
- Both frameworks are coded separately within the same notebook.
For sglang
installation, refer to the installation guide.
- Run the Multilingual_eval.ipynb notebook.
- Review scores in Pre-Evaluated-LLMs/Scores.txt.
- Analyze JSON outputs from PreGenerations.
indic-llm-eval is open-source under the Apache 2.0. Use it freely and contribute to make it better! 🚀