Evaluating Indic LLMs: Multilingual Performance Benchmarking

📌 Overview

This repository is dedicated to evaluating Indic Large Language Models (LLMs) for multilingual Indian languages based on predefined questions and their generated outputs. The evaluation includes:

Pre-generated model outputs stored in JSON format.
Scoring and analysis of model responses.
Multilingual model evaluation using sglang and Transformers libraries.

📂 Repository Structure

1️⃣ PreGenerations

Contains JSON files with model-generated outputs for various prompts.
These outputs are used for scoring and analysis.

2️⃣ Pre-Evaluated-LLMs

Stores Scores.txt, which includes:
- Model-generated response scores.
- Analysis of responses.
- Comparisons across different models.

3️⃣ Multilingual_eval.ipynb

The main evaluation script.
Implements:
- Question generation and submission to models.
- Evaluation using both sglang and Transformers.
Both frameworks are coded separately within the same notebook.

🛠 Installation Guide

For sglang installation, refer to the installation guide.

🚀 Usage Instructions

Run the Multilingual_eval.ipynb notebook.
Review scores in Pre-Evaluated-LLMs/Scores.txt.
Analyze JSON outputs from PreGenerations.

🤝 Contributors

Govind-AIML
ashutoshqp

📜 License

indic-llm-eval is open-source under the Apache 2.0. Use it freely and contribute to make it better! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Pre-Evaluated-LLms		Pre-Evaluated-LLms
PreGenerations		PreGenerations
LICENSE		LICENSE
Multilingual_eval.ipynb		Multilingual_eval.ipynb
README.md		README.md
image.png		image.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Evaluating Indic LLMs: Multilingual Performance Benchmarking

📌 Overview

📂 Repository Structure

1️⃣ PreGenerations

2️⃣ Pre-Evaluated-LLMs

3️⃣ Multilingual_eval.ipynb

🛠 Installation Guide

🚀 Usage Instructions

🤝 Contributors

📜 License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

build-ai-applications/indic-llm-eval

Folders and files

Latest commit

History

Repository files navigation

Evaluating Indic LLMs: Multilingual Performance Benchmarking

📌 Overview

📂 Repository Structure

1️⃣ PreGenerations

2️⃣ Pre-Evaluated-LLMs

3️⃣ Multilingual_eval.ipynb

🛠 Installation Guide

🚀 Usage Instructions

🤝 Contributors

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages