Skip to content

build-ai-applications/indic-llm-eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Evaluating Indic LLMs: Multilingual Performance Benchmarking

Model Scores Graph

📌 Overview

This repository is dedicated to evaluating Indic Large Language Models (LLMs) for multilingual Indian languages based on predefined questions and their generated outputs. The evaluation includes:

  • Pre-generated model outputs stored in JSON format.
  • Scoring and analysis of model responses.
  • Multilingual model evaluation using sglang and Transformers libraries.

📂 Repository Structure

1️⃣ PreGenerations

  • Contains JSON files with model-generated outputs for various prompts.
  • These outputs are used for scoring and analysis.

2️⃣ Pre-Evaluated-LLMs

  • Stores Scores.txt, which includes:
    • Model-generated response scores.
    • Analysis of responses.
    • Comparisons across different models.

3️⃣ Multilingual_eval.ipynb

  • The main evaluation script.
  • Implements:
    • Question generation and submission to models.
    • Evaluation using both sglang and Transformers.
  • Both frameworks are coded separately within the same notebook.

🛠 Installation Guide

For sglang installation, refer to the installation guide.


🚀 Usage Instructions

  1. Run the Multilingual_eval.ipynb notebook.
  2. Review scores in Pre-Evaluated-LLMs/Scores.txt.
  3. Analyze JSON outputs from PreGenerations.

🤝 Contributors


📜 License

indic-llm-eval is open-source under the Apache 2.0. Use it freely and contribute to make it better! 🚀

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •