🚀 Eval-STT: Open-Source Speech-to-Text Evaluation Framework

🌟 Overview

Eval-STT is an open-source evaluation framework for benchmarking Speech-to-Text (STT) models. It provides a structured approach to compare transcription models based on key performance metrics, ensuring fair and transparent evaluations.

📊 Key Metrics Evaluated:

🎯 Word Error Rate (WER) – Measures transcription accuracy.
⚡ Real-Time Factor (RTF) – Evaluates processing speed.
⏳ Latency – Assesses transcription delay.
💻 CPU Utilization – Tracks computational efficiency on CPU.
🎮 GPU Utilization – Measures GPU resource usage.
🚀 Speed – Compares performance across different hardware configurations.

🤖 Models Evaluated

Eval-STT supports a range of open-source STT models across different architectures:

🌀 Whisper Models (OpenAI)

whisper-tiny
whisper-base
whisper-small
whisper-medium
whisper-large-v2

🎙️ Wav2Vec2 Models (Facebook AI)

wav2vec2-base
wav2vec2-large
wav2vec2-english
wav2vec2-xlsr-en

🏗️ HuBERT Model (Facebook AI)

hubert-large

🚀 Getting Started

1️⃣ Clone the Repository

git clone https://github.com/build-ai-applications/Eval-STT
cd Eval-STT

2️⃣ Install Dependencies

Ensure you have the required dependencies installed:

pip install -r requirements.txt

3️⃣ Run the Evaluation Notebook

The evaluation is automated via a Jupyter Notebook:

jupyter notebook evaluate_stt.ipynb

4️⃣ Add More Models for Evaluation 🚀

Modify the STTEvaluator class inside the notebook to include additional models. Example:

import torch

class STTEvaluator:
    def __init__(self, device='cuda' if torch.cuda.is_available() else 'cpu'):
        self.device = device
        self.models = {
            'whisper-large-v2': ('openai/whisper-large-v2', self._load_whisper, self._transcribe_whisper),
            'wav2vec2-base': ('facebook/wav2vec2-base-960h', self._load_wav2vec2, self._transcribe_wav2vec2)
        }
        self.results = {}

Extend the dictionary with more models as needed! 🚀

📈 Evaluation Metrics

Our framework provides detailed performance insights:

✅ Accuracy (WER) – Lower is better.
⚡ Speed (RTF, Latency) – Lower is better.
💡 Resource Efficiency (CPU/GPU Utilization) – Lower means more cost-effective deployment.

Results are logged and visualized automatically in the notebook! 📊🚀

🤝 Contribute to Eval-STT!

Eval-STT is open-source, and we welcome contributions from the community! 🎉

How to Contribute:

Fork the repository.
Implement model loading and transcription functions.
Update the evaluation notebook.
Submit a pull request (PR) with a description of changes.

Your contributions help make Eval-STT the best open-source Speech-to-Text evaluation framework! 🚀💡

📚 References

📜 License

Eval-STT is open-source under the Apache 2.0. Use it freely and contribute to make it better! 🚀

📬 Contact Us

For any questions, suggestions, or feature requests, open an issue or reach out to the maintainers! 💡

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
LICENSE		LICENSE
README.md		README.md
STT-Eval.ipynb		STT-Eval.ipynb
requirements.txt		requirements.txt
results.json		results.json
stt.ipynb		stt.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 Eval-STT: Open-Source Speech-to-Text Evaluation Framework

🌟 Overview

📊 Key Metrics Evaluated:

🤖 Models Evaluated

🌀 Whisper Models (OpenAI)

🎙️ Wav2Vec2 Models (Facebook AI)

🏗️ HuBERT Model (Facebook AI)

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Run the Evaluation Notebook

4️⃣ Add More Models for Evaluation 🚀

📈 Evaluation Metrics

🤝 Contribute to Eval-STT!

How to Contribute:

📚 References

📜 License

📬 Contact Us

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

build-ai-applications/Eval-STT

Folders and files

Latest commit

History

Repository files navigation

🚀 Eval-STT: Open-Source Speech-to-Text Evaluation Framework

🌟 Overview

📊 Key Metrics Evaluated:

🤖 Models Evaluated

🌀 Whisper Models (OpenAI)

🎙️ Wav2Vec2 Models (Facebook AI)

🏗️ HuBERT Model (Facebook AI)

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Run the Evaluation Notebook

4️⃣ Add More Models for Evaluation 🚀

📈 Evaluation Metrics

🤝 Contribute to Eval-STT!

How to Contribute:

📚 References

📜 License

📬 Contact Us

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages