This repository provides an open-source implementation of various Indic Text-to-Speech (TTS) models that support multiple Indian languages. The models enable text input conversion into natural-sounding speech across various Indian languages. This project aims to enhance accessibility and voice synthesis research.
π Why This Repository?
- Supports multiple Indian languages
- Offers easy-to-use Colab notebooks
- Provides open-source solutions for research and development
- Encourages community contributions to improve Indic TTS
- Size: 3.8 GB
- Supported Languages: 20 Indic languages
- Inputs: Text, Voice description
- Output: Audio
- License: Apache 2.0 (Free for all uses)
- Colab Notebook: Try Here
π Results:
βοΈ Very natural and clear audio
βοΈ Good accent
β Large model size
β Some inconsistencies
π§ Sample Output:
- Size: 150-160 MB
- Supported Languages: Hindi (Separate models for other languages)
- Inputs: Text
- Output: Audio
- License: CC-BY-NC 4.0 (Non-commercial use only)
- Colab Notebook: Try Here
- Other Languages: Find More Models
π Results:
βοΈ Natural and clear audio
βοΈ Small model size
β Requires switching models for different languages
β Some voice inconsistencies
π§ Sample Output:
π Listen Here
- Size: 170 MB
- Supported Languages: Assamese, Bengali, Bodo, Dogri, Kannada, Maithili, Malayalam, Marathi, Nepali, Punjabi, Sanskrit, Tamil, Telugu
- Inputs: Text
- Output: Audio
- License: Apache 2.0 (Free for all uses)
- Colab Notebook: Try Here
π Results:
βοΈ Clear audio
βοΈ Mixed accent support
β Slight tone inconsistencies
π§ Sample Output:
π Listen Sample 1
π Listen Sample 2
1οΈβ£ Open the Colab notebook for the model you want to use.
2οΈβ£ Run the provided cells to install dependencies and set up the environment.
3οΈβ£ Input text (and voice description, if applicable).
4οΈβ£ Generate speech and listen to the output!
We welcome contributions from the community! If youβd like to contribute:
π Bug Reports & Issues: Open an issue if you find any bugs.
π Feature Requests: Suggest new features via discussions or pull requests.
π Pull Requests: Fork the repo, make changes, and submit a pull request!
- Ensure your changes follow best practices.
- Keep the repository structured and well-documented.
- Respect the licenses of individual models.
This repository includes models with different licenses:
- ai4bharat models: Apache 2.0 (Permissive, allows commercial use)
- Facebook MMS-TTS models: CC-BY-NC 4.0 (For non-commercial use only)
π Note: If you plan to use any model commercially, verify its license compliance.
Special thanks to AI4Bharat and Facebook Research for open-sourcing these Indic TTS models and contributing to speech synthesis research.
If you're working on Indic TTS, we'd love to collaborate!
π¬ Join the Discussion: GitHub Discussions
π¦ Follow us on Twitter: @YourTwitterHandle
π’ Stay Updated: Watch the repo for new models and updates!