Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
-
Updated
Aug 12, 2024 - Python
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
Persian/Farsi text to speech(TTS) training using coqui tts
Open-source, fully private and local alternative to NotebookLM. Chat with your documents, generate audio summaries, and ground AI in your own sources—built with Supabase, N8N on a React frontend using Ollama for local inference
Text to Speech using Coqui TTS + RVC
The world’s first game framework that lets you talk to AI in real time — locally. Supports any custom voice.
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the WhatsApp Desktop App.
SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and has capabilities for extending functionalities through a modular tool system.
Automatically generate faceless YouTube Shorts from trending topics using AI scripts, TTS, and FFmpeg. Fully containerized and one-click deployable
Open source Speechify alternative. Read PDFs and EPUBs with local models.
Rust bindings to the https://github.com/coqui-ai TTS library
Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS
The TTS Platform leverages the power of Coqui TTS, an advanced open-source framework, to deliver a high-quality text-to-speech (TTS) experience. It caters to diverse user needs, offering natural-sounding voice generation with extensive customization options.
DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The system utilizes Coqui TTS for text-to-speech generation, along with various face rendering and animation techniques to create a video where the given avatar articulates the speech.
Lira is a voice-first AI companion that provides real-time conversations, context-aware responses, and on-demand image generation. It listens, understands, and interacts naturally to assist users with daily tasks, emotional check-ins, and creative prompts.
Voice cloning using coqui-TTS
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
A lightweight voice companion, optimized for macOS.
Add a description, image, and links to the coqui-tts topic page so that developers can more easily learn about it.
To associate your repository with the coqui-tts topic, visit your repo's landing page and select "manage topics."