Skip to content

pranjalisr/Voice-AI-agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Voice Conversational Agent

An intelligent voice-based conversational AI agent that listens, understands, and responds in real time — powered by speech recognition, natural language understanding (NLU), and text-to-speech (TTS) technologies.

🚀 Features

🎧 Speech-to-Text (STT): Converts spoken input into text using models like OpenAI Whisper, Google Speech API, or Vosk.

🧠 Conversational Intelligence: Uses LLMs (e.g., GPT, Gemini, LLaMA, etc.) for contextual and memory-based conversation.

🔊 Text-to-Speech (TTS): Responds naturally with human-like voice using tools like ElevenLabs, gTTS, or Coqui TTS.

🗣️ Real-Time Interaction: Supports streaming input and output for natural back-and-forth conversation.

🧩 Extensible Architecture: Modular design — plug in different STT, LLM, or TTS providers.

🌐 Multi-Platform: Works on desktop, web, or embedded systems (e.g., Raspberry Pi).

🧱 Architecture [User Voice] ↓ [Speech-to-Text Engine] ↓ [Conversational Model (LLM)] ↓ [Text-to-Speech Engine] ↓ [Spoken Response]

Each layer can be configured independently to use different backends.

🛠️ Tech Stack

Component Technology Speech-to-Text OpenAI Whisper / Google Speech / Vosk LLM Backend GPT-4 / GPT-3.5 / Llama / Claude Text-to-Speech ElevenLabs / gTTS / Azure Speech Framework Python / Node.js Optional UI React + Tailwind / Streamlit Deployment Docker / FastAPI / Flask

⚙️ Setup & Installation

1️⃣ Clone Repository git clone

cd voice-conversational-agent

2️⃣ Install Dependencies pip install -r requirements.txt

or

npm install

3️⃣ Set Environment Variables

Create a .env file and add:

OPENAI_API_KEY=your_key_here ELEVENLABS_API_KEY=your_key_here

4️⃣ Run the Application python app.py

or

npm run dev

Then open the interface (if available) or start speaking through your mic!

About

Voice conversational AI Agent

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages