DroidCore

Iterations

5D2: BD-2-X5 - Current

A project focused on building an advanced robotic platform incorporating multimodal AI capabilities (Vision, Speech, Language) and low-level hardware control.

Features

High-Level AI & Perception

Speech Processing: Utilizes Speech-to-Text (e.g., Whisper) and Text-to-Speech (e.g., Piper, Orpheus). Includes sound synthesis capabilities (ggwave).
Language Understanding: Leverages Large Language Models (e.g., Ollama + DeepSeek R1 7B) for reasoning, conversation, and potentially task execution.
Computer Vision: Integrates vision capabilities using sensors like Kinect v2 (via libfreenect2), employing techniques like SLAM, Object Detection (YOLO), and Face Recognition (OpenFace/MediaPipe).

Low-Level Hardware Control & Sensing

Motor Control: Implements advanced motor control strategies like Field-Oriented Control (FOC).
Sensing: Incorporates various sensors, including Ultrasound.
Communication: Utilizes communication protocols like Bluetooth.
Actuation: Controls hardware components like Relays, fans.

Architecture Overview

The system follows a layered architecture:

HighLvl: Manages complex AI tasks, perception, decision-making, and human interaction. Potential use of middleware like ROS.
- Integrates Vision, Language, Speech, and Sound processing modules.
LowLvl: Handles direct hardware interfacing, real-time control, and sensor data acquisition.
- Includes drivers and control logic for motors (FOC), sensors (Radar), communication hardware (Bluetooth), and other peripherals.

Directory Structure

├── HighLvl/
│   ├── Vision/       # Vision processing, SLAM, Object Detection, Face Recognition
│   ├── Language/     # Language models, Reasoning, Text Generation
│   ├── Speech/       # Speech-to-Text (STT) and Text-to-Speech (TTS)
│   └── Sound/        # Sound synthesis (ggwave), potentially audio processing
├── LowLvl/
│   ├── Fans/         # Control for cooling or other fan systems
│   ├── Radar/        # Radar sensor interface and data processing
│   ├── Motor/        # Motor drivers and control logic
│   ├── FOC/          # Field-Oriented Control specific implementations
│   ├── Build/        # Build system files/scripts for low-level firmware
│   ├── Bluetooth/    # Bluetooth communication stack/interface
│   ├── Base/         # Core low-level functionalities, drivers, or base platform code
│   └── BleFF/        # Potentially Bluetooth Low Energy related functionality
└── README.md         # This file

Core Technologies

AI/ML: Ollama, DeepSeek, OpenAI Whisper, Piper TTS, YOLO, OpenFace, MediaPipe
Vision: libfreenect2 (Kinect v2)
Middleware: ROS (planned)
Hardware Control: Field-Oriented Control (FOC)
Communication: ggwave, Bluetooth, Lora

Roadmap & TODOs

Key areas for future development include:

Implementing high-speed vs. low-speed thinking models.
Developing a voice interrupt system.
Refining model selection and context management.
Implementing parallel TTS and ggwave transmission.
Exploring alternative technologies (e.g., faster-whisper, alternative TTS, simulation environments like Omniverse Isaac Sim, Dora-rs).
Integrating long-term memory and personality development.
CAD design for physical components (Head, BasePlate, Gimbal).
Finalizing hardware choices (Stereocams, Sensing Array, Comms).

(This README is generated based on initial project notes and structure. It will be updated as the project evolves.)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
CAD/Head		CAD/Head
HighLvl		HighLvl
LowLvl		LowLvl
ros2_ws/droidcore_rosbridge		ros2_ws/droidcore_rosbridge
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
audio_chat.log		audio_chat.log
combined_system.log		combined_system.log
conversation_history.json		conversation_history.json
conversation_memory.json		conversation_memory.json
docker		docker
output.wav		output.wav
performance_metrics.log		performance_metrics.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DroidCore

Iterations

Features

High-Level AI & Perception

Low-Level Hardware Control & Sensing

Architecture Overview

Directory Structure

Core Technologies

Roadmap & TODOs

About

Uh oh!

Releases

Packages

Uh oh!

Languages

AryanRai/DroidCore

Folders and files

Latest commit

History

Repository files navigation

DroidCore

Iterations

Features

High-Level AI & Perception

Low-Level Hardware Control & Sensing

Architecture Overview

Directory Structure

Core Technologies

Roadmap & TODOs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages