SparkTTS inference with C++

Windows:

ONNX Runtime (DirectML backend) for BiCodec/Wav2Vec etc.
llama.cpp (Vulkan backend) for Qwen2.5-0.5B

macOS:

CoreML for BiCodec/Wav2Vec etc.
llama.cpp (Metal backend) for Qwen2.5-0.5B

Performance

With Q4-K quantized transformer, it can achieve Real-Time Factor (RTF) of approximately 0.15 and 300ms first audio sample latency on a NVIDIA RTX 4070 GPU.

How to build

Install Rust

Rust

Setup vcpkg

vcpkg

Build llama.cpp

Windows

Make sure you are using x64 Native Tools Command Prompt for VS 2022
Setup Vulkan dependencies, llama.cpp build doc
Build and install with CMake

cd third_party\llama.cpp
cmake -B build -G Ninja -DGGML_VULKAN=ON -DLLAMA_CURL=OFF -DCMAKE_INSTALL_PREFIX=..\..\lib\llama -DCMAKE_BUILD_TYPE=Release
cmake --build build --config Release
cmake --install build --config Release
cd ..\..

macOS (Apple Silicon)

pushd third_party/llama.cpp
cmake -B build -G Ninja -DLLAMA_CURL=OFF -DCMAKE_INSTALL_PREFIX=../../lib/llama -DCMAKE_BUILD_TYPE=Release
cmake --build build --config Release
cmake --install build --config Release
popd

Build ONNX Runtime (Windows only with DirectML)

cd third_party\onnxruntime
python .\tools\ci_build/build.py ^
    --update ^
    --build ^
    --config Release ^
    --build_shared_lib ^
    --parallel ^
    --build_dir ./build ^
    --cmake_extra_defines "CMAKE_POLICY_VERSION_MINIMUM=3.5" ^
    --skip_tests ^
    --enable_lto ^
    --use_dml
cmake --install build\Release --config Release --prefix ..\..\lib\onnxruntime
cd ..\..

Build with CMake and Ninja

Windows

cmake --preset=vcpkg -DCMAKE_BUILD_TYPE=Release
cmake --build build --config Release
cmake --install build --config Release && copy /Y build\src\*.dll install\tools\bin

macOS

cmake --preset=vcpkg -DCMAKE_BUILD_TYPE=Release
cmake --build build --config Release
cmake --install build --config Release

How to use

C API is provided for C++ and other languages.

Example command line tool is provided to for performance tuning.

Acknowledgements

Models used in this project are from SparkAudio/Spark-TTS

Inspired by arghyasur1991/Spark-TTS-Unity

Third-party libraries used in this project:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
cmake		cmake
scripts		scripts
src		src
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
README.md		README.md
vcpkg-configuration.json		vcpkg-configuration.json
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SparkTTS inference with C++

Performance

How to build

Install Rust

Setup vcpkg

Build llama.cpp

Windows

macOS (Apple Silicon)

Build ONNX Runtime (Windows only with DirectML)

Build with CMake and Ninja

Windows

macOS

How to use

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

DarkKowalski/SparkTTS.cpp

Folders and files

Latest commit

History

Repository files navigation

SparkTTS inference with C++

Performance

How to build

Install Rust

Setup vcpkg

Build llama.cpp

Windows

macOS (Apple Silicon)

Build ONNX Runtime (Windows only with DirectML)

Build with CMake and Ninja

Windows

macOS

How to use

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages