Skip to content

Piloting the Soniox 2nd time with their new(w15) release #2224

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
beastoin opened this issue Apr 19, 2025 · 0 comments
Open

Piloting the Soniox 2nd time with their new(w15) release #2224

beastoin opened this issue Apr 19, 2025 · 0 comments
Assignees

Comments

@beastoin
Copy link
Collaborator

Objective: add support the multi-language to the omi system

Key results:

  • adapt to the new api spec
  • test the hallucination which is the cause of long transcripts leading to high OpenAI billing.

--

BBernard — 16/04/2025, 03:25
Important ---- new multilingual Speech-to-Text was just released in production. Both Real Time and Async models were updated and so is API and docs. Most changes relates to real time API - please check the documentation (https://soniox.com/docs/speech-to-text/get-started) and update with new APIs. We are here to support you with any questions you might have. For maximum accuracy use language hints (several could be selected simultaneously), speaker separation in real time model is now greatly improved, wer accuracy was already very high and is now even better, both in async and real time and across all languages.

@beastoin beastoin self-assigned this Apr 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant