Updates to speaker diarization documentation #126

chrisw77 · 2025-10-07T10:26:26Z

Correct some casing issues in title, and added more information for some of the parameters (prefer current speaker, max speakers). Also gave an example on how the punctuation related correction can look like.

…ng further details.

vercel · 2025-10-07T10:26:30Z

@chrisw77 is attempting to deploy a commit to the Speechmatics Team on Vercel.

A member of the Team first needs to authorize it.

chrisw77 · 2025-10-07T10:35:31Z

docs/speech-to-text/batch/batch_diarization.mdx

+
+In this case, the above would be corrected to move the speaker change point to match with the end of sentence:
+
+> <span style={{ color: "red" }}>Hello my name is John.</span> <span style={{ color: "blue" }}> And my name is Alice.</span>


Not sure if this is the best way of showing an example, so feedback welcome!

chrisw77 · 2025-10-07T10:38:01Z

Looking to update diarization docs (title casing -> sentence style, some extra detail and context).
@anjz @mnemitz
@stuartw843 @yaiir-a

vercel · 2025-11-11T15:33:57Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
docs	Ready	Preview	Comment	Nov 11, 2025 3:35pm

mnemitz · 2025-11-11T15:39:49Z

docs/speech-to-text/batch/batch_diarization.mdx

+
+In this case, the above would be corrected to move the speaker change point to match with the end of sentence:
+
+> <span style={{ color: "red" }}>Hello my name is John.</span> <span style={{ color: "blue" }}> And my name is Alice.</span>


I'd recommend using the Text element which should already be available on that page. You can add the following to the top (Line 18):

import { Blockquote, Card, DataList, Text } from '@radix-ui/themes';

Suggested change

> <span style={{ color: "red" }}>Hello my name is John.</span> <span style={{ color: "blue" }}> And my name is Alice.</span>

<Blockquote>

<Text color="red">Hello my name is John.</Text> <Text color="blue"> And my name is Alice.</Text>

</Blockquote>

Thanks, makes sense - I'll update here and elsewhere.

mnemitz · 2025-11-11T15:40:39Z

docs/speech-to-text/realtime/realtime_diarization.mdx

+
+For example, consider a case where the diarization marks a speaker change one word after a full stop:
+
+> <span style={{ color: "red" }}>Hello my name is John. And</span> <span style={{ color: "blue" }}> my name is Alice.</span>


Same as the other suggestion here

mnemitz · 2025-11-11T15:40:45Z

docs/speech-to-text/realtime/realtime_diarization.mdx

+In this case, the above would be corrected to move the speaker change point to match with the end of sentence:

-Speaker diarization uses punctuation to improve accuracy. Small corrections are applied to speaker labels based on sentence boundaries.  
+> <span style={{ color: "red" }}>Hello my name is John.</span> <span style={{ color: "blue" }}> And my name is Alice.</span>


Same as the other suggestion here

mnemitz · 2025-11-11T15:41:28Z

docs/speech-to-text/realtime/realtime_diarization.mdx


- Use a [GPU Speech-to-Text container](../../deployments/container/gpu-speech-to-text.mdx). Handling multiple audio streams is computationally intensive and benefits from GPU acceleration.  
- Set the `SM_MAX_CONCURRENT_CONNECTIONS` environment variable to match the number of channels you want to process.  
+- Use a [GPU Speech-to-Text container](../../deployments/container/gpu-speech-to-text.mdx). Handling multiple audio streams is computationally intensive and benefits from GPU acceleration.


Suggested change

- Use a [GPU Speech-to-Text container](../../deployments/container/gpu-speech-to-text.mdx). Handling multiple audio streams is computationally intensive and benefits from GPU acceleration.

- Use a [GPU Speech-to-Text container](/deployments/container/gpu-speech-to-text). Handling multiple audio streams is computationally intensive and benefits from GPU acceleration.

Updates to speaker diarization documentation, correcting casing, addi…

4b7f1cd

…ng further details.

Merge branch 'main' into speaker_dz_updates

4bbf06f

chrisw77 commented Oct 7, 2025

View reviewed changes

vercel bot deployed to Preview November 11, 2025 15:35 View deployment

mnemitz reviewed Nov 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updates to speaker diarization documentation #126

Updates to speaker diarization documentation #126

Uh oh!

chrisw77 commented Oct 7, 2025

Uh oh!

vercel bot commented Oct 7, 2025

Uh oh!

chrisw77 Oct 7, 2025

Uh oh!

chrisw77 commented Oct 7, 2025 •

edited

Loading

Uh oh!

vercel bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

mnemitz Nov 11, 2025

Uh oh!

chrisw77 Nov 11, 2025

Uh oh!

mnemitz Nov 11, 2025

Uh oh!

mnemitz Nov 11, 2025

Uh oh!

mnemitz Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants


		In this case, the above would be corrected to move the speaker change point to match with the end of sentence:

		> <span style={{ color: "red" }}>Hello my name is John.</span> <span style={{ color: "blue" }}> And my name is Alice.</span>

-> <span style={{ color: "red" }}>Hello my name is John.</span> <span style={{ color: "blue" }}> And my name is Alice.</span>
+<Blockquote>
+  <Text color="red">Hello my name is John.</Text> <Text color="blue"> And my name is Alice.</Text>
+</Blockquote>


		For example, consider a case where the diarization marks a speaker change one word after a full stop:

		> <span style={{ color: "red" }}>Hello my name is John. And</span> <span style={{ color: "blue" }}> my name is Alice.</span>

	- Use a [GPU Speech-to-Text container](../../deployments/container/gpu-speech-to-text.mdx). Handling multiple audio streams is computationally intensive and benefits from GPU acceleration.
	- Use a [GPU Speech-to-Text container](/deployments/container/gpu-speech-to-text). Handling multiple audio streams is computationally intensive and benefits from GPU acceleration.

Updates to speaker diarization documentation #126

Are you sure you want to change the base?

Updates to speaker diarization documentation #126

Uh oh!

Conversation

chrisw77 commented Oct 7, 2025

Uh oh!

vercel bot commented Oct 7, 2025

Uh oh!

chrisw77 Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

chrisw77 commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mnemitz Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

chrisw77 Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

mnemitz Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

mnemitz Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

mnemitz Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

chrisw77 commented Oct 7, 2025 •

edited

Loading

vercel bot commented Nov 11, 2025 •

edited

Loading