Speaker Diarizer

Updated on 20 May 2025
1 Minute to read
Contributors

Article summary

Did you find this summary helpful?

Thank you for your feedback!

Takes a single channel(mono) audio in which more than one people talks and splits this into a multichannel audio where each channel only has a single person's speech.

Parameters

name	description	default
Address	Address of the speaker diarizer service	http://core-speaker-diarization
NumSpeakers	Expected number of different speakers in the input audio. The most common use is with records from call centers. So, the default value is 2.	2
MaxNumSpeakers	Expected number of different speakers in the input audio. -1 for no upper limit.	-1
IgnoreSslErrors	Ignore any certificate errors if speaker diarizer address contains https	false

Inputs

Audio

Accepts audio from a single channel.

Events

none

Outputs

Audio

Outputs audio to multiple channels

Events

name	description
Start Of Segment	Raised once before sending each audio fragment. This event is sent to the same channel with the corresponding audio.
End Of Segment	Raised once after sending each audio fragment. This event is sent to the same channel with the corresponding audio.

Remarks

Speaker Diarizer splits the provided audio into many smaller fragments and determines the speaker in each fragment. Sends each speaker's audio on a separate channel output.

Speaker Diarizer technology needs to analyze the whole audio file at once. So, this node can only work in batch mode and cannot be used for streaming.

Important Note

Speaker Diarizer splits the audio into separate segments. So, there is no need to use another Audio Splitter node in the project.

You can adjust the number of output channels from the menu that appears by right clicking the node.
By right clicking the node, we can add and remove channels from one channel to three channel outputs. Each output sends the audio of only one speaker.