Takes a single channel(mono) audio in which more than one people talks and splits this into a multichannel audio where each channel only has a single person's speech.

Parameters

name	description	default
NumSpeakers	Expected number of different speakers in the input audio. The most common use is with records from call centers. So, the default value is 2.	2
MaxNumSpeakers	Expected number of different speakers in the input audio. -1 for no upper limit.	-1

Inputs

Audio

Accepts audio from a single channel.

Events

none

Outputs

Audio

Outputs audio to multiple channels

Events

name	description
Start Of Segment	Raised once before sending each audio fragment. This event is sent to the same channel with the corresponding audio.
End Of Segment	Raised once after sending each audio fragment. This event is sent to the same channel with the corresponding audio.

Remarks

Speaker Diarizer splits the provided audio into many smaller fragments and determines the speaker in each fragment. Sends each speaker's audio on a separate channel output.

Speaker Diarizer technology needs to analyze the whole audio file at once. So, this node can only work in batch mode and cannot be used for streaming.

Important Note

Speaker Diarizer splits the audio into separate segments. So, there is no need to use another Audio Splitter node in the project.

You can adjust the number of output channels from the menu that appears by right clicking the node.
By right clicking the node, we can add and remove channels from one channel to three channel outputs. Each output sends the audio of only one speaker.

Project Structure

A minimal project that utilizes Speaker Diarizer can be created as such:

Note that in this project, the SR Http results will be tagged as Speaker No: 0 and Speaker No: 1.

Supported flow types

Batch

Release Notes

v4.5.0

Removed 'Address' and 'Ignore SSL Errors' from parameters.

v1.0.0

Introduced Node.