Speaker Diarizer
  • 19 Jul 2024
  • 1 Minute to read
  • Contributors
  • PDF

Speaker Diarizer

  • PDF

Article summary

Takes a single channel(mono) audio in which more than one people talks and splits this into a multichannel audio where each channel only has a single person's speech.

Parameters:

name
description
default
AddressAddress of the speaker diarizer servicehttp://core-speaker-diarization
NumSpeakersExpected number of different speakers in the input audio. The most common use is with records from call centers. So, the default value is 2.2
MaxNumSpeakersExpected number of different speakers in the input audio. -1 for no upper limit.-1
IgnoreSslErrorsIgnore any certificate errors if speaker diarizer address contains httpsfalse

Inputs

Audio:
Accepts audio from a single channel.

Events:
none

Outputs

Audio:
Outputs audio to multiple channels

Events:

namedescription
Start Of SegmentRaised once before sending each audio fragment. This event is sent to the same channel with the corresponding audio.
End Of SegmentRaised once after sending each audio fragment. This event is sent to the same channel with the corresponding audio.

Remarks :
Speaker Diarizer splits the provided audio into many smaller fragments and determines the speaker in each fragment. Sends each speaker's audio on a separate channel output.

Speaker Diarizer technology needs to analyze the whole audio file at once. So, this node can only work in batch mode and cannot be used for streaming.

Important Note

Speaker Diarizer splits the audio into separate segments. So, there is no need to use another Audio Splitter node in the project.

You can adjust the number of output channels from the menu that appears by right clicking the node.
By right clicking the node, we can add and remove channels from one channel to three channel outputs. Each output sends the audio of only one speaker.

Project Structure
A minimal project that utilizes Speaker Diarizer can be created as such:

image.png

Note that in this project, the SR Http results will be tagged as Speaker No: 0 and Speaker No: 1.

Supported flow types: Batch


Was this article helpful?

What's Next
Changing your password will log you out immediately. Use the new password to log back in.
First name must have atleast 2 characters. Numbers and special characters are not allowed.
Last name must have atleast 1 characters. Numbers and special characters are not allowed.
Enter a valid email
Enter a valid password
Your profile has been successfully updated.