- Print
- PDF
Takes a single channel(mono) audio in which more than one people talks and splits this into a multichannel audio where each channel only has a single person's speech.
Parameters:
name | description | default |
---|---|---|
Address | Address of the speaker diarizer service | http://core-speaker-diarization |
NumSpeakers | Expected number of different speakers in the input audio. The most common use is with records from call centers. So, the default value is 2. | 2 |
MaxNumSpeakers | Expected number of different speakers in the input audio. -1 for no upper limit. | -1 |
IgnoreSslErrors | Ignore any certificate errors if speaker diarizer address contains https | false |
Inputs
Audio:
Accepts audio from a single channel.
Events:
none
Outputs
Audio:
Outputs audio to multiple channels
Events:
name | description |
---|---|
Start Of Segment | Raised once before sending each audio fragment. This event is sent to the same channel with the corresponding audio. |
End Of Segment | Raised once after sending each audio fragment. This event is sent to the same channel with the corresponding audio. |
Remarks :
Speaker Diarizer splits the provided audio into many smaller fragments and determines the speaker in each fragment. Sends each speaker's audio on a separate channel output.
Speaker Diarizer technology needs to analyze the whole audio file at once. So, this node can only work in batch mode and cannot be used for streaming.
Speaker Diarizer splits the audio into separate segments. So, there is no need to use another Audio Splitter node in the project.
You can adjust the number of output channels from the menu that appears by right clicking the node.
By right clicking the node, we can add and remove channels from one channel to three channel outputs. Each output sends the audio of only one speaker.
Project Structure
A minimal project that utilizes Speaker Diarizer can be created as such:
Note that in this project, the SR Http results will be tagged as Speaker No: 0
and Speaker No: 1
.
Supported flow types: Batch