Emotion

Prev Next

Evaluates the emotional state in the input voice.

Parameters

 none

Inputs

Audio

 Accepts audio from a single channel.

Events

 none

Outputs

Audio

 none

Events

name description
Emotion Includes information about the emotional state in the voice (such as normal or angry) and the level of monotony in the speaker's tone.

Remarks

  • This node evaluates the speech in segments of 3 seconds. Silent periods are not included in this duration.

Project Structure

The Emotion Node only needs audio input. The audio doesn't have to be segmented. So, you can place the Emotion node wherever the desired audio is flowing through. A simple project can be built as such:

image.png

However, a better approach would be:

image.png

Why this approach is better?

In the second example, the only difference is that we pass the audio through a VAD node. This means the audio is filtered. The only audio info that passes through a VAD is speech info. This approach will result in better confidence levels in Emotion identification since random silences are cleared.

Supported flow types

Stream, Batch

Release Notes

v4.5.0
  • Added Knovvu ML Support, and removed obsolete parameters.
v3.1.0
  • Fixed a rarely observed crash while responding to Stop
v1.0.0
  • Introduced node.