Speech Quality
  • 19 Jul 2024
  • 1 Minute to read
  • Contributors
  • PDF

Speech Quality

  • PDF

Article summary

Measures the quality of the audio recording and assigns scores.

Parameters:

name
description
default
AddressAddress of the speech quality servicehttp://core-speech-critique
IgnoreSslErrorsIgnore any certificate errors if emotion address contains httpsfalse

Inputs

Audio:
Accepts audio from a single channel.

Events:
none

Outputs

Audio:
none

Events:

namedescription
Speech QualityRaised several times when the node has accumulated enough audio data to update speech quality measurement. The quality value is in range from 1 to 5. 5 means the highest quality.

Remarks :

  • This node evaluates the speech in segments of 3 seconds. Silent periods are not included in this duration.

Project Structure
The Speech Quality Node only needs audio input. The audio doesn't have to be segmented. So, you can place the Speech Quality node wherever the desired audio is flowing through. A simple project can be built as such:

image.png

However, a better approach would be:

image.png

Why this approach is better?

In the second example, the only difference is that we pass the audio through a VAD node. This means the audio is filtered. The only audio info that passes through a VAD is speech info. This approach will result in better confidence levels in Speech Quality identification since random silences are cleared.

Supported flow types: Stream, Batch


Was this article helpful?

What's Next
Changing your password will log you out immediately. Use the new password to log back in.
First name must have atleast 2 characters. Numbers and special characters are not allowed.
Last name must have atleast 1 characters. Numbers and special characters are not allowed.
Enter a valid email
Enter a valid password
Your profile has been successfully updated.