SR Http
  • 19 Jul 2024
  • 3 Minutes to read
  • Contributors
  • PDF

SR Http

  • PDF

Article summary

Performs speech recognition. Uses SestekSR with HTTP protocol.

Parameters:

default
description
default
AddressAddress of SestekSR that will be used for performing recognitionshttp://core-sr
IgnoreSslErrorsignore any certificate errors if SestekSR address contains httpsfalse
ConnectionTimeoutMsecSestekSR Connection timeout in terms of milliseconds5000
CredentialIdId of the credential that will be used for SestekSR recognitions. This parameter is only necessary if a cloud SestekSR deployment is used. You must leave this empty for on-premise deployments.None(On-Prem)
Modelslist of models that can be used in the recognitions
LanguageLanguage of the model that will be used for recognitionfalse
ProduceRecognizedWordsIf set to true recognition events will provide extra info for each recognized wordfalse

Inputs

Audio:
Accepts audio from a single channel.

Events:

name
descriptionknown nodes that generate this event
Language ChangeChanges the language parameter. The recognitions requested after this event will use the updated language (hence appropriate model)Language Identifier, Text Language Identifier
Start Of SegmentSignals the start of a speech segment. Useful in batch mode.Audio Segmenter
Speech StartedSignals the start of speech in realtime. Necessary for stream mode. Can be used in batch mode.Vad
End Of SegmentSignals the start of a speech segment. Useful in batch mode.Audio Segmenter
Speech EndedSignals the end of speech in realtime. Necessary for stream mode. Can be used in batch mode.Vad

Outputs

Audio:
none

Events:

name
description
SR MilestoneRaised for each piece of speech in the provided audio data. This event contains recognition results for a single piece of speech. "SR Milestone" results are cumulative. The total recognition result in the end can be prepared by concatenating all the sr-milestone events' texts.

Remarks :

  • If you want to split audio and perform your recognitions you may or may not want to add an AudioSegmentor node before this node for splitting the audio into pieces.
  • We recommend using end-to-end models for recognitions performed with this node.

License : Needs "sr-rest" option.

Supported flow types: Stream, Batch

Service Link Parametrization:
The default address for this node is http://core-sr . This default link is intended for on-premise installations.
When SestekSr is deployed in the cloud SestekSR uses a http address that depends on the name of the model it hosts. eg.

http://core-sr-turkish
http://core-sr-english
http://core-sr-arabic

In order to use a single link in the cloud that you won't need to change every time you switch your models you should prefer the link with the {model_title} placeholder. ie. your default link should be the following: http://core-sr-{model_title}

Project Structure

The simplest project utilizing SR Http can be built as such:

image.png

About this project

Note that the project above will not split the audio. All of the inputted audio will be sent to the SR service in bulk, and the output will be just one SR Milestone event including all of the recognized speech.

A better project that gives separate results for each speech fragment can be built like the examples below:

image.png

image.png

About Audio Splitters

We recommend using Audio Segmenter for Batch projects, however, it is incompatible with Stream flows. VAD node can be used in both Batch and Stream projects. So the project below can give almost real-time SR Outputs. To learn more about Audio Splitters you can check out this link.

One of the most complex examples utilizing the SR HTTP node can be built as such:

image.png

In the project above, every speech segment's language is identified first. Then, the speech will be recognized in the identified language. Then, the transcript will be normalized, punctuated, and filtered. At the same time, every SR Milestone event will be translated into a targeted language. All of these will be done in real-time.

At the end of the flow, all of the accumulated SR Milestone events will be used for Topic Detection, and Summarization.

Compatibility

Almost every node that utilizes textual data is compatible with the SR Http and SR WebSocket nodes' event outputs. Which is why we are able to create complex projects such as the example above utilizing these nodes.

Supported flow types: Stream, Batch


Was this article helpful?

What's Next
Changing your password will log you out immediately. Use the new password to log back in.
First name must have atleast 2 characters. Numbers and special characters are not allowed.
Last name must have atleast 1 characters. Numbers and special characters are not allowed.
Enter a valid email
Enter a valid password
Your profile has been successfully updated.