SR Http

Performs speech recognition. Uses SestekSR with HTTP protocol.

Parameters

name	description	default
Address	Address of SestekSR that will be used for performing recognitions	http://core-sr
IgnoreSslErrors	ignore any certificate errors if SestekSR address contains https	false
ConnectionTimeoutMsec	SestekSR Connection timeout in terms of milliseconds	5000
CredentialId	Id of the credential that will be used for SestekSR recognitions. This parameter is only necessary if a cloud SestekSR deployment is used. You must leave this empty for on-premise deployments.	None(On-Prem)
Models	list of models that can be used in the recognitions
Language	Language of the model that will be used for recognition	false
ProduceRecognizedWords	If set to true recognition events will provide extra info for each recognized word	false

Inputs

Audio

Accepts audio from a single channel.

Events

name	description	known nodes that generate this event
Language Change	Changes the language parameter. The recognitions requested after this event will use the updated language (hence appropriate model)	Language Identifier, Text Language Identifier
Start Of Segment	Signals the start of a speech segment. Useful in batch mode.	Audio Segmenter
Speech Started	Signals the start of speech in realtime. Necessary for stream mode. Can be used in batch mode.	Vad
End Of Segment	Signals the start of a speech segment. Useful in batch mode.	Audio Segmenter
Speech Ended	Signals the end of speech in realtime. Necessary for stream mode. Can be used in batch mode.	Vad

Outputs

Audio

none

Events

name	description
SR Milestone	Raised for each piece of speech in the provided audio data. This event contains recognition results for a single piece of speech. "SR Milestone" results are cumulative. The total recognition result in the end can be prepared by concatenating all the sr-milestone events' texts.

Remarks

If you want to split audio and perform your recognitions you may or may not want to add an AudioSegmentor node before this node for splitting the audio into pieces.
We recommend using end-to-end models for recognitions performed with this node.

License

Needs "sr-rest" option.

Service Link Parametrization

The default address for this node is http://core-sr . This default link is intended for on-premise installations.
When SestekSr is deployed in the cloud SestekSR uses a http address that depends on the name of the model it hosts. eg.

http://core-sr-turkish
http://core-sr-english
http://core-sr-arabic

In order to use a single link in the cloud that you won't need to change every time you switch your models you should prefer the link with the {model_title} placeholder. ie. your default link should be the following: http://core-sr-{model_title}

Project Structure

The simplest project utilizing SR Http can be built as such:

About this project

Note that the project above will not split the audio. All of the inputted audio will be sent to the SR service in bulk, and the output will be just one SR Milestone event including all of the recognized speech.

A better project that gives separate results for each speech fragment can be built like the examples below:

About Audio Splitters

We recommend using Audio Segmenter for Batch projects, however, it is incompatible with Stream flows. VAD node can be used in both Batch and Stream projects. So the project below can give almost real-time SR Outputs. To learn more about Audio Splitters you can check out this link.

One of the most complex examples utilizing the SR HTTP node can be built as such:

In the project above, every speech segment's language is identified first. Then, the speech will be recognized in the identified language. Then, the transcript will be normalized, punctuated, and filtered. At the same time, every SR Milestone event will be translated into a targeted language. All of these will be done in real-time.

At the end of the flow, all of the accumulated SR Milestone events will be used for Topic Detection, and Summarization.

Compatibility

Almost every node that utilizes textual data is compatible with the SR Http and SR WebSocket nodes' event outputs. Which is why we are able to create complex projects such as the example above utilizing these nodes.

Supported flow types

Stream, Batch

Release Notes

v3.6.0

Improved Observability for no speech information.

v3.0.0

Fixed a rarely observed crash

v1.0.0

Introduced node.