Microsoft Text To Speech Integration Node
Parameters
| name | description | default |
|---|---|---|
| Credential | API Key Credential for the Microsoft Azure Speech service | empty |
| Azure Region | Azure Speech resource region (e.g. eastus, westeurope). A full list of supported regions is available on Azure regions | eastus |
| Voice Name | Name of the voice to use (e.g. en-US-AriaNeural, en-US-GuyNeural, en-GB-RyanNeural) | en-US-AriaNeural |
| Speaking Style | Optional expressive speaking style (e.g. cheerful, sad, angry, excited) | empty |
| Speaking Rate | Controls the speed of the voice. Examples: default, x-slow, slow, medium, fast, x-fast, or percentage values like +20%, -15%. | 1.0 |
| Speaking Pitch | Optional pitch adjustment (e.g. +2st, -3st, default) | default |
| Speaking Volume | Optional volume setting (e.g. default, loud, soft) | default |
Inputs
Audio
none
Events
| name | description | known nodes that generate this event |
|---|---|---|
Every Event that Contains the Data.Text field |
A TTS synthesis audio can be generated from any event that is text based. Users can also generate their custom event that has Data.Text and send it through the Entry, or External Relay nodes. |
SR Http, SR WebSocket, Translate |
Outputs
Audio
Synthesized speech is sent to output in PCM16 Format with a Wave header.
Events
| name | description |
|---|---|
| Start of TTS Fragment | Sent just before a TTS operation writes the result to Audio output. Contains the text that was used for the synthesis under the Data.Text field. |
| End of TTS Fragment | Sent just after a TTS operation has finished writing to Audio output. |
Remarks
Requirements
The node requires a valid Microsoft Azure Subscription key and Region.
You can provision a Speech resource in the. Azure Portal
More information about authentication can be found in the. Azure Speech Service documentation
The list of available voices and styles can be found in Microsoft’s voices documentation
Custom TTS Event
A sample tts-request event would look as follows:
{
"Event": {
"Name": "Custom TTS Event",
"Data" : {
"Text": "hello world"
}
}
}
Data.Text: The text that will be synthesized.
To learn more about sending Events to projects, you can check the HTTP and External Relay pages.
Project Structure
The project structures defined in TTS Http apply to this node as well.
You can use multiple Microsoft TTS nodes with different voices or styles, and select between them dynamically using the Channel field in your custom events.
Supported flow types
Stream, Batch
