Features

Prev Next
Document Number Revision Number Revision Date
KN. GU.30.EN Rev5 07.04.2026

Custom Voice

SESTEK provides the ability to create and use your own branded voices. Rather than relying on generic voice profiles, businesses can work with SESTEK to develop a voice that reflects their brand identity and audience expectations.

Learn about the custom voice preparation process →


Human-Like Voices

Powered by neural network technologies, SESTEK TTS delivers natural-sounding, neutral voices that closely resemble human speech. These voices are designed to minimize the mechanical or robotic quality often associated with synthetic speech, resulting in a more engaging listener experience.


SSML Tag Support

SESTEK TTS supports the most widely used SSML (Speech Synthesis Markup Language) tags, giving developers and content creators fine-grained control over how text is spoken. Supported tags include:

  • <break> - insert pauses of specified duration
  • <say-as> - specify how content should be interpreted (e.g. date, number, currency)
  • <speak> - root element for SSML input
  • <audio> - embed external audio files
  • <voice> - switch between available voices within a single request

Explore full SSML tag support →


Voice Tuning

Voice tuning allows businesses to adjust the tone, pace, and emphasis of synthesized audio without changing the underlying voice model. This includes:

  • Speed - increase or decrease the rate of speech to match your content format
  • Volume - control output loudness for different playback environments

Voice tuning ensures that synthesized audio accurately represents your brand and messaging across all channels.


Supported Languages

SESTEK TTS currently supports 15 languages, covering a broad range of global markets. Each language is backed by dedicated voice models optimized for natural-sounding output.


Custom Abbreviations

Businesses can define their own abbreviations and pronunciations for specific words or terms. This is especially useful for brand names, product codes, and industry-specific terminology that standard pronunciation models may handle incorrectly.

For example, an abbreviation like "TTS" can be configured to be read as "Text-to-Speech" rather than spelled out letter by letter.


Easy Deployment

SESTEK TTS is available in both on-premises and cloud deployment models, giving businesses full flexibility to choose the environment that best fits their infrastructure and security requirements.

On-premises

With on-premises deployment, businesses retain complete control over their data - ensuring maximum security and privacy. The software runs on your own hardware, enabling faster processing and reduced latency. Integration with existing systems and software is straightforward, making it a seamless addition to your infrastructure.

Cloud

The cloud deployment option delivers all the capabilities of SESTEK TTS without the operational overhead of managing your own environment. Key advantages include:

  • Scalability - computing resources can be increased or decreased on demand
  • Reduced maintenance - no need to manage hardware or technical infrastructure
  • Accessibility - available from any device with an internet connection