| Document Number | Revision Number | Revision Date |
|---|---|---|
| KN. GU.30.EN | Rev5 | 07.04.2026 |
Custom Voice
SESTEK provides the ability to create and use your own branded voices. Rather than relying on generic voice profiles, businesses can work with SESTEK to develop a voice that reflects their brand identity and audience expectations.
Learn about the custom voice preparation process →
Human-Like Voices
Powered by neural network technologies, SESTEK TTS delivers natural-sounding, neutral voices that closely resemble human speech. These voices are designed to minimize the mechanical or robotic quality often associated with synthetic speech, resulting in a more engaging listener experience.
SSML Tag Support
SESTEK TTS supports the most widely used SSML (Speech Synthesis Markup Language) tags, giving developers and content creators fine-grained control over how text is spoken. Supported tags include:
<break>- insert pauses of specified duration<say-as>- specify how content should be interpreted (e.g. date, number, currency)<speak>- root element for SSML input<audio>- embed external audio files<voice>- switch between available voices within a single request
Explore full SSML tag support →
Voice Tuning
Voice tuning allows businesses to adjust the tone, pace, and emphasis of synthesized audio without changing the underlying voice model. This includes:
- Speed - increase or decrease the rate of speech to match your content format
- Volume - control output loudness for different playback environments
Voice tuning ensures that synthesized audio accurately represents your brand and messaging across all channels.
Supported Languages
SESTEK TTS currently supports 15 languages, covering a broad range of global markets. Each language is backed by dedicated voice models optimized for natural-sounding output.
Custom Abbreviations
Businesses can define their own abbreviations and pronunciations for specific words or terms. This is especially useful for brand names, product codes, and industry-specific terminology that standard pronunciation models may handle incorrectly.
For example, an abbreviation like "TTS" can be configured to be read as "Text-to-Speech" rather than spelled out letter by letter.
Easy Deployment
SESTEK TTS is available in both on-premises and cloud deployment models, giving businesses full flexibility to choose the environment that best fits their infrastructure and security requirements.
On-premises
With on-premises deployment, businesses retain complete control over their data - ensuring maximum security and privacy. The software runs on your own hardware, enabling faster processing and reduced latency. Integration with existing systems and software is straightforward, making it a seamless addition to your infrastructure.
Cloud
The cloud deployment option delivers all the capabilities of SESTEK TTS without the operational overhead of managing your own environment. Key advantages include:
- Scalability - computing resources can be increased or decreased on demand
- Reduced maintenance - no need to manage hardware or technical infrastructure
- Accessibility - available from any device with an internet connection
