Amazon Polly
Amazon Web Services’ Polly TTS engine includes several models to accommodate different use cases.
Models
SignalWire supports the following three Amazon models.
Polly Standard is a traditional, cost-effective, and reliable TTS model.
Example voice ID string: amazon.Emma:en-GB or amazon.Emma:standard:en-GB
Polly Neural produces more natural, human-like speech than Polly Standard.
Example voice ID string: amazon.Kendra:neural:en-US
Polly Generative is Amazon’s largest and most realistic model.
Example voice ID string: amazon.Danielle:generative:en-US
Languages
Consult AWS documentation for a comprehensive and up-to-date list of supported voices, as well as information on accented and fully bilingual voices.
All Amazon Polly voices support accented bilingual pronunciation through the use of the SSML lang tag.
Learn more about fully bilingual voices like Aditi, Kajal, Hala, and Zayd,
which are designed to fluently speak two languages.
Usage
Amazon voice IDs conform to the following format:
Examples:
Simplified Format:
For voices that only support one model, you can omit the model parameter:
Example: amazon.Aditi:hi-IN (Standard model implied)
The polly engine code is being deprecated. Use amazon instead.
Amazon Polly has a limit of 3000 chargeable characters in a single request. If your TTS request is longer than 3000 characters, you will experience silence.
Examples
See how to use Amazon Polly voices on the SignalWire platform.
SWML
RELAY Realtime SDK
Call Flow Builder
Use the
languages
SWML method to set one or more voices for an AI agent.
Alternatively, use the say_voice parameter
of the play
SWML method to select a voice for basic TTS.