Cartesia

View as Markdown

Cartesia offers a wide selection of fully multilingual voices with very low latency. Create a Cartesia account to browse and test voices in the Cartesia Playground.

Models

Cartesia provides multiple generations of its Sonic TTS model:

ModelDescription
sonic-3Default. Latest model with enhanced naturalness
sonic-2Second-generation model with improved quality
sonic-turboOptimized for ultra-low latency
sonicThe first version of Sonic, optimized for accuracy and low latency.

All Cartesia voices can be used with any model.

Voices

Copy the voice ID from the table below:

Voice nameVoice ID
German Conversational Woman
Nonfiction Man
Friendly Sidekick
French Conversational Lady
French Narrator Lady
German Reporter Woman
Indian Lady
British Reading Lady
British Narration Lady
Japanese Children Book
Japanese Woman Conversational
Japanese Male Conversational
Reading Lady
Newsman
Child
Meditation Lady
Maria
1920’s Radioman
Newslady
Calm Lady
Helpful Woman
Mexican Woman
California Girl
Korean Narrator Woman
Russian Calm Lady
Russian Narrator Man 1
Russian Narrator Man 2
Russian Narrator Woman
Hinglish Speaking Lady
Italian Narrator Woman
Polish Narrator Woman
Chinese Female Conversational
Pilot over Intercom
Chinese Commercial Man
French Narrator Man
Spanish Narrator Man
Reading Man
New York Man
Friendly French Man
Barbershop Man
Indian Man
Australian Customer Support Man
Friendly Australian Man
Wise Man
Friendly Reading Man
Customer Support Man
Dutch Confident Man
Dutch Man
Hindi Reporter Man
Italian Calm Man
Italian Narrator Man
Swedish Narrator Man
Polish Confident Man
Spanish-speaking Storyteller Man
Kentucky Woman
Chinese Commercial Woman
Middle Eastern Woman
Hindi Narrator Woman
Sarah
Sarah Curious
Laidback Woman
Reflective Woman
Helpful French Lady
Pleasant Brazilian Lady
Customer Support Lady
British Lady
Wise Lady
Australian Narrator Lady
Indian Customer Support Lady
Swedish Calm Lady
Spanish Narrator Lady
Salesman
Yogaman
Movieman
Wizardman
Australian Woman
Korean Calm Woman
Friendly German Man
Announcer Man
Wise Guide Man
Midwestern Man
Kentucky Man
Brazilian Young Man
Chinese Call Center Man
German Reporter Man
Confident British Man
Southern Man
Classy British Man
Polite Man
Mexican Man
Korean Narrator Man
Turkish Narrator Man
Turkish Calm Man
Hindi Calm Man
Hindi Narrator Man
Polish Narrator Man
Polish Young Man
Alabama Male
Australian Male
Anime Girl
Japanese Man Book
Sweet Lady
Commercial Lady
Teacher Lady
Princess
Commercial Man
ASMR Lady
Professional Woman
Tutorial Man
Calm French Woman
New York Woman
Spanish-speaking Lady
Midwestern Woman
Sportsman
Storyteller Lady
Spanish-speaking Man
Doctor Mischief
Spanish-speaking Reporter Man
Young Spanish-speaking Woman
The Merchant
Stern French Man
Madame Mischief
German Storyteller Man
Female Nurse
German Conversation Man
Friendly Brazilian Man
German Woman
Southern Woman
British Customer Support Lady
Chinese Woman Narrator


For more information, refer to Cartesia’s guide to Choosing a Voice.

Usage

Cartesia voice IDs conform to the following format:

cartesia.<voice_id>:<model>

Parameters:

  • voice_id (required): The UUID voice identifier from the Voices table
  • model (optional): One of the Sonic models listed above (default: sonic-3)

Examples:

cartesia.a167e0f3-df7e-4d52-a9c3-f949145efdab
cartesia.694f9389-aac1-45b6-b726-9d9369183238:sonic-3
cartesia.829ccd10-f8b3-43cd-b8a0-4aeaa81f3b30:sonic-turbo

Languages

Cartesia voices are fully multilingual when used with sonic-multilingual, sonic-2, sonic-3, or sonic-3 models. The multilingual models automatically adapt to the input text language.

Supported languages include: English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Russian, Chinese, Japanese, Korean, Hindi, Turkish, Swedish, and many more.

For the complete list, refer to Cartesia’s Sonic 3 language support and Sonic 2 language support references.


Examples

See how to use Cartesia voices on the SignalWire platform.

Use the languages SWML method to set one or more voices for an AI agent.

1version: 1.0.0
2sections:
3 main:
4 - ai:
5 prompt:
6 text: Have an open-ended conversation about flowers.
7 languages:
8 - name: English
9 code: en-US
10 voice: cartesia.a167e0f3-df7e-4d52-a9c3-f949145efdab

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

1version: 1.0.0
2sections:
3 main:
4 - set:
5 say_voice: "cartesia.a167e0f3-df7e-4d52-a9c3-f949145efdab"
6 - play: "say:Greetings. This is the Customer Support Man voice from Cartesia's Sonic text-to-speech model."