MiniMax

Text-to-speech

View as MarkdownOpen in Claude

MiniMax is a text-to-speech engine offering expressive voices across many languages, with controls for emotion, speed, pitch, and volume.

Models

SignalWire supports the following MiniMax models. Pick a turbo model for speed and cost, or an hd model for the highest quality. If you don’t specify a model, speech-2.6-turbo is used.

ModelDescription
speech-2.6-turboDefault. Latest turbo model: faster and lower-cost
speech-2.6-hdLatest HD model: higher quality
speech-02-turboPrevious-generation turbo model
speech-02-hdPrevious-generation HD model
speech-01-turboFirst-generation turbo model
speech-01-hdFirst-generation HD model

Voices

Press play to audition any MiniMax voice, then copy config to grab the value for SWML or your SDK.

MiniMax provides a large library of system voices across many languages. A voice’s Voice ID is what you put in the voice string: for example, English_CalmWoman becomes minimax.English_CalmWoman.

Only MiniMax’s built-in system voices are supported. Cloned and AI-generated voices do not work — you must use one of the default voice IDs that MiniMax provides.

Some voice IDs contain spaces or parentheses (for example, Cantonese_ProfessionalHost (F)). Wrap the whole voice string in quotes when a voice ID isn’t a plain word, for example voice: "minimax.Cantonese_ProfessionalHost (F):speech-2.6-turbo".

Browse the full list of voice IDs in MiniMax’s documentation:

Languages

MiniMax supports the following languages and automatically detects the language of your text:

Arabic, Cantonese, Chinese — Mandarin, Czech, Dutch, English, Finnish, French, German, Greek, Hindi, Indonesian, Italian, Japanese, Korean, Polish, Portuguese, Romanian, Russian, Spanish, Thai, Turkish, Ukrainian, Vietnamese.

Refer to the MiniMax developer platform for the most up-to-date reference to supported languages and voices.

Usage

A voice identifier string has three parts: the minimax engine code, a voice ID, and an optional model.

Format: minimax.<voiceId>:<model>

  • voiceId (required): a MiniMax system voice ID (for example, English_CalmWoman); see Voices.
  • model (optional): one of the models above. Defaults to speech-2.6-turbo.

Examples:

minimax.English_CalmWoman:speech-2.6-turbo
minimax.English_Trustworth_Man:speech-2.6-hd
minimax.Japanese_KindLady:speech-2.6-turbo

You can also set the model with the separate model parameter of the languages SWML method instead of appending :model to the voice string:

1languages:
2- name: English
3 code: en-US
4 voice: minimax.English_CalmWoman
5 model: speech-2.6-turbo

Build with MiniMax on SignalWire

1

Create a Space and add credit

If you don’t have one yet, you’ll need to create a SignalWire Space. Be sure to add some credit to test with.

2

Add a new Resource

Navigate to the Resources tab in your SignalWire Dashboard and click + Add New to create a new Resource.

3

Create a SWML Script

From the Resources menu, select SWML Script. Name it something fun and recognizable. Ours is titled MiniMax Wizard.

Next, paste the following starter script into the text box, and hit Save:

1version: 1.0.0
2sections:
3 main:
4 - ai:
5 prompt:
6 text: |
7 You're the Calm Woman, a voice from MiniMax's TTS engine!
8 Introduce yourself, and have a conversation about programmable unified communications on the SignalWire platform.
9 languages:
10 - name: English
11 code: en-US
12 voice: minimax.English_CalmWoman:speech-2.6-turbo
4

Buy and assign a phone number

Navigate to the Phone Numbers section of the Dashboard’s left sidebar menu.

Purchase a phone number and assign it to the desired SWML script.

A purchased phone number showing assignment to a specified Resource.

Assigning a phone number to the SWML Script
5

Give it a call!

Call the number you just assigned to chat with your new AI voice application on the phone.

Next steps with SWML

Now you’ve deployed your very first SignalWire voice AI application using MiniMax voices. Next, dive deeper into SWML to explore its capabilities!