THE NEXT ERA FOR GENERATIVE SPEECH

MARS8: The Multilingual Text-to-Speech Model Built for Production

MARS8 is a family of production-grade text-to-speech models built so every use case, language, and voice profile gets the same rock-solid reliability when millions are listening.

Build with MARS8

View benchmarks

launching natively ON ALL TOP COMPUTE PLATFORMS

Hear the Difference

Live-Ready Voice vs Everything Else

Most text-to-speech models are built for conversational demos. MARS8 is built for moments where timing, emotion, and clarity cannot fail.

Test in our TTS Battleground

Lorem ipsum dolor sit amet [Excited], consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua [Sad]. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat [Laugh].

Lorem ipsum dolor sit amet [Excited], consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua [Sad]. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat [Laugh].

Title 2

CAMB.AI and Broadcom deliver ultra-fast, ultra-private speech and multilingual AI to millions of next-gen devices—engineered for global scale.

5x

Metric Name

2x

Metric Name

Title 3

CAMB.AI and Broadcom deliver ultra-fast, ultra-private speech and multilingual AI to millions of next-gen devices—engineered for global scale.

5x

Metric Name

2x

Metric Name

Title 4

CAMB.AI and Broadcom deliver ultra-fast, ultra-private speech and multilingual AI to millions of next-gen devices—engineered for global scale.

5x

Metric Name

2x

Metric Name

Lorem ipsum dolor sit amet [Excited], consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua [Sad]. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat [Laugh].

Lorem ipsum dolor sit amet [Excited], consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua [Sad]. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat [Laugh].

Title 2

CAMB.AI and Broadcom deliver ultra-fast, ultra-private speech and multilingual AI to millions of next-gen devices—engineered for global scale.

5x

Metric Name

2x

Metric Name

Title 3

CAMB.AI and Broadcom deliver ultra-fast, ultra-private speech and multilingual AI to millions of next-gen devices—engineered for global scale.

5x

Metric Name

2x

Metric Name

Title 4

CAMB.AI and Broadcom deliver ultra-fast, ultra-private speech and multilingual AI to millions of next-gen devices—engineered for global scale.

5x

Metric Name

2x

Metric Name

THE WORLDS FIRST FAMILY OF TTS MODELS

The MARS8 family
Specialized models for each use-case

Read our full research blog post →

MARS8-Flash

Low-latency multilingual TTS for Conversational AI Agents

Parameters: 600M

Try it on our API

Use Cases:

Real-time voice agents
Contact centers
Live conversational AI

MARS8.1-Pro

Highest quality target. Improved pronunciation, expressiveness with high-pitch references, prosody, accent control / coverage.

Parameters: 600M

Try it on our API

Use Cases:

Expressive dubbing
Audiobooks
Digital media

MARS8-Instruct

Fine-grained control over emotion, timing, and style; independent of speaker identity.

Parameters: 1.2B

Try it on our API

Use Cases:

Film & TV dubbing
Precise prosody control
Creative editing workflows

MARS8-Nano

When memory and compute are constrained but production quality still matters.

Parameters: 50M

Use Cases:

Automotive systems
Embedded devices
Edge deployments

MARS8 Family

BENCHMARK RESULTS

Redefining the new baseline in TTS

Run the benchmarks yourself →

PQ ↑

Approximate mean opinion score on a 1–10 scale, predicted by Meta’s Audiobox‑Aesthetics model; higher PQ indicates better production quality.

WavLM SV cosine similarity↑

Speaker similarity metric measured as the mean cosine similarity between generated audio and reference audio, using the wavlm-base-sv embedding model.

CAM++ cosine similarity ↑

Speaker similarity metric measured as the mean cosine similarity between generated audio and reference audio, using the CAM++ embedding model.

CE ↑

Approximate mean opinion score on a 1–10 scale, predicted by Meta’s Audiobox‑Aesthetics model; higher CE reflects greater content enjoyment.

CER ↓

Percentage of characters that are incorrect in the generated output, as measured by Whisper ASR.

MARS8-Pro

MARS8-Flash

Sonic-3

Speech-2.6-hd

Multilingual_v2

Multilingual_v3

7.4498

7.4523

6.9471

6.9468

7.4516

7.1934

0.8676

0.8666

0.8420

0.8666

0.8109

0.8253

0.7097

0.7066

0.5134

0.5878

0.3912

0.336

5.4308

5.4299

5.0445

4.9877

5.4146

5.1816

5.77%

5.67%

8.54%

11.30%

4.39%

14.62%

Production Economics

Voice AI that moves you from demo‑ware to production realities.

Voice systems behave very differently at scale. Once latency budgets tighten, usage spikes, and compliance kicks in, architectural decisions start to dominate outcomes. MARS8 is built for these real‑world constraints, not for API convenience.

LANGUAGES SUPPORT FOR 99% OF THE WORLD

Global language coverage

MARS8 is the multilingual backbone that lets you cover 99% of the world while staying native to how your audiences speak and listen.

English

Hindi (India)

French (France)

Spanish (Spain)

German

Japanese

Modern Standard Arabic

Korean

Chinese (Simplified)

Italian

Spanish (Mexico)

Portuguese (Portugal)

Portuguese (Brazil)

Indonesian

Dutch

Russian

Arabic (Saudi Arabia)

Tamil

Telugu

Bengali (India)

Arabic (Egypt)

Arabic (Syria)

Arabic (Morocco)

Marathi

Kannada

Bengali (Bangladesh)

Assamese

Malayalam

French (Canada)

Polish

Turkish

Want the full technical breakdown?

For a detailed look at MARS8’s architecture, deployment patterns, and performance characteristics, read the full technical article on our blog.

Technical document

Build, scale, or partner
on your terms

MARS8 is designed to scale across startups, enterprises, and infrastructure providers. 
Whether you’re building a product or enabling others to build, there’s a direct path to get started.

Get started for free →