Spark-TTS

Free

LLM-based TTS with efficient single-stream tokens.

Open-SourceVoice CloningLLM-Based

Overview

Spark-TTS builds on a Qwen-based LLM and the BiCodec single-stream token representation for efficient, high-quality speech. It supports zero-shot cloning as well as creating brand-new voices by adjusting attributes like gender, pitch and speed. The streamlined design keeps the pipeline simple.

What makes Spark-TTS special?

Focus: LLM token-based synthesis.
Availability: Free & Open-Source.
Use cases: Zero-shot cloning and controllable voice creation.

Visit official site

Related free models

Chatterbox

A lightweight, fast TTS model built on LLaMA.

Dia

A 1.6B parameter TTS model from Nari Labs.

Kokoro

An 82M parameter TTS model by Hexgrad.

Back to directory