Natural language processing

Chatterbox is a free open-source voice cloning model with emotional tone control

Chatterbox is a free open-source voice cloning model with emotional tone control


Resemble AI has released Chatterbox, a free open-source voice cloning model that runs locally and supports emotional tone control like “dramatic” or “monotone.” It clones voices using just a few seconds of audio and responds in under 200 milliseconds. The tool works on Windows, Mac, and Linux with 5–6 GB of video memory. All generated speech includes a faint watermark, “PerTh,” to identify it as AI-made. According to Resemble AI, it performed better than ElevenLabs in blind tests. Currently, it only supports English.


Decoder EN demo (heightened emotional expression)

Chatterbox is licensed under MIT and targets developers. Check out the demo here.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Chatterbox is a free open-source voice cloning model with emotional tone control

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Chatterbox is a free open-source voice cloning model with emotional tone control

Source link