Question 1

How to install TTS WebUI on Windows?

Accepted Answer

Download the installer zip from releases, extract it, and run start_tts_webui.bat—it sets up conda and virtual environments automatically, then launches the server at localhost:7770 for Gradio and 3000 for React UI.

Question 2

TTS WebUI vs Coqui TTS: which is better for voice cloning?

Accepted Answer

TTS WebUI includes Coqui's XTTSv2 as an extension, but if you solely need voice cloning, Coqui TTS alone is lighter. TTS WebUI excels for comparing multiple clones like RVC and GPT-SoVITS in one place.

Question 3

How to integrate TTS WebUI with Silly Tavern?

Accepted Answer

Enable the OpenAI API extension in TTS WebUI, verify it generates audio, then in Silly Tavern add a new TTS API with endpoint http://localhost:7778/v1/audio/speech—step-by-step screenshots are in the Integrations section.

Question 4

Does TTS WebUI support real-time speech synthesis?

Accepted Answer

Not directly; it's batch-oriented for generating audio files via web UI or API. For real-time, you'd need to stream API calls, but latency depends on model complexity and hardware.

Question 5

What are the system requirements for TTS WebUI?

Accepted Answer

Requires a GPU for performance, at least 15-20 GB free storage, Python 3.10/3.11, and ffmpeg. Docker setup needs NVIDIA Container Toolkit; the installer handles most dependencies.

Question 6

How to add a custom model to TTS WebUI?

Accepted Answer

Create an extension as a Python package following the extension catalog guidelines, or use the External Extensions Installer to load from JSON—community examples are in the marketplace.

TTS WebUI

What is TTS WebUI?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions