Question 1

How does WaveGlow compare to WaveNet in terms of speed and quality?

Accepted Answer

WaveGlow is faster (1200 kHz on V100) due to its non-autoregressive flow-based architecture, while maintaining similar audio quality per Mean Opinion Scores. However, WaveNet might excel in fine-grained quality but is slower because of autoregressive generation.

Question 2

How to train WaveGlow on my own dataset?

Accepted Answer

Prepare audio files and create train/test lists as in the README, then use train.py with config.json, setting parameters like fp16_run for mixed precision. Note that it requires significant GPU resources and custom data preprocessing for mel-spectrograms.

Question 3

Can I run WaveGlow on a CPU or without an NVIDIA GPU?

Accepted Answer

WaveGlow is optimized for NVIDIA GPUs, especially V100 with FP16; running on CPU or other GPUs will be much slower and may lack mixed precision benefits, making it impractical for real-time use.

Question 4

What are the steps to generate audio from text using WaveGlow?

Accepted Answer

First, convert text to mel-spectrograms using a model like Tacotron, then use inference.py with WaveGlow's pre-trained model to synthesize audio. The README focuses on the spectrogram-to-audio step, not end-to-end TTS.

Question 5

Is WaveGlow better than Tacotron for audio synthesis?

Accepted Answer

WaveGlow and Tacotron serve different roles: Tacotron generates mel-spectrograms from text, while WaveGlow converts those to audio. WaveGlow is often paired with Tacotron for a complete TTS pipeline, offering fast inference but dependent on the spectrogram quality.

Question 6

How to fix common Apex installation errors with WaveGlow?

Accepted Answer

Ensure compatible CUDA and PyTorch versions, and follow Apex's official GitHub instructions. Using Docker or conda environments can help manage dependencies, as installation issues are common due to hardware-specific requirements.

waveglow

What is waveglow?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions