Question 1

How to set up whisper.cpp for real-time audio transcription?

Accepted Answer

Use the whisper-stream example with SDL2 for microphone input; build with -DWHISPER_SDL2=ON and run with parameters like --step and --length to control sampling, as described in the real-time audio input section.

Question 2

Is whisper.cpp faster than the original OpenAI Whisper?

Accepted Answer

Yes, whisper.cpp is optimized for performance with hardware acceleration and efficient C/C++ code, often providing faster inference times, especially on GPUs and Apple Silicon, though benchmarks vary by hardware.

Question 3

Can I use whisper.cpp on a Raspberry Pi?

Accepted Answer

Absolutely, it supports ARM architectures and has examples for Raspberry Pi, making it suitable for embedded systems with efficient CPU inference, though performance depends on model size and quantization.

Question 4

How to reduce the model size in whisper.cpp?

Accepted Answer

Use the quantization feature to convert models to integer formats like Q5_0 via the quantize tool, which reduces disk and memory usage without significant accuracy loss, as explained in the quantization section.

Question 5

What audio formats does whisper.cpp support?

Accepted Answer

By default, the CLI only handles 16-bit WAV files, but you can enable FFmpeg support at build time for formats like Opus and AAC, though this requires additional dependencies on Linux.

Question 6

How to integrate whisper.cpp into a mobile app?

Accepted Answer

Use the provided examples for iOS (Objective-C/Swift) and Android (Java), or bindings like XCFramework for Swift, to embed the library directly for offline ASR with hardware optimization.

whisper.cpp

What is whisper.cpp?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions