Whisperfile
Whisperfile is a high-performance speech-to-text tool built on whisper.cpp by Georgi Gerganov, et al., and OpenAI's Whisper model weights.
Whisperfile bundles the binary and model weights into a single self-contained executable that runs on Linux, macOS, and Windows without installation.
Quick Start
# transcribe a local audio file
whisperfile -m whisper-tiny.en-q5_1.bin audio.wav
# translate non-English speech to English
whisperfile -m ggml-medium-q5_0.bin -f audio.ogg --translate
# start the HTTP server
whisper-server -m whisper-tiny.en-q5_1.bin --port 8080
Features
- Transcribes WAV, MP3, FLAC, and Ogg Vorbis audio
- GPU acceleration via Apple Metal, NVIDIA CUDA, and AMD ROCm
- Translates speech from any language into English
- HTTP server with a REST API for remote transcription
- Pack the binary and model weights into a single portable executable