How Dikt Works

From voice to polished text in seconds. A simple four-step pipeline that runs locally or in the cloud.

The Pipeline

Press your global hotkey and start talking. NAudio captures high-quality audio from your microphone in the background, in any application.

Your audio is converted to text using Whisper.cpp locally on your machine, or via OpenAI's cloud API for maximum accuracy.

Optional AI post-processing with Claude or GPT fixes grammar, punctuation, filler words, and applies formatting to your text.

The polished text is automatically injected at your cursor position in whatever application you're using. No copy-paste needed.

Choose the approach that fits your needs. Use both with automatic failover.

Your voice data is yours. Dikt is designed from the ground up to keep your information private and secure.

Choose the model that balances speed, accuracy, and disk space for your needs.

Model	Size	Speed	Accuracy	Best For
tiny	~75 MB	Fastest	Basic	Quick drafts, low-resource machines
base	~142 MB	Fast	Good	General use with decent hardware
small	~466 MB	Moderate	Very Good	Balanced speed and accuracy
medium	~1.5 GB	Slower	Excellent	High-accuracy offline transcription
large	~2.9 GB	Slowest	Best	Maximum accuracy, multi-language

14-day free trial. No credit card required. All features included.