
2.57 MB
Android 7.0+
63
x86, x86_64, arm64-v8a, armeabi-v7a
Verified safeScanned with ClamAV, APKiD, and Quark-Engine. No threats detected.
What's New
- Detects when Whisper returns the vocabulary list as a "transcription" of empty recordings, and drops it
- Catches more silence artifacts: subtitle credits with "Корректор/Редактор", thanks lines in German/French
- Stops eating real one-word dictation: short "Привет" / "Спасибо" / "Хорошо" pass through correctly
- Three-name dictations like "Альфия, Алсу, Рустем" are preserved while seven-name hallucinations are not
Description
Android keyboard (IME) for speech-to-text. Sends audio to any OpenAI-compatible Whisper endpoint of your choice — by default Groq (free tier available), can be pointed at OpenAI or any other compatible provider. Optional LLM post-processing.
Features
Voice Input
- Real-time voice recording with amplitude visualization
- Processing queue — start a new recording immediately, previous ones transcribe in the background
- Works with any OpenAI-compatible Whisper API; ships with Groq as the default endpoint (whisper-large-v3-turbo)
- Configurable API endpoint, model, and language
- Auto-start recording when keyboard opens
- Custom vocabulary — add names and technical terms to bias recognition; helps with contacts and rare words
- Drop period for single-word output — handy for voice search where a trailing period gets in the way
Post-Processing
- Fix errors — corrects punctuation, spelling, removes filler words (um, uh)
- Shorten — makes text concise while keeping key points
- Emoji — adds relevant emoji to your messages
- Rhyme — rewrites dictated text as poetry
- Translate — translates to any of the supported languages
- Supports OpenAI and Claude as processing providers
- Customizable prompts and temperature for each mode
Keyboard
- Send button (paper plane) — sends Ctrl+Enter for quick message sending in messengers
- Accelerating backspace — hold to delete slowly at first, then faster
- Clipboard bar stays visible after paste for repeated pasting
- Graceful shutdown — if keyboard hides during recording, audio is finalized and transcribed to clipboard
General
- 17 interface and transcription languages
- Light, Dark, and Auto themes
- Long-press spacebar to switch keyboard
- Built-in test recording in settings
- App logs and crash reports
- Auto-update from GitHub Releases
Setup
1. Install the APK
2. Go to Settings → System → Languages & input → On-screen keyboard
3. Enable "Voice Keyboard"
4. Open the app and enter a Whisper API key. The default endpoint is Groq — get a free key at console.groq.com/keys. You can also point the app at OpenAI's Whisper endpoint or any other OpenAI-compatible provider in the same screen.
5. (Optional) Configure post-processing with your OpenAI or Claude API key
Privacy
No analytics, no telemetry, no advertising. Audio is sent only to the transcription provider you configure, using your own API key.
License
MIT
Rate this app