-
Notifications
You must be signed in to change notification settings - Fork 15.2k
[FEATURE]: Speech-to-Text Voice Input for Lazy People in OpenCode #4695
Copy link
Copy link
Labels
discussionUsed for feature requests, proposals, ideas, etc. Open discussionUsed for feature requests, proposals, ideas, etc. Open discussion
Description
Feature hasn't been suggested before.
- I have verified this feature I'm about to request hasn't been suggested before.
Describe the enhancement you want to request
Hi! First of all, congratulations on the amazing project.
I've been working on a Speech-to-Text voice input feature that integrates directly into the TUI. It allows users to start audio recording with a keybind, automatically transcribe speech using different providers, and insert the resulting text directly into the prompt.
I've built an initial working version, currently tested only on macOS, and the system includes:
- Real-time audio recording via FFmpeg;
- Support for Groq Whisper, OpenAI Whisper, and local whisper.cpp;
- Automatic microphone/device detection;
- Interactive menus for choosing provider, model, and audio device;
- Persistent configuration stored in ~/.opencode/state/speech.json;
- Customizable keybinds (Ctrl+X v, Ctrl+X P, Ctrl+X D);
- Smooth flow: record → transcribe → insert into prompt input;
Would this be something you'd be interested in integrating into the project?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
discussionUsed for feature requests, proposals, ideas, etc. Open discussionUsed for feature requests, proposals, ideas, etc. Open discussion