feat: add /add-telegram-voice skill for local whisper.cpp transcription by vweaver · Pull Request #718 · qwibitai/nanoclaw

vweaver · 2026-03-05T00:32:38Z

Summary

Adds a new skill (/add-telegram-voice) that upgrades the Telegram channel with local voice message transcription using whisper.cpp. Follows the project's "skills over features" philosophy — no changes to core code.

Channel-agnostic transcription module — transcribeAudio(Buffer): Promise<string | null>, usable by any channel
Local whisper.cpp — no cloud API, no API key, no cost. Uses ffmpeg + whisper-cli on-device
Depends on telegram skill, conflicts with voice-transcription (incompatible src/transcription.ts API — Baileys-coupled vs channel-agnostic)
Voice notes arrive as [Voice: <transcript>] instead of [Voice message] placeholders

Skill contents

File	Purpose
`SKILL.md`	Setup: install deps, download model, apply, verify
`manifest.yaml`	Metadata, deps, conflicts
`add/src/transcription.ts`	Channel-agnostic whisper.cpp module
`modify/src/channels/telegram.ts`	Async voice handler with download + transcribe
`modify/src/channels/telegram.test.ts`	5 new voice tests + fetch/transcription mocks
`tests/telegram-voice.test.ts`	15 skill package validation tests

Built and tested on Linux (Ubuntu) with whisper.cpp built from source (-DBUILD_SHARED_LIBS=OFF). SKILL.md includes Linux-specific build instructions and the static linking gotcha.

Test plan

15 skill package tests pass (manifest, file presence, intent files, API shape, no Baileys deps)
54 Telegram channel tests pass (including 5 new voice transcription tests)
Full suite passes (374 tests)
Verified end-to-end: voice note in Telegram -> [Voice: Can you hear me now?]

🤖 Generated with Claude Code

Adds a skill that upgrades the Telegram channel with local voice message transcription using whisper.cpp. Voice notes arrive as [Voice: <transcript>] instead of placeholders. No cloud API, no cost — runs entirely on-device. - Channel-agnostic transcription module (Buffer in, text out) - Depends on telegram skill, conflicts with voice-transcription (different API) - 15 skill package tests, 5 new voice transcription integration tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

TomGranot · 2026-03-05T09:05:32Z

There's already a use-local-whisper skill that landed in #702. How does this differ? If it's the same thing, we should close this one.

vweaver · 2026-03-05T13:50:55Z

They solve different problems — use-local-whisper swaps the OpenAI backend for whisper.cpp on WhatsApp, while this PR adds voice transcription to Telegram which has no voice support today.

That said, use-local-whisper already notes that Telegram just needs audio download logic added. Rather than a separate skill, I could rework this as an update to use-local-whisper that:

Adds a channel-agnostic transcribeAudio(Buffer) export to src/transcription.ts
Keeps the existing transcribeAudioMessage(WAMessage, WASocket) as a wrapper so WhatsApp isn't broken
Adds the Telegram voice handler and tests

Would you prefer that approach, or is there a different way you'd like to see it structured?

vweaver · 2026-03-05T18:21:30Z

Closing in favor of a new PR that adds Telegram support to the existing use-local-whisper skill instead of creating a separate skill.

github-actions Bot mentioned this pull request Mar 5, 2026

🦞 OpenClaw 生态日报 2026-03-05 rollysys/agents-radar#38

Open

Andy-NanoClaw-AI added PR: Skill Skill package or skill-related changes Status: Needs Review Ready for maintainer review labels Mar 5, 2026

Merge branch 'main' into skill/add-telegram-voice

cc09d9c

vweaver closed this Mar 5, 2026

vweaver mentioned this pull request Mar 5, 2026

feat: add Telegram voice transcription to use-local-whisper skill #741

Open

4 tasks

This was referenced Mar 6, 2026

🦞 OpenClaw 生态日报 2026-03-06 duanyytop/agents-radar#83

Open

🦞 OpenClaw 生态日报 2026-03-06 rollysys/agents-radar#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add /add-telegram-voice skill for local whisper.cpp transcription#718

feat: add /add-telegram-voice skill for local whisper.cpp transcription#718
vweaver wants to merge 2 commits intoqwibitai:mainfrom
vweaver:skill/add-telegram-voice

vweaver commented Mar 5, 2026

Uh oh!

TomGranot commented Mar 5, 2026

Uh oh!

vweaver commented Mar 5, 2026

Uh oh!

vweaver commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vweaver commented Mar 5, 2026

Summary

Skill contents

Test plan

Uh oh!

TomGranot commented Mar 5, 2026

Uh oh!

vweaver commented Mar 5, 2026

Uh oh!

vweaver commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants