Add Inworld Realtime Service by cshape · Pull Request #4140 · pipecat-ai/pipecat

cshape · 2026-03-25T15:36:02Z

Add Inworld's Realtime Speech to Speech API as a Pipecat service.

codecov · 2026-03-25T21:53:44Z

Codecov Report

❌ Patch coverage is 49.81504% with 407 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/pipecat/services/inworld/realtime/llm.py	30.67%	348 Missing ⚠️
...ecat/adapters/services/inworld_realtime_adapter.py	37.80%	51 Missing ⚠️
src/pipecat/services/inworld/realtime/events.py	96.47%	8 Missing ⚠️

Files with missing lines	Coverage Δ
src/pipecat/services/inworld/realtime/events.py	`96.47% <96.47%> (ø)`
...ecat/adapters/services/inworld_realtime_adapter.py	`37.80% <37.80%> (ø)`
src/pipecat/services/inworld/realtime/llm.py	`30.67% <30.67%> (ø)`

... and 5 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

filipi87 · 2026-04-08T13:27:14Z

Hi @cshape, when trying to run the example examples/foundational/19zb-inworld-realtime.py, I’m encountering the following error:

2026-04-08 10:22:58.909 | DEBUG    | pipecat.services.inworld.realtime.llm:_send_user_audio:1017 - Streaming audio to Inworld: 24000Hz, 1ch, 960B/frame
2026-04-08 10:22:59.123 | ERROR    | pipecat.processors.frame_processor:push_error_frame:744 - InworldRealtimeLLMService#0 error: Inworld Realtime Error: Failed to read content stream. INTERNAL: HTTP error: 400
2026-04-08 10:22:59.124 | WARNING  | pipecat.pipeline.task:_source_push_frame:903 - PipelineTask#0: Something went wrong: ErrorFrame#0(error: Inworld Realtime Error: Failed to read content stream. INTERNAL: HTTP error: 400, fatal: False)

Could you double check whether everything is working as expected ?

filipi87

Thanks for pushing this PR, @cshape.

Overall, it looks good. I’ve left a few comments that we should address.

I noticed that we’re reusing a lot of code from OpenAIRealtime. But the same pattern exists with XAI (Grok), so once this lands, I’ll work on a broader refactor to introduce a shared base class that all three can extend.

For now, it’s probably best to handle that in a follow-up so we don’t block this PR.

Let me know once you’ve addressed the comments so I can review it again 👍.

Adds a WebSocket-based realtime service for Inworld's cascade STT/LLM/TTS API with semantic VAD, function calling, and streaming transcription support. New files: - src/pipecat/services/inworld/realtime/ (service, events) - src/pipecat/adapters/services/inworld_realtime_adapter.py - examples/foundational/19zb-inworld-realtime.py Also includes: - websockets dependency for inworld extra in pyproject.toml - Adapter and settings tests matching OpenAI/Grok realtime patterns - Fix for double-response when server-side VAD is enabled

Adopt _resolve_system_instruction() from BaseLLMAdapter, matching the pattern applied to OpenAI Realtime, Grok Realtime, Gemini Live, and Nova Sonic in the pk/realtime-services-init-v-context-system-instructions-cleanup branch.

- Change default model from gpt-4.1-nano to gpt-4.1-mini - Add function calling demo to example - Remove demo-testing artifact from system instruction - Mention Router support in changelog

- Move example to examples/realtime/realtime-inworld.py - Change initial context role from "user" to "developer" - Remove explicit sample rates from example; sync them in _ensure_audio_config so Inworld gets the transport's actual rates - Add audio race condition guard in _handle_evt_audio_delta (matches OpenAI realtime pattern) - Convert remaining "system"/"developer" messages to "user" in adapter - Add clarifying comment for local-VAD vs server-VAD metrics paths

- Remove function calling from example, switch model to xai/grok-4-1-fast-non-reasoning - Add pipecat-realtime session key prefix and provider_data metadata for Inworld traffic attribution - Remove local VAD code path (Inworld only supports server-side VAD) - Use typed InputAudioBufferAppendEvent for audio sends

- Remove non-functional AdapterType.SHIM custom tools code from adapter - Default STT model to assemblyai/u3-rt-pro - Default VAD eagerness to low

filipi87

LGTM! Thank you for putting this together, @cshape 🚀

cshape force-pushed the cale/inworld-realtime branch from c1911c0 to 3ab7e18 Compare March 25, 2026 21:46

oleksa-source approved these changes Mar 27, 2026

View reviewed changes

cshape force-pushed the cale/inworld-realtime branch from 03d1054 to 3960c79 Compare March 27, 2026 19:19

filipi87 self-requested a review April 8, 2026 12:14