Add auto_mode support for inworld plugin#3593
Conversation
f8a1593 to
00f555e
Compare
Sounds good! |
|
Hi @markbackman - Chatted with @ianbbqzy and this feature (basically a smart auto-flushing mechanism to get optimal latency and prosody from our TTS with websockets) is good to merge now from our perspective. It's currently slated to be rolled out on our side later this week, but will be safely ignored if the Pipecat SDK includes it before then. Our preference is to merge it into the Pipecat SDK ASAP so users can benefit as soon as our release goes out. Thanks! |
markbackman
left a comment
There was a problem hiding this comment.
Thanks for confirming, @cshape!
Can you please add a changelog to this PR and then it'll be ready to go:
https://github.com/pipecat-ai/pipecat/blob/main/CONTRIBUTING.md#changelog-entries
Codecov Report❌ Patch coverage is
... and 21 files with indirect coverage changes 🚀 New features to boost your workflow:
|
|
|
@markbackman sorry, I realized that this auto_mode could exacerbate the timestamps resetting issue. I will work on a fix and report back |
c248b00 to
e086a02
Compare
|
Implemented timestamp cumulation using |
e086a02 to
02dbc95
Compare
02dbc95 to
22398e1
Compare
|
Thanks for patiently educating me on this. This does sound like a nice improvement. In running the example, I see really solid TTFB performance now. Kudos! Before this is ready, just two small things:
|
No problem at all and thanks for your team's and your personal insights into Voice AI orchestration in general. There might still be some rough edges with the Inworld API, but we will keep improving it to truly show the potentials of our model.
Thanks! but we also know it's not there yet, we have an even bigger latency improvement regarding timestamps coming soon next week, stay tuned! |
Inworld auto_mode to improve first token latency.