-
Notifications
You must be signed in to change notification settings - Fork 5.9k
修复有时候会出现长段无意义音频的bug #2653
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
修复有时候会出现长段无意义音频的bug #2653
Conversation
Added logic to handle bad tokens and adjust logits based on repeated tokens during decoding.
Added logic to handle bad tokens and prevent repetition in token generation.
|
不错的建议,抽空测试下! |
|
能否提供一些比较稳定能复现同一个token反复的case? |
|
大量报错,还需修改 |
I'm all ears. Hmm, "拉布布" sounds a bit mysterious! |
修复了一下 |
|
@jsntcheng bad_tokens_list = [809, 207,411,679,676,25,23,7]这些bad tokens是如何得到的? |
|
0.618 1.414和35这几个值是如何考虑的? |
|
@RVC-Boss 这个修补在我们这边已经运行到现在,运行良好😄 |
生产中发现一些模型可能会因为素材或者参考音频的原因,时不时推理出大段的无意义音频,排查发现是在AR的decode阶段有点问题,进行了优化,降低了一直推出同一个token的情况。对于一些恶意token(蹩脚的叫法),上一个修改依旧不能修复,会出现a,a,a,a,b,b,a,a,a,a这种情况,直接干掉。测试下来也没什么不妥,质量也没下降,稳定性肉眼可见的提升。人机交互产品出现那种大段的噪音真的不能接受,不是说抽卡就行的。