-
Notifications
You must be signed in to change notification settings - Fork 536
QNN: wav2letter doesn't export #7634
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Is there any error message? |
Updated with stacktrace. cc: @cccclai |
Hi @guangy10, |
@winskuo-quic If you think the model is working fine, should we just enable it in the CI, the setup there can be the source of truth for future reference. Here is the QNN models we are currently running on CI: https://github.com/pytorch/executorch/blob/main/.github/workflows/trunk.yml#L305-L329 Can you add wav2letter to it? The CI only need to test it in |
@guangy10 Could you please share where I can place my model's weight file? I can't use |
Can we use random weights instead, for compile_only, since the model is not calibrated nor evaluating the accuracy. In the CI the goal is to just ensure it's functioning end-to-end with low latency. Users can always rerun these models locally by disabling the compile_only so that they will get the high quality model easily. wdyt @winskuo-quic |
Random weights possibly have a different latency number than the actual weights. If it will be used on dashboard, it's probably better to use real weights |
@cccclai Curious how different the numbers are—can you share them? Or are you assuming a hypothetical scenario? If the weights are public, one solution is to cache the weights in the docker image for qnn. It can avoid redownloading the weights again and again. And we already have that docker. |
I've observed a 10-15% perf difference for a model we plan to ship to production. It took me a while to debug the perf gap and it turns out that the difference is random weight/real weight |
@guangy10, @cccclai, |
cc @kirklandsign and @huydhn for this question |
Save weight in s3 maybe? Docker is really for env set up right? |
🐛 Describe the bug
fails to export. See stacktrace:
where states_fused.pth is downloaded from https://github.com/nipponjo/wav2letter-ctc-pytorch/tree/main?tab=readme-ov-file#wav2letter-ctc-pytorch
cc: @cccclai @cbilgin
Versions
latest main
cc @cccclai @winskuo-quic @shewu-quic
The text was updated successfully, but these errors were encountered: