Commit f927192
fix(audio-encoder): apply upstream PR QwenLM#103 block-diagonal mask for non-FA2 backends
SDPA/eager attention ignored cu_seqlens and ran full global self-attention
over the whole batch, degrading transcription quality. Build the
block-diagonal chunk mask via _prepare_attention_mask and pass it to the
encoder layers; FA2 path is unchanged (mask is None).
Co-authored-by: Cursor <cursoragent@cursor.com>1 parent c17a131 commit f927192
1 file changed
Lines changed: 8 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
724 | 724 | | |
725 | 725 | | |
726 | 726 | | |
| 727 | + | |
| 728 | + | |
| 729 | + | |
| 730 | + | |
| 731 | + | |
| 732 | + | |
| 733 | + | |
727 | 734 | | |
728 | 735 | | |
729 | 736 | | |
730 | 737 | | |
731 | 738 | | |
| 739 | + | |
732 | 740 | | |
733 | 741 | | |
734 | 742 | | |
| |||
0 commit comments