You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add debug.md, apply EP hang fixes (1-3, 6), add pre/post-attention NaN trace with first-occurrence print-based ERROR output and padding/actual row distinction, fix attention output buffer init in both shared and model-specific layers, add DP+EP example...
#21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.