Skip to content

Stacked cache mixtral. #155

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 51 commits into from
Jul 20, 2024
Merged

Stacked cache mixtral. #155

merged 51 commits into from
Jul 20, 2024

Conversation

wang2yn84
Copy link
Collaborator

No description provided.

wang2yn84 added 30 commits July 20, 2024 02:52
… ring buffer support then fix the mask. Int8 updates also included but not tested.
…im for quantization from 1,3 to -3,-1 to make it more robust;
@wang2yn84 wang2yn84 requested review from qihqi, lsy323 and FanhaiLu1 July 20, 2024 03:00
@qihqi qihqi merged commit 2f77223 into mlperf-mixtral Jul 20, 2024
4 checks passed
@qihqi qihqi deleted the stacked-cache-mixtral branch July 20, 2024 03:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants