[CPU] FullyConnected acceleration for BF16 compressed weights #24633

dmitry-gorokhov · 2024-05-22T13:45:46Z

Details:

Improved LLMs latency for models with BF16 weights (by extending compressed IP kernel on BF16 data type)
Improved compilation time for models with BF16 weights on avx2 systems (enabled jit reorder for BF16 data type)

OneDNN fork PR: openvinotoolkit/oneDNN#250

Tickets:

CVS-142057

dmitry-gorokhov · 2024-05-22T13:50:55Z

@usstq could you please review?

usstq

LGTM， except one comment here:
openvinotoolkit/oneDNN#250 (comment)

slyalin · 2024-06-19T15:55:43Z

Are we going to merge it for the next release?

…s_compression

dmitry-gorokhov self-assigned this May 22, 2024

dmitry-gorokhov requested review from a team as code owners May 22, 2024 13:45

github-actions bot added the category: CPU OpenVINO CPU plugin label May 22, 2024

dmitry-gorokhov assigned usstq May 22, 2024

dmitry-gorokhov added this to the 2024.3 milestone Jun 3, 2024

usstq approved these changes Jun 6, 2024

View reviewed changes

[CPU] FullyConnected acceleration for BF16 compressed weights

ea73407

dmitry-gorokhov force-pushed the feature/bf16_weights_compression branch from 55908c4 to ea73407 Compare June 26, 2024 07:37

maxnick added 4 commits June 27, 2024 17:46

Merge remote-tracking branch 'origin/master' into feature/bf16_weight…

aea404f

…s_compression

Fix oneDNN bf16 load-store

76b50e0

More safe check for bf16 conversion in brgemm

890ec7e

Merge remote-tracking branch 'origin/master' into feature/bf16_weight…

1f39b19

…s_compression

maxnick added this pull request to the merge queue Jul 1, 2024

Merged via the queue into openvinotoolkit:master with commit 1d7daae Jul 1, 2024

maxnick deleted the feature/bf16_weights_compression branch July 1, 2024 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU] FullyConnected acceleration for BF16 compressed weights #24633

[CPU] FullyConnected acceleration for BF16 compressed weights #24633

Uh oh!

dmitry-gorokhov commented May 22, 2024 •

edited

Loading

Uh oh!

dmitry-gorokhov commented May 22, 2024

Uh oh!

usstq left a comment

Uh oh!

slyalin commented Jun 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[CPU] FullyConnected acceleration for BF16 compressed weights #24633

[CPU] FullyConnected acceleration for BF16 compressed weights #24633

Uh oh!

Conversation

dmitry-gorokhov commented May 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

dmitry-gorokhov commented May 22, 2024

Uh oh!

usstq left a comment

Choose a reason for hiding this comment

Uh oh!

slyalin commented Jun 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dmitry-gorokhov commented May 22, 2024 •

edited

Loading