StackLLaMA: correctly merge peft model #398

mnoukhov · 2023-06-01T19:05:07Z

fixes #368 #386 #297 and partially #327
correctly merge weights with peft's merge_and_unload
if merging a reward model, load sequence classification

Note that trl-lib/llama-7b-se-rm-peft does not contain the score weight so I will release my own pretrained reward model adapter weights

correctly merge weights with peft's merge_and_unload load sequence classification model for reward models

HuggingFaceDocBuilderDev · 2023-06-02T12:29:53Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Amazing work @mnoukhov !
Could you run the styling checks?

make style && make quality

After that we should be good for merging
Thanks!

mnoukhov · 2023-06-04T03:22:31Z

Done!

younesbelkada

Thanks a lot!

* correctly merge stackllama models correctly merge weights with peft's merge_and_unload load sequence classification model for reward models * style, black line length 119 * flake8

correctly merge stackllama models

83f5f29

correctly merge weights with peft's merge_and_unload load sequence classification model for reward models

mnoukhov mentioned this pull request Jun 1, 2023

Reproducing StackLLaMA #401

Closed

younesbelkada reviewed Jun 2, 2023

View reviewed changes

younesbelkada mentioned this pull request Jun 2, 2023

StackLLaMA: fix supervised finetuning and reward model training #399

Merged

mnoukhov added 2 commits June 4, 2023 03:13

style, black line length 119

c586a55

flake8

0a93491

mnoukhov requested a review from younesbelkada June 4, 2023 03:22

younesbelkada approved these changes Jun 5, 2023

View reviewed changes

younesbelkada requested a review from lvwerra June 5, 2023 08:16

lvwerra approved these changes Jun 5, 2023

View reviewed changes

younesbelkada merged commit 0ddf9f6 into huggingface:main Jun 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

StackLLaMA: correctly merge peft model #398

StackLLaMA: correctly merge peft model #398

Uh oh!

mnoukhov commented Jun 1, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 2, 2023 •

edited

Loading

Uh oh!

younesbelkada left a comment

Uh oh!

mnoukhov commented Jun 4, 2023

Uh oh!

younesbelkada left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

StackLLaMA: correctly merge peft model #398

StackLLaMA: correctly merge peft model #398

Uh oh!

Conversation

mnoukhov commented Jun 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

mnoukhov commented Jun 4, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mnoukhov commented Jun 1, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 2, 2023 •

edited

Loading