Add KaSA implementation to layer.py #2543
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
cc @BenjaminBossan
I've added KaSA implementation part to
layer.py
by referring to the existing KaSA code. This is a very early draft of the PR, and there's a chance I misunderstood something, so a lot might need to be revised/added.I'm not entirely sure what the sentence "For the PEFT integration, the new LoRA variant pattern introduced in #2443 should be leveraged (i.e. use a similar approach to extend vanilla LoRA as is used for DoRA)" - from #2516 - means exactly. If you could provide some guidance on this part, it would be really helpful for me.
I'm open to any kind of feedback!