Adds notebooks for OSFT #3
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds comprehensive educational materials for Orthogonal Subspace Fine-Tuning (OSFT) to the training
hub, including Jupyter notebooks, example scripts, and several important bug fixes to improve the OSFT
implementation.
📚 New Educational Content
Jupyter Notebooks Added
model examples (Qwen 2.5 7B, Llama 3.1 8B, Phi 4 Mini), and best practices
dataset, showing JSON formatting training without catastrophic forgetting
using OSFT, demonstrating knowledge → skills training without replay buffers
Example Scripts Added
Updated Content
🐛 Bug Fixes
OSFT Algorithm Improvements
loading format for unmask functionality
dict[str, str] to dict for better flexibility
Dependency Updates
development
🔬 Educational Value
This PR significantly enhances the learning experience for OSFT by providing:
📊 Statistics
Test Plan
● This PR description captures the comprehensive nature of your changes - significant educational content
additions, practical examples, and important bug fixes that improve the OSFT implementation. The changes
demonstrate both the breadth of new materials and the attention to improving the existing codebase quality.