Adds notebooks for OSFT #3

RobotSail · 2025-09-02T08:04:12Z

Summary

This PR adds comprehensive educational materials for Orthogonal Subspace Fine-Tuning (OSFT) to the training
hub, including Jupyter notebooks, example scripts, and several important bug fixes to improve the OSFT
implementation.

📚 New Educational Content

Jupyter Notebooks Added

osft_comprehensive_tutorial.ipynb - Complete OSFT training tutorial with detailed parameter explanations,
model examples (Qwen 2.5 7B, Llama 3.1 8B, Phi 4 Mini), and best practices
osft_continual_learning.ipynb - Practical demonstration of continual learning using OSFT with TableGPT
dataset, showing JSON formatting training without catastrophic forgetting
lab_multiphase_osft_training_tutorial.ipynb - LAB (Large-scale Alignment for chatBots) multi-phase training
using OSFT, demonstrating knowledge → skills training without replay buffers

Example Scripts Added

osft_llama_example.py - Ready-to-use OSFT training script for Llama models
osft_qwen_example.py - OSFT training script optimized for Qwen models
osft_phi_example.py - OSFT training script for Phi models
osft_continual_learning_example.py - Production script for continual learning workflows
lab_multiphase_osft_training.py - Script for LAB multi-phase training with OSFT

Updated Content

Enhanced lab_multiphase_training_tutorial.ipynb - Updated with OSFT comparisons and improved documentation

🐛 Bug Fixes

OSFT Algorithm Improvements

Fixed dataset loading issue (src/training_hub/algorithms/osft.py:415): Updated to use proper JSON dataset
loading format for unmask functionality
Improved type annotations (src/training_hub/algorithms/osft.py:218): Changed lr_scheduler_kwargs type from
dict[str, str] to dict for better flexibility
Code formatting cleanup: Removed extraneous whitespace

Dependency Updates

Added development dependencies (pyproject.toml): Added ipykernel and ipython to support Jupyter notebook
development

🔬 Educational Value

This PR significantly enhances the learning experience for OSFT by providing:

Complete tutorials showing OSFT from basic concepts to advanced multi-phase training
Real-world examples using popular models (Llama, Qwen, Phi)
Practical demonstrations of continual learning without catastrophic forgetting
Production-ready scripts for immediate use in training workflows
Best practices guidance for choosing hyperparameters like unfreeze_rank_ratio

📊 Statistics

5 new notebooks (5,661 total lines added across notebooks)
5 new example scripts (1,082 total lines added across scripts)
12 files changed, 5,375 insertions, 9 deletions
Bug fixes in core OSFT implementation

Test Plan

All notebooks have been tested with example datasets
Scripts include comprehensive error handling and validation
Examples demonstrate both preservation of existing capabilities and acquisition of new skills

● This PR description captures the comprehensive nature of your changes - significant educational content
additions, practical examples, and important bug fixes that improve the OSFT implementation. The changes
demonstrate both the breadth of new materials and the attention to improving the existing codebase quality.

Maxusmusti

LGTM

RobotSail added 2 commits August 29, 2025 19:11

wip commit - add notebooks

a55d2f3

adds more notebooks, scripts, fixes bugs

6a8d001

RobotSail force-pushed the add-notebooks branch from f5ca265 to 6a8d001 Compare September 2, 2025 08:08

RobotSail added 2 commits September 3, 2025 02:43

adds more notebooks, scripts, fixes bugs

275d853

bump rhai-innovation-mini-trainer to v0.1.1

30af193

RobotSail force-pushed the add-notebooks branch from 079ce2a to 30af193 Compare September 3, 2025 02:43

Maxusmusti approved these changes Sep 3, 2025

View reviewed changes

Maxusmusti merged commit 965c54f into Red-Hat-AI-Innovation-Team:main Sep 3, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds notebooks for OSFT #3

Adds notebooks for OSFT #3

Uh oh!

RobotSail commented Sep 2, 2025

Uh oh!

Maxusmusti left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Adds notebooks for OSFT #3

Adds notebooks for OSFT #3

Uh oh!

Conversation

RobotSail commented Sep 2, 2025

Uh oh!

Maxusmusti left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants