Skip to content

Conversation

@casinca
Copy link
Contributor

@casinca casinca commented Jan 14, 2026

As discussed fix #947

edit: I fixed a typo that was spread across files at the same time Reusuable → Reusable

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Copy link
Owner

@rasbt rasbt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@rasbt rasbt merged commit 9c4be47 into rasbt:main Jan 14, 2026
13 of 15 checks passed
@casinca casinca deleted the optional-weight-tying branch January 14, 2026 15:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Weight tying for pretraining with Qwen3 or Llama3.2

2 participants