Highlights:
- FSDP2 compatibility for Params4bit (#1719)
- Bugfix for 4bit quantization with large block sizes (#1721)
- Further removal of previously deprecated code (#1669)
- Improved CPU coverage (#1628)
- Include NVIDIA Volta support in CUDA 12.8 and 12.9 builds (#1715)
What's Changed
- Enable CPU/XPU native and ipex path by @jiqing-feng in #1628
- Fix CI regression by @matthewdouglas in #1666
- Add CPU + IPEX to nightly CI by @matthewdouglas in #1667
- Fix params4bit passing bnb quantized by @mklabunde in #1665
- Deprecation cleanup by @matthewdouglas in #1669
- CI workflow: bump torch 2.7.0 to 2.7.1 by @matthewdouglas in #1670
- Improvement for torch.compile support on Params4bit by @matthewdouglas in #1673
- Fixed a bug in test_fw_bit_quant testing on CPU by @Egor-Krivov in #1675
- doc fix signature for 8-bit optim by @ved1beta in #1660
- Apply clang-format rules by @matthewdouglas in #1678
- Add clang-format by @matthewdouglas in #1677
- HPU (Intel gaudi) support for bnb unit tests by @ckvermaAI in #1680
- CI: Setup HPU nightly tests by @matthewdouglas in #1681
- Update test_kbit_backprop unit test by @ckvermaAI in #1682
- Update README.md by @matthewdouglas in #1684
- Enable ROCm backend with custom ops integration by @pnunna93 in #1683
- Fix AdamW documentation by @agupta2304 in #1686
- Make minor improvements to optimizer.py by @agupta2304 in #1687
- Add CUDA 12.9 build by @matthewdouglas in #1689
- CI: Test with PyTorch 2.8.0 RC by @matthewdouglas in #1693
- Automatically call CMake as part of PEP 517 build by @mgorny in #1512
- fix log by @jiqing-feng in #1697
- [XPU] Add inference benchmark for XPU by @Egor-Krivov in #1696
- Add kernel registration for 8bit and 32bit optimizers by @Egor-Krivov in #1706
- Create FUNDING.yml by @matthewdouglas in #1714
- Add Volta support in cu128/cu129 builds by @matthewdouglas in #1715
- Fix Params4bit tensor subclass handling by @ved1beta in #1719
- [CUDA] Fixing quantization uint8 packing bug for NF4 and FP4 by @Mhmd-Hisham in #1721
New Contributors
- @mklabunde made their first contribution in #1665
- @agupta2304 made their first contribution in #1686
- @mgorny made their first contribution in #1512
- @Mhmd-Hisham made their first contribution in #1721
Full Changelog: 0.46.0...0.47.0