Added support for quantization in vLLM backend #690

SulRash · 2025-04-26T22:22:22Z

The arguments quantization and load_format are missing from the vLLM configuration, which made applying quantizations a headache, especially since the base class forbids any hyperparameters not defined. I got a bitsandbytes model up and running with this tweak, where it used to fail before. Arguments default to None to not change any functionality, drop-in addition.

HuggingFaceDocBuilderDev · 2025-04-28T11:35:18Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

NathanHB · 2025-04-28T11:37:51Z

hey ! thanks for the PR, you only need to make style and should be ready :)

SulRash · 2025-04-28T11:41:57Z

@NathanHB done :)

* Added support for quanitzation in vllm backend * Fixed style issues --------- Co-authored-by: Nathan Habib <[email protected]>

SulRash and others added 2 commits April 27, 2025 01:04

Added support for quanitzation in vllm backend

e985799

Merge branch 'main' into main

0c3e3d7

Fixed style issues

d6ebe58

SulRash added 2 commits April 29, 2025 15:57

Merge branch 'main' into main

2992d7f

Merge branch 'main' into main

eb38a15

NathanHB added the feature/enhancement New feature/request label May 5, 2025

NathanHB approved these changes May 12, 2025

View reviewed changes

NathanHB merged commit 04a74a2 into huggingface:main May 12, 2025
4 checks passed

hynky1999 pushed a commit that referenced this pull request May 22, 2025

Added support for quantization in vLLM backend (#690)

1a90907

* Added support for quanitzation in vllm backend * Fixed style issues --------- Co-authored-by: Nathan Habib <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added support for quantization in vLLM backend #690

Added support for quantization in vLLM backend #690

Uh oh!

SulRash commented Apr 26, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2025

Uh oh!

NathanHB commented Apr 28, 2025

Uh oh!

SulRash commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

Added support for quantization in vLLM backend #690

Added support for quantization in vLLM backend #690

Uh oh!

Conversation

SulRash commented Apr 26, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2025

Uh oh!

NathanHB commented Apr 28, 2025

Uh oh!

SulRash commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!