Closed
Description
is it possible to load the awq qunatized models into CPU memory instead of GPU memory ?
similar issue : #999
Metadata
Metadata
Assignees
Labels
No labels
is it possible to load the awq qunatized models into CPU memory instead of GPU memory ?
similar issue : #999