-
Notifications
You must be signed in to change notification settings - Fork 51
Open
Description
Hi guys,
Awesome library, really underrated! Could I ask whether there’s a way to download kernels via hf download? So far, I’ve been relying on the kernels toolkit by running some Python code, but I think this library would integrate nicely with hf download.
Also, besides attn_implementation in transformers, are you aware of other ways to monkey-patch pre-trained Transformers (e.g., Llama) to use fast CUDA kernels (e.g., activations etc) from kernels ?
Thanks!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels