Add CUDA compute capability compile guard #636

gau-nernst · 2024-08-08T12:29:22Z

Fixes #632

Tested on Google Colab

pytorch-bot · 2024-08-08T12:29:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/636

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2f0aef4 with merge base e11201a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

gau-nernst · 2024-08-08T12:32:01Z

torchao/csrc/cuda/fp6_llm/fp6_linear.cu

@@ -14,6 +14,8 @@
 // 
 // This file is adapted from https://github.com/usyd-fsalab/fp6_llm/blob/5df6737cca32f604e957e3f63f03ccc2e4d1df0d/fp6_llm/csrc/fp6_linear.cu

+#if !defined(__CUDA_ARCH__) || __CUDA_ARCH__ >= 800  // at least Ampere


For anyone wondering why this instead of defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 800, I'm following this.

One reason I can think of is for vscode intellisense to work. If I use #if defined(__CUDA_ARCH__), vscode will not do syntax highlighting. There are workarounds, like this, but it is more cumbersome.

As titled. On some devices `python` and `python3` are pointing to different environments so good to unify them.

* executable README * fix title of CI workflow * markup commands in markdown * extend the markup-markdown language * Automatically identify cuda from nvidia-smi in install-requirements (pytorch#606) * Automatically identify cuda from nvidia-smi in install-requirements * Update README.md --------- Co-authored-by: Michael Gschwind <[email protected]> * Unbreak zero-temperature sampling (pytorch#599) Fixes pytorch#581. * Improve process README * [retake] Add sentencepiece tokenizer (pytorch#626) * Add sentencepiece tokenizer Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Add white space Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Handle white space: Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Handle control ids Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * More cleanup Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Lint Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Use unique_ptr Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Use a larger runner Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Debug Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Debug Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Cleanup * Update install_utils.sh to use python3 instead of python (pytorch#636) As titled. On some devices `python` and `python3` are pointing to different environments so good to unify them. * Fix quantization doc to specify dytpe limitation on a8w4dq (pytorch#629) Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Co-authored-by: Kimish Patel <[email protected]> * add desktop.json (pytorch#622) * add desktop.json * add fast * remove embedding * improvements * update readme from doc branch * tab/spc * fix errors in updown language * fix errors in updown language, and [skip]: begin/end * fix errors in updown language, and [skip]: begin/end * a storied run * stories run on readme instructions does not need HF token * increase timeout * check for hang un hf_login * executable README improvements * typo * typo --------- Co-authored-by: Ian Barber <[email protected]> Co-authored-by: Scott Wolchok <[email protected]> Co-authored-by: Mengwei Liu <[email protected]> Co-authored-by: Kimish Patel <[email protected]> Co-authored-by: Scott Roy <[email protected]>

add compile guard

759895c

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 8, 2024

gau-nernst commented Aug 8, 2024

View reviewed changes

Merge branch 'pytorch:main' into compile_guard

2f0aef4

msaroufim approved these changes Aug 8, 2024

View reviewed changes

msaroufim merged commit 32f421b into pytorch:main Aug 8, 2024
14 checks passed

gau-nernst deleted the compile_guard branch August 9, 2024 01:02

gau-nernst mentioned this pull request Oct 30, 2024

BF16 support for Quant-LLM kernel #1147

Merged

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

Update install_utils.sh to use python3 instead of python (pytorch#636)

fe75b16

As titled. On some devices `python` and `python3` are pointing to different environments so good to unify them.

psinger mentioned this pull request Jan 31, 2025

CUDA compile guard problem for marlin_qqq #1648

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CUDA compute capability compile guard #636

Add CUDA compute capability compile guard #636

gau-nernst commented Aug 8, 2024

pytorch-bot bot commented Aug 8, 2024 •

edited

Loading

gau-nernst Aug 8, 2024

Add CUDA compute capability compile guard #636

Add CUDA compute capability compile guard #636

Conversation

gau-nernst commented Aug 8, 2024

pytorch-bot bot commented Aug 8, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/636

✅ No Failures

gau-nernst Aug 8, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Aug 8, 2024 •

edited

Loading