You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/backend/SYCL.md
+8Lines changed: 8 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -636,6 +636,14 @@ use 1 SYCL GPUs: [0] with Max compute units:512
636
636
637
637
It's same for other projects including llama.cpp SYCL backend.
638
638
639
+
- Meet issue: `Native API failed. Native API returns: -6 (PI_ERROR_OUT_OF_HOST_MEMORY) -6 (PI_ERROR_OUT_OF_HOST_MEMORY) -999 (UNKNOWN PI error)` or `failed to allocate SYCL0 buffer`
640
+
641
+
Device Memory is not enough.
642
+
643
+
|Reason|Solution|
644
+
|-|-|
645
+
|Default Context is too big. It leads to more memory usage.|Set `-c 8192` or smaller value.|
646
+
|Model is big and require more memory than device's.|Choose smaller quantized model, like Q5 -> Q4;<br>Use more than one devices to load model.|
639
647
640
648
### **GitHub contribution**:
641
649
Please add the **[SYCL]** prefix/tag in issues/PRs titles to help the SYCL-team check/address them without delay.
0 commit comments