-
Hello, 1st of all thanks for this great software which just works for most use cases. I bring an edge case in the hopes of helping extend the frontier further, namely, older GPUs. in summary, koboldcpp detects the card AND the available memory correctly (with a bit of a delay though, after it already output "no gpu detected" which is curious) but when I click launch, the compilation fails claiming the "type half can only be used directly When the cl_khr_fp16 extension is enabled (unsupported)" or something similar to that (I'm paraphrasing) A few potentially important details: -- I tried all the "older cpus" options that shows the correct GPU (Pitcairn) below it, basically the openblast ones, it can even autoset the gpulayers after i select the model; -- i tried --usecublas running from the cli directly, but that delay I mentioned doesn´t seem to allow enough time for detection when running this way, because it fails before getting to the compilation phase; -- I'm running Archlinux, and those libraries are installed from the AUR repo, that is, manually installed and this was quite some time ago, however, I did update the rest of the system and, most importantly, Blender works (a version that is known to work with this GPU), and I can see the cycles renderer outputting to the terminal it's progress on (several) kernels compilations and running them to produce a (faster than cpu) render. Which leads me to believe that it's just a matter of the opencl version not supporting that "half type", either by the hardware or by my installation of the libraries. Also, I can also send my own kernels to compile and run, although I forget now how to do it, I can potentially help dev/debug koboldcpp. Is there a distro more adequate, perhaps with the drivers pre-installed, to run koboldcpp on? which one is it developed in? Debian? I'll be happy to provide any further details to help solve this, thanks. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 24 replies
-
This looks like an error from clblast instead of cublas. Could you kindly share your terminal output? This will contain the launch parameters as well as any outputs or errors encountered. You can simply run it in terminal and copy/paste the output here. |
Beta Was this translation helpful? Give feedback.
Just use RADV, it is the only driver you can use and the best one available on Linux. Ignore the warning, it is only about being Vulkan-certified, for which some minor things may be missing, depending on the version, but which doesn't concern you at all.