-
Notifications
You must be signed in to change notification settings - Fork 11.6k
Bug: cant finetune #7643
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I'm facing very similar problem, here is what i try to do, its almost copy paste from the original readme for finetune:
and then the process exits The model im trying to finetune is this: https://huggingface.co/SanctumAI/Meta-Llama-3-8B-Instruct-GGUF version q6_k |
the same issues with few other models.(phi, mistral)
|
Find the first commit that stops working |
with this commit work: |
Does it work with |
I don't think -nkvo parameter is present in finetune. However I recompiled everything, forcing bool no_kv_offload = true; in common.h but it still doesn't work |
Any updates on this issue? I'm facing the same problem unfortunately. |
UPDATED: Actually, flash_attention is related to this issue. See below comments. |
I have a same issue on Linux. Llama3-finetuned models always get this error, but prediction( Only fine-tuning |
Find the first commit that causes |
I'm git-bisecting this. Quite hard to find.
Latest commits(
When I re-build e84b71c and run it, training works. But, I'm not sure if it works properly because FA-related commits were merged frequently. cd llama.cpp
git checkout d48c88cbd563b6cf0ce972e2f56796896e240736^
rm -rf build
cmake -B build -DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS
cmake --build build --config Debug -j
build/bin/finetune \
--model-base $model \
--train-data shakespeare.txt \
--lora-out lora.gguf \
--seed 1 @ggerganov Check out d48c88c |
With
Without
|
Related to #7523 |
I think we should try with small base models and scale up to those that cause problems. |
Did you find anything? The problem is only with llama3 as it seems... |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
What happened?
GGML_ASSERT: D:\a\llama.cpp\llama.cpp\ggml.c:12853: ne2 == ne02
Name and Version
What operating system are you seeing the problem on?
Windows
Relevant log output
The text was updated successfully, but these errors were encountered: