Bug: missing option `--vocab-type bpe` in `convert-hf-to-gguf.py` #7912

gakugaku · 2024-06-13T03:33:47Z

What happened?

README:

https://github.com/ggerganov/llama.cpp/blob/f578b86b2123d0f92afbaa98a031df4d4464e582/README.md?plain=1#L625-L626

Actual Output:

$ python convert-hf-to-gguf.py ./mymodels/ --vocab-type bpe
usage: convert-hf-to-gguf.py [-h] [--vocab-only] [--awq-path AWQ_PATH] [--outfile OUTFILE] [--outtype {f32,f16,bf16,q8_0,auto}] [--bigendian] [--use-temp-file] [--no-lazy] [--model-name MODEL_NAME] [--verbose] model
convert-hf-to-gguf.py: error: unrecognized arguments: --vocab-type bpe

Name and Version

$ ./llama-cli --version
version: 3143 (f578b86)
built with cc (Ubuntu 9.4.0-1ubuntu1~20.04.2) 9.4.0 for x86_64-linux-gnu

What operating system are you seeing the problem on?

Linux

Relevant log output

No response

The text was updated successfully, but these errors were encountered:

Galunid · 2024-06-13T06:00:39Z

Hi, that's not a bug. convert-hf-to-gguf.py automatically detects which vocab should be used based on model. There's no need for --vocab-type anymore.

I'll fix the readme.

cmp-nct · 2024-06-14T14:11:48Z

Hi, that's not a bug. convert-hf-to-gguf.py automatically detects which vocab should be used based on model. There's no need for --vocab-type anymore.

I'll fix the readme.

Struggling with the same issue, trying to convert the minicpm-2.5 model which is dynamically generated (like llava-surgery) during the pre-conversion process. Forcing the tokenizer/model type should be an option available

In the same process I also got an error to trust remote code during get_vocab_base - from_pretrained()

Galunid · 2024-06-15T21:26:39Z

@cmp-nct Yup, convert-hf-to-gguf-update.py is a bit of a pain for models with new tokenizers. I replied in more details in #7599, but you can still use examples/convert-legacy-llama.py which is old convert.py script. We should add some sort of option for this though. I agree. I'll take a look at that tomorrow.

gakugaku added bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) labels Jun 13, 2024

Galunid closed this as completed Jun 13, 2024

Galunid mentioned this issue Jun 13, 2024

Remove outdated instructions from README.md #7914

Merged

CypherpunkSamurai mentioned this issue Jan 30, 2025

Reproduce/enable DeepSeek R1 Distill Llama 8B pytorch/executorch#7981

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: missing option `--vocab-type bpe` in `convert-hf-to-gguf.py` #7912

Bug: missing option `--vocab-type bpe` in `convert-hf-to-gguf.py` #7912

gakugaku commented Jun 13, 2024

Galunid commented Jun 13, 2024 •

edited

Loading

cmp-nct commented Jun 14, 2024 •

edited

Loading

Galunid commented Jun 15, 2024

Bug: missing option --vocab-type bpe in convert-hf-to-gguf.py #7912

Bug: missing option --vocab-type bpe in convert-hf-to-gguf.py #7912

Comments

gakugaku commented Jun 13, 2024

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

Galunid commented Jun 13, 2024 • edited Loading

cmp-nct commented Jun 14, 2024 • edited Loading

Galunid commented Jun 15, 2024

Bug: missing option `--vocab-type bpe` in `convert-hf-to-gguf.py` #7912

Bug: missing option `--vocab-type bpe` in `convert-hf-to-gguf.py` #7912

Galunid commented Jun 13, 2024 •

edited

Loading

cmp-nct commented Jun 14, 2024 •

edited

Loading