-
Notifications
You must be signed in to change notification settings - Fork 5
Update overviews #18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update overviews #18
Conversation
Context window (context length) seems to be part of the |
ai/deepcoder-preview.md
Outdated
|------------------------------|------------|--------------|----------------|--------|--------| | ||
| `deepcoder-preview:14B-F16` | 14.77B | F16 | 131,072 | 24GB¹ | 29.5GB | | ||
| `deepcoder-preview:14B:latest` <br><br> `deepcoder-preview:14B-Q4_K_M` | 14.77B | Q4_K_M | 131,072 | 8GB¹ | 9GB | | ||
| Model Variant | Parameters | Quantization | Context window | VRAM | Size | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| Model Variant | Parameters | Quantization | Context window | VRAM | Size | | |
| Model variant | Parameters | Quantization | Context window | VRAM | Size | |
Picky nit. Can we have all them to be changed to sentence case like this please? TIA
* Renaming readme files for each model to the same name used in Hub * Fix smollm2 urls
@ilopezluna Context length is generally model specific and should be given by the model creators, I'm not sure if there's an easy way to automate that if the metadata is not included in the HF repo consistently. Also, we should be aware of the context length limitations we currently have in DMR (I'm not sure if any progress has been made there).. maybe we should specify that instead of just removing all values? if not, we could also remove the column in the table instead of just leaving it empty |
@krissetto I've just verified (thanks @jalonsogo for the hint) its included in the GGUF metadata as
|
- Parse gguf without downloading it
- Fixes update of the markdown
…k parameters metadata
@krissetto @jalonsogo I'm using this formula now: https://github.com/docker/model-cards/pull/18/files#diff-3ddaf77e1aeb6813c77ff54404fc4be8e4aa5bbff4bd6227bbea8d04155d4468R216 |
@jalonsogo I kept the previous scripts but I think it would be better to remove them once we confirm that current go approach works as expected |
@ilopezluna noice 🫶 nit: don't forget the footnote notation (the little "1") in the VRAM calc parts of the tables when we generate them |
or maybe lets put it in the table header itself? 🤔 |
good catch, thanks! (added) |
* Renaming readme files for each model to the same name used in Hub * Fix smollm2 urls * Update overviews (#18) * adds update script * adds build-model-table.sh script * Updates all models * force param is not needed anymore * Renaming model overviews to match with the model name in Hub (#17) * Renaming readme files for each model to the same name used in Hub * Fix smollm2 urls * Use sentence case * Adds initial go script to update table * - build-all tables script to Go - Parse gguf without downloading it * - Uses authenticated req (to avoid rate limit) - Fixes update of the markdown * Try to get labels from general.size_label first, if not found fallback parameters metadata * Format context length * VRAM estimation * Allow to update only the specified file * Removes unneeded scripts * Fix estimated VRAM for embedding model * Adds model inspect command * Rename to model-cards-cli * Updates model-cards * Rename header to VRAM¹ * Adds parsed gguf file into ModelVariant, and includes method to extract all metadata * Includes gguf metadata into inspect * No need to use interface for registry client for now. * A ModelVariant has multiple tags * Formats VRAM * Formats context length * Adds --all to include metadata * Removes formatter * Format size * Update models * Script not needed anymore * Updates README.md
Introducing Model Cards CLI Tool and Model Documentation Updates
This PR introduces a new Model Cards CLI tool and updates model documentation across the repository. Key changes include:
New Model Cards CLI Tool:
Model Documentation Updates: