Skip to content

Commit 255d365

Browse files
committed
🔴 [VLM] Add base model without head (huggingface#37033)
* i guessreverted all CdGen classes * style * llava onevision * fix copies * fix some tests * some more tests * dump * skip these * nevermind, i am dumb * revert fix not needed * fixup * fixup * another fixup * more fixup to make ci finally happy * fixup after rebasing * fix qwen tests * add internVL + typos here and there * image token index -> id * style * fix init weights * revert blip-2 not supported * address comments * fix copies * revert blip2 test file as well * as discussed internally, revert back CdGen models * fix some tests * fix more tests for compile * CI red * fix copies * enumerate explicitly allowed models * address comments * fix tests * fixup * style again * add tests for new model class * another fixup ( x _ x ) * [fixup] unused attributes can be removed post-deprecation
1 parent 42dd05a commit 255d365

File tree

85 files changed

+7496
-2810
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

85 files changed

+7496
-2810
lines changed

docs/source/en/model_doc/aria.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -102,6 +102,10 @@ response = processor.decode(output_ids, skip_special_tokens=True)
102102

103103
[[autodoc]] AriaTextModel
104104

105+
## AriaModel
106+
107+
[[autodoc]] AriaModel
108+
105109
## AriaTextForCausalLM
106110

107111
[[autodoc]] AriaTextForCausalLM

docs/source/en/model_doc/aya_vision.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -237,6 +237,10 @@ for i, output in enumerate(batch_outputs):
237237

238238
[[autodoc]] AyaVisionConfig
239239

240+
## AyaVisionModel
241+
242+
[[autodoc]] AyaVisionModel
243+
240244
## AyaVisionForConditionalGeneration
241245

242246
[[autodoc]] AyaVisionForConditionalGeneration

docs/source/en/model_doc/emu3.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -174,6 +174,10 @@ for i, image in enumerate(images['pixel_values']):
174174
[[autodoc]] Emu3TextModel
175175
- forward
176176

177+
## Emu3Model
178+
179+
[[autodoc]] Emu3Model
180+
177181
## Emu3ForCausalLM
178182

179183
[[autodoc]] Emu3ForCausalLM

docs/source/en/model_doc/fuyu.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -103,6 +103,10 @@ The `LlamaTokenizer` is used as it is a standard wrapper around sentencepiece.
103103

104104
[[autodoc]] FuyuConfig
105105

106+
## FuyuModel
107+
108+
[[autodoc]] FuyuModel
109+
106110
## FuyuForCausalLM
107111

108112
[[autodoc]] FuyuForCausalLM

docs/source/en/model_doc/gemma3.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -254,6 +254,10 @@ visualizer("<img>What is shown in this image?")
254254
[[autodoc]] Gemma3TextModel
255255
- forward
256256

257+
## Gemma3Model
258+
259+
[[autodoc]] Gemma3Model
260+
257261
## Gemma3ForCausalLM
258262

259263
[[autodoc]] Gemma3ForCausalLM

docs/source/en/model_doc/got_ocr2.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -277,6 +277,10 @@ alt="drawing" width="600"/>
277277

278278
[[autodoc]] GotOcr2Processor
279279

280+
## GotOcr2Model
281+
282+
[[autodoc]] GotOcr2Model
283+
280284
## GotOcr2ForConditionalGeneration
281285

282286
[[autodoc]] GotOcr2ForConditionalGeneration

docs/source/en/model_doc/instructblip.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -69,6 +69,10 @@ The attributes can be obtained from model config, as `model.config.num_query_tok
6969
[[autodoc]] InstructBlipQFormerModel
7070
- forward
7171

72+
## InstructBlipModel
73+
74+
[[autodoc]] InstructBlipModel
75+
7276
## InstructBlipForConditionalGeneration
7377

7478
[[autodoc]] InstructBlipForConditionalGeneration

docs/source/en/model_doc/instructblipvideo.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,10 @@ The attributes can be obtained from model config, as `model.config.num_query_tok
7373
[[autodoc]] InstructBlipVideoQFormerModel
7474
- forward
7575

76+
## InstructBlipVideoModel
77+
[[autodoc]] InstructBlipVideoModel
78+
- forward
79+
7680
## InstructBlipVideoForConditionalGeneration
7781

7882
[[autodoc]] InstructBlipVideoForConditionalGeneration

docs/source/en/model_doc/internvl.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -340,6 +340,11 @@ This example showcases how to handle a batch of chat conversations with interlea
340340
[[autodoc]] InternVLVisionModel
341341
- forward
342342

343+
## InternVLModel
344+
345+
[[autodoc]] InternVLModel
346+
- forward
347+
343348
## InternVLForConditionalGeneration
344349

345350
[[autodoc]] InternVLForConditionalGeneration

docs/source/en/model_doc/llava.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -256,6 +256,10 @@ A list of official Hugging Face and community (indicated by 🌎) resources to h
256256

257257
[[autodoc]] LlavaProcessor
258258

259+
## LlavaModel
260+
261+
[[autodoc]] LlavaModel
262+
259263
## LlavaForConditionalGeneration
260264

261265
[[autodoc]] LlavaForConditionalGeneration

0 commit comments

Comments
 (0)