Skip to content

Commit 19df1ae

Browse files
xjkmfaxujing43
andauthored
[Docs] add Qwen25vl yaml (#4662)
* Add ci case for min token and max token * 【CI case】include total_tokens in the last packet of completion interface stream output * 【CE】add qwen25-vl * 【CE】add qwen25-vl --------- Co-authored-by: xujing43 <xujing43@baidu.com>
1 parent 8f40dfa commit 19df1ae

File tree

2 files changed

+14
-0
lines changed

2 files changed

+14
-0
lines changed
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
max_model_len: 32768
2+
max_num_seqs: 128
3+
gpu_memory_utilization: 0.85
4+
tensor_parallel_size: 1
5+
limit_mm_per_prompt: '{"image": 100, "video": 100}'
6+
enable_mm: True
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
top_p: 0.8
2+
temperature: 0.7
3+
metadata:
4+
min_tokens: 1
5+
max_tokens: 32768
6+
repetition_penalty: 1.05
7+
frequency_penalty: 0
8+
presence_penalty: 0

0 commit comments

Comments
 (0)