Skip to content

Commit 0966834

Browse files
authored
Update README.md
1 parent 42ca8ad commit 0966834

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -26,16 +26,16 @@ At present, our core contributors are preparing the **33B** version and we expec
2626
### GPT-4 automatic evaluation
2727

2828
We adopt the automatic evaluation framework based on GPT-4 proposed by FastChat to assess the performance of chatbot models. As shown in the following figure, WizardLM-13B achieved better results than Vicuna-13b.
29-
<p align="center" width="96%">
30-
<a ><img src="imgs/WizarLM13b-GPT4.png" alt="WizardLM" style="width: 96%; min-width: 300px; display: block; margin: auto;"></a>
29+
<p align="center" width="100%">
30+
<a ><img src="imgs/WizarLM13b-GPT4.png" alt="WizardLM" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
3131
</p>
3232

3333
### WizardLM-13B performance on different skills.
3434

3535
The following figure compares WizardLM-13B and ChatGPT’s skill on Evol-Instruct testset. The result indicates that WizardLM-13B achieves 89.1% of ChatGPT’s performance on average, with almost 100% (or more than) capacity on 10 skills, and more than 90% capacity on 22 skills.
3636

37-
<p align="center" width="96%">
38-
<a ><img src="imgs/evol-testset_skills-13b.png" alt="WizardLM" style="width: 96%; min-width: 300px; display: block; margin: auto;"></a>
37+
<p align="center" width="100%">
38+
<a ><img src="imgs/evol-testset_skills-13b.png" alt="WizardLM" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
3939
</p>
4040

4141
## Call for Feedbacks

0 commit comments

Comments
 (0)