Skip to content

Commit 290de37

Browse files
committed
Update model usage
1 parent 6ca9ab5 commit 290de37

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

docs/index.html

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -253,23 +253,23 @@
253253
<td class="has-text-centered"><a href="https://files.sri.inf.ethz.ch/swt-bench/assertflip/">🔗</a></td>
254254
</tr>
255255
<tr data-mode="unittest">
256-
<td><a href="https://docs.all-hands.dev/">OpenHands</a> <small>GPT-4, CI setup</small></td>
256+
<td><a href="https://docs.all-hands.dev/">OpenHands</a> <small>Cl. Sonnet 3.5, CI setup</small></td>
257257
<td class="has-text-centered"><a href="https://all-hands.dev/"><img alt="All Hands AI" title="All Hands AI" src="./static/images/logos/allhands.svg" class="org-icon"></a></td>
258258
<td>28.3%</td>
259259
<td>52.4%</td>
260260
<td><time>2025-02-18</time></td>
261261
<td class="has-text-centered"><a href="https://github.com/logic-star-ai/swt-bench?tab=readme-ov-file#evaluation-results">🔗</a></td>
262262
</tr>
263263
<tr>
264-
<td><a href="https://docs.all-hands.dev/">OpenHands</a> <small>GPT-4, vanilla</small></td>
264+
<td><a href="https://docs.all-hands.dev/">OpenHands</a> <small>Cl. Sonnet 3.5, vanilla</small></td>
265265
<td class="has-text-centered"><a href="https://all-hands.dev/"><img alt="All Hands AI" title="All Hands AI" src="./static/images/logos/allhands.svg" class="org-icon"></a></td>
266266
<td>22.8%</td>
267267
<td>43.6%</td>
268268
<td><time>2025-02-18</time></td>
269269
<td class="has-text-centered"><a href="https://github.com/logic-star-ai/swt-bench?tab=readme-ov-file#evaluation-results">🔗</a></td>
270270
</tr>
271271
<tr>
272-
<td><a href="https://arxiv.org/abs/2406.12952">SWE-Agent+</a></td>
272+
<td><a href="https://arxiv.org/abs/2406.12952">SWE-Agent+</a><small>GPT-4</small></td>
273273
<td class="has-text-centered"><a href="https://logicstar.ai/"><img alt="LogicStar" title="LogicStar" src="./static/images/logos/logicstar.png" class="org-icon"></a></td>
274274
<td>18.5%</td>
275275
<td>27.6%</td>
@@ -285,7 +285,7 @@
285285
<td class="has-text-centered"><a href="https://github.com/logic-star-ai/swt-bench?tab=readme-ov-file#evaluation-results">🔗</a></td>
286286
</tr>
287287
<tr>
288-
<td><a href="https://swe-agent.com/latest/">SWE-Agent</a> <small>Claude 3.5 Sonnet</small></td>
288+
<td><a href="https://swe-agent.com/latest/">SWE-Agent</a> <small>Cl. 3.5 Sonnet</small></td>
289289
<td class="has-text-centered"><a href="https://swe-agent.com/"><img alt="SWE-agent" title="SWE-agent" src="./static/images/logos/swe-agent.svg" class="org-icon"></a></td>
290290
<td>12.3%</td>
291291
<td>30.3%</td>
@@ -341,7 +341,7 @@
341341
<td class="has-text-centered"><a href="https://github.com/logic-star-ai/swt-bench?tab=readme-ov-file#evaluation-results">🔗</a></td>
342342
</tr>
343343
<tr>
344-
<td><a href="https://arxiv.org/abs/2209.11515">LIBRO</a></td>
344+
<td><a href="https://arxiv.org/abs/2209.11515">LIBRO</a><small>GPT-4</small></td>
345345
<td class="has-text-centered"><a href="https://github.com/coinse/libro"><img alt="KAIST" title="KAIST" src="./static/images/logos/KAIST.svg" class="org-icon"></a></td>
346346
<td>14.1%</td>
347347
<td>23.8%</td>
@@ -437,7 +437,7 @@
437437
<td class="has-text-centered"><a href="https://files.sri.inf.ethz.ch/swt-bench/otter/">🔗</a></td>
438438
</tr>
439439
<tr>
440-
<td><a href="https://docs.all-hands.dev/">OpenHands</a> <small>GPT-4o</small></td>
440+
<td><a href="https://docs.all-hands.dev/">OpenHands</a> <small>Cl. Sonnet 3.5</small></td>
441441
<td class="has-text-centered"><a href="https://all-hands.dev/"><img alt="All Hands AI" title="All Hands AI" src="./static/images/logos/allhands.svg" class="org-icon"></a></td>
442442
<td>27.7%</td>
443443
<td>52.9%</td>

0 commit comments

Comments
 (0)