Skip to content

Conversation

@georgosgeorgos
Copy link

  • Expand benchmark datasets adding AIME 2025, OMNI-MATH, GSM8K
  • Update documentation
  • Update benchmark file

@coderabbitai
Copy link

coderabbitai bot commented Oct 9, 2025

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch feat/update_benchmark

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@georgosgeorgos georgosgeorgos changed the title feat: add new datasets and update docs feat/update_benchmark Oct 9, 2025
@codecov
Copy link

codecov bot commented Oct 9, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@georgosgeorgos georgosgeorgos self-assigned this Oct 9, 2025
@georgosgeorgos georgosgeorgos added documentation Improvements or additions to documentation enhancement New feature or request labels Oct 9, 2025
Copy link
Contributor

@gx-ai-architect gx-ai-architect left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants