Intern@veRL. PhD student in ML/NLP.
- Shanghai, China
-
18:05
(UTC +08:00) - https://yyding1.github.io
Pinned Loading
-
verl-project/uni-agent
verl-project/uni-agent PublicA unified framework for building, running, and training general agents at scale.
-
verl-project/verl
verl-project/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
ScaleQuest
ScaleQuest Public[ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


