🚀 Feature
With #78 being merged, we may want to consider training own reward models within LLM Studio.
This should probably be a new task type and requires a different dataset type.
Motivation
With #78 being merged, we may want to consider training own reward models within LLM Studio.