LLM-as-a-Judge: Categorical and Boolean Scores #4965
jannikmaierhoefer
started this conversation in
Ideas
Replies: 1 comment 3 replies
-
|
Yes. As an example use case, in our legal information chatbot, we would like to use LLM-as-judge to categorize (or "score") a trace into one of these topics: [ |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the feature or potential improvement
Currently, LLM-as-a-Judge evaluations in Langfuse can only create numerical scores.
It would be helpful if this feature could also create categorical or boolean scores (e.g. for intent classification).
Additional information
No response
Beta Was this translation helpful? Give feedback.
All reactions