Extend the existing RBP metric to handle graded relevance judgements (non-binary r_i ∈ [0, 1]) as described in Rank-Biased Precision for Measurement of Retrieval Effectiveness . Graded RBP reflects fractional utility rather than binary success, but uses the same weighting and persistence model.
Proposed Implementation
- Add a new class
GradedRBP, inheriting from RBP
- Reads a relevance field (default:
grade) from the test ItemList
- Scale grades to a unit range
[0, 1]
- Default grade value for unseen items =
0.25
- If
grade field is not found, defaults to binary RBP
Questions
Should we assume the grade field is already scaled to [0, 1]? If not, we can apply scaling. So, should scaling be optional here? Should we divide by the max grade value for scaling or is there a different method?
Extend the existing
RBPmetric to handle graded relevance judgements (non-binaryr_i ∈ [0, 1])as described in Rank-Biased Precision for Measurement of Retrieval Effectiveness . Graded RBP reflects fractional utility rather than binary success, but uses the same weighting and persistence model.Proposed Implementation
GradedRBP, inheriting fromRBPgrade) from the testItemList[0, 1]0.25gradefield is not found, defaults to binary RBPQuestions
Should we assume the
gradefield is already scaled to [0, 1]? If not, we can apply scaling. So, should scaling be optional here? Should we divide by the max grade value for scaling or is there a different method?