The BERDS Benchmark aims to measure retrieval diversity for questions that are opinionated or invite diverse perspectives.
Hung-Ting Chen
timchen0618
·
AI & ML interests
NLP
Recent Activity
upvoted a paper 3 days ago
Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents updated
a dataset over 1 year ago
timchen0618/BERDS