Hacker News
new
top
best
ask
show
job
MathDuels: Evaluating LLMs as Problem Posers and Solvers
(
arxiv.org
)
1 point
by
matt_d
7 hours ago
1 comment
matt_d
7 hours ago
Blog post:
https://www.rabdos.ai/research/introducing-mathduels-ai
Leaderboard:
https://mathduels.ai/