Hacker News
new
top
best
ask
show
job
Show HN: AA-Briefcase: a frontier knowledge work evaluation
(
artificialanalysis.ai
)
11 points
by
declanjackson
3 hours ago
2 comments
mrdbourke
3 hours ago
the example submissions are really good comparisons, comparing Fable 5's submission to Opus 4 is fairly stark
brenton_on_news
2 hours ago
GLM hanging with the frontier big dogs