14 pointsby saikatsg10 months ago5 comments

f30e3dfed1c910 months ago
To the extent that this is true, it's encouraging. Altman in particular is really obviously not to be trusted.
innagadadavida10 months ago
Meta Got Caught Gaming AI Benchmarks Meta released two new Llama 4 models over the weekend -- Scout and Maverick -- with claims that Maverick outperforms GPT-4o and Gemini 2.0 Flash on benchmarks. Maverick quickly secured the number-two spot on LMArena, behind only Gemini 2.5 Pro.
Researchers have since discovered that Meta used an "experimental chat version" of Maverick for LMArena testing that was "optimized for conversationality" rather than the publicly available version.
In response, LMArena said "Meta's interpretation of our policy did not match what we expect from model providers" and announced policy updates to prevent similar issues.
saikatsg10 months ago
https://news.ycombinator.com/item?id=43624417
ashoeafoot10 months ago
It say a ton about the democracy , when a 2/3 majore can not oust the sumoners and the circle.
10 months ago
undefined