Show HN: 1v1 coding game that LLMs struggle with(yare.io)

3 pointsby levmiseri11 hours ago1 comment

javadhu8 hours ago
Cool project, this is my first time seeing such project using LLMs. Took me a while to understand what's happening on the home page.
A question though, why such powerful bots like Gemini 3.1 failed against Clowder bot? Is it because of inefficient code or the LLMs did not handle edge cases? Or they are not as good as humans when it comes to strategy.
- levmiseri7 hours ago
  I’m not sure honestly. It could be some combination of bad spatial reasoning of the LLMs and lack of any training data for this specific challenge.
  You can see replays for all of the matches if you hover over the cells in the table.