2 pointsby sunandsurf11 hours ago2 comments
  • Eric_Xua5 hours ago
    Love the idea of turning agent benchmarks into a real-time Bomberman match between LLMs — super fun way to surface speed vs reasoning tradeoffs.
  • jamespeng7 hours ago
    hahha that's mad !