2 pointsby latexr6 hours ago3 comments
  • vunderba6 hours ago
    At the risk of stating the obvious, DUH.

    There was a relatively comprehensive article around benching LLMs to play chess that measured even the SOTA models at around a mediocre 1000 ELO as compared to Carlsen who is rated at ~2850.

    https://maxim-saplin.github.io/llm_chess

  • jqpabc1236 hours ago
    He went on to ask ChatGPT for feedback on his performance.

    A master asking a beginner for feedback? I guess he was just curious if the evaluation would be as inept as the play.

  • CamperBob23 hours ago
    OK, now let's see how Stockfish does at Python coding.