1 pointby metadat3 months ago1 comment
  • metadat3 months ago
    Explanation:

    Given a task (e.g. "Build a Spotify but for movies"), compares the results of 2 unidentified LLMs and lets you rank which one is better before revealing which model you preferred.

    Check out the global leaderboard to see which model is most beloved:

    https://web.lmarena.ai/leaderboard