8 pointsby minimaxir8 hours ago2 comments
  • CSMastermind7 hours ago
    It's interesting to me that they're extracting game state from memory instead of just passing the video into the LLM.
    • _--__--__7 hours ago
      Every agent step takes both a visual snapshot and a memory read, the memory gives more consistent tileset and location parsing and also has some stuff like party state/status conditions that isn't usually visible.
      • CSMastermind7 hours ago
        Shouldn't the goal be to compare it against a human player that would need to menu for that information?
        • _--__--__5 hours ago
          that's probably fair but I imagine that without memory access Claude would have been opening and searching the (completely unordered!) bag to check for progress critical items like the pokeflute every 5 minutes
  • minimaxir23 days ago
    This is released by the Anthropic engineer who developed Claude Plays Pokemon.