2 pointsby kenblum4 hours ago1 comment
  • vunderba3 hours ago
    Nice job - but it's definitely not abandonware having been re-released on Steam [1] (along with a sequel) back in 2015.

    Regarding the verifier that plays against the live engine, I’ve approached the problem from a similar angle by having LLM agents effectively borrow a page from the speedrunning community in the form of tool-assisted speedruns, allowing the LLM access only to a virtualized game controller.

    [1] - https://store.steampowered.com/app/346850/Chips_Challenge_1

    • kenblum3 hours ago
      Good catch on the Steam re-release. I'll add a note, or pull the site if it comes to that.

      Curious about your agent setup though. Any public repo?

      • vunderba3 hours ago
        Even if you do pull the game itself I would still definitely leave all the post mortem stuff up. I think it's just as interesting and worth keeping around - especially the YT vids demonstrating the harness.

        I don't have a GH repo up for the TAS system yet - it's a bespoke mess right now since it was built with the old game "Castle of the Winds" in mind but I'll definitely consider it in the future!

        https://en.wikipedia.org/wiki/Castle_of_the_Winds