64 pointsby randall7 hours ago4 comments
  • Agent_Builder9 minutes ago
    We ran into similar reliability issues while building GTWY. What surprised us was that most failures weren’t about model quality, but about agents being allowed to run too long without clear boundaries.

    What helped was treating agents less like “always-on brains” and more like short-lived executors. Each step had an explicit goal, explicit inputs, and a defined end. Once the step finished, the agent stopped and context was rebuilt deliberately.

    Harnesses like this feel important because they shift the problem from “make the model smarter” to “make the system more predictable.” In our experience, reliability came more from reducing degrees of freedom than from adding intelligence.

  • salesplayan hour ago
    This is an interesting direction for agent frameworks. What stood out to me is the shift from simple tool orchestration to agents that can reason, call other agents, and self-manage workflows. That’s something we’ve been thinking about a lot while building SalesPlay — especially around how autonomous sales agents need clear evaluation, guardrails, and accountability to actually be useful in real GTM teams. The built-in grading/evaluation angle here feels like a practical step toward making agents less brittle and more production-ready. Curious to see how this evolves in real-world use cases.
  • Trufa4 hours ago
    Is this an alternative to https://mastra.ai/docs

    How would it compare?

    • randall4 hours ago
      So I look at something like Mastra (or LangChain) as agent orchestration, where you do computing tasks to line up things for an LLM to execute against.

      I look at Gambit as more of an "agent harness", meaning you're building agents that can decide what to do more than you're orchestrating pipelines.

      Basically, if we're successful, you should be able to chain agents together to accomplish things extremely simply (using markdown). Mastra, as far as I'm aware, is focused on helping people use programming languages (typescript) to build pipelines and workflows.

      So yes it's an alternative, but more like an alternative approach rather than a direct competitor if that makes sense.

  • tomhow4 hours ago
    [under-the-rug stub]

    [see https://news.ycombinator.com/item?id=45988611 for explanation]

    • 5 hours ago
      undefined
    • franciscomello6 hours ago
      This looks quite interesting in terms of the architecture. Seems like a fresh take on stuff like Langchain, which at least last time I checked sucks.
    • alberson5 hours ago
      I’m excited to give this a spin at Agentive! Really interesting approach.
    • sofdao5 hours ago
      this is awesome

      are things like file system baked in?

      fan of the design of the system. looks great architecturally

      • randall5 hours ago
        omg thank you so much. We're working on the file system stuff, that's an easier lift for us than the initial work, so we wanted to start with the big stuff and work backward. Claude Code and Codex are obviously really great at that stuff, and we'd like to be able to support a lot of that out of the box.
    • pych5 hours ago
      wow this looks cool - been meaning to dig into harness stuff this looks like a good starting point
      • randall5 hours ago
        Thx! Happy to help if you need it. :)
    • randall4 hours ago
      thx, i appreciate it, believe it or not. :)
    • 5 hours ago
      undefined