23 pointsby marianebekker2 hours ago9 comments
  • marianebekkeran hour ago
    One thing that surprised us while putting this together was how uneven the stack still is. Planning and execution tooling feels fairly mature, but evaluation and long-term reliability lag far behind. Curious how people here are testing and validating agents in production today.
  • marianebekker2 hours ago
    Over the last 18 months, agentic AI has shifted from prompt-driven chatbots to system-level engineering. This post maps the open-source tools teams are actually using to build, run, and evaluate agents in production, and where they sit in the agent stack.
  • jhindle228 minutes ago
    Great article!
  • kenhoppean hour ago
    The agentic AI landscape is rapidly maturing, shifting from isolated experiments to end-to-end systems that can think, act, and adapt autonomously. Thanks for sharing how various frameworks are shaping the future of intelligent, real-world automation."
    • marianebekkeran hour ago
      Agreed on the shift — what stood out to us is how quickly teams hit system-level constraints once agents leave the demo phase. Things like evaluation drift, tool reliability, and recovery behavior start to matter more than model choice. Curious which of those has been the biggest bottleneck for you in practice?
  • phlikan hour ago
    Great Read.
  • alexfeinstein77an hour ago
    Cool article!
  • bgrief2 hours ago
    good stuff :)
  • idramirez14 minutes ago
    [dead]
  • abikuganesanan hour ago
    [dead]