34 pointsby mellosouls3 hours ago9 comments
  • eclipsetheworld9 minutes ago
    I have been working on my own "Digital Twins Universe" because 3rd-party SaaS tools often block the tight feedback loops required for long-horizon agentic coding. Unlike Stripe, which offers a full-featured environment usable in both development and staging, most B2B SaaS companies lack adequate fidelity (e.g., missing webhooks in local dev) or even a basic staging environment.

    Taking the time to point a coding agent towards the public (or even private) API of a B2B SaaS app to generate a working (partial) clone is effectively "unblocking" the agent. I wouldn't be surprised if a "DTU-hub" eventually gains traction for publishing and sharing these digital twins.

    I would love to hear more about your learnings from building these digital twins. How do you handle API drift? Also, how do you handle statefulness within the twins? Do you test for divergence? For example, do you compare responses from the live third-party service against the Digital Twin to check for parity?

  • simonwan hour ago
    This is the stealth team I hinted at in a comment on here last week about the "Dark Factory" pattern of AI-assisted software engineering: https://news.ycombinator.com/item?id=46739117#46801848

    I wrote a bunch more about that this morning: https://simonwillison.net/2026/Feb/7/software-factory/

    This one is worth paying attention to to. They're the most ambitious team I've see exploring the limits of what you can do with this stuff. It's eye-opening.

    • enderforthan hour ago
      This right here is where I feel most concerned

      > If you haven’t spent at least $1,000 on tokens today per human engineer, your software factory has room for improvement

      Seems to me like if this is true I'm screwed no matter if I want to "embrace" the "AI revolution" or not. No way my manager's going to approve me to blow $1000 a day on tokens, they budgeted $40,000 for our team to explore AI for the entire year.

      Let alone from a personal perspective I'm screwed because I don't have $1000 a month in the budget to blow on tokens because of pesky things that also demand financial resources like a mortgage and food.

      At this point it seems like damned if I do, damned if I don't. Feels bad man.

      • mgkimsal4 minutes ago
        I read that as combined, up to this point in time. You have 20 engineers? If you haven't spent at least $20k up to this point, you've not explored or experienced enough of the ins and outs to know how best to optimize the use of these tools.

        I didn't read that as you need to be spending $1k/day per engineer. That is an insane number.

        EDIT: re-reading... it's ambiguous to me. But perhaps they mean per day, every day. This will only hasten the elimination of human developers, which I presume is the point.

      • reilly300011 minutes ago
        My friend works at Shopify and they are 100% all in on AI coding. They let devs spend as much as they want on whatever tool they want. If someone ends up spending a lot of money, they ask them what is going well and please share with others. If you’re not spending they have a different talk with you.

        As for me, we get Cursor seats at work, and at home I have a GPU, a cheap Chinese coding plan, and a dream.

        • r0b056 minutes ago
          > I have a GPU, a cheap Chinese coding plan, and a dream

          Right in the feels

      • simonw40 minutes ago
        Yeah, that's one part of this that didn't sit right with me.

        I don't think you need to spend anything like that amount of money to get the majority of the value they're describing here.

        • noosphr17 minutes ago
          This is the part that feels right to me because agents are idiots.

          I built a tool that writes (non shit) reports from unstructured data to be used internally by analysts at a trading firm.

          It cost between $500 to $5000 per day per seat to run.

          It could have cost a lot more but latency matters in market reports in a way it doesn't for software. I imagine they are burning $1000 per day per seat because they can't afford more.

        • 30 minutes ago
          undefined
      • buster44 minutes ago
        May be the point is, that the one engineer replaces 10 engineers by using the dark factory which by definition doesn't need humans.
      • navanchauhanan hour ago
        I think corporate incentives vs personal incentives are slightly different here. As a company trying to experiment in this moment, you should be betting on token cost not being the bottleneck. If the tooling proves valuable, $1k/day per engineer is actually pretty cheap.

        At home on my personal setup, I haven't even had to move past the cheapest codex/claude code subscription because it fulfills my needs ¯\_(ツ)_/¯. You can also get a lot of mileage out of the higher tiers of these subscriptions before you need to start paying the APIs directly.

        • rune-dev12 minutes ago
          How is 1k/day cheap? Even for a large company?

          Takes like this are just baffling to me.

          For one engineer that is ~260k a year.

  • Herring15 minutes ago
    $100 says they're still doing leetcode interviews.

    If everyone can do this, there won't be any advantage (or profit) to be had from it very soon. Why not buy your own hardware and run local models, I wonder.

    • navanchauhan11 minutes ago
      I would spend those $100 on either API tokens or donate to a charity of your choice. My interview to join this team was whether I could build something of my choosing in under an hour with any coding agent of my choice.

      No local model out there is as good as the SOTA right now.

  • hnthrow028734512 minutes ago
    Yep, you definitely want to be in the business of selling shovels for the gold rush.
  • threecheese16 minutes ago
    So much of this resonated with me, and I realize I’ve arrived at a few of the techniques myself (and with my team) over the last several months.

    THIS FRIGHTENS ME. Many of us sweng are either going be FIRE millionaires, or living under a bridge, in two years.

    I’ve spent this week performing SemPort; found a ts app that does a needed thing, and was able to use a long chain of prompts to get it completely reimplemented in our stack, using Gene Transfer to ensure it uses some existing libraries and concrete techniques present in our existing apps.

    Now not only do I have an idiomatic Python port, which I can drop right into our stack, but I have an extremely detailed features/requirements statement for the origin typescript app along with the prompts for generating it. I can use this to continuously track this other product as it improves. I also have the “instructions infrastructure” to direct an agent to align new code to our stack. Two reusable skills, a new product, and it took a week.

    • beepbooptheory6 minutes ago
      Sorry if rude but truly feel like I am missing the joke. This is just LinkedIn copypasta or something right?
  • easeoutan hour ago
    > A problem repeatedly occurred on "https://factory.strongdm.ai/".
  • mellosouls3 hours ago
    Having submitted this I would also suggest the website admin revisit their testing; its very slow on my phone. Obviously fails on aesthetics and accessibility as well. Submitted for the essay.
    • pengaru44 minutes ago
      Sounds like you're experiencing an "agentic moment".
    • pityJukean hour ago
      Haha yeah if I scroll on my iPhone 15 Pro it literally doesn’t load until I stop.
    • navanchauhanan hour ago
      Anything special thing about your browser setup? Javascript disabled / browser extension?
    • foolserrandboyan hour ago
      I get the following on safari on iOs: A problem repeatedly occurred on (url)
      • throwaway0123_539 minutes ago
        On iOS Safari it loads and works decent for me, but w/ iOS Firefox and Firefox Focus doesn't even load.
  • beklein2 hours ago
  • navanchauhanan hour ago
    (I’m one of the people on this team). I joined fresh out of college, and it’s been a wild ride.

    I’m happy to answer any questions!

    • steveklabnik32 minutes ago
      More of a comment than a question:

      > Those of us building software factories must practice a deliberate naivete

      This is a great way to put it, I've been saying "I wonder which sacred cows are going to need slaughtered" but for those that didn't grow up on a farm, maybe that metaphor isn't the best. I might steal yours.

      This stuff is very interesting and I'm really interested to see how it goes for you, I'll eagerly read whatever you end up putting out about this. Good luck!

      EDIT: oh also the re-implemented SaaS apps really recontextualizes some other stuff I’ve been doing too…

      • axus11 minutes ago
        > "I wonder which sacred cows are going to need slaughtered"

        Or a vegan or Hindu. Which ethics are you willing to throw away to run the software factory?

        I eat hamburgers while aware of the moral issues.

    • simonwan hour ago
      I know you're not supposed to look at the code, but do you have things in place to measure and improve code quality anyway?

      Not just code review agents, but things like "find duplicated code and refactor it"?

      • navanchauhanan hour ago
        A few overnight “attractor” workflows serve distinct purposes:

        * DRYing/Refactoring if needed

        * Documentation compaction

        * Security reviews