I used AI. It worked. I hated it(taggart-tech.com)

54 pointsby evolve2k2 days ago13 comments

riffraff2 days ago
I have come to the conclusion that many people are going to live this AI period pretty much like the five stages of grief: denial that it can work, anger at the new robber barons, bargaining that yeah it kinda works but not really well enough, catastrophic world view and depression, and finally acceptance of the new normality.
I'm still at the bargaining phase, personally.
- cherryteastain2 days ago
  What's the 'new normality' in the fifth stage? Do you think you'll start to believe it actually works 100%? Or that you won't change your assessment that it works only sometimes, but maybe pulling the lever on the slot machine repeatedly is better/more efficient than doing it yourself?
  - opensandwich2 days ago
    Business will start accepting bad uptime to be the norm. Following the lead of Github: https://mrshu.github.io/github-statuses/
    orangecoffee2 days ago
    No this is still the "bargaining/negotiating" phase thinking. After this is when depression hits when for your usecases you see that the code quality and security audit is very good.
    bluefirebrand2 days ago
    Is this a new "delusional" phase?
  - sothatsit2 days ago
    People will accept it as a way to build good software.
    Many are still in denial that you can do work that is as good as before, quicker, using coding agents. A lot of people think there has to be some catch, but there really doesn’t have to be. If you continue to put effort in, reviewing results, caring about testing and architecture, working to understand your codebase, then you can do better work. You can think through more edge cases, run more experiments, and iterate faster to a better end result.
  - sdenton42 days ago
    When you resolve bottlenecks, new bottlenecks become apparent. Right now, it's looking like assessment and evaluation are massive bottlenecks.
    8n4vidtmkvmk2 days ago
    I'm kind of excited about that though. What I've come to realize is that automated testing and linting and good review tools are more important than ever, so we'll probably see some good developments in these areas. This helps both humans and AIs so it's a win win. I hope.
    qsera2 days ago
    > it's looking like assessment and evaluation are massive bottlenecks.
    So I think LLMs have moved the effort that used to be spent on fun part (coding) into the boring part (assessment and evaluation) that is also now a lot bigger..
    jochem92 days ago
    You could build (code, if you really want) tools to ease the review. Of course we already have many tools to do this, but with LLMs you can use their stochastic behavior to discover unexpected problems (something a deterministic solution never can). The author also talks about this when talking about the security review (something I rarely did in the past, but also do now and it has really improved the security posture of my systems).
    You can also setup way more elaborate verification systems. Don't just do a static analyis of the code, but actually deploy it and let the LLM hammer at it with all kinds of creative paths. Then let it debug why it's broken. It's relentless at debugging - I've found issues in external tools I normally would've let go (maybe created an issue for), that I can now debug and even propose a fix for, without much effort from my side.
    So yeah, I agree that the boring part has become the more important part right now (speccing well and letting it build what you want is pretty much solved), but let's then automate that. Because if anything, that's what I love about this job: I get to automate work, so that my users (often myself) can be lazy and focus on stuff that's more valuable/enjoyable/satisfying.
    qsera2 days ago
    Fuzz testing has existed long before LLMs...
    hellojimbo2 days ago
    When writing banal code, you can just ask it to write unit tests for certain conditions and it'll do a pretty good job. The cutting edge tools will correctly automatically run and iterate on the unit tests when they dont pass. You can even ask the agent to setup TDD.
    Ferret7446a day ago
    Cars removed the fun part (raising and riding horses) and automatic transmissions removed the fun part (manual shifting), but for most people it's just a way to get from point A to B.
  - riffraffa day ago
    I'm not sure, but I think it boils down to accepting that some things we were attached to are no longer important or normal (not just software building).
    But specifically to your examples, the latter: I think the "brute force the program" approach will be more common that doing things manually in many cases (not all! I'm still a believer in people!).
    Edit: Well, I wrote a bad blog post on this some time ago, I might as well share it: I think the accepting means engaging with the change rather than ignoring it.
    https://riffraff.info/2026/03/my-2c-on-the-ai-genai-llm-bubb...
  - dcre2 days ago
    It doesn't have to work 100% of the time to be ubiquitous! This is just the strangest point of view. People don't work 100% of the time either, and they wrote all the code we had until a couple of years ago. How did we deal with that? Many different kinds of checks and mitigations. And sometimes we get bugs in prod and we fix them.
  - fodkodrasz2 days ago
    The new normal will be: Everything will get worse and far more unstable (both in terms of UI/UX and reliability), and many of us will loose their jobs. Also the next generation of the programmers will have shallower understanding of the tools they use.
  - bitwize2 days ago
    AI doesn't need to outrun the bear; it only needs to outrun you.
    Once the tools outperform humans at the tasks to which they were applied (and they will), you don't need to be involved at all, except to give direction and final acceptance. The tools will write, and verify, the code at each step.
    neonstatic2 days ago
    > Once the tools outperform humans at the tasks to which they were applied (and they will)
    I don't get why some people are so convinced that this is inevitable. It's possible, yes, but it very well might be the case, that models cannot be stopped from randomly doing stupid things, cannot be made more trustworthy, cannot be made more verifiable, and will have to be relegated to the role of brainstorming aids.
    qsera2 days ago
    >I don't get why some people are so convinced that this is inevitable.
    Someone once said that It is hard to make a man understand things if their profit depends on them not understanding it...
    neonstatic2 days ago
    I don't make money coding, so it doesn't apply to me in this case.
    NateEag2 days ago
    I think they meant that people insisting total genAI takeover of coding is inevitable are likely people who stand to profit greatly by everyone giving up and using the unmind machines for everything.
    orangecoffee2 days ago
    the original post is an example of how. Every programmer is discovering slowly, for their own usecases, that the agent can actually do it. This happens to an individual when they give it a shot without reservation..
    sciencejerk2 days ago
    Large scale AI datacenters require a very expensive physical supply chain that includes cheap land, water, and electricity, political leverage, human architects and builders to build datacenters, and massive capital investments. Yes, AI will outperform humans, but at some point it may become cheaper to hire a human programmer.
    Ferret7446a day ago
    Wait till you hear about the resources required to sustain an equivalent number of humans.
- dgxyz2 days ago
  I’m at the fucking loom smashing stage personally.
  We don’t have to accept things.
  - riffraff2 days ago
    I hear you, but let me point out that Ned Ludd didn't stop the industrial revolution.
    I think in the foreseeable future we have open models running on commonly available hardware, and that is not a change that can be stopped (and arguably it's the commons getting back their own value). What we can do is fight for proper taxation, for compensatory fees, for regulation that limits plagiarism, for regulation of the most extreme externalities.
    But it makes no sense, to me, to fight the technology tout court.
  - orangecoffee2 days ago
    How long can you afford to stay in this phase? Is there some framework you can suggest where this path works?
    dgxyz2 days ago
    My existence is defined not but what I adopted but what I sabotaged or refused to deal with. 30 years in I haven't made a mistake and I don't think I am making one here. The positive bets I made have been spot on as well. I think I have a handle on what works for society and humanity at least.
    When I say AI, I mean specifically LLMs. There isn't a single future position where all the risks are suitably managed, there is a return of investment and there is not a net loss to society. Faith, hope, lies, fraud and inflated expectations don't cut it and that is what the whole shebang is built on. On top of that, we are entering a time of serious geopolitical instability. Creating more dependencies on large amounts of capital and regional control is totally unacceptable and puts us all at risk.
    My integrity is worth more than sucking this teat.
    orangecoffee2 days ago
    When you say sabotage, how exactly?
    Or is it limited to refusal to use LLM, which is a strategy, but more like becoming a hobbyist programmer then.
    sph2 days ago
    “The reasonable man adapts himself to the world: the unreasonable one persists in trying to adapt the world to himself. Therefore all progress depends on the unreasonable man.”
    — George Bernard Shaw
    The antidote to runaway hype is for someone to push back, not to just relent and accept your fate. Who cares about affording to. We need more people with ideals stronger than the desire to make a lot of money.
    orangecoffee2 days ago
    Sure, hype pushback is fine.
- vova_hn22 days ago
  > yeah it kinda works but not really well enough
  I mean, at some point it was true.
  I remember that around 2023, when I first encountered colleagues trying to use ChatGPT for coding, I thought "by the time you are done with your back-and-forth to correct all the errors, I would have already written this code manually".
  That was true then, but not anymore.
  - globular-toast2 days ago
    This is true for things you already understand. It works for implementing yet another CRUD view because I've done it a million times before. I know exactly what the code should look like, but it takes a while to type it in. When my typing speed is the bottleneck then of course LLMs win (and I use them for that all the time).
    But the interesting stuff where you don't understand the problem yet, it doesn't make it quicker. Because then the bottleneck is my understanding. Things take time. And sleep. They require hands-on experience. It doesn't matter how fast LLMs can churn out code. There's a limit to how fast I can understand things. Unless, of course, I'm happy shipping code I don't understand, which I'm not.
    yusefnapora17 hours ago
    Unfortunately, it seems that few of the people willing to pay money for software development actually care whether you understand the code or not. Unless it breaks in a publicly embarrassing way, of course. Then "you own the code" and the sacrificial human in the loop can be dredged out to take the blame.
    All the incentives nudge you towards less and less critical evaluation of the output. The results of careful evaluation are much harder to measure than "this guy is cranking out 10x the code compared to last year!" And while you were busy thoughtfully internalizing the output, the guy next to you has been letting Strega Nona's pasta pot go brrr and spew another thousand lines of spaghetti on top, ready for you to review. Eventually "lgtm" becomes the default, and "do you want me to go ahead and implement that for you" starts sounding like the only way to keep your head above water.
  - riffraff2 days ago
    I think it's still true, but very domain specific. I am not confident it will stay true.
  - bigstrat20032 days ago
    No, it's still very much true. Every now and then I use an LLM to write code and the vast majority of the time it turns out to take just as much time (if not more) than it would've taken to write the code myself.
    BinaryIgor2 days ago
    Exactly. Verification is not cheap at all
    hellojimbo2 days ago
    You are either using it wrong or you are writing extremely niche code that has bad llm coverage
    Zimzom2 days ago
    I suspect I fall into the former camp, but I'm not sure where to start when it comes to learning how to use llms "the right way".
    I'm not a proper software engineer, but I do a lot of scripting and most of my attempts to let a model speed up a menial task (e.g. a small bash or python script for some data parsing or chaining together other tools), end up with me doing extensive rewrites because the model is completely inconsistent in naming convention, pattern reusage, etc.
    neonstatic2 days ago
    or you are in denial about what he is saying
  - 2 days ago
    undefined
- threethirtytwo2 days ago
  Less than 6 months ago I would say about 50% of HN was at the denial phase saying it's just a next token predictor and that it doesn't actually understand code.
  To all of you I can only say, you were utterly wrong and I hope you realize how unreliable your judgements all are. Remember I'm saying this to roughly 50% of HN., an internet community that's supposedly more rational and intelligent than other places on the internet. For this community to be so wrong about something so obvious.... That's saying something.
  - qsera2 days ago
    It doesn't actually understand anything...let alone code. And I think you are the one who is in denial.
    threethirtytwo2 days ago
    If it doesn’t understand anything why the fuck are we letting it write all our code when it doesn’t understand code at all? Does that make any sense to you? Does that align with common sense? You’re still in denial.
    You gonna give some predictable answer about next token prediction and probability or some useless exposition on transformers while completely avoiding the fact that we don’t understand the black box emergent properties that make a next token predicted have properties indistinguishable from intelligence?
    Toutouxc2 days ago
    I'm letting it write (type out) most (80-98%) of my code, but I see it as an idiot savant. If the idea is simple, I get 100 lines of solid Ruby. Good, saves me time. If the idea is complicated (e.g. a 400-LOC class that distills a certain functionality currently scattered across different methods and objects) and I ask 4 agents to come up with different solutions, I get 4 slightly flawed approaches that don't match how I'd personally architect the feature. And "how I'd personally architect the feature" is literally my expertise. My job isn't typing Ruby, it's making good decisions.
    My conclusion is that at this point, LLMs are not capable of making good decisions supported by deep reasoning. They're capable of mimicking that, yes, and it takes some skill to see through them.
    threethirtytwo2 days ago
    Follow the trendline. It went from autocomplete to agentic coding. What do you think will happen to your “good decision making” in a couple years?
    As of right now the one shot complex solutions AI comes up with are actually frequently extremely good now. It’s only gonna get better and this was in the last 6 months. You could be outdated on frontier model progress. That’s how quick things are changing.
    qsera2 days ago
    This is not an appeal to authority, but this video probably contains the answers to your questions if you are open minded about it
    https://www.youtube.com/watch?v=qvNCVYkHKfg
    threethirtytwo2 days ago
    What questions do I have? I didn’t even mention a single question and you hallucinated an assumption that I have questions.
    I don’t have any questions about LLMs. At least not any more than say an LLM researcher at anthropic working on model interpretability.
    qsera2 days ago
    Can't you count? Are you an LLM?
    threethirtytwo2 days ago
    No. I'm not an LLM, but you have intellectual issues. Counting? What does that have to do with anything?
    qsera2 days ago
    Go count the number of questions in your comment..
    threethirtytwo2 days ago
    It's called rhetorical questions. Look it up.
    qsera2 days ago
    Oh, I thought you were genuinely wondering...
  - bigstrat20032 days ago
    > To all of you I can only say, you were utterly wrong and I hope you realize how unreliable your judgements all are.
    They weren't wrong though. It objectively is just a next turn predictor and doesn't understand code. That is how the thing works.
    threethirtytwo2 days ago
    Not true. You’re a next token predictor and clearly the tokens you predict indicate that the way you predict the next token is much much more then simply a probabilistic detection. You’re a black box and so is the LLM and the evidence is pointing at emergent properties we don’t completely understand but are completely inline with what we understand as reasoning.
    Don’t make me cite George Hinton or other preeminent experts to show you how wrong you all are.
    Use your brain. It is changing the industry from the ground up. It understands.
    qsera2 days ago
    >Don’t make me cite George Hinton or other preeminent experts to show you how wrong you all are.
    https://www.youtube.com/watch?v=qvNCVYkHKfg
    threethirtytwo2 days ago
    Yann Lecunn was vocal about his stance against LLMs very early on and claimed they were a dead end. Well he's been proven fucking wrong. Completely.
    George Hinton was his mentor and George is the main god father of AI while Yann is more of a malfunctioning student still holding onto the stochastic parrot monicker. Here's George saying what you need to know:
    https://www.reddit.com/r/agi/comments/1qwoee7/godfather_of_a...
    qsera2 days ago
    > Well he's been proven fucking wrong. Completely.
    How was he "proven" wrong?
    > Yann is more of a malfunctioning student...
    lol what?
    threethirtytwo2 days ago
    He’s proven wrong by reality. Look at what LLMs are doing right now. It’s utterly obvious now that hallucinations are getting reduced, AI is extremely effective now…
    Yann is malfunctioning because he cant reconcile his past statements with reality. He can’t admit he’s wrong. As time goes on his past statements will look even more and more absurd as progress on AI keeps moving forward.
    At the same time we have Terence Tao using ai to develop new math and Hinton saying the opposite with actual evidence and the entire industry. Yann is a clown: https://www.reddit.com/r/singularity/comments/1piro45/people... and his opinions are not mainstream at all.
    qseraa day ago
    >now that hallucinations are getting reduced..
    Actually that is done by bolting more "fact checking" layers on top. Even that does not fix it very well..
    So at a fundamental level, LLMs have not really progressed. On a superficial level, they have, but that is only because marketing wanted to show the "progress" over a short amount of time, so that the "uninitiated" will extrapolate that to mean some god like AI in near future, raking in all the investor money...
    Smart move though. It is working very well....
    threethirtytwo19 hours ago
    >Actually that is done by bolting more "fact checking" layers on top. Even that does not fix it very well..
    Reinforcement training is done as well. And it fixed it quite well such that we use it on a daily basis now.
    >So at a fundamental level, LLMs have not really progressed. On a superficial level, they have,
    No those fixes aren't superficial. They're the same fixes you have in your brain. You also fact check, people also hallucinate and people with brain damage hallucinate even more. You can bypass mechanisms in your brain that prevent hallucination by taking drugs.
    Essentially the brain is a big hallucination machine with mechanisms to prevent it both low level and high level. We even consciously fact check ourselves and double check our own work. Is that superficial? No.
    You look at progress by seeing how LLMs are used. At first they were used as a chatbot. Then it became autocomplete. Now basically most people don't code with their hands anymore and they use it as an agent. That is the most disruptive thing to ever happen to programming. This isn't an investor thing. This is REALITY.
    >So at a fundamental level, LLMs have not really progressed. On a superficial level, they have, but that is only because marketing wanted to show the "progress" over a short amount of time, so that the "uninitiated" will extrapolate that to mean some god like AI in near future, raking in all the investor money...
    This is you hallucinating. Investor money is raking in because they are closer than ever to creating AI that can replace developers and companies will pay top dollar for that. That's why AI is making money. Very few people are that speculative into making a god AI... but a few are and those are the people throwing money at Yann's AMI venture which is huge gamble and could have that money end up in the trash.
    But LLM technology? We use it everyday. It's already a validated technology.
    >Smart move though. It is working very well....
    I can ask an LLM, "hey, human society is changing before our very eyes. Nobody programs directly anymore" The LLM is not so stupid as to say that's "superficial" progress. That's a smarter answer then a lot of the people here.
    I would say 6 months ago, I would get like 5 or 6 detractors responding to one of my posts like this. Now I think this thread got 2 and a bunch of vote downs. People are realizing they're embarrassingly wrong. It'll hit you eventually. It will happen either in the next couple months or next couple years simply because humanity is pouring so much research into this area there is no way it won't progress.
    For a good analogy you just need to look at self driving cars. HN used to be loaded with people saying it was a shit venture and totally useless and no progress has been made.... well now I regularly take waymo cars everywhere. Investors were wrong about crypto, but they weren't wrong about self driving.
    I would say the HN crowd is just as stupid if not more stupid then investors.
    qsera6 hours ago
    > And it fixed it quite well
    Not really. LLMs will still happily hallucinate and even provide "sources" for their claims and when you check the sources often it does not even exist..
    So they will even hallucinate the sources to justify their hallucinated claims. LOL.
  - charlie902 days ago
    Yes, I do find it a little funny how the developer community got it all wrong and the non technical people who were thinking AI is going to change everything in 2023 were the right ones. Maybe they know more than developers think.
    threethirtytwo2 days ago
    They don't know more. Humanity mostly doesn't know how LLMs work because most of the properties just emerged from the soup of billions of weights whose sheer complexity is so high that understanding any of it holistically is impossible.
    The difference is the arrogance. Developers think they know more. Developers think they're smart. And also there's an existential crisis where the LLM are poised to take over developer jobs first. So the developer calls every other layman an idiot and deludes himself into thinking his skills will always be superior to AI.
simonw2 days ago
> I hated writing software this way. Forget the output for a moment; the process was excruciating. Most of my time was spent reading proposed code changes and pressing the 1 key to accept the changes, which I almost always did. [...]
That's why they hated it. Approving every change is the most frustrating way of using these tools.
I genuinely think that one of the biggest differences between people who enjoy coding agents and people who hate them is whether or not they run in YOLO mode (aka dangerously-skip-permissions). YOLO mode feels like a whole different product.
I get the desire not to do that because you want to verify everything they do, but you can still do that by reviewing the code later on without the pain of step-by-step approvals.
- samlinnfer2 days ago
  >reviewing the code later on without step-by-step approvals
  I found that Claude likes to leave some real gems in there if you get lazy and don't check. Gently sprinkled in between 100 lines of otherwise fine looking code that sows doubt into all of the other lines it's written. Sometimes it makes a horrific architectural decision and if it doesn't get caught right there it's catastrophic for the rest of the session.
  - neonstatic2 days ago
    or it casually forgets to implement some requirements, which one finds out about when the program runs, hits that pathway, and either crashes or does nothing.
  - txtsd2 days ago
    Are you not giving it enough information to work with? All of these issues you and the parent comment mentioned can be worked around by telling it HOW to do things.
    qsera2 days ago
    The whole shtick of LLMs is that it can do stuff without telling it explicitly. Not sure why people are blamed because they are using it based on that expectation....
    cornel_io2 days ago
    Yes, it can. So can I. But neither of us will write the code exactly the way nitpicky PR reviewer #2 demands it be written unless he makes his preferences clear somewhere. Even at a nitpick-hellhole like Google that's mostly codified into a massive number of readability rules, which can be found and followed in theory. Elsewhere, most reviewer preferences are just individual quirks that you have to pick up on over time, and that's the kind of stuff that neither new employees nor Claude will ever possibly be able to get right in a one-shot manner.
    qsera2 days ago
    Sure, but that is not what the OP talks about.
    tetraodonpuffer2 days ago
    you can tell it how to do things, but sometimes it still goes out on its own, I have some variant of "do not deviate from the plan" and yet sometimes if you look while it's coding it will "ah, this is too hard as per the plan, let me take this shortcut" or "this previous test fails, but it's not an issue with my code I just wrote, so let's just 'fix' the test"
    For simple scripts and simple self contained problems fully agenting in yolo mostly works, but as soon as it's an existing codebase or plans get more complex I find I have to handhold claude a lot more and if I leave it to its own devices I find things later. I have found also that having it update the plan with what it did AND afterwards review the plan it will find deviations still in the codebase.
    Like the other day I had in the plan to refactor something due to data model changes, specifying very clearly this was an intentional breaking change (greenfield project under development), and it left behind all the existing code to preserve backwards compatibility, and actually it had many code contortions to make that happen, so much so I had to redo the whole thing.
    Sometimes it does feel that Anthropic turns up/down the intelligence (I always run opus in high reasoning) but sometimes it seems it's just the nature of things, it is not deterministic, and sometimes it will just go off and do what it thinks it's best whether or not you prompt it not to (if you ask it later why it did that it will apologize with some variation of well it made sense at the time)
    samlinnfer2 days ago
    There is an unconstrained number of ways it can write code and still not be how I want it. Sometimes it's easier to write the correction against the code that is already generated since now you at least have a reference to something there than describing code that doesn't yet exist. I don't think it's solvable in general until they have the neuralink skill that senses my approval as it materializes each token and autocorrects to the golden path based on whether I'm making a happy or frowny face.
    bitwize2 days ago
    Stop thinking like a programmer and start thinking like a business person. Invest time and energy in thinking about WHAT you want; let the LLM worry about the HOW.
    Toutouxc2 days ago
    The thing is that the HOW of today becomes the context of someone else's tomorrow session, that person may not be as knowledgeable about that particular part of the codebase (and the domain), their LLM will base its own solution on today's unchecked output and will, inevitably, stray a little bit further from the optimum. So far I haven't seen any mechanism and workflow that would consistently push in the opposite direction.
    qsera2 days ago
    >let the LLM worry about the HOW.
    You mean, let the LLM hallucinate about the HOW...
    Toutouxc2 days ago
    Technically that's true, but unless you literally write every single line of code, the LLM will find a way to smuggle in some weirdness. Usually it isn't that bad, but it definitely requires quite a lot of attention.
    fodkodrasz2 days ago
    There is a point where telling it how to do stuff is comparable/more effort to just doing it yourself.
- Sophira2 days ago
  > I get the desire not to do that because you want to verify everything they do, but you can still do that by reviewing the code later on without the pain of step-by-step approvals.
  It's a well-known truth in software development that programmers hate having to maintain code written by someone else. We see all the ways in which they wrote terrible code, that we obviously would never write. (In turn, the programmers after us will do the same thing to our code.)
  Having to get into the mindset of the person writing the code is difficult and tiring, but it's necessary in order to realise why they wrote things the way they did - which in turn helps you understand the problems they were solving, and why the code they wrote actually isn't as terrible in context as it looked at first glance.
  I think it makes sense that this would also apply to the use of generative AI when programming - reviewing the entire codebase after it's already been written is probably more error-prone and difficult than following along with each individual step that went into it, especially when you consider that there's no singular "mindset" you can really identify from AI-generated output. That code could have come from anywhere...
- vova_hn22 days ago
  I think that those permissions are largely security theater anyway.
  It would be better if an LLM coding harness just helped you set up a proper sandbox for itself (containers, VMs etc.) and then run inside the isolated environment unconstrained.
  In setup mode, the only tool accessible to the agent should be running shell scripts, and each script should be reviewed before running.
  Inside an isolated environment, there should be no permission system at all.
- evnp2 days ago
  I'm legitimately curious - could you elaborate on the difference? Speaking as someone who has always preferred the commit-by-commit focus of a rebase instead of all-at-once merge conflict resolution, auditing all the changes together later doesn't sound more appealing than doing things incrementally.
  - sdenton42 days ago
    It's far more sane to review a complete PR than to verify every small change. They are like dicey new interns - do you want to look over their shoulder all day, or review their code after they've had time to do some meaningful quantum of work?
    NitpickLawyer2 days ago
    > It's far more sane to review a complete PR than to verify every small change.
    Especially when the harness loop works if you let it work. First pass might have syntax issues. The loop will catch it, edit the file, and the next thing pops up. Linter issues. Runtime issues. And so on. Approving every small edit and reading it might lead to frustrations that aren't there if you just look at the final product (that's what you care about, anyway).
  - vova_hn22 days ago
    The main difference in the current (theatrical) permission model is that the agent is blocked on waiting for your approval. So you can't just launch it and go do something else, because when you return you will see that nothing is done and it has just been waiting for your input all this time. You have to stare at the screen and do nothing, which is a really boring and unproductive way to spend time.
    If you launch it in YOLO mode in a separate branch in a separate worktree (or, preferably, in total isolation), you can instead spend time reviewing changes from previous tasks or refining requirements for new tasks.
  - dcre2 days ago
    The choice isn't really between all at once and line by line. I always use accept all changes, but I make commits that I can review and consider in bigger pieces, but usually smaller than the full PR.
- dcre2 days ago
  I think it's too far to say you need YOLO mode — the author was correctly pointing to the "auto-accept all changes" setting. They should have just turned that on and then reviewed the changes in larger chunks. You don't have to let it go for half an hour and review the mess it cooked up — you can keep an eye on things and even manually make commits to break the work into logical pieces.
  With auto-accept edits plus a decent allowlist for common commands you know are safe, the permission prompts you still get are much more tolerable. This does prevent you from using too many parallel agents at a time, since you do have to keep an eye on them, but I am skeptical of people using more than 3-5 anyway. Or at least, I'm sure there is work amenable to many agents but I don't think most software engineering is like that.
  All that said, I am reaching the point where I'm ready to try running CC in a VM so I can go full YOLO.
- MattGaiser2 days ago
  Even if you don't want to do yolo mode, there are things like Copilot Autopilot or you can make the permissions for Claude so wide that they can work for an hour and let you come back to the artifact after lunch.
- OptionOfT2 days ago
  Yesterday I had it get the length of a word in characters by doing `word.len()`. In Rust. In 2026. Using Opus.
  This again showed me that I can't go in YOLO mode. Things like this are disastrous if left to fester in a codebase.
  - ninkendoa day ago
    Eh… I get what you’re saying but the word “character” is super overloaded. C uses “char” to mean “byte”. Rust uses it to mean “Unicode scalar” (which still isn’t a user-perceived character.) The meaning that corresponds to “where should the caret move when I press the arrow keys in a text editor” turns out to only be meaningful in a tiny set of circumstances. The vast, vast, vast majority of the time, it doesn’t make sense to think about “characters” at all, and it’s just bytes you need to account for. I’m generally with you on AI needing serious review from knowledgeable humans or it can be a disaster, but “it misunderstood what I meant by characters” smells a lot more like you were unclear in your prompt.
    OptionOfT20 hours ago
    That's the thing. I didn't ask it about how to get to the width of the string.
    It came up with a plan and I tried it.
- seba_dos12 days ago
  ...and then you get "the agent just git resetted --hard 12 hours of my work!", because AI bros can't be bothered to make their tooling actually good and version the changes at filesystem level, because it needs more than putting another variation of "pretty please don't break things" in the prompt.
beej712 days ago
The door is really opening for programmers who like getting stuff made, and really closing for those who like making stuff at a low level.
No need to get out the chisel to carve those intricate designs in your chair back. We can just get that made by pressing "1". Sorry, those of you who took pride in chiseling.
I'm definitely in the latter group. I can and do use AI to build things, but it's pretty dull for me.
I've spent hours and hours putting together a TUI window system by hand recently (on my own time) that Claude could have made in minutes. I rewrote it a number of times, learning new things each time. There's a dual goal there: learn things and make a thing.
Times change, certainly. Glad to be in semi-retirement where I still get to hand carve software.
spiderfarmer2 days ago
I recently spoke to a very junior developer (he's still in school) about his hobby projects.
He doesn't have our bagage. He doesn't feel the anxiety the purists feel.
He just pipes all errors right back in his task flow. He does period refactoring. He tests everything and also refactors the tests. He does automated penetration testing.
There are great tools for everything he does and they are improving at breakneck speeds.
He creates stuff that is levels above what I ever made and I spent years building it.
I accepted months ago: adapt or die.
- Toutouxc2 days ago
  > stuff that is levels above what I ever made
  How is that measured? Is his stuff maintainable? Is it fast? Are good architectural decisions baked in that won't prevent him from adding a critical new feature?
  I don't understand where this masochism comes from. I'm a software developer, I'm an intelligent and flexible person. The LLM jockey might be the same kind of person, but I have years of actual development experience and NOTHING preventing me from stepping down to that level and doing the same thing, starting tomorrow. I've built some nice and complicated stuff in my life, I'm perfectly capable of running a LLM in a loop. Most of the stuff that people like to call prompt/agentic/frontier or whatever engineering is ridiculously simple, and the only reason I'm not spending much time on it is that I don't think it leads to the kind of results my employer expects from me.
- manquer2 days ago
  You can still survive without using generative tools. Just not writing crud apps .
  There is plenty of code that require proof of correctnesss and solid guarantees like in aviation or space and so on. Torvalds in a recent interview mentioned how little code he gets is generated despite kernel code being available to train easily .
- noisem4ker2 days ago
  Your experience may be valuable, and in fact made me think, but I also think the brashness of framing everything in the "adapt or die" ultimatum is unnecessary and off-putting.
- lpcvoid2 days ago
  The way I see it, the kid has a dangerous dependency on at least one expensive service, cannot solve problems by himself and highly likely doesn't understand core concepts of programming and computers in general.
  Yeah I dread the software landscape in 10 years, when people will have generated terabytes of unmaintainable slop code that I need to fix.
- sciencejerk2 days ago
  Maybe adapt and still die anyway?
  - sph2 days ago
    The most pathetic of deaths as well.
    “He automated his job so well the company doesn’t need him anymore.”
    spiderfarmer2 days ago
    I'm a solo entrepreneur. If the company does well, I do well.
    sciencejerk2 days ago
    How did you make the transition to working for yourself? Genuinely curious
    spiderfarmer16 hours ago
    I couldn't tell you since I started my company when I was 18. I'm 42 now and never worked for a boss.
- hackable_sand2 days ago
  Psychopaths running the circus
orangecoffee2 days ago
The author has arrived at resentful acceptance of the models power(eg: "negative externalities", "condemn those who choose").
But the next step for many is championing acceptance. Eg "that the same kind of success is available outside the world of highly structured language" .. it actually is visible when you engage with people. I'm myself going through this transition.
0gs2 days ago
giving partial credit to Rust, the language, for shipping production code because you "hate" the experience of agent-driven development so much is an amazing move. i didn't think we could push things forward so fast. i guess Rust is just that powerful
samrus2 days ago
They really shouldnt have read all the changes individually. What you gotta do is set up your VC properly so these changes are seperated from good code, and then review the whole set of changes in an IDE that highlights them, like a proto PR. Thats far far less taxing since you get the whole picture
MattGaiser2 days ago
> I have no reason to expect this technology can succeed at the same level in law, medicine, or any other highly human, highly subjective occupation.
I mean, if anything, I would expect it to help bring structure to medicine, which is an often sloppy profession killing somewhere between tens of thousands and hundreds of thousands of people a year through mistakes and out of date practices.
As medicine is currently very subjective. As a scientific field in the realm of physical sciences, it shouldn't be.
- mattlondon2 days ago
  I was just talking to some friends in medicine the other day. They are getting more and more AI stuff and they love it.
  Just basic stuff like smart dictation that listens to the conversation the practitioner is having and auto creates the medical notes, letters, prescriptions etc saving them time and effort to type that all up themselves etc. They were saying that obviously they have to check everything but it was (and I quote) "scarily perfectly accurate". Freeing up a bunch of their time to actually be with the patient and not have to spend time typing etc.
  - noisem4ker2 days ago
    It's way beyond dictation. Medics I know (fresh postgraduates who used LLMs to help write their R code for statistical analysis for their research) are starting to treat it as one of their peers for domain reasoning, e.g. for discussing whether the conditions for a heart transplant are met. They're indeed in the "wow, this thing is human-like" stage, just not in the "let's delegate to the super brain, and then rubber-stamp the result at the end if it looks good" one we seem to be in... perhaps yet.
    jochem92 days ago
    This is the crazy part with LLMs. It knows much more than you as a single user will ever realize, as it only shows the part that matches with what you put in.
    I was building a tool to do exploratory data analysis. The data is manufacturing stuff (data from 10s of factories, having low level sensor data, human enrichments, all the way up to pre-agregated OEE performance KPIs). I didn't even have to give it any documentation on how the factories work - it just knew from the data what it was dealing with and it is very accurate to the extent I can evaluate. People who actually know the domain are raving about it.
- xigoi2 days ago
  I wouldn’t trust a tool that advises people to put glue on pizza to make medical decisions.
jeremie_strand2 days ago
[dead]
weiyong10242 days ago
[flagged]
lo0pback2 days ago
[dead]
bitwize2 days ago
Don't care. It's no longer up for debate: this is the future. Shape up or ship out.
- dodomodo2 days ago
  Why even responding then? And are we not allowed to talk about how doing our jobs makes us feel?
  - bitwize2 days ago
    If programmers like being able to pay their rent/mortgage, they'll quickly learn not to feel sad about literally the best thing to happen to software development in decades. Because otherwise they'll be replaced by someone who's delighted with it (they're not hard to find).
    qsera2 days ago
    >who's delighted with it
    A programmer who is not delighted by programming cannot be very good at it. So the same people who are "delighted" by using an LLM is the exact same people who should not be using it.
    It would be like putting a person who don't know how to drive in the driving seat of a semi-autonomous driving vehicle.
    sciencejerk2 days ago
    The decline of intellectually stimulating work is nothing to celebrate, nor is a sort of machine-driven natural selection
    hackable_sand2 days ago
    Can you explain how an LLM is going to grill up and package food for my customers?
    I'm able to pay rent just fine without one...
    lpcvoid2 days ago
    Nah, my theory is that people hyping slop programming are the sort of people who sucked at programming beforehand, and LLMs hide that pretty well.
    bluefirebrand2 days ago
    And when they're all out of work and desperate, what then? History suggests it will turn very bloody
    Is that really the future you're delighted about? Or are you just so shortsighted that you don't realize it is an inevitable outcome of mass unemployment is that the masses will eventually snap and it will get seriously ugly?
    phist_mcgee2 days ago
    Bang on.
  - 2 days ago
    undefined
periodjet2 days ago
These takes are growing increasingly tiresome, I have to admit. They are pretty much all just tacit admissions of some kind of skill issue with this new class of tool, but presented with a sheen of moral outrage. I don’t think anyone’s buying it anymore. Figure it out.
- Toutouxc2 days ago
  What kind of skill does it require to let LLMs write 100% of your code? I'm genuinely asking, what's the hard part that a pre-LLM developer is fundamentally incapable of doing? Is it running the agents in a loop? Or along a state machine? Running them in parallel? Because honestly none of that sounds like anything an experienced software dev shouldn't be able to pick up in two weekends.
- phist_mcgee2 days ago
  The worst are the anecdotal poster claiming that they are faster and more correct than an LLM nearly all the time.
  If that's not delusional thinking I don't know what is.
  - qsera2 days ago
    Even worse is the "true believers" who without any sort of justification will just declare that "everyone is cooked!"..
- sciencejerk2 days ago
  Did you not read the linked blog post? Author admits that Claude did a good job