54 pointsby gmays8 hours ago33 comments
  • bottlepalm7 hours ago
    I don't think vibe coders know the difference, but often when I ask AI to add a feature to a large code base, I already know how I'd do it myself, and the answer that Claude comes up with is more often the one I would have done. Codex and Gemini have burned me too many times, and I keep going back to Claude. I trust it's judgement. Anthropic models have always been a step above OpenAI and Google, even 2 years ago it was like that so it must be something fundamental.
    • tracker12 hours ago
      I'm there with you, but only been using it a couple months now. I find that as long as I spend a fair amount of time with Claude specifying the work before starting the work, it tends to go really well. I have a general approach on how I want to run/build the software in development and it goes pretty smoothly with Claude. I do have to review what it does and sanity check things... I've tended to find bugs where I expect to see bugs, just from experience.

      I keep using the analogy of working with a disconnected overseas dev team over email... since I've had to do this before. The difference is turn around in minutes instead of the next day.

      On a current project, I just have it keep expanding on the TODO.md as working through the details... I'd say it's going well so far... Deno driver for MS-SQL using a Rust+FFI library. Still have some sanity checks around pooling, and need to test a couple windows only features (SSPI/Windows Auth and FILESTREAM) in a Windows environment, and I'll be ready to publish... About 3-4 hours of initial planning, 3 hours of initial iteration, then another 1:1:1:1 hours of planning/iteration working through features, etc.

      Aside, I have noticed a few times a day, particularly west coast afternoon and early evening, the entire system seems to go 1/3 the speed... I'm guessing it's the biggest load on Anthropic's network as a whole.

    • dandiep7 hours ago
      For me, Codex does well at pure-coding based tasks, but the moment it involves product judgement, design, or writing – which a lot of my tasks do – I need to pull in Claude. It is like Claude is trained on product management and design, not just coding.
    • quaintdev7 hours ago
      Claude is good with code but I've found gemini is good for researching topics.
    • geldedus4 hours ago
      The title is about developers, not vibe coders (no, it is not the same thing)
    • colechristensen7 hours ago
      Codex and Gemini don't do as good a job or can't do what I ask them.

      The complexity of a project vs. getting lost and confused metric, Claude does a lot better than every time I've tried something else, that's it.

  • mrdependable7 hours ago
    I use Claude for a few reasons.

    1) I don't want to give OpenAI my money. I don't like how they are spending so much money to shape politics to benefit them. That seems to fly in the face of this being a public benefit. If you have to spend money like that because you're afraid of what the public will do, what does that say?

    2) I like how Claude just gives me straight text on one side, examples on the other, and nothing else. ChatGPT and Gemini tend to go overboard with tables, lists, emojis, etc. I can't stand it.

    3) A lot of technical online conversation seems to have been hollowed out in recent years. The amount of people making blog posts explaining how to use something new has basically tanked.

    • daxfohl2 hours ago
      Wow, I'd always considered claude more of a software tool and never really gave it a chance at regular chat, but yeah after one session I think I'm a convert for exactly #2.

      I'm fine with charts, but ChatGPT is so long-winded and redundant. "When would I use such-and-such pattern?" "That's exactly the right question to ask! ... What you're really asking ... Why that's interesting ... Why some people find it critical ... Option 1 ... Option 2 ... Consideration ... Table comparing to so-and-so ... The deep reason ... What it all boils down to ... The one-line answer (tight!) ... The next thing you need to know ... I can also draw a useless picture for you. Would you like me to do that?"

      • tracker12 hours ago
        I like Claude a lot for general and technical questions... I like xAI for some of that as well, especially for very current event summaries.

        I will say with Claude that I often have to make sure to give it the context of a question for it to give me answers I'm looking for. I find in general it does better when you include the why's behind a decision or question.

  • sidrag227 hours ago
    There is also the very lame auto win category that i happen to fall into...

    I dont trust openai, or google. google has beyond proven that they aren't trustworthy well before the LLM coding tool era. I am legitimately not even giving them a chance.

    Sadly I am assuming anthropic will at some point lose my trust, but for now they just feel like the obvious choice for me.

    So obviously i am a terrible overall observer, but i am sure i am not alone in the auto win portion of devs choosing anthropic.

    • nozzlegear6 hours ago
      That was exactly why I had been a paying Anthropic customer as well – I trusted them more than I trusted OpenAI or Google. But I canceled my subscription this morning after the news that they've ditched their core safety promise [†], and they look likely to fold to the Pentagon's demands on autonomous weapons/surveillance as well.

      [†] https://www.cnn.com/2026/02/25/tech/anthropic-safety-policy-...

    • azinman27 hours ago
      I believe anthropic is the only one that lets you opt out of training based on your chats for the developer subscription plans? Is that right?
    • tracker12 hours ago
      The sometimes hot garbage I get from the AI results for technical questions in the past year or so has me not even considering them from the start... I've tried github copilot (whatever the default engine is) and OpenAI and just found it annoying. Claude is the first one that I've felt was more productive than annoying and I just started using it.
  • ChrisMarshallNY7 hours ago
    I've been using ChatGPT (Thinking). I like how it has learned how I do stuff, and keeps that in mind. Yesterday, I asked it to design an API, and it referenced a file I had sent in, for a different server, days earlier, in order to figure out what to do.

    I'm not using it in the same way that many folks do. Maybe if I get to that point, I'll prefer Claude, but for my workflow, ChatGPT has been ideal.

    I guess the best part, is that it seems to be the absolute best, at interpreting my requirements; including accounting for my human error.

    • ryoshu7 hours ago
      Oof. I turned the history referencing off. I use ChatGPT for wildly diverging topics and it will bring things up that have zero relevance to what I'm currently looking for if history is on.
      • ChrisMarshallNY7 hours ago
        Yup. I can see that being a problem. My use is pretty linear, so that isn't an issue. I think that you might be able to establish different "contexts," though I haven't tried.

        I will see whether or not I can “wipe the slate clean,” between projects.

        I would also NEVER use it for any confidential/proprietary stuff that can’t leak.

        The project I’m working on is not open-source, but we don’t really care too much, if it leaks. I don’t send in any credentials; just source.

        Right now, it’s helping me design tests in Postman, to validate the API.

    • el_benhameen7 hours ago
      I like this feature and rely on it too. I get that some people hate it and that it can make some pretty insidious mistakes when it uses it, but I’ve found it valuable for providing implicit context when I have multiple queries for the same project.

      Worth noting that Claude also has a memory feature and uses it intelligently like this, sometimes more thoughtfully than cgpt does (fewer “out of left field” associations, smoother integration).

    • tracker12 hours ago
      The past few weeks, Claude has started doing that as well... ie recognizing my preference to use Deno for scripting or React+mui when scaffolding a ui around something.

      I've been using the browser/desktop for planning sessions on pieces of a larger project I'm putting together and it's been connecting the dots in unexpected ways from the other conversations.

      I think the one disappointment is that I can't seem to resume a conversation from the web/desktop interface to the code interface... I have to have it generate a zip I can extract then work from.

  • ChadMoran7 hours ago
    Model aside, the harness of Claude Code is just a much better experience. Agent teams, liberal use of tasks and small other ergonomics make it a better dev tool for me.
    • dalenw7 hours ago
      I've heard a lot of people prefer OpenCode to Claude Code, myself included. Having tried both, I find myself having a much better time in OpenCode. Have you tried it?

      I'll admit it lacks on the agent teams side but I tend to use AI sparingly compared to others on my team.

      • user81927 hours ago
        I’ve been using Claude Code for about six months and evaluated OpenCode on my Windows work laptop a few weeks ago. Found 3 dealbreakers that sent me back:

        1. No clipboard image paste. In Claude Code I constantly paste screenshots – a broken layout, an error dialog, a hand-drawn schema – and just say “fix this.” OpenCode on Windows Terminal can’t do that without hacky workarounds (save to file, drag-and-drop, helper scripts). I honestly don’t understand how people iterate on UI without this. 2. Ctrl+C kills the process instead of copying. And you can’t resume with --continue either, so an accidental Ctrl+C means losing your session context. Yes, I know about Ctrl+Ins/Shift+Ins, but muscle memory is muscle memory. I also frequently need to select a specific line from the output and say “this part is wrong, do it differently” – that workflow becomes painful. 3. No step-by-step approval for individual edits. Claude Code’s manual edit mode lets me review and approve each change before it’s applied. When I need tight control over implementation details, I can catch issues early and redirect on the spot. OpenCode doesn’t have an equivalent.

        All three might sound minor in isolation, but together they define my daily workflow. OpenCode is impressive for how fast it’s moving, but for my Windows-based workflow it’s just not there yet.

  • pinkmuffinere7 hours ago
    > Half their agentic usage is coding. When that's your reality, you train for it. You optimize the tool use, the file editing, the multi-step workflows - because that's what your paying users are actually doing. Google doesn't have that same pressure.

    I wonder if this is a strategic choice — anthropic has decided to go after the developers, a motivated but limited market. Whereas the general populace might be more attracted to improved search tools, allowing Google/openai/etc to capture that larger market

    • bonoboTP7 hours ago
      They are heavily dogfooding. Coding is needed to orchestrate the training of the next Claude model, data processing, RL environments, evals, scaffolding, UI, APIs, automated experiments, cluster management, etc etc. This allows them to get the next model faster and then get the next one etc.

      Making a model that's great at other kinds of knowledge/office work is coincidental, it doesn't feed back directly into improving the model.

    • WarmWash7 hours ago
      It's more likely that anthropic feels that if they can crack just programming, then their agents can rapidly do the legwork of surpassing the other labs.
      • pinkmuffinere7 hours ago
        Hmm that makes some sense to me as well, like buying eco upgrades early in an rts game. But that assumes that better programming leads to more-generally-competent AI, which I think is tenuous. The thing that sparked this recent AI summer was not better programming, so why would better programming lead to the next/ongoing big improvements?
    • sjsjzbbz6 hours ago
      They’re doing a lot of dev hostile stuff:

      - limiting model access when not using claude code

      - claude code is a poorly made product. inefficient, buggy, etc. it shows they don’t read the code

      - thousands of open GitHub issues, regressions introduced constantly

      - dev hostile changes like the recent change to hide what the agent is actually doing

      However, they are very good at marketing and hype. I’d recommend everyone give pi or opencode a try. My guess is anthropic actually wants vibe coders (a much broader market).

    • j2kun7 hours ago
      > Google doesn't have that same pressure.

      I doubt it. Gemini is heavily used internally for coding with integrations across Google's developer tooling. gemini-cli is not meaningfully different from claude code.

  • anonzzzies6 hours ago
    Gemini is supposed to have this huge context; Gemini cli (paid) often forgets by the next prompt whatever the previous was about and starts doing something completely different , often switching natural or programming language. I use codex and with 5.3 it is better but not there compared to cc for us anyway; it just goes looking for stuff, draws the most bizarre conclusions and ends up lost quite often doing the wrong things. Mistral works quite well on smaller issues. Cerebras gml rocks on quick analysis; if it had more token allowance and less rate limiting , it would probably be what I would use all the time; unfortunately, on a large project, I hit a 24 hour block in less than an hour of coding. It does do a LOT in that time of course because of its bizarre speed.
  • mark_l_watson7 hours ago
    Could it be tooling like Claude Code? I just used Claude Code with qwen3.5:35b running locally to track down two obscure bugs in new Common Lisp code I wrote yesterday.
    • genghisjahn7 hours ago
      I use Claude Code as an orchestrator and have the agents use different models:

        product-designer   ollama-cloud / qwen3.5:cloud
        pm                 ollama-cloud / glm-5:cloud
        test-writer        claude-code  / Sonnet 4.6
        backend-builder    claude-code  / Opus 4.6
        frontend-builder   claude-code  / Opus 4.6
        code-reviewer      codex-cli    / gpt-5.1-codex-mini
        git-committer      ollama-cloud / minimax-m2.5:cloud
      
      I use ollama pro $20/month and OpenAI $20/month. I have an Anthropic max plan at $100/month.
      • alexsmirnov2 hours ago
        I do in similar way, connect claude code to litellm router that dispatches model requests to different providers: bedrock, openai, gemini, openrouter and ollama for opensource models. I have special slash command and script that collect information about session, project and observed problems to evaluation dataset. I can re-evaluate prompts and find models that do a job in particular agent faster/cheaper, or use automated prompt optimization to eliminate problems.
      • geor9e7 hours ago
        Is this because Anthropic models are worse at those tasks, or more expensive, or what?
        • genghisjahn7 hours ago
          They are great, but they are expensive. I can run those against the cheaper ollama cloud models for things that are basically requirements gathering and review of a plan. The Product Designer Agent and the Product Manager basically argue for a few rounds and give an artifact that the coding agents pick up.

          It could all easily be anthropic models and would work well, but running this swarm eats up all my anthropic tokens and these other models are good enough for the roles I've given them.

      • mark_l_watson5 hours ago
        very cool!
    • smt887 hours ago
      Qwen seems fine for analysis to me, but Opus 4.6 is far better to use as a sounding board or for writing code
  • theanonymousone7 hours ago
    Claude the model or Claude (Code) the tool? I'm not sure what to think about an article that doesn't make it clear which one they are talking about...
    • geor9e7 hours ago
      They are talking about Claude Code, the terminal app, which uses Opus and Sonnet for models mainly.
  • a11r7 hours ago
    This resonates with my experience. At Morph we use gemini for well specified point coding tasks, and it does very well across millions of lines of code every day. We also use claude code as an engineering tool for our own codebase and it does better at being adaptive and for working on open ended issues.
  • pgm87057 hours ago
    I also have always gone back to Claude after trying new models... until GPT-5.3-Codex, specifically with the new Codex Mac app. I've been pretty much full time with it for a few weeks now and have not missed Claude Code. It can over complicate things at times, but for the most part, it is providing working solutions on first go and following coding patterns that already exist in my app. With Claude, it would frequently knock out a feature with acceptable code quality, but be completely broken and require a round of debugging.

    I'm even getting by without hitting limits on the $20/month plan, whereas I needed to be on the $100/month one with Claude.

  • mowmiatlas5 hours ago
    Well CC is awesome, there's that.

    Codex is awesome too. Opencode is awesome as well. It's so easy to transition from one tool to another especially when one command in project root is what it needs to get up to speed.

    But I actually feel like asking Opus to review Codex and vice versa gives me best results. Opus does push back on some reviews comments, sometimes Codex is overselling a feature, but at least to me it feels like I have more points of control, and different perspective even if I could simulate it with two terminal sessions lol

  • geldedus4 hours ago
    Because Opus 4.6 it is better than any other AI Coder.
  • aquir7 hours ago
    I’m quite happy with the Codex app.
    • elevaet7 hours ago
      I am too, and haven't really given Anthropic's stuff a fair shake as a result, and am so curious if I'm missing out or if it's the same shit different pile.
      • nineteen9992 hours ago
        I fired up Codex yesterday and asked it to do a security review on an UE5 project i'm working on with Claude. It found some things that Claude already knew about. But nothing made me feel confident it could _write_ UE5 C++ code as well as Claude. I guess it's worth a shot testing it on a minor feature, but what other people are saying here gels with my experience as well.
  • rlk207 hours ago
    All developers are in love with that wonderful Claude:

    https://archive.org/details/1950-Tide-Detergent-Ad

  • 7 hours ago
    undefined
  • mosura7 hours ago
    Mistral are quietly far better than all the noise would suggest.
  • istillcantcode7 hours ago
    I prefer Googles. I can only afford the free models. I normally copy and paste my stuff into 4-5 models and compare the responses. Its probably a waste of time, but very mentally satisfying. I mostly program as a form of mental stimulation instead of trying to become a billionaire. Taking this perspective, using AI agents is not really the same experience, and less mentally stimulating than programming.
  • __alexs7 hours ago
    I don't understand quite how Anthropic have managed to get so much mind share for Claude Code given the UX is pretty bad compared to something like Cursor.
  • IAmGraydon7 hours ago
    Developers prefer Claude because that's their brand, a very intentional choice. If you have a very specific use in mind (like coding), you aren't going to go for the jack of all trades, master of none solution. You're going to go for the coding specialist, which Anthropic has squarely positioned themselves as. Props to them for it - they correctly predicted that LLMs can do many things, but perhaps the most valuable is coding as they're very well suited to it due to the rigidly defined syntax and high cost of engineers.
  • sampton7 hours ago
    I started using Codex 5.3. Compare to Opus 4.6, it's more precise in pinning down bugs and more concise with code. Opus can be best described as distracted and easily agreeable. Codex actually digs deeper for root causes and push back when I'm wrong.
    • nineteen9992 hours ago
      Interesting, I started playing a little with Codex yesterday and it did find some bugs Claude already knew about it, and seemed pretty matter of fact about it. I might have to point it at some of the harder bugs and see how it goes.
  • verdverm7 hours ago
    Dev, very happy with Gemini, especially flash

    Googles AI products suck hard though

    • WarmWash7 hours ago
      Antigravity is pretty good, gemini CLI is rough though
      • verdverm7 hours ago
        I really dislike antigravity, I can get a better experience with a vs code extension (I did before anti even came out)

        $2B, best pork ever, it's not even theirs to begin with, so in a lot of senses, it's not a Google product

  • adithyassekhar7 hours ago
    Does anyone know if using claude with opencode violate their new policies?
  • simianwords7 hours ago
    This article is 100% AI generated. I confirmed with pangram.
    • 5 hours ago
      undefined
  • scotty795 hours ago
    > The benchmarks will tell you one thing. The developers who use these tools every day will tell you another. Usually, you should listen to the developers.

    Isn't it a bit like asking horses what car features they like best?

  • hirvi747 hours ago
    I am torn between Claude and GPT. Though it was recently brought to my attention that I use LLMs in an old-fashioned way [1]. I will say that based on my usages, both models seem very comparable in terms of accuracy and quality. Sometimes one might do things a bit different, but both tend to be more similar than different.

    When I am using an LLM for JS, I can't really tell the difference between the two. For C#, I think GPT might produce slightly better quality code, but Claude produces code that seems more modern. I also feel like Claude makes slightly more minor mistakes like forgetting to negate a boolean conditional check.

    With Swift, I have found both models to be surprisingly awful. I am not sure if it is because of some of the more recent changes with Swift versions greater than 6.0, but both seem to produce wild results for me.

    [1] I do not use Codex CLI nor Claude Code nor any IDE plug-ins. I just type questions into the web app and rarely copy/paste anything back and forth.

  • Traubenfuchs7 hours ago
    At this point I completely stopped using anything else.

    Even for vacation questions or psychotherapy, claude is the best, despite complaining about not receiving a coding task (sometimes).

  • gadflyinyoureye7 hours ago
    Use to love Grok Code Fast 1 because it was free on GHCP. I gave it context but just let it churn on a solution. Claude is far better but a finite resource. I think OpenCode plus GPT-4o might be my next step.
    • jkukul7 hours ago
      I hope you're not too invested into GPT-4o because it has been retired so you'll need to use a different model :)
      • gadflyinyoureye7 hours ago
        That's fair. I just use the free dreg models. Unlimited tokens with those.
    • ahofmann7 hours ago
      What is GHCP?
    • ronsor7 hours ago
      GPT-4o is discontinued now
  • TZubiri7 hours ago
    Was it ever confirmed that Anthropic did paid astroturfing? Or is this organic?
    • ghqst7 hours ago
      I think this is organic. I've observed the exact same thing over the last week: I tried Google Antigravity and really liked it when I was using my Claude quota. I ran out of Claude quota and tried Gemini 3.1 Pro and it was comparatively terrible at using the tools provided by the IDE. (but it's a useful model in the browser for chat)
    • scuff3d7 hours ago
      Given my own experiences with LLMs, I'm convinced about half the comments on any given thread are just bots told to hype <insert_product_here>
      • bigbadfeline5 hours ago
        That's about right, my estimate is 50% to 70% depending on case.

        On the other hand, isn't it clear that people will be more productive with whatever agent they have experience with? That's my reason for going with open source agents and the attempts of Big LLM to kill that workflow only reaffirm my decision.

    • peyton7 hours ago
      The article is AI-generated at least.
    • tayo427 hours ago
      Could be both?

      I've been leaning more towards claude. A lot of the LLM tropes seem to be ChatGPT really? I feel like claude doesn't do as much of the overly intense "its this -- not that" pattern and isn't constantly acting like my hype man. Claude code has been nice 90% of the time. I haven't tried to many competitors though.

  • tristor7 hours ago
    I keep using Claude over other models (through Cursor) because the answers it gives and the plans it creates align with how I would personally approach the same problem if I were doing it myself. The other models /might/ produce a better final result as benchmarked/tested, but how they get there feels like complete nonsense to me.
  • boxingdog7 hours ago
    [dead]
  • bogzz7 hours ago
    [flagged]
  • davidguetta7 hours ago
    I prefer ChatGPT because i can ask it to rewrite entire files with minimal changes in the chat (up to 3k lines) and it will do it.

    Every other AI add random opinionated and unwanted stuff