Basically, it's a company that's not sustainable for two separate reasons. The first one is that they have an extremely high overhead. SG&A of 55% is really bad. The seconds reason is that their R&D costs are truly astronomical. They could probably cut those costs to some extent, but they're not going to cut them to nothing. They're already losing ground to Anthropic even with this much R&D.
To put it differently, even if OpenAI cut its R&D and inference costs by half, they would still be leaking money like a sieve.
Gemini is number 3 in this race
Now you can’t really find taxis anywhere, even at airports it’s a lot more difficult than it used to be.
Once the taxi business was disrupted enough, Uber’s pricing skyrocketed and customers had basically no other options for competition on pricing.
OpenAI basically created a new market. There is no AI chatbot incumbent to disrupt and swallow.
Again like in the Amazon analogy, I don't think they're done growing, and unfortunately, I think they've positioned themselves (perhaps intentionally) as too big to fail, and need to continue growth at all costs.
I'm glad I'm not OAI's CFO sounds like a stressful job trying to justify/account for whatever Sam says to the board, or whatever the board demands. Sam hasn't said hardly anything since about February so I'm guessing the CFO simply bends to the will of the board these days. But that's speculation.
Anthropic is also likely losing money, right?
Revenue is still growing faster than costs and gross margins have continued to improve.
The real question is when they can start spending less on R&D and still compete.
If you're building a model that lasts a few months before it's no longer the most current one, and maybe a year before it's completely unusable by anybody, then that should just be COGS.
Doing that, however, would betray the real problem with this business model.
Unless we are genuinely pushing to find AGI, at which point nothing matters, LLMs in their current form don't replace knowledge workers but are an effective force multiplier. How good is enough?
For instance, I pay about $1-2 a month for DeepSeek. It's not as sophisticated as Claude, but it still doubles my productivity as a SWE.
If Fable comes out and demands 50x the price of DeepSeek in order for Anthropic to make a profit on it, how much more productive would I be compared to my personal experience + DeepSeek? 3x? 50x?
Is it cost effective for a business to hire someone without SWE experience + Fable verses hiring someone with SWE experience and DeepSeek? When does R&D hit diminishing returns?
There's always working on improving the cost of inference, but I don't think this is an area of R&D that will slow down. The reason is:
1. A better competitor model risks eating away at how much they can charge for inference (i.e. revenue) 2. Whoever unlocks AGI will unlock even more growth 3. Even when you unlock AGI, you'll want to throw gobs of money at it to improve itself and all sorts of things.
> If Fable comes out and demands 50x the price of DeepSeek in order for Anthropic to make a profit on it, how much more productive would I be compared to my personal experience + DeepSeek? 3x? 50x?
You're pricing it wrong and looking at it wrong. First, the per token price doesn't consider that a smarter model can end up using fewer tokens overall to achieve a result. Secondly, if the difference is between failing to accomplish the task and accomplishing the task, suddenly that 50x can seem like a bargain.
> Is it cost effective for a business to hire someone without SWE experience + Fable verses hiring someone with SWE experience and DeepSeek? When does R&D hit diminishing returns?
At this time, someone without SWE experience + <name AI model> vs someone good with SWE experience and <name another AI model> is a no-brainer. The AI model is an accelerant but the "no SWE experience" will be accelerated into a wall. Now maybe that doesn't matter for prototyping and certain other things, but anything in production the lack of experience will hurt them with things they won't even know about or even know how to look for it (e.g. slow, insecure, etc).
If you're in the US and you're making 100k a year, that's worth 5k or $416/m. So you can buy two of the most expensive plans on the frontier models.
This focus on cost optimization is insane. Just use the frontier models. Even a marginal bump is worth whatever the hell they're charging, at least for now.
If all construction workers can build houses 5% more efficiently, that's not the same as nothing changing. Depending on supply and demand, it means 5% more houses are built, or houses are 5% cheaper, or maybe 5% bigger, or some combination. Whether or not the construction workers all get a raise or 5% get fired (or both) depends on that supply and demand, but historically they often get a piece of the growing economic pie.
Also where is the evidence that the workers have ever benefited from productivity bumps? The only thing that happens is surplus gets captured by the owners while workers are forced to do more.
Bad deal all around.
Deepseek shines for personal usage because it's possible to use it however you want and whenever you want with no session/weekly limits stress because you use the API and it's priced very reasonably.
I think the third coming out Jesus Christ in closer than AGI. Seriously, I dread how much of Silicon Valley is wrapped in this narrative of AGI and Singularity.
How can all these "rationalists" fail to see that this is what religion looks like: Faith and promises of heaven and hell.
There's a non-negligible percentage of the industry who have a pseudo-religious belief in AGI, so I wouldn't be surprised if that was, in fact, the goal.
Who knows, maybe they'll stop once the money dries up.
If they manage to keep those customers for several years without more sales, that bit looks like a normal "high-touch" business.
They shouldn't look like a "high-touch" business, but their unitary numbers look way better than I expected. They just need to grow some 10 times to star making a profit... Maybe 100 to cover the opportunity cost of their capital.
It's just a matter of finding 5 billion people willing to pay US prices :)
But it is still better than I expected.
It's getting businesses to pay $2k/mo or more per professional employee, like a lot of Anthropic customers.
Anthropic is ahead of them there, but that is how they win.
If they're the only ones who ̶a̶r̶e̶ ̶w̶i̶l̶l̶i̶n̶g̶ ̶t̶o̶ ̶b̶e̶ ̶t̶h̶e̶ ̶e̶n̶g̶i̶n̶e̶ ̶f̶o̶r̶ ̶a̶u̶t̶o̶n̶o̶m̶o̶u̶s̶ ̶k̶i̶l̶l̶b̶o̶t̶s̶ can draw a reciprocation dingle-arm to reduce soinosoidal repleneration, then "I'm sure the government will buy it" [0]
Isn't Anthropic currently killing that market though? I've been hearing about a lot of businesses pulling back after having experienced the reality.
This is how you know ads are inevitable. YouTube is probably a good indicator of how BigLabs will operate for free users.
There’s also the difficulty of proving to the advertising clients that the advertising actually takes place (and how much of it), if it is covert.
It'll be like Facebook; they're not losing money but it's awful to use
With so many free models available the ai companies are going to struggle to convert active free users to paid.
the biggest reason for this is that the digital ad market is a duopoly (charitably a triopoly if you count Amazon in), if all of the LLM companies start to go into ads that's going to be a much more competitive market for ad buyers. It's not going to be so straight forward when both customers and merchants have ten different places to go.
Also not to forget that ChatGPT has zero moat, unlike social Facebook and Google.
I think that AI is going to become just another utility people pay to stay relevant. Same as their internet, electricity or gas.
I'm guessing that might be so in certain professions, but I would expect the employer to pay for that. For the rest of us, it seems unlikely. At least for me, I don't have a need of a device to generate text for me. And I bet most people are are in the same boat as me.
But: how are they calculating the cost of revenue? Do they have rapidly depreciating assets that are also needed to produce that revenue? (Starlink has this issue.) Will their cost per arithmetic operation for inference rise or fall? (Anthropic is paying xAI an absolutely insane amount to lease GPUs. They must be betting that they will not need to repeat that.) Is a large portion of the cost allocated to R&D actually being used to support their revenue?
I certainly believe that the cost of inference can be plenty low for them to make a profit, but a more granular breakdown would make it easier to evaluate.
R&D costs are hurting profit side and while you can cut that one just becomes irrelevant overnight in this space if you do, hence the problem.
You're stuck racing against your competitors with the distinct possibility that your R&D costs will outgrow the market demand, and you can't stop because otherwise your customers will stop investing in your dead end tech and switch.
If it's not materials, not energy or taxes, not manufacturing, not licensing or rental fees, then I can only think of R&D.
Unless these frontier providers feel some type of squeeze or constraint the Chinese are well positioned to leave the US bag holders of an NVidia bound system. And if anyone has to wonder how one provider for a critical piece of infrastructure will go, well...
Would love to hear some details on that one...
Or was that a typo and you meant the $200/mo plan instead maybe? That one I could believe, assuming no or frugal subagent use that is.
> Interesting. I'm mostly using Claude, so perhaps I'm not nearing the limits, but I do use Codex (for coding and reviews occasionally) and use chatgpt for second opinion many times, including "pro" research. Never got to my limits. But again, not my main go to tool.
And then the reality turns out not to be the case - you have to continuously spend on R&D to avoid getting your lunch eaten by someone else.
This isn't a social media network with lockin either. People can and will just switch to whatever whenever they feel like it. Maybe it becomes a defacto standard like google but if someone is much better than you, well...
The problem is you can't just separate training costs from inference costs. If OpenAI just didn't train a new model for the next five years, sure, they'd do OK. Assuming all those dirt cheap Chinese models nipping at their heels don't make up the gap while OpenAI is resting on their laurels.
Without being a frontier model (read: continuous, incredibly expensive training), they effectively don't have much to sell. So inference and training costs are intertwined to some extent.
And the network effect which ruled for the last 20 years seems to have relaxed its death grip just a bit (of course it is still there as having more customers using your tools and models provides more training data, etc., yet the current network effect doesn't seem to have that high exponential value like before)
It will become profitable. Local models and local on-laptop inference will get good enough. This argument has been made for decades. It's not like everyone is walking around hosting email and photos on their personal machines. Sometimes it takes a large investment to make servers and clouds for this stuff possible.
We need to get away from this idea that in order for one thing to succeed, the other must fail. We also need to stop thinking in binary and accept that all these things (profitability, local models, powerful laptops, etc.) can all happily coexist.
That latest drug for pancreatic cancer? Yeah, all human. After the trillions already spent, AI hasn't come up with any new medications, no new inventions to save lives... Nothing
It's already being used in the medical field in many different ways, and I believe it will be able to fold new proteins to help make new drugs. It's coming.
Just because people are using it doesn't mean it's a net good. Lots of people use social media and that's just rotting brains and making people far more polarized than they ever were before.
It's use in medicine hasn't resulted in anything meaningful. Nobody's medical bill has gotten cheaper and nobody has lived longer or healthier because of anything AI did.
This is not happening in a vacuum. A lot of index funds and retirement accounts have bought into AI and AI adjacent companies, many with stakes in OpenAI. If OpenAI keels over, even when private, it will affect a lot of americans. If they IPO, it's even worse.
Some of my coworkers even use Sonnet (the default in Claude Code for the 20 USD subscription) and see no reason to change even though that model is definitely "outdated" compared to current SOTA.
I was watching a World Cup match last week and one of the TV ads during half time was something to the tune of ChatGPT being used by kids to improve their street soccer skills. This was Brazilian TV. Anyone even remotely familiar with Brazil would find this ad deeply, thoroughly out of touch. I can't think of a worse chatbot pitch than that.
Anyway: Zero, as of right now.
I fully expect to be able to run useful LLMs on a machine I can justify buying for other reasons. I already can on the secondhand kit I own, and I don’t expect the cost-benefit analysis of local LLMs to ever really get worse.
If I ever need to pay for it, it will likely be to shift some of the capacity into the cloud for either business or pragmatic personal reasons (so I can just carry an iPad etc.)
I fully intend my expenditure to be negligible. Because once one realises that outspending others is impossible, only spending minimisation makes sense.
I foresee it potentially making sense for me to move some mature tools off a local LLM to openrouter, maybe. But probably to the same or similar models.
AI is so important, I want to have it under my control. Even if I have to pay a penalty in terms of capabilities.
Maybe it’s just your phrasing but people will only pay for what works, no one is loony enough to support a trillion dollar industry out of the kindness of their heart or spirit of innovation
When I bought my last GPU, running AI models locally was a consideration though not the only one, and I have it set up but haven't used it much yet. I mostly use the free tiers of ChatGPT or Google to write the occasional script for me. I guess they're going to have to inject a truly unfathomable number of ads to get their money's worth.
I have a feeling my experience is closer to an average persons' than a dev, but it doesn't seem like they'll be able to monetize just from devs even if each one is spending thousands a month.
Don't give up just keep trying you can truly build personally life changing things. Don't look at it purely from a how do I sell this lense, just empower yourself with these tools while the getting is good
If AI allows me to cut my time to do something in half on average or allows me to do 2x more it would be worth it to pay up to what my monthly income was before assuming my income scaled with my output.
I spend 30 - 60 bucks a year with Horizon Labs.
I spend 25 bucks a month on Cursor. Cursor replaced an OpenAI sub.
Both support hobby projects. If either cost increased I would spend some time testing local alternatives and probably drop them.
Horizon Labs especially, I know that they have been matched by open models and are mostly a convenience at this point.
For work, it depends, but if I have to spend more than a few hundreds bucks probably I'll start looking for alternatives (local models, Chinese providers, ecc)
PS: I'm in Italy, I guess in several parts of the world these figures are even smaller.
If I were really forced to.
LLMs provide me about the same value as a car does.
We have benchmarks on our domain and it does there are models that are 2x to 10x cheaper for a small drop in percentage points in accuracy
It may put me at a disadvantage when it comes to quickly slop something together? But so far the free-to-use chat bots do as well for my needs.
Glad to see more sane takes in the comments. All these articles on their current financials are missing the point.
OpenAI is doing pretty well.
Capital expenditure is required to deliver on 1) better models 2) better infra and 3) better products. Insane CapEx is required to do all the above + compete with Google, Meta, Microsoft, Apple, Anthropic, etc. etc. etc. who are all trying to do the same. These financials are sane, considering the scenario.
Luckily, we've seen this before. Doom and gloom when smartphones came out. And then the same again when mobile development was preferred and there was an outcry from the web dev crowd and constant downvoting of phone apps.
This is because people here are quietly realizing that they fell for the "token-maxxing" marketing drive which was complete BS for you to gamble more money on tokens as the big AI labs gave heavily subsidized token prices they cannot afford.
Jevon's paradox does not exist at those companies, but it certainly exists at the Chinese AI Labs at Deepseek, Alibaba, z.AI and Xiaomi.
Good callout. All these "trends" in AI were definitely from the AI companies themselves in order to push the sales of more tokens. What's after agent orchestration? Whatever it is, it will involve a big spend.
I don't like these products. I have several negative opinions on them. To the extent they work and there is a customer base what marketing could you /possibly/ be engaged in? Doesn't the product sort of market itself? Or another way is this a product that you can market to expand your MAUs?
It's so polarizing I can't imagine how that $5.7B is being spent.
There's perhaps a metaphor or two lurking about bait and switch tactics.
In every way imaginable and then more, looks like beyond the imagination :)
>I don't like these products. I have several negative opinions on them.
You're not alone, and the crowd seems to be building at the same time enthusiasts are proliferating too.
So much widespread negativity I would guess that's about what it's expected to cost to fully overcome resistance and objections. Which must be bigger than we think, they sure have more information than us.
AI companies are black holes for money the way delivery companies are (or were, considering the money people are willing to pay these days).
Most of them will disappear alongside the money people have bet on them.
"Exclusive: OpenAI Losses Increased Nearly 8X in 2025, With Spending Hitting $34 Billion" https://news.ycombinator.com/item?id=48550465 (188 points, 2 days ago, 108 comments)
https://www.wheresyoured.at/exclusive-openai-financials/
Please stop posting Ars. It's just blogspam, sad as that is. We need to let it go.
The whole point of the company is that they are investing a huge amount of money upfront in order to make models that are better and better, and thus have a higher productivity multiplier.
They are very profitable on inference, they just know that the race to AGI requires a huge amount of investment, compute, getting the best researchers, etc.
Look, for coding and a lot of other things, AI is awesome.
But the here's the killer. I have a dinky 16gb VRAM card, and that's kind of the sweet spot for the level of AI I actually want. I don't want it doing too much, I'd rather create slowly than have it one shot something that I have to then pore over later.
Feels like a company investing kazillions in, i don't know, air-conditioning or building wi-fi. Yes, it's going to be around, and also no one's gonna need THAT MUCH.
I feel like the 1080 ti is like a prophet of the current crisis, these companies are buying $10k paperweights per user to MAYBE... LUCKILY... charge what... $200 a year? and that is for every 1/100 users.
this same 10k hardware will be outdated in a couple of years...
It just doesn't make financial sense, if you couldn't sell standalone GPUs that people PAID for with HBM in them, what makes you think that you can sell a POSSIBLE subscription utilizing a $10k+ GPU?
This is the most obvious bubble of all time.