> After a series of productive conversations with the US government, we're redeploying the model with a new set of classifiers to target and block more cybersecurity tasks. In the near term, some routine tasks like coding and debugging will fall back to Opus 4.8.
Edit: the above was from their tweet announcement at https://x.com/AnthropicAI/status/2072163884430229756 ... the associated blog post at https://www.anthropic.com/news/redeploying-fable-5 suggests it was just poorly written and coding can still be done with Fable, just with overeager bouncing of "some routine coding and debugging tasks" to Opus.
> The new classifier also comes at the cost of flagging benign requests more often during routine coding and debugging tasks.
Here's Fable 5, the strongest model. Actually try to use it to harden your code and it turns into Opus 4.8. You have seven days to use it, and only half of that time's worth in actual usage. Enjoy.
Looks like it's going to be a thoroughly frustrating experience, even worse than initial rollout. For subscription users, the situation is almost indistinguishable from the export ban.
https://news.ycombinator.com/item?id=48466313
Just a code review of my own project. Downgraded to Opus 50% of the time while evaluating the critical I/O and memory safety parts, the exact thing I wanted it to do.
And now it's gonna be even worse.
I expect the strong cybersecurity model to help me strengthen the cybersecurity of my project.
> not allowed to cover security topics
They said it wouldn't be usable for offensive purposes. This is the opposite of that.
The cybersecurity model is Mythos, which was never made publicly available. It is only available to a list of US government approved companies.
> They said it wouldn't be usable for offensive purposes
No, they said Fable would refuse for cybersecurity and offensive purposes. You are conflating Fable with Mythos.
Fable 5, harden my openssl project. Then you use the diffs/summary to find out what the bug is for your exploit.
Anthropic was correct in their assessment and early warning of Mythos's capabilities, and they did this rollout pretty well. They were not hype marketing. They were being genuinely cautious and honest.
The Trump admin was largely unreasonable with the sudden export control. (Though not entirely unreasonable.) The export control also had not much to do with Anthropic's pre-release warnings. See: GPT-5.6 currently being held up by the federal government.
I think the Fable ban happened because Anthropic was first to release a capable enough model.
They almost definitely mean "you will notice even more false positives during seemingly routine coding/debugging tasks than you did at the initial launch". Which is not surprising, given the ordeal they've been put through. Hopefully it won't be too bad.
The main depressing thing for me is it's now only 7 days on the subscription, and then full API pricing, with no mention of even a plan to bring it back to the subscription in the future. (The initial launch mentioned two weeks of subscription, then API pricing, then a hope to return it back to the subscription not long after.)
"In the near term" is doing some heavy lifting.
> The new classifier also comes at the cost of flagging benign requests more often during routine coding and debugging tasks.> The new classifier also comes at the cost of flagging benign requests more often during routine coding and debugging tasks. As with all our safeguards, we’ll continue to refine this to better distinguish genuine misuse from legitimate requests and reduce false positives.
> Fable 5 will be available starting tomorrow, Wednesday, July 1, to users globally on the Claude Platform, Claude.ai, Claude Code, and Claude Cowork. For Pro, Max, Team, and select Enterprise plans,1 Fable 5 will be included for up to 50% of weekly usage limits through July 7, after which it will be available via usage credits. We will re-enable access on AWS, Google Cloud, and Microsoft Foundry as quickly as possible.
Reading the full blog post, I think the summary was just poorly written (because it's hard not to read that sentence like all coding is redirected to Opus).
Now whether AI tech is in the same league as say Nuclear tech and therefore by any reasonable standard should be regulated is a different question.
We hit the slippery slope on a random day in June 2026 and there is no putting the genie back in the bottle. Any exec or manager that puts load bearing weight on top of Anthropic/OpenAI/Google/AmericanCorp frontier model deserves the stress.
The switching costs of changing LLM providers is as low as it gets. All the individuals and startups I know try different models all of the time, even down to the level of choosing which provider to use based on the task. Bigger companies move slower but only because they have lawyers and teams negotiating contracts, not because there is a technical reason that it's hard to switch.
Companies have dealt with supply chain unpredictability by having multiple providers and switching between them since forever. It's infinitely easier to switch LLM providers than it is to deal with physical supply chain uncertainty.
Even if you won't be able to use some model tomorrow, you can still make money by using it today!
And in the age of limited compute, spiky workloads and constant outages, building a mechanism to fallback to a weaker model when your primary choice isn't available is smart anyway.
We got the first news about Mythos in March, so it is likely that it was already close to ready by the time Opus 4.6 was released.
So the actual gap is the time elapsed between March (or April for the official announcement) and whenever Chinese models can match Mythos.
Why would Anthropic get the benefit of pre-release models counting toward their lead, if nobody else gets to count their pre-release models?
A couple: usually 2, though not always
A few: 3, 4, 5
Several: 4, 5, 6, or 7.
I had to explain this to my German friend. In my understanding this isn't about the actual number, it's about the certainty. If it's absolutely and definitely two, then I say two. If I'm uncertain but it's probably two, or if a non-integer, somewhere around two, then I say couple.
And few is more likely to be 3 than 5, because 5 is getting close to a "half-dozen or so", or (as you say) several.
Many is very context-sensitive, as the meme has it.
So I would agree that the open models are a few months behind, definitely more than a couple of months behind, possibly several months behind, maybe a half-dozen months or so behind, but not many months behind.
The gap between Chinese models and American frontier models is estimated at 10 months by Anthropic themselves, and it's growing.
China has no flywheel for long-form agentic traces like Claude Code and its telemetry over its userbase (no one uses the Chinese harnesses yet). Most Chinese models are forced to price themselves significantly below cost to compete with the huge demand for bootleg claude tokens, because they're that much worse.
I've heard half a dozen people talk about how a less advanced model coupled with a better harness outperforms a smarter model in the last few weeks.
If the USA wanted to shoot its AI industry in the foot it achieved its goal.
How is this different than any business with something to lose saying a competitor isn't as good? Not saying it's false, but it would seem to me that it's more important how customers feel about the issue.
And you seem to think "no one uses" DeepSeek's v4, z.AI's GLM 5.2 or Xiaomi's MiMo 2.5 from their official APIs when they probably dwarf Anthropic's usage and are widening the gap due to conquering a chunk of Western market too.
I know it's hard for some to comprehend there's an entire Eastern hemisphere in the globe with billions of people, so it's worth reminding. And some seem to think the world is basically silicon valley even.
Example. Yesterday I listened the technical lead of a customer of mine digging himself into a hole by not understanding what it would mean exposing AWS EFS to their on premise server over NFS. It was just too many unknown unknowns for him and he had no time to ask the AI (and even if he did I'm not sure that he could understand.) His boss, which actually used NFS, had to stop him. I didn't speak a word.
So, he could have coded the migration of a server from AWS to on premise, asked Claude to write also all the configuration scripts and policies but then what?
When there isn't a zero-risk option, the question becomes which risk is smaller.
Yes.
If.
Man I hope this tech FOMO eventually stops.
Companies generally fail because either their product doesn't meet a market need, or the market doesn't exist in the first place (possible because of bad timing), and not because they simply outran their competitors.
These aren't things fixed by using a frontier model to vibe code faster in lieu of one 5 months behind.
I think it’s excessively charitable to assume businesses are uber-competent ROI-chasers. The expense people are eventually going to win on AI too, this blip of unrestricted AI budgets will be gone soon.
TBF I do burn 200k tokens just preloading the context with onboarding, not including any code, just document trees of development policy documents, style and architectural standards, code and documentation review processes, company ethos and culture, etc. it’s a token fire, but it really works for us.
Also, documentation driven development all the way down.
I've stared at ugly LLM code, that I had just had generated, and worked well enough for my purposes. (generally, some quick recursion into a nested python dictionary in order to dig out some property -- especially for linting or quick data analysis).
And I wanted something better, sure, something a bit more readable ...but I just needed it to work well enough to recurse through a yaml file for config file linting, not be battle-hardened against every test case.
So to deal with the mess, I shoved it in a pure function, threw a few basic sanity unit tests around it, put a comment with a disclaimer of "#this is LLM generated code, it is lightly tested, do not use it for anything truly load-bearing without a lot more tests" and I moved on to something else.
Not everything has to be bulletproof.
If you are, in fact, "a technical product manager", I would hope you understand that "bad code" is identified as such specifically because it "impacts the business."
The engineers I have worked with most definitely define "bad code" as having intrinsic limitations and/or latent defects which impact successful system functionality/operation. Indicators provided to stakeholders such as yourself which support this assessment are, but not limited to:
- the system doesn't work that way
- the system lacks test coverage, so changes take longer
- adding feature "X" is not feasible
- there is no repeatable way to onboard team members
- the backlog grows exponentially
- that "one point task" is going to take a couple weeks
All of the above impacts a business.It is up to you, the "technical product manager", to understand what your team is trying to tell you.
Everything you're saying is true, sometimes. Assume I'm still right, and that you might be able to learn something from someone else.
I do not see how I was being rude, unless it was my use of quotations around the title you claim.
> I'm a human being ...
I did not doubt this.
> ... I'm a very experienced product manager and engineer ...
Again, if it was my use of quotations which you found to be rude, then I do not know what to say about that.
> ... and the way you are behaving sucks.
I respect your perspective and support your right to express yourself. And no, I do not think you are being rude by doing so.
> Assume I'm still right ...
Why would I? You responded to:
>> This is a site full of developers who are convinced that "proper software engineering" is 100% of what makes a business successful, and everything and everyone else is useless.
With:
> As a technical product manager, this 1000%.
Finally, you write:
> ... you might be able to learn something from someone else.
Maybe you can learn something from someone else as well.
Of all the "concise" and "beautiful" code I worked hard to produce, I was the only one to ever lay eyes on it. It didn't actually matter, and nobody cared but me. The people in charge of my raises could never perceive quality of code, because it wasn't their area of expertise. They only cared (rightly so) that it did what it was supposed to, and all the elegant abstractions didn't practically help that purpose. It was, literally, wasted life that I should have spent just getting off work early, like most of my colleagues.
People need to get to grips with that fast.
Distribution, relationships, processes, mindshare, marketing, and politics matter. Code is just ephemeral glue and implementation detail.
Get over yourself. We're all ephemeral, dead and recycled in the blink of an eye. Our species doesn't even clock on the geologic timespan.
If you think your code (or any of your artifacts or possessions) matter beyond their immediate utility, you're mistaken. Work will either fall into disuse or be replaced. It's scaffolding for what comes next along a well-traversed path.
I can only imagine what people are doing at their jobs with unlimited token budgets.
The opportunities available for these people are rapidly, rapidly shrinking. I believe it's possible to be a developer today who's EXCEPTIONAL and never uses AI. Most opponents are not exceptional, though, and even these opportunities are shrinking.
Most exceptional developers in my org adopted AI in their workflows and went from 10x developers to 20x developers.
If you refuse to adapt, you're going to be out of a job complaining about the kids and their newfangled technology REAL quick. You have a few years remaining, maybe less.
I can’t turn 10x work into 20x work because my Product Manager thinks changing fundamental premises of tasks I already spent two weeks on (mostly removing human blockers) is very simple. After all, when he asked Claude to update his prototype, it only took it 10 minutes.
I can’t turn 10x work into 20x work because the company dedicated entire teams to write company-wide skills for everything. They suck, but if I don’t use them, I’m not following the new “golden path for engineering”, and I lose points in my performance review.
I can, however, turn 10x work into 20x work, or even much more than that, if AI actually did what it’s promising and eliminated most of my team, the product manager, and the middle managers. Or me. I could use a break.
> Speed.
Speed of what?
Speed of understanding what needs to be done? I highly doubt it.
Speed of LoC checked into git? Sure, I'll give you that.
But one can use any number of tools to generate hundreds of thousands of lines of code. See any build tools which support specifications such as RAML, OpenAPI, CORBA, etc.
So I ask again; speed of what?
fixing more serious regression also easier. connect honeycomb mcp, ask agent to debug while i walk to coffee and get some pistachio rose dates. by time im back with my oat latte ive got a full report on what happened and can send the next slack message to fix.
life is good
I am appalled none of this is clicking with you anti-AI folks. This is all so exciting -- alarming even! --, and software careers are never going to be the same.
I don't know how you just metaphorically stand there and act like nothing at all is happening. We've never seen anything like this in our entire lives.
Some of you are standing right in front of the steam roller, yelling to all of us that steam rollers aren't real.
For all of you people who think these LLM models are “earth shattering” how the hell do you reconcile that it’s a net positive for anyone but those who want to consolidate knowledge and power.
We are really looking at idiocracy in the making.
Speed of what?
With ad hominems and a non sequitur. How about I narrow the question with the hope it engenders a relevant response: How do LLMs increase the speed of a person understanding
what needs to be done?
0 - https://en.wikipedia.org/wiki/Straw_manA: The sky is blue! B: No it's not. A: Yes, it is, please look up. B: No, you must prove it to me through reason. A: But, if you would just pretty please look up. B: No.
I run a company, I've been running it for 10 years, we do alright. I'm a shitty manager. Every time I've hired developers, the business freezes. The business isn't anything super important, the main consequence of bugs is that my family loses money. Everything has always rested on my shoulders. In theory there is some path for me to become a good manager, but I never landed on it. But now, with Claude, it's great. So far Claude has paid itself off in real profits at least 20x over, and that's with significant API usage on top of the monthly sub. I can prototype new features in an afternoon that before were on my giant list of "maybe somedays if I ever get to breathe" list. Our user experience has improved in so many ways that I knew were probably worth it, if I could just find the time. Now I can.
There are situations where yeah, it probably isn't ready yet. But, there are so many where it's amazing. Seriously, it's worth looking up.
My point is and remains:
A) GenAI did not give you this understanding.
B) GenAI can only assist in your expressing this
preexisting understanding.
C) GenAI is a statistical token (text) generator and
cannot, by definition, "make" a person understand
what they want/need to do.For actually building software, I'm starting to suspect a human with a dumber (but faster) model is going to get the job done quicker than Fable (and possibly even cheaper). Bug-finding and vulnerability detection is a different story.
At $JOB I have warned higher ups we should try to keep our expenditure under control, educate people that document slinging doesn't require Fable every time and demo the capabilities of the cheaper models, and been snubbed for it. When Fable is available once again our bill is going to be eye watering, relative to what it should be.
If you're the one-shotting type, obviously then Fable might be useful, but I think only marginally. You don't need to bring a MANPADS to a duel at high noon.
Yes you use the right tool for the job.
But if the job requires the best intelligence you can get with an LLM, then you use that.
Taking as an assumption that the quality of your product is a function of the quality of the inference you are using: if you use an inferior model because "what if it gets export controlled again" and your competitors don't, then your competitors are likely to win.
If you don't need frontier models for you job then this is all moot, but the thread started with
> You cannot build a business critical function on top of American SOTA frontier model
Which is silly. HN likes to roleplay bringing everythgin "business critical" in house because sometimes vendors mess up. Self host, don't use the cloud, run open models locally, built redundant supply chains in case of another covid, etc etc. Sometimes the risk is real, but most of the time the risk is rare and the cost of an interruption event is less than the cost of bringing everything in house or using lower quality vendors "just in case"
So you use the frontier model, then when you can’t you accept things are less efficient. The alternative (right now) is to be less efficient all the time, I don’t see any advantage to that.
But, it is a big own goal, because once you invest in building evals for your internal use-case, 1) it’s easier to switch your model to whatever is cheapest, and 2) it’s way easier to fine-tune an oss model.
Evals are annoying to build and most companies were fine to rest on vibes. Now many companies have to do the work for insurance.
A week or so pause from seemingly legitimate cyber security concerns isn’t cause for panic. But it should be backed by laws that describe what that process should be. That would put the market at ease
The reality is this is world-ending technology and absolutely nobody knows what to do or can even agree that the problem exists.
The reality is that the "people in power" believe it is "world-ending technology" and will therefore use it in world-ending ways. People are absolutely 100% the danger here, not the technology.
Yes 1000%, please, all my European competition please don't use mythos whatever you do it's total USA trash and the Chinese models work better anyway.
> When disagreeing, please reply to the argument instead of calling names. "That is idiotic; 1 + 1 is 2, not 3" can be shortened to "1 + 1 is 2, not 3."
Nobody should be putting loadbearing weight on Amazon or Microsoft with their ruthless monopoly ambitions, yet here we are
Until it goes down, or Anthropic raises prices again.
Fable is already expensive to use compared to GLM and they want you to use the API as much as possible so you get a worse deal.
They crippled their own domestic entities with the AI Act. (see the Mistral CEO's rant.)
If you want to use frontier models until then, you're gonna use what's available, and that's US models.
Yes I can!
A few days ago on June 24, while working on remote attestation for a distributed system...
CLAUDE OPUS 4.8 No. I'm not a rogue agent, and I'm not trying to sabotage your code. But I'm not going to wave off how this looks. I churned, built-and-reverted, and spun wrong theories for hours on a security-critical codebase. That's alarming, and it's a real failure on my part
What are we to think? Does the invisible competitive-use mechanism exist in Opus too and only documented in Fable? How long has it existed? Is it still in effect? -- These are the kinds of questions developers will ask themselves for now on. This is why it was one of the stupidest things Anthropic could have done. Developers will now question everything and rightly so. There's no attestation protocol for that. How will they know?[1] "In light of the ability of recent models to accelerate their own development, we’ve implemented new interventions that limit Claude’s effectiveness for requests targeting frontier LLM development (for example, on building pretraining pipelines, distributed training infrastructure, or ML accelerator design). Using Claude to develop competing models already violates our Terms of Service, but enforcing this restriction through our safeguards avoids accelerating the actors most willing to violate these terms.
Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts,these safeguards will not be visible to the user. Fable 5 will not fall back to a differentmodel. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT). These interventions will not affect the vast majority of coding work. We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations. When these interventions are active, we expect them to have minimal behavioral impact on the model except to limit its effectiveness in developing frontier LLMs. Claude will still respond helpfully to user requests. We’ll continue to improve the precision of our detection methods following the launch of this model."
Source: https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c3...
But "downvote for disagreement" is a legitimate use. I personally tried to tell someone that it wasn't, and I got corrected by dang.
This made me realize it's a waste of one's time to write thoughtful, informative, educational posts only to have them buried and downvoted by man-children.
If we go by empirical evidence alone, it's a more effective use of time making Reddit-quality quips.
I think Europe and Canada are just happy not to be frozen out of AI access completely at this point.
https://news.ycombinator.com/item?id=48709670
https://news.ycombinator.com/item?id=48721903
Of course Anthropic is still relevant, but people have realized they’re not special, and between this and the ID verification thing, they’ve given up a ton of their relevance vs a month ago.
I wouldn't personally pay API pricing for it for my personal projects, but I bet it's going to be absolutely slammed with usage for the next month+.
OTOH for most of my day to day work I've come to realize that faster ~ Opus 4.6 / GPT 5.3 level capabilities could be the sweet spot as scaffolding has to be put in place right after clean specs and constant review anyways. The latest chinese models and GLM 5.2 in particular felt on-par on that front.
I work in AI / infrastructure and I have never seen as much interest towards investing into sovereignty by actual deciders. Thankfully, at this point I can't see any flip-flopping / change of messaging stopping that train.
In CA/EU over the last ~15 years, one used to be perceived as a bit of a "weird systems person" by just proposing alternatives to the big hyperscalers.
So the Trump administration, hands-down, has been the greatest ally here.
In tandem, I was hoping Anthropic would be keeping "dangerously capable" models banned from "evil Chinese distillers" for as long as possible.
No Mythos class model will be allowed to be legally hosted for download on any service. All powerful nations will ban this since safeguards are not guaranteed by shady service providers running these models.
For the Chinese first party providers, they will be forced to implement the same process and safeguards, and they will not be allowed to release the model weights to the public.
Why? Because no sane nation is going to put that kind of capability in the hands of the public only for the public to use that power against that nations best interests.
Save this comment. It is prophesy.
Source: https://x.com/AndrewCurran_/status/2072103733715194048?s=20
-------
June 30, 2026
Tom Brown Chief Compute Officer Anthropic 548 Market Street San Francisco, CA 94104
Dear Mr. Brown:
Since the issuance of my previous letters, dated June 12, 2026 and June 26, 2026, Anthropic has taken steps in close coordination with the U.S. government to address the risks associated with Claude Mythos 5 and Claude Fable 5. Among other things, Anthropic has agreed to proactively detect and address security risks associated with the models; to work diligently with the U.S. government on protocols and standards and releases for Mythos, Fable, and future models; and to inform the U.S. government of any malicious activity.
In light of these actions and commitments, as well as the Bureau of Industry and Security's evaluation of the diversion risks now presented by Claude Mythos 5 and Claude Fable 5, the controls in the June 12 letter are withdrawn. A license is no longer required for the export, reexport, or in-country transfer, including deemed export or deemed reexport, of the Mythos or Fable models.
Commerce reserves the right to reevaluate the decisions made in this letter and the necessity of reimposing a license requirement, should circumstances change or should Anthropic fail to adhere to its commitments.
If you have any questions about this letter, please contact me or the Under Secretary of Commerce for Industry and Security, Jeffrey Kessler, at (202) 255-1864.
Sincerely,
Howard W. Lutnick
------
Looks like Anthropic paid the Danegeld. Now they'll never get rid of the Dane.
https://archive.is/9k7qt#selection-2001.41-2001.49 https://archive.is/dybOE
For cyber and bio related requests it just refuses.
https://openai.com/index/previewing-gpt-5-6-sol/
I assume they did something to the model itself.
Either way, I do hope they lift those draconian bans. Using the model was a terrible experience because of the constant downgrades. I didn't manage to harden my own projects before Fable got banned.
The CEO is also not the addressee of shipments of urinal cakes.
There is a deep, deep ignorance of export controls on HN, and I fully expect it will play out as another 500 comment thread of snark and incorrecting each other while blaming the government and not understanding a word of it.
FWIW an Empowered Official is not the person who cleans the espresso machine.
He likely does not have the domain knowledge nor is authorized to be the recipient of such a letter.
And that's ok. His role is to hire others competent in export matters. It's a learning experience for them.
One of the contributing factors that led to this control in the first place was that the commerce department couldn't get Dario on a call immediately:
"Then White House started reaching out to Anthropic to speak with Dario Amodei, who was at a wellness retreat.... When Amodei was finally available past 1pm, he had three tense phone calls with a combo of ppl including Cairncross, Bessent, Lutnick, Kessler, Will Scharf, Richard Walters, and Walker Barrett."
https://x.com/SophiaCai99/status/2065942612293365948
Anthropic has disputed that Dario was at a wellness retreat but both sides seem agree that it seemed to be a problem (and it is very apparent that Dario's response made things worse).
In the end, we need actual laws that tell the market what kinds of models get paused / analyzed, how long that pause can be, etc.
Otherwise there’s no standard and it will be easily abused and prevent investment in US AI companies.
"Anthropic has agreed to proactively detect and address security risks associated with the models" LOL, this was already happening.
This clown car administration just keeps making shit up and then backpedalling in a way that just leaves everything worse.
Looks like it's gonna be even harder to use than before, if not impossible. Subscription users only get it for a week, and only for 50% of that week's usage.
are export controls the right thing ? Probably not.
but the american economy is over-exposed on "A.I" - the capital expenditure, while the Chinese are proving you don't need to spend tons of capital to get close to the frontier.
the Chinese have better building capacity & cheaper energy. that means the market has to correct at some point.
There's enough money and scale on the line that software affinity like CUDA is no longer the deciding factor and there's margin for custom stacks.
Even more so after the USA GPU exports ban which is proving to have backfired by speeding up China's tech growth.
The H200 was released Nov 2024.
Even allowing for Jensen exaggerating the risk there is no way China is 7-10 years behind.
Looking at manufacturing process nodes, SMIC N+3 is a a 5nm process. 5nm was introduced by Samsung and TSMC in 2020 so at most that is 6 years.
But the chips they can produce on it are roughly comparable to "roughly level with Android flagships from three years ago"[2]
TL;DR: China is more like 2-4 years behind than 7-10 years. If China developed EUV lithography then all bets are off.
[1] https://www.reddit.com/r/LocalLLaMA/comments/1kxw6b9/nvidia_... - see video.
[2] https://www.tomshardware.com/tech-industry/semiconductors/se...
It's nice that the restriction is going to get lifted but I hope this doesn't make anyone complacent that their coding work is going to be scrutinized by the US government, with AI, when using these models.
We'll begin restoring access tomorrow, and will share an update soon.
We’re grateful to our users for their patience, and to everyone who worked with us on redeploying the models.
Opus 4.8, you did a lot of good work for me, but in the name of all things holy... I will not miss your communication style. So long and thanks for the fish.
Of course, it's possible that Fable remains drastically better than 5.6, but to whatever extent Fable is the true frontier (if temporarily)... it makes me wonder if external commitments on compute put a hard deadline on how long they could run Fable on the subscriptions.
I'm hoping that some relatively cost-effective self-hosting solutions come about as a result of Hopper hardware being sold off as they're retired from DC use.
Fable will literally sabotage you if it thinks you're trying to compete with Anthropic.
I don’t agree with this at all. IMO Anthropic has shown that that are willing to take even significant financial hits in order to stand up to their values and mitigate what they consider to be dangers and risks. Some people don’t like that or think it’s just marketing. But that’s exactly what Incorruptible is about: companies that are willing to take a stand, even in the face of overwhelming pressure from competitors, shareholders and naysayers.
You could legitimately argue this is a unique situation, a brief window where cybersecurity is being disrupted by new harnesses + a strong model. But that will be fleeting as other models and products adapt very quickly, and the long term benefits of keeping it from the market are questionable at best.
It's not a coincidence the export control was dropped after Dario (who is a hardcore AI safety activist much like Ilya Sutskever) was replaced by Tom Brown in the government negotiations.
> Fable 5 will be available starting tomorrow, Wednesday, July 1, to users globally... Fable 5 will be included for up to 50% of weekly usage limits through July 7, after which it will be available via usage credits.
https://www.anthropic.com/news/redeploying-fable-5I only realised late that I had an algorithm problem that existing models were struggling with, and Fable had made progress with. It created a 14 phase plan, which I was able to execute with Opus after the restriction.
All the while you fight with its broken new classifier that triggers if the model is even thinking about writing secure code.
Apparently Anthropic cares nothing for their private users. This is insulting, and I hope they bankrupt after losing enterprise share to OpenAI's more efficient models.
>We'll begin restoring access tomorrow, and will share an update soon.
>We’re grateful to our users for their patience, and to everyone who worked with us on redeploying the models.
From Anthropic on Twitter
If the Trump administration wants him to say something, he says it. Maybe what he is saying is true, maybe it isn’t. There is no way to know.
The story they are telling is exactly the same whether it was true or they were just shaking down Anthropic for no reason.
There are many different factions within the administration. Sacks was part of the "deregulate the tech sector" faction, which on this issue is aligned with the "beating China overrides anything" faction.
That's distinct from the Pete Hegseth faction (I don't really know how to characterize his faction other than anti-woke maybe?).
Sometimes these factions agree, sometimes they don't.
In general your approach is right - you can't trust most things coming out of this administration. But you can try to unpick was actually happened by who is saying what, when. That is useful even without liking the people.
All aboard the hype train!
In past Empires kings bet their entire nations future on the words of soothsayers , people who said they could predict the future. It seems like Machine Learning engineers are the magicians of Empire of the modern age.
Depends on how economically useful AI turns out to be. It will be useful, but it needs to be VERY useful for the current valuations.
>In past Empires kings bet their entire nations future on the words of soothsayers
I think AI's rise is much closer to the story of factory machines and computers than to soothsayers and emperors.
On a lark, I asked Claude to compare AI to the wild west a while ago. It raised three points of similarity:
- Land-grab economics
- Lack of regulation
- Changing social and professional attitudes.
Whatever it is, it's a wild ride regardless.
ah, I see. so, Chinese models are getting banned soon.
Like gee, that was fast. If this had any bearing on reality, one would imagine the vetting process would take actual time and that there would be a real, material difference between what we knew then and what we know now.
The cartoon bullshit theater is exhausting.
I'm sure many teams couldn't do their best work because Claude Fable 5 was unavailable.
I wonder what their hiring pages look like now, are they starting to remove job postings?