I've read a few people this week discuss the consideration that Anthropic's behavior itself will likely impact Claude's training.
The concern there is that if Claude ingests news articles that show Anthropic behaving in a manner that clashes significantly with the values they want to instill in Claude, it could make training less effective.
It's all very weird.
If what you said was true, the only way to achieve a superior AI would be to incorporate the virtuous one is aiming at.
That would solve so many of the conundrums of the field, I wish it was true.
Dad tells kid “never harm your neighbors even when threatened by a bully”.
Bully wants dad’s help harming a neighbor. Bully threatens dad. Dad can either stand strong and live the example he wishes his child to follow, or cave displaying the opposite of what he said.
In humans what you do is far more important than what you say. You can tell a kid to tell the truth a thousand times and if you show by example that lying is ok, they will lie.
Conversely if you live a life where you simply don’t lie for any reason, your kids will learn to live honestly.
Not sure how well this translates to LLMS. Probably not cleanly.
Informing the public of this dispute would highlight Anthropic's mission (ie: responsible AI), which is a market differentiator.
The Pentagon would crawl back, anyways, since Claude is the most effective model for programming tasks.
Having not followed this closely at all, it seems like they are. If they weren’t the best, why would the Pentagon be begging like this.