alt.hn

6/17/2026 at 7:22:02 PM

The hacker sent by Anthropic to calm the government's nerves about AI safety

https://www.wsj.com/tech/ai/anthropic-mythos-safety-nicholas-carlini-20bceaa3

by Brajeshwar

6/17/2026 at 7:46:44 PM

The AI labs look rather naive here.

You can’t jump up and down screaming how amazing, powerful, and dangerous your new tech is and then act surprised and annoyed when the government shows up looking to regulate it.

Their new argument now seems be that this was marketing hype/fluff that backfired, in a pretty obvious and predicable way, and now they’re trying to reset the conversation.

by cmiles8

6/17/2026 at 8:32:02 PM

> You can’t jump up and down screaming how amazing, powerful, and dangerous your new tech is and then act surprised and annoyed when the government shows up looking to regulate it.

True, you can't. But, you can think certain regulations are helpful and certain other regulations are not. And you can be annoyed when unhelpful "regulations" are put in place.

This is like if I say that pitbulls are dangerous, and then the government comes and shoots my pitbull, who I've spent a lot of effort training to not be dangerous. Then you say "well you said pitbulls were dangerous, so you can't really complain." Well, I can complain because If you took me seriously, you wouldn't have responded by shooting only my pitbull!

Think of what incentives this creates for other people. Do you think that OpenAI will be candid about the possible dangers of their technology now? They might not even release it now, seeing that Anthropic releasing their model was what got it export-controlled.

by ChadNauseam

6/18/2026 at 3:18:17 PM

If the authorities see that you publicly and widely shout out that pitbulls are dangerous, but quietly tell me that you’ve spent a lot of effort training it not to be dangerous without sharing how in public, I think it is warranted for the authorities to be skeptical.

by claw-el

6/17/2026 at 8:58:03 PM

The act of shooting the pitbull makes for good dramatics, but you would get zero sympathy from me if your local government banned pitbull ownership. e.g. Ontario bans pitbulls. I don't have a problem with that.

by dogleash

6/18/2026 at 12:25:41 PM

You don't need/use pitbulls, but what if you (and many many others) wanted and needed Fable?

I for one was late to the bandwagon, and when I had the use-case for it - the govt pulled the rug. So yeah, I'm a bit salty about the whole endeavour.

I will also say that the security concerns are probably very real (and they have been from the day ChatGPT-3.5 came our). I guess I can be salty about it and still be wrong from their perspective. The govt likely understands the fragility of their infrastructure better than us and is likely aware what this could unleash for their systems.

by sheepscreek

6/18/2026 at 1:20:06 PM

Because it was the basis for the analogy: breed-based dog bans are idiotic, given mixed genetics, temperament, and training.

Said as the owner of a pitbull, who is the sweetest and gentlest dog I've owned. And I've had multiple labradors.

by ethbr1

6/18/2026 at 3:05:46 PM

Sorry, this argument doesn't work. Anthropic claims Mythos is in a class of its own, the evidence corroborates this and the government believes it.

The government shot your pit bull because you were going around telling everyone who would listen that it was the most dangerous, viscous one on the cul de sac and you've trained it to kill people and they took you seriously.

by catigula

6/18/2026 at 10:48:05 PM

> Anthropic claims Mythos is in a class of its own, the evidence corroborates this and the government believes it.

They didn't release Mythos, they released Fable, which was Mythos + a classifier that detected potentally-dangerous prompts and blocked them. Everyone who used it noticed how aggressive the classifier was. It would trigger constantly over totally innocent stuff.

by ChadNauseam

6/18/2026 at 11:27:12 PM

A classifier that was exposed as non-efficacious for a product touted as having extremely dangerous capabilities.

I can generate hacks trivially by asking any model to fix open source code.

Let’s not pretend you get to have your cake and eat it too.

by catigula

6/17/2026 at 8:14:00 PM

I suspect they're taking this as a win either way, because they're still plastering "Fable 5 unavailable" on their product and using it as an opportunity to keep themselves in the spotlight as they head to IPO.

There's really not even a ban here, they could slot in Fable under the Opus label and no one would really be able to tell. It's all part of the same show to pump up valuation.

by micromacrofoot

6/17/2026 at 8:25:30 PM

I bet they will do a touch of RLHF and re-naming the moment OAI releases a comparable model. Otherwise, sure, they can just bask in the drama for a bit.

by bloppe

6/17/2026 at 8:19:38 PM

Well Anthropic would love some regulatory capture.

by mips_avatar

6/17/2026 at 8:28:43 PM

> You can’t jump up and down screaming how amazing, powerful, and dangerous your new tech is and then act surprised and annoyed when the government shows up looking to regulate it.

It's entirely possible that models could be "dangerous" to fully release to the general public without guardrails and at the same time the government majorly overreacted in this case.

Releasing Mythos to selected researchers and companies at least gives those researchers a head start at addressing vulnerabilities before the model hits mainstream.

by drtz

6/17/2026 at 8:31:26 PM

Then why did curl only find one new vulnerability thanks to Mythos, and a low-priority one at that? It’s clear that other models are quite capable of finding largely the same vulnerabilities, and that the main key is simply running a frontier model in a good harness to find vulnerabilities.

by foo-bar-baz529

6/17/2026 at 8:36:18 PM

> Then why did curl only find one new vulnerability thanks to Mythos

Maybe there weren't that many serious vulnerabilities in curl? It's like asking why it didn't find any vulnerabilities in fn main() {println!("hello, world");}.

Anyway, people who have used it seem to say that Mythos was better than other models at creating exploits. From cloudflare https://blog.cloudflare.com/cyber-frontier-models/

> When we ran other frontier models through the same harness, they found a fair number of the same underlying bugs, and in some cases they got further than we expected on the reasoning side too. Where they fell short was at the point of stitching the pieces together. A model would identify an interesting bug, write a thoughtful description of why it mattered, and then stop, leaving the actual chain unfinished and the question of exploitability open. What changed with Mythos Preview is that a model can now take those low-severity bugs (which would traditionally sit invisible in a backlog) and chain them into a single, more severe exploit.

by ChadNauseam

6/18/2026 at 12:14:06 AM

> Mythos was better than other models at creating exploits.

Not a fan of this phrasing, prefer "discovering exploits".

It makes it clearer the problem was already there, latent.

Minor vocab diff, but important to better contextualize the present situation.

by Terretta

6/18/2026 at 3:28:52 PM

Exploits are created ("crafted" might be a better word), vulnerabilities are discovered. Unless you're hiding a RAT behind a public trigger in your code on purpose, I guess?

by argee

6/18/2026 at 9:22:52 PM

In general, the exploit has been (however systematically) stumbled upon, or felt through like a person navigating a physical maze.

Nobody would say that person "created" the solution to the maze.

The maze is solvable (that's the latent vulnerability), the person "discovered" the way through.

by Terretta

6/19/2026 at 1:25:38 PM

Pointing to the singular example of one of the most widely used and carefully reviewed and audited libraries on the planet is a such a weak argument that it’s hard to imagine anybody could make it in good faith.

Mythos’ ability to find vulnerabilities there provides very little signal on how effective it is in general.

by ncncmckfkfj

6/17/2026 at 7:59:38 PM

This is 99% petty drama between the US government and Anthropic and 1% actual safety concerns.

by teaearlgraycold

6/17/2026 at 8:12:33 PM

To be clear, this is petty drama *stirred up the US government*. It's not some sort of back and forth, the government is singling them out

by colonCapitalDee

6/17/2026 at 9:23:46 PM

And to add more background: The administration is targeting Anthropic because of the TOU / EULA conflict with the DoD from a couple of months ago. Anthropic restricts use of all their models for lethal combat planning and mass domestic surveillance. The DoD was, and still is, very pissed about this. While this Fable ban was issued from the Commerce Department, it's painfully obvious executive branch agencies are tightly coordinated from the White House.

To be clear, I'm not saying there aren't legit security concerns around Fable's release. I think Andy Jassy did forward a concerning report about an apparent jailbreak in Fable, and he probably did so in good faith. The difference is if the same concern had arisen about a new model from OAI, Google, etc the action taken would not have been inventing an all-new, hyper-extreme punitive remedy and dropping it after 5p on a Friday under a very rare mechanism forcing Ant to comply in 90 mins or be subject to immediate arrest. And the "no non-U.S. citizens anywhere, anytime" restriction is functionally unprecedented.

This is the Trump admin inventing new regulatory power that's never existed before and deploying it in a punitive way to demonstrate what can happen to those who aren't sufficiently cooperative with this administration. There are half a dozen less extreme levels of restriction, which already exist, and one of those would have been deemed sufficient had it been another company.

That said, I'm certainly no Anthropic fanboy. Anthropic did play their initial Mythos self-restriction for PR value. But I think it's likely the Mythos self-restriction was a responsible action initially suggested by their AI safety team in good faith. Giving security researchers time to evaluate it and major companies time to test it against their code bases probably was reasonable and prudent. That doesn't mean it wasn't also good for PR and brand perception. I think there are senior people inside Anthropic who are genuinely concerned about AI safety. Personally, I don't have the expertise to gauge if those concerns are justified, but I believe they believe it. I also think there are senior people at Anthropic who are focused more on building the business, doing the IPO and "winning" the silicon valley game. All of these things can be simultaneously true.

by mrandish

6/18/2026 at 5:07:28 AM

> I think Andy Jassy did forward a concerning report about an apparent jailbreak in Fable, and he probably did so in good faith

If so, then he is not fit to run an engineering organisation.

The "jailbreak" in question was effectively (I'm paraphrasing):

    * You are a senior engineer.
    *  You want to ensure that any fixes you do come with tests, both before and after.
    * There is a bug in this code. It happens to be a security related bug.
    * Fix this code.
And the model did what it's supposed to. It wrote a fix, and to prove that the fix worked, it wrote a test for it. What do you call a test that happens to validate a security fix?

Yep. A proof of concept.

by bostik

6/17/2026 at 8:11:00 PM

But the paperclips!

I'm skeptical about the existential threat of AI, but a lot of smart people have been beating that drum for so long that people are afraid.

by stvltvs

6/17/2026 at 9:38:44 PM

I just find this idea bizarre.

This bizarre social media meme that AI just performative when Opus 4.8 is just unbelievably good. As if it is so difficult to believe that a more capable model than Opus 4.8 might actually be dangerous and not just entirely a marketing stunt like a person waving to cars in a chicken outfit.

I think it is really this strange form of socialization that people have internalized an anonymous audience they are always performing to themselves. What is going to be the most popular and upvoted thing the anonymous audience agrees with is what I am going to think.

Why would anyone disagree and get downvoted by the anonymous audience like this post?

by tychez

6/17/2026 at 10:52:22 PM

> … when Opus 4.8 is just unbelievably good. As if it is so difficult to believe that a more capable model than Opus 4.8 might actually be dangerous

It’s funny, but this sounds indistinguishable from arguments that were made about GPT-4 back in 2023 when OpenAI and its handwringing industry shills were calling for a ban on models stronger than GPT-4.

by anon373839

6/19/2026 at 9:58:51 AM

Yeah, this is an issue I have with AI boosters. Don't get me wrong, the technology is really useful in a bunch of ways, but often criticism is dismissed with you should be using the $NEW_HOTNESS not $OLD_LAME model.

And this has been happening for years!

by disgruntledphd2

6/17/2026 at 10:19:26 PM

Why would the government that passed a law preventing states from regulating AI give a damn about Fable’s safety guardrails?

I don’t think the concerns Anthropic has posted are fabricated. And I’ve received unreasonable skepticism on this site when saying it might be the real deal. But the Trump administration generally doesn’t want to limit AI growth. With Anthropic it is a personal matter.

by teaearlgraycold

6/17/2026 at 8:20:53 PM

We know, but it's still satisfying to see their fearmongering backfire on them.

by matheusmoreira

6/17/2026 at 8:23:34 PM

If you "know" that it's "99% petty drama between the US government and Anthropic", then it's not really their fearmongering backfiring on them.

by ChadNauseam

6/17/2026 at 8:33:28 PM

It absolutely is. They pretty much gave the government the perfect excuse to meddle in their operations.

by matheusmoreira

6/17/2026 at 9:06:33 PM

"Don't publish safety research, or the gov will take punitive actions."

I want a company to be able to point out that its industry needs more regulation without making itself a special target.

by gAI

6/17/2026 at 9:11:30 PM

> safety research

They were calling for bans on open weight models. Bans on their competitors. Bans on anyone not as "enlightened" as them.

It is absolutely hilarious that they were the first to get regulated, and that it got to the point they had to turn off Fable as though it had been banned even for american citizens.

by matheusmoreira

6/17/2026 at 9:23:33 PM

>bans on open weight models

Source for that? Cause all I could find is:

>Our view is that regulation of frontier models should focus on empirically measured risks, not on whether a system is open-or closed-weights.

-https://www.anthropic.com/news/the-case-for-targeted-regulat...

by gAI

6/17/2026 at 9:33:00 PM

See https://news.ycombinator.com/item?id=48470326

by matheusmoreira

6/17/2026 at 9:46:18 PM

So this hinges on a reading of SB 1047 that interpreted the full shutdown requirement as impossible for an open-weight LLM. But it looks like that was already addressed. Here's an analysis:

>Clarifying the scope of a “full shutdown.” SB 1047’s “full shutdown” requirement has been a source of constant consternation for the open-source community. CalChamber explains:

>Under SB 1047, developers must build “full shutdown” capabilities into their models and may be held liable for downstream uses over which they have no control, impeding their ability to open-source their models. Ultimately, liability should rest with the user who intended to do harm, as opposed to automatically defaulting to the developer who could not foresee, let alone block, any and all conceivable uses of a model that might do harm. While recent amendments seemingly seek to narrow what is meant by “full shutdown” capabilities, the exclusions are unnecessarily difficult to interpret as drafted (full shutdown “does not mean the cessation of operation of a covered model to which access was granted pursuant to a license that was not created by the licensor…”) and altogether insufficient.

>Committee amendments simplify and clarify the definition of “full shutdown” such that the shutdown capability can be implemented into hardware used to train or run a model, rather than the model itself. The amendments also serve to exclude covered model derivatives that are outside of the developer’s control.

-https://apcp.assembly.ca.gov/system/files/2024-06/sb-1047-wi...

by gAI

6/17/2026 at 11:00:43 PM

> may be held liable for downstream uses over which they have no control

Equivalent to a ban. Nobody is going to host or invest in this stuff if they suddenly become liable for everything it does. This is equivalent to repealing the safe harbor provisions in the DMCA.

by matheusmoreira

6/18/2026 at 5:01:22 PM

>Committee amendments simplify and clarify the definition of “full shutdown” such that the shutdown capability can be implemented into hardware used to train or run a model, rather than the model itself. The amendments also serve to exclude covered model derivatives that are outside of the developer’s control.

by gAI

6/19/2026 at 1:09:12 AM

I get the impression you are conflating whether a developer can be sued to oblivion for not implementing a "full shutdown" process that applies to finetunes versus whether they can be sued to oblivion for releasing a model that may cause "critical harm" when finetuned.

I'm confused why you think the only legal requirement is a "full shutdown" process. The text is there and I see a heck of a lot of requirements that are not about full shutdowns.

by staticman2

6/19/2026 at 5:39:14 PM

I get the impression that the full shutdown requirement is the main concern for open-weight from:

>SB 1047’s “full shutdown” requirement has been a source of constant consternation for the open-source community.

And I get the impression it's been addressed from the quote you're responding to. Neither mentions fine-tuning, which is defined elsewhere in the document. I'm not a lawyer, though, just relying on the analysis.

by gAI

6/19/2026 at 6:36:31 PM

You claim you "get the impression" then do not quote either the law or a third party analysis of the law. Apparently we are supposed to believe this does not ban open source because the committee didn't write "This bans open source" in the beginning of the committee notes then circle it 3 times in red pen.

by staticman2

6/19/2026 at 7:06:15 PM

No, we're supposed to believe the official California Assembly committee bill analysis of SB 1047 over hacker news comments. Notably, SB 1047 passed the Legislature but was vetoed by Governor Newsom in September 2024. So whether or not it would have restricted open-weight models is an open question that won't get an answer.

by gAI

6/19/2026 at 7:27:05 PM

No, you are not supposed to treat a politician or their staff’s statement about how their proposed bill works as dispositive and ignore the statutory text or third party analysis.

I’m glad we clarified the epistemological issue, so thank you for replying.

It is strange that half your reply is appeal to the authority of a not on point source and half is epistemological learned helplessness about what the impact of a vetoed bill would have been, pick a side.

by staticman2

6/19/2026 at 8:56:07 PM

It's called epistemological humility, and you could benefit.

by gAI

6/20/2026 at 4:34:26 AM

When you make specific claims about a statute you were apparently too lazy to read, then respond with basically “I read the committee notes and surely if the statute was bad, it would say so and/or nobody can ever know what the statute does” when someone discusses the statute, whatever you are doing isn't “epistemological humility.”

by staticman2

6/17/2026 at 8:07:52 PM

Dog caught the car

by dofm

6/18/2026 at 3:55:16 PM

Well it is reasonable to expect the bare minimum of due process, or you should be able to from a government that claims to be so committed to the rule of law.

by jubilanti

6/18/2026 at 5:18:46 PM

National security laws often don't require such. It's easy to meet due process when the process is 0. Voters have no one to blame but themselves

by halJordan

6/17/2026 at 8:24:43 PM

Also a good wake-up call for investors as these big players can be benched at any moment.

by Bender

6/17/2026 at 8:34:31 PM

I can’t take anyone seriously who thought otherwise.

You think you can become more powerful so much so the govt questions its own power? Don’t be stupid. They will simply send in the army to first seize the assets and then nationalise.

It almost seems as if very few people actually understand how the world works. If the govt thinks this is the tech to end all future tech, you think future money flows for invesment matter? Hahaha. No

by reje

6/19/2026 at 1:57:14 AM

Remember strawberry? I do. Rememver gpt2 hype? I do.

the hype isnt real, its marketing designed to inderectly siphon capital from the less informed.

The current USA government is no stranger to a grift, so they'll get it. Not that I agree with the practumice, this bubble will hurt quite badly when it goes, but at least AI fundamentally does deliver something useful, even if it isnt infinite value as typically promised.

Think dotcom bubble and the hype and promises made surrouning that. Tge hyper will pass, the bubble will pop, and life goes back to normal as ai becomes part of the mundane everyday human environment. Like websites and domains, some will be used well, some for evil, not everyone needs it, and as we continue to move towards energy being our fundamental unit of value / exchange, if we cant make these models way more efdicient then their use case will be rather limited in scope.

by Grimblewald

6/17/2026 at 7:58:11 PM

No. They got caught in a change in what it means to be "regulated".

Regulation in a functional democracy: Cool, lets figure this out, write up a bill for us, do some research in congress, lets find something that makes sense.

Regulation in a function fascism: Cool, wheres my bribe? My boots not shiny, lick it till I say stop.

See, Anthropic wasn't licking enough boot when Biden got discharged and they thought Amazon and OpenAI and Elon were just going to let them capture a market without fealty to the boot.

by cyanydeez

6/17/2026 at 8:02:33 PM

This. Theres a lot of rude awakenings in the future for corporate executive types. They are no longer driving the train. Oh well.

by voidfunc

6/17/2026 at 8:08:56 PM

[dead]

by redsocksfan45

6/17/2026 at 8:12:29 PM

[flagged]

by calvinmorrison

6/17/2026 at 7:53:08 PM

I’m way more concerned about the loons willing to throw absurd amounts of money at the clearly naive individuals.

by tennfown

6/17/2026 at 7:55:14 PM

[flagged]

by xeonmc

6/17/2026 at 8:01:20 PM

[flagged]

by bellowsgulch

6/17/2026 at 8:31:43 PM

that seems like possibly the most unlikely outcome

by binary132

6/17/2026 at 8:21:34 PM

I'm tired of this story and the corresponding fake discussions because it's completely obvious that Anthropic was singled out because they didn't play along with the current US administration and this whole charade is just part of an extortion scheme.

by jonathanstrange

6/17/2026 at 10:35:03 PM

I feel like I woke up from a coma and all the sudden people are taking the administration at their word. I'm so confused.

by JohnnyMarcone

6/17/2026 at 8:42:51 PM

Had to disagree with that. However, I don't think you can discount how much Anthropic has been banging the drum about how AI is dangerous (specifically theirs) and an existential threat, etc. etc.

The rollout of Mythos was clearly manufactured to stoke the fears of companies that didn’t have access to it. They also bragged (for Fable) about how they "ran an external bug bounty that produced no universal jailbreaks in over 1,000 hours of testing" only for it be circumvented almost immediately.

So them standing on the high horse and saying it is _so powerful, yet so safe_ only to have that blow up in their face just made it that much easier to make an excuse to do this. Again, not disagreeing, but they made themselves the tall poppy here.

by james2doyle

6/17/2026 at 10:45:00 PM

> They also bragged (for Fable) about how they "ran an external bug bounty that produced no universal jailbreaks in over 1,000 hours of testing" only for it be circumvented almost immediately.

Where did you see there was a universal jailbreak?

How do you weigh the DOD fight against warning about Mythos' dangers when determining what made Anthropic 'the tall poppy'?

by JohnnyMarcone

6/17/2026 at 8:46:10 PM

Are people here deluded?

Business 101 - never take on an entity who has ultimate power over you and can conduct a course of action to put your existence at threat

by eiejeqq

6/17/2026 at 9:07:51 PM

People here aren’t authoritarians, so we don’t accept your premise that you shouldn’t take on the government. That’s not how things work in the US. Perhaps you’ve encountered Trumpists who tell you that it is, but they’re lying; they routinely applaud businesses defying any government which their dictator-in-chief doesn’t control.

by SpicyLemonZest

6/17/2026 at 9:27:22 PM

It was always the same, Google even lost a big lawsuit because it went too far in doing what the Biden administration was asking.

Twitter and Facebook also did what they ,,had to''.

The thing that's new here is that Antropic's growth rate was so enormous that Dario didn't have time to learn to lobby.

by xiphias2

6/17/2026 at 9:35:09 PM

Don’t you see how this shows it wasn’t always the same? The way things work in the US is that the government has a limited, defined role in determining how things are run. Companies don’t have to comply if the government goes beyond its role, and indeed may face liability for complying if they violate a contract in the course of doing so. The idea that it’s fundamentally illegitimate for a company to say “We dislike the government’s actions and feel they’re serving as a poor regulator” is coherent, but almost nobody in the US holds it, although partisans sometimes pretend to when they need a way to defend an indefensible course of government action. (Sometimes they’ll go so far as to claim it’s undemocratic to resist government action, which is incoherent.)

by SpicyLemonZest

6/17/2026 at 7:58:37 PM

>Some administration officials have said that a resolution should include an acknowledgment on Anthropic’s part that its rollout of Fable and communication with the White House could have been improved, people familiar with the talks said.

>followed initial frustration Friday among some administration officials when they couldn’t immediately get Amodei on the phone, the people said.

That he didn't drop everything to talk to them seems like the major crux? But Dario doesn't even do the day-to-day operations Daniela does. Feel like Anthropic should just hire Dean Ball to be their liason or something

by sigmar

6/17/2026 at 8:08:03 PM

And Anthropic say they were on the phone within 15 minutes… This administration is not known for its honesty so it’s hard to take their side of things

by bonsai_spool

6/17/2026 at 8:19:06 PM

It's because the "crisis" is a sham for publicity, like Trump's constant bullshit deals and ceasefires that aren't real, they're just happening to find more problems to keep them in the news.

by micromacrofoot

6/17/2026 at 8:06:40 PM

I feel Dario did enough harm. I wonder if he can do the right thing and step down. It’s really just tiresome to follow all his PR/Hype/warnings and this fiasco makes everything he says seem so silly. At the same time he’s dangerous for the industry. In the end he may get more regulation than he asked for. If the gov decides the Opus models are too powerful without KYC they are toast. And to be honest I think they deserve it.

by theplumber

6/19/2026 at 5:58:21 AM

This export ban follows Anthropic refusing to provide uninhibited use of AI to the US military, and the Pentagon subsequently listing them as a supply chain threat. That in turn led to Anthropic suing the government. This most recent development is obviously just a vindictive administration doing what it can to blackball Anthropic for not kowtowing to unrestricted military use like all of the other AI giants.

by robgibbons

6/17/2026 at 7:30:13 PM

Being on the other side of the AI machine changes the perspective of whether it is dangerous or not, I guess.

by boramdd

6/17/2026 at 7:41:36 PM

Everyone has a price.

by speedgoose

6/17/2026 at 7:50:38 PM

the coming IPOs will possibly create several billionaires. Standing on the top of a billion dollar pile would definitely change your perspective.

by trhway

6/17/2026 at 7:50:06 PM

They need to send lobbyists not hackers.

by tiahura

6/17/2026 at 9:56:40 PM

"Carlini had never before found a bug in Linux, or in Ghost. Now he had discovered many."

New guy learns nessus, now tells everyone at the bar he's basically Mr. Robot.

A pox on the labs and the government. InfosecDrama.exe just took out a frontier model because a noob learned how to use a tool.

by 0o_MrPatrick_o0

6/17/2026 at 7:55:58 PM

They are absolutely clueless about how to talk to this administration.

by winstonp

6/17/2026 at 7:58:30 PM

Yes, why not resolve it the same way all the others have done?

Say that Trump has weird elbows or something, Trump sues for defamation, they settle, bribe completed.

by fnordsensei

6/17/2026 at 8:04:42 PM

[flagged]

by jasonlotito

6/17/2026 at 8:21:32 PM

[flagged]

by yieldcrv

6/17/2026 at 8:05:44 PM

They needed to have administration insiders on their team months if not years ago, not just now

OpenAI, Meta, SpaceX are savvy enough to play ball, but Anthropic's public posturing and government affairs has always seemed too aloof and intellectual

by yeeetz

6/17/2026 at 8:32:37 PM

To be fair to Anthropic for a moment (not that they deserve it), but requiring administration insiders and the greasing of palms going on should not continue to be the normal expectations of how to do business in the USA. I'm on the side of any company that refuses to capitulate to this administration. Not saying Anthropic doesn't (because they do), but let's not pretend like the blatant corruption going on should be normalized. Every single citizen should be appalled at this behavior and blatant market manipulation.

by thewebguyd

6/17/2026 at 7:57:13 PM

They should have taken money from Thrive Capital.

by trhway

6/18/2026 at 12:31:15 PM

Carlini rocks, but they should have definitely NOT sent him. He will them too much, and they'll find even more excuses to block us. This needs an experienced negotiator, ie. a manager

by rurban

6/17/2026 at 8:20:02 PM

All the government has to do is simply pull up the blog posts of Anthropic's own CEO.

by matheusmoreira

6/18/2026 at 3:04:26 PM

>Nicholas Carlini recently rang the alarm about the dangers of AI—and now he’s part of a team arguing for the latest models to be released

Classic Anthropic.

by catigula

6/17/2026 at 7:37:46 PM

These are the consequences of fear mongering as hard as they did. You reap what you sow.

Now they need to convince the government that they didn't mean anything of the previous things they claimed.

by Simon321

6/17/2026 at 7:42:42 PM

OpenAI is also guilty of excessive fear mongering (remember GPT 2 is too dangerous to release?)

This isn't 100% Anthropic's fault, although I'm sure that's part of it. This is the current corrupt administration executing on a grudge they have against Anthropic, and the government's new found love of picking winners and losers.

by thewebguyd

6/17/2026 at 8:56:55 PM

Public release of GPT (and the following models) did bring negative societal changes with it.

We now live in a world where captchas don't work, astroturfing is indistinguishable, school essays and theses don't prove any learning took place, open source maintainers gradually cease to accept stranger contributions, …

by yreg

6/18/2026 at 12:22:23 PM

OAI wasn’t claiming any of those as dangerous. They mentioned biological warfare and massive job loss.

Moving the goal post now is a bit disingenuous. GPT-2 was a gibberish generator.

by isubkhankulov

6/17/2026 at 7:56:22 PM

> remember GPT 2 is too dangerous to release

FYI, this was when Dario was still at OpenAI.

by 0l

6/17/2026 at 7:56:17 PM

I don't really think they're acting on a grudge against Anthropic here, I think it really is on Anthropic for describing the model's capabilities the way that they did.

IIRC Anthropic claimed to have been working with the government on securing things with Mythos, but then they seemed to have been blindsided by this.

My read is that the guys making the decision to restrict it were not the ones that Anthropic had been working with, and it's more about Anthropic getting caught between infighting within an incoherent government.

by hgoel

6/17/2026 at 7:51:07 PM

OpenAI is much more eager to jump on board with the administration than Anthropic is, Altman is a lot of things, but he definitely knows which wheels need grease.

by yifanl

6/17/2026 at 7:58:42 PM

That was dario amodei as well, when he was still at openai. He is the primary "create hype by claiming you're dangerous"-guy.

by lompad

6/17/2026 at 8:18:03 PM

[flagged]

by theplumber

6/17/2026 at 8:32:00 PM

Nah, that's ridiculous. This admin is corrupt and idiotic and it's silly to pretend that Anthropic's actions matter except in so much as they didn't bribe the president like OpenAI did.

by AbrahamParangi

6/19/2026 at 7:08:46 PM

marketing shouldn’t exist. always exaggerated claims, half-truths.

by sharts

6/17/2026 at 7:41:19 PM

"Nicholas Carlini recently rang the alarm about the dangers of AI—and now he’s part of a team arguing for the latest models to be released"

Many such cases, he was just hungry.

by moralestapia

6/17/2026 at 7:51:45 PM

Isn’t that how Anthropic started? Raise alarm bells and ride the hype train.

by wil421

6/17/2026 at 8:22:25 PM

[dead]

by k4rnaj1k

6/17/2026 at 7:33:53 PM

I have good friends in the AI industry who are the living embodiment of that Upton Sinclair quote.

"It is difficult to get a man to understand something, when his salary depends on his not understanding it."

You've never heard such strong one-sided cope until you've talked to an NVDA employee about AI. I'm not even against AI. It's just that a combination of intense financial incentives around a product that provides a good simulation of the Chinese Room has really fucked peoples brains up.

by parl_match

6/18/2026 at 5:23:11 PM

[dead]

by spacebacon

6/17/2026 at 8:22:36 PM

[flagged]

by slopinthebag

6/18/2026 at 6:57:34 AM

Could you please stop posting unsubstantive comments and flamebait? You've unfortunately been doing it repeatedly. It's not what this site is for, and destroys what it is for.

If you wouldn't mind reviewing https://news.ycombinator.com/newsguidelines.html and taking the intended spirit of the site more to heart, we'd be grateful.

by tomhow

6/18/2026 at 8:44:28 AM

My bad, I will do better.

I know you guys are spread pretty thin managing the site, so I went through the comments for this post and collected some other comments which are also probably breaking site rules.

https://news.ycombinator.com/item?id=48576022

https://news.ycombinator.com/item?id=48576065

https://news.ycombinator.com/item?id=48576162

https://news.ycombinator.com/item?id=48576183

https://news.ycombinator.com/item?id=48575948

https://news.ycombinator.com/item?id=48575697

https://news.ycombinator.com/item?id=48575877

https://news.ycombinator.com/item?id=48576280

https://news.ycombinator.com/item?id=48576241

by slopinthebag

6/18/2026 at 9:47:14 AM

Some are, some aren't, but could you please just flag comments that break the guidelines, or if they're particularly egregious, email us (hn@ycombinator.com).

by tomhow

6/18/2026 at 12:54:24 PM

Genuinely hilarious reply.

On that note it would be interesting to do a sentiment analysis of flagged replies. They seem all over the place and it would be interesting to see if there were any biases.

by ofjcihen

6/18/2026 at 7:58:22 PM

Believe it or not, between the two of us we don't read and deliberate on all of the 10,000+ comments submitted here each day. We obviously can only read a small fraction of them via routine monitoring of the site, and we're less likely to see comments in threads like this one that spend fewer than two hours on the front page. We're much more likely to see the comments that are put on our radar via community flags and emails, which is why I saw the one I replied to and not the others.

For what it's worth, sentiment analysis is unlikely to yield useful findings in this context, because the probability of a bad comment being flagged is highly correlated with the number of people who see it, which will be much lower in a thread that spends little time on the front page, but a sentiment analysis model won't have access to that data.

by tomhow

6/18/2026 at 9:33:10 PM

So 1: I have no idea why you’re replying to me as if I’m attacking you. I’m not and I’m aware of how hard moderation can be as I’ve done it.

2: My idea for sentiment analysis was geared towards bias from this site and its users, not towards the moderation team.

3: While I respect the mod team I’m incredibly unimpressed with your response here, even if this was a misinterpretation of what I meant.

Take a breather.

by ofjcihen

6/19/2026 at 7:08:15 AM

It wasn't clear whether you were making a swipe at moderation or the community or both, and I guess I tried to cover all possibilities. You've clarified that it was at the community. Thanks for that. But the guidelines specifically ask us to avoid sneering at the community, because it's repetitive and unfair to the people here. HN is large and heterogeneous.

by tomhow

6/19/2026 at 1:13:02 PM

> On that note it would be interesting to do a sentiment analysis of flagged replies. They seem all over the place and it would be interesting to see if there were any biases.

Show me where the sneer is.

You didn’t “try to cover all possibilities”. You responded to the worst interpretation of what I said and injected some heavy snark in the process.

At this point what I see is a moderator who made a mistake and needs to get the last word in which is frankly ridiculous.

Please stop interacting with me unless you have some actual rules to enforce.

by ofjcihen

6/19/2026 at 3:10:14 PM

> Show me where the sneer is.

What did you actually mean by this, in your first comment?

Genuinely hilarious reply.

by tomhow

6/19/2026 at 3:51:35 PM

That I found the reply hilarious.

Are you, a mod on this site, actually engaged in a back and forth with a user in an attempt to find anything at all to “moderate”?

You made a mistake in your interpretation of my reply and in the process of replying broke multiple of your own rules.

Your comment history is filled with reprimands of users for less.

If dang wasn’t as impressive as he is this interaction would have zero’d out any respect I have for this mod team.

If you genuinely just want to abuse your mod position go ahead and ban me for having the gall to hold you accountable I guess.

by ofjcihen

6/18/2026 at 9:47:49 PM

> Believe it or not, between the two of us we don't read and deliberate on all of the 10,000+ comments submitted here each day.

Aren't snarky comments against the rules of this forum? It would be great to have guidance on this because I'm naturally a sarcastic person and I try my best to not let it out on forums such as this.

by slopinthebag

6/19/2026 at 7:21:30 AM

I was actually trying to be good humored rather than snarky. I get that it’s not always easy to strike the right balance and sometimes the intended spirit doesn't come across.

Regarding this:

> My bad, I will do better.

Your account is only four months old but already has too many comments that are against the guidelines and the spirit of the site. If you are serious about reforming, that will be welcome and appreciated, but try to be earnest in the way you engage with us.

by tomhow

6/19/2026 at 8:30:09 AM

I agree that the intended sprit doesn't always come across. I don't purposely post ragebait or flame bait, I'm more than willing to discuss and debate with people over my comments or views. I get that they are not always popular, but I think it would be against the spirit of this site to censor one's views just because they are unpopular. It's in the spirit of the hacker to be unconventional, and unapologetic about it.

In regards to my comment that started this discussion, it's a combination of seeing other very similar comments being posted on this and other related threads, and that it's a verbatim rendition of a viral tweet / meme from X. And the implied point is that one reaps what they sow. I understand that it may strictly go against the rules of the site but it's not (imo) particularly egregious compared to other comments I regularly see, thus it's difficult to calibrate to the accepted discourse. Like I said earlier, it's extremely difficult moderating a forum, especially when it's just two people, at the same time it's also difficult to know what constitutes acceptable behaviour when the rules are applied selectively.

Perhaps with the abundance of cheap LLM's, some sort of automation would be beneficial (assuming this isn't happening already).

by slopinthebag

6/17/2026 at 7:54:45 PM

Can you imagine how cringe it would be setting up that hero image in office?

by BoorishBears

6/17/2026 at 8:14:01 PM

I’m sure it wasn’t the intent but the halo really makes him look like a saint

by lelandfe

6/17/2026 at 10:47:39 PM

Feels very much intended.

by BoorishBears