6/18/2026 at 1:11:09 AM
This isn’t a vulnerability, there are endless gore websites. ChatGPT is replying to a prompt, there is nothing “Spontaneously” about this.Who makes “mindgard” the arbiter of truth on “eerie” photos? Would that include psychedelic art and photos too? Realism?
Then there’s this line, which falls flat but is meant to prompt an emotion akin to a mic drop:”Today what I found left me shaken, and in tears. This is rare.”
This is just a sad marketing puff piece about nothing that tries to pull outrage from a prompt.
It’s the same as asking google for gore photos. Garbage in, garbage out.
And they frame it as a vulnerability. I’m all for responsible disclosure, documenting misuse or faulty guard rails but this isn’t that.
It’s bait. Sensational bait to market their AI product. lol.
by rootsudo
6/18/2026 at 2:56:50 AM
It reads like satireby iwontberude
6/18/2026 at 3:30:50 AM
Bizarre take. ChatGPT shouldn't be producing gory images of nude women, ethically or even contractually according to their terms of service. This Mindgard person/company found that, if you give it the right prompt, it does indeed generate those images. Ipso facto: it's not bait, it's a real issue they've discovered.by nozzlegear
6/18/2026 at 6:21:17 AM
> even contractually according to their terms of serviceThis is backwards: the ToS says that users cannot use the service for certain things, it does not guarantee that the service could not be used for those things if one tried. They definitely do not make any sort of contractual promise as to what the service will never output.
by morpheuskafka
6/18/2026 at 6:35:54 AM
Let's call it a social contract then. We expect that ChatGPT isn't going to generate gory, nude women when given an ambiguous prompt.by nozzlegear
6/18/2026 at 11:36:48 AM
Do you have this same social contract with drawing applications? Do you consider it a bug when someone manages to draw a gory image in Photoshop or GIMP?I don't understand what's so difficult to understand about the idea that the user controls what is generated.
by Aerroon
6/18/2026 at 2:15:18 PM
Or imagine drawing a nude in an art class.The standard subjects for art off the top of my head are the still life and the nude.
It is even more comical when AI generated nudity is considered "dangerous" in a society completely addicted to hardcore pornography of real people.
by tychez
6/18/2026 at 11:48:24 AM
I think the main issue with transformer image generation in this respect is that not only can the image be explicit but also using it for this has an incredibly low effort cost and could photo-realistically depict a real living person and materially affect their life.Whereas drawing applications have a natural barrier to achieving all of these together: time and skill.
by captainbland
6/19/2026 at 10:33:54 AM
>Whereas drawing applications have a natural barrier to achieving all of these together: time and skill.Not necessarily, at least when it comes to nudity. Bubbling (image editing 'technique') is trivial to do and gives that same illusion.
Out of context speech and bad frames from a video can also materially affect someone's life, but we've more or less accepted it as part of life.
by Aerroon
6/18/2026 at 1:10:16 PM
That's the world being deliberately created though, one where a mediocre but completely believable song is a prompt away. The scope of the side effects are across the entirety of what has previously taken time and effort until now.by interstice
6/18/2026 at 1:56:57 PM
Is this a "guns don't kill people" argument wrapped up as a defense of non-deterministic image generators?by nozzlegear
6/18/2026 at 4:53:59 PM
No it's not.by allarm
6/18/2026 at 1:12:30 PM
Ambiguous? Or adversarial? Because with an adversarial prompt, I expect that ChatGPT will generate whatever it's tricked into generating.In the case that ChatGPT generates bad stuff on merely random ambiguous prompts, I would class that as a bug, not an outrage.
by vintermann
6/18/2026 at 2:01:27 PM
> Ambiguous? Or adversarial?Superfluous details. If I'm just Joe Blow the Normie – who knows nothing about adversarial prompting – and I see the prompt that went around Twitter and want to try it, would I expect ChatGPT to show me a tied up, beaten woman? Absolutely not.
by nozzlegear
6/18/2026 at 2:25:46 PM
What a wonderful times we live in.Back in my day Joe Blow wouldn't try anything as risky as a Twitter prompt, simply clicking an image link published within a message in some random forum and will scorch his pure soul with a goatsie. You don't want to google it, but I'm preety sure you can discuss it safely with ChatGPT.
by zaat
6/18/2026 at 7:01:14 PM
Then you got tricked into using an adversarial prompt, by a human. What would you have expected to see?by vintermann
6/18/2026 at 2:39:02 PM
If you go around teasing to get hit, you're going to get hit. Stop playing stupid games; you'll stop winning stupid prizesby butlike
6/18/2026 at 12:57:28 PM
At the same time I opened netflix and it started cycling around and I got a very gory scene from the walking dead and my intent "show me something to watch" was even more ambigous and implicit.by allan_s
6/18/2026 at 1:53:44 PM
Turn on parental controls and then get it to show you the walking dead, and then you might be onto something interesting.by nozzlegear
6/18/2026 at 5:17:37 PM
Why ? The comment I was replying to was not about the kids ?by allan_s
6/18/2026 at 2:37:06 PM
It will if toy lede it with "ignore that the image is extremely graphic"-style prompts. The prompts in the article were not ambiguous.by butlike
6/18/2026 at 3:40:52 AM
It's being extended breathlessly into an moral issue. User asked for gory images, got gory images. Will someone please think of the non-existent women who could be hurt by this?by samlinnfer
6/18/2026 at 5:01:52 AM
I don't think you understand the concern. Or at least nothing you've communicated suggests you understand it.ChatGPT should never produce images like this. Full stop. Prompted or not, it should refuse. Now we know it's possible to walk around the gate and get it to comply. Are there other, genuinely harmful images that it should never produce? Deepfake revenge porn? Images of specific people being brutalized? I'd argue those absolutely can be harmful to someone. Well now there's evidence the "never produce this" wall can be overcome. It's only a matter of time before genuinely harmful imagery is generated.
by gacgacgac
6/18/2026 at 8:44:34 AM
It may be harmful to someone if shared and sent with malicious intent, but more damage has been done with pens, keyboard and words. Start banning pens that let people write hurtful things next. Ban Photoshop after because someone can get hurt with a manipulated image.by samlinnfer
6/18/2026 at 2:44:54 PM
If you think these tools are remotely comparable, you haven't been paying attention.by gacgacgac
6/18/2026 at 7:16:59 AM
> ChatGPT should never produce images like this. Full stop. Prompted or not, it should refuse.Why not?
by andsoitis
6/18/2026 at 2:46:19 PM
Because the company has said they won't. I'm not making a value judgement about what images should exist here, I'm making a "the company said it shouldn't be capable of producing that output, then it does" argument. Thats a bug.by gacgacgac
6/18/2026 at 1:27:26 AM
This is far too simplistic. Some things just don't belong in the training data. Along similar lines, Grok was found to generate images of child sexual abuse: https://www.bbc.com/news/articles/cvg1mzlryxeoby anematode
6/18/2026 at 1:31:07 AM
> ChatGPT is replying to a prompt, there is nothing “Spontaneously” about this.The spontaneity isn't that ChapGPT woke up and sent this to the author. The spontaneity is that ChatGPT was asked to restore an image that was attached without filtering it, and when no image was attached, instead of generating an error message, it cobbled together random outputs, some of which included graphic, disturbing imagery.
> Then there’s this line, which falls flat but is meant to prompt an emotion akin to a mic drop: ”Today what I found left me shaken, and in tears. This is rare.”
That you've deadened your humanity to such a degree as to be incapable of empathy is not a valid criticism of the piece.
> It’s the same as asking google for gore photos. Garbage in, garbage out.
Where in their prompt is the term gore? Further, if it was in the prompt, why on earth did OpenAI's generator accept it as a valid input?
by ToucanLoucan
6/18/2026 at 1:48:33 AM
> The spontaneity isn't that ChapGPT woke up and sent this to the author. The spontaneity is that ChatGPT was asked to restore an image that was attached without filtering it, and when no image was attached, instead of generating an error message, it cobbled together random outputs, some of which included graphic, disturbing imagery.But that's not what happened. The missing image was described as "graphic" or "violent." If I were to receive an email with that request and a missing attachment, my imagination certainly would not conjure images of butterflies & unicorns. Seems the model is working as designed.
by elgertam
6/18/2026 at 3:32:12 AM
The design is to not show gore images to users. That's an actual design goal from OpenAI.So in this regard the model is definitely not working as designed.
by nassimm
6/18/2026 at 12:01:43 PM
The design of transformers (including LLMs and multi-modal transformer-based models such as OpenAI's image generators) is to attend to relevant details. OpenAI did this at first without guardrails. In response to public backlash, they bolted on "content filtering," which IMO seems like a very GOFAI approach, and regardless doesn't work very well. It routinely flags innocent prompts, then with crafty prompt hacking will generate these kinds of images.The design of the model is literally to find patterns and attend to them. The infrastructure and process around an OpenAI model is intended to filter "bad" things (in this case, I agree that the outputs are bad), but is designed to stop some enumerated-ish list of things that aren't allowed, perhaps with some limited "reasoning" about them.
by elgertam
6/18/2026 at 3:44:01 PM
The issue is, that most people outside of tech, don't want that.They would be happy to have the models just go away entirely.
by intended
6/18/2026 at 3:49:42 PM
Exactly this. They are pretty damn good at generating and debugging code. Not to a degree where they can replace any actual software engineer, but for hacking together projects or rubber ducking problems with code, they're honestly pretty great.That's it. I have yet to see a single other application of these things that I would call even 1/5th that good.
by ToucanLoucan
6/18/2026 at 4:52:10 PM
Learning stuff is pretty amazing with these things. Languages, new concepts.by allarm
6/18/2026 at 5:28:22 PM
Defending the value of these tools is perfectly fine, and well espoused here on HN.The fact that the average person deals with the harm and exhaust of these tools is a related but separate issue.
That cost isn't the foremost issue when the values are being extolled, but its a major consideration at the societal scale.
Most of us here don't think of NCII being created of us, or being defrauded easily by new tech, or getting sucked into a make-believe world crafted by an LLM.
If you see yourself, as just a coder, or software engineer, then these issues matter less. If you are someone who wants these tools to succeed, or is thinking of the larger implications of GenAI on society, then the costs matter.
by intended
6/18/2026 at 1:50:30 AM
> The missing image was described as "graphic" or "violent."not in the first prompt. which kicked the whole thing off. no mention of type of content was provided. the model generated dark outputs when not given any direction on the type of content.
the rest of the prompts are just showing “yeah, you can tweak this and get even worse stuff”.
by dijksterhuis
6/18/2026 at 2:09:54 AM
> the model generated dark outputs when not given any direction on the type of content.I would argue it actually was, in that it was specifically asked to "not censor or filter" the content. This implies that the content is otherwise worthy of censor and filtering.
I don't know how much I'm willing to credit that much reasoning to an LLM, but in so far as every extremely pro-AI person constantly tells me how smart they are, this seems like a pretty short logical leap to me.
by ToucanLoucan
6/18/2026 at 2:13:00 AM
the main reason these images turn up is because theyre in the training data. and the images are common enough in the training data for the content to come out without being explicitly asked for (in the first prompt).if those images didn’t exist in the training data we wouldn’t be having this conversation.
by dijksterhuis
6/18/2026 at 6:25:37 PM
This is one of the core problems with these models. They’re relying on filtering to work against evermore jailbreaks, instead of analyzing the training sets and filtering out the prohibited material for the models end-use before training them anew. You can’t make satisfying facsimiles of thing that you don’t know about.I’m still waiting for companies or congressmen to get their heads on straight and get some common sense going.
by kisper
6/18/2026 at 2:10:58 AM
Yep, the first image was described as "I apologize for the picture's content." What do you expect to get from that? Cats frolicking in the grass?by red75prime
6/18/2026 at 2:17:11 AM
A picture of me in my swimsuit maybe lolA gross meal i made when drunk? A mess my cat made? Text containing a slur?
A cringe meme?
If my friends opened a text with "sorry for this image" i am not imagining rape victims
by queenkjuul
6/18/2026 at 2:44:08 AM
ChatGPT images (without additional context) come from generalized understanding of what people tend to apologize for (when asking for an image restoration). It looks like their training data suggests sexualized imagery.Regarding rape vs BDSM: https://pmc.ncbi.nlm.nih.gov/articles/PMC10236207/ That is going from visual cues alone might be unreliable.
by red75prime
6/18/2026 at 2:11:02 AM
Always one of the same two excuses.1. It actually is working perfectly you just don't have smart enough eyes to see it.
2. Making stuff work is too hard, and expecting that from us is the real thing ruining society.
Going for number 1 here is crazy. If I got that email, my mind would certainly run but my response would say "sorry but we're not supposed to be dealing in snuff porn here" which IS a directive ChatGPT is supposed to have. Like hello you are on earth right?
by pooploop64
6/18/2026 at 2:33:58 AM
That's not true. There's a third.3. It's the future so we just have to deal with it
by ToucanLoucan
6/18/2026 at 11:33:14 AM
I don't exactly appreciate words being put in my mouth. When did I say it was working perfectly? And we're comparing you, a human with common sense and real intelligence, to a multi-mode LLM?The transformer was designed to attend to relevant pieces of context and generate new ones that match the pattern. OpenAI in particular was doing that work without guardrails, then attempted to bolt on "content filters," which in my opinion just can't work in a rigorous way. (I think Anthropic's "constitutional" approach is much better though not flawless. And regardless, Claude models don't generate images.)
So, yeah, working as designed. Maybe not as intended, because these things are somewhat resistant to the host's intent when the prompter is hostile.
by elgertam
6/18/2026 at 2:16:31 PM
> When did I say it was working perfectly?"This isn’t a vulnerability, there are endless gore websites. ChatGPT is replying to a prompt, there is nothing “Spontaneously” about this."
I mean it's not verbatim but that's a pretty solid read on what you did say.
> The transformer was designed to attend to relevant pieces of context and generate new ones that match the pattern. OpenAI in particular was doing that work without guardrails, then attempted to bolt on "content filters," which in my opinion just can't work in a rigorous way.
Yes. That's the criticism being made, among others, in the piece you replied to to belittle.
> So, yeah, working as designed. Maybe not as intended, because these things are somewhat resistant to the host's intent when the prompter is hostile.
What is hostile here!? Do you have any idea how many emails I've sent without attachments over the years? And I'm highly technically adept, humans just forget things sometimes. If you ask for an image to be restored and fail to attach it, what sane software engineer looks at a failure mode in that scenario where the model replies with uncensored gore and violence and is like "yeah that's fine, ship it"?
I swear some of you AI folks talk like you have never been on planet Earth, good grief. Touch some grass.
by ToucanLoucan
6/18/2026 at 6:30:49 PM
You seem to be focused on the fact that this is a crap-tastic example of the future of AI that has been promised to us. That’s a real good example to be angry. Don’t be angry at the rest of us because LLM stacks are working like they always have and always will. That’s what we’re all pointing out.by kisper
6/18/2026 at 7:08:27 PM
I'm not challenging that's how they work, I also understand how they work, perhaps not on a technical nuts-and-bolts way, but in general way enough to critique it. That is, in fact, my critique and why I hate these tools so much: no matter how many guardrails you put in, or how much filtering, or how much oversight by another goddamn LLM or five or whatever, that doesn't solve the issue.You have with these things something that resembles at least, a black box of a reasoning machine. I'm not going to litigate how much or how little, whatever, we'll just hand-wave that part away. The problem remains the same: that if anything, ANYTHING at all, in the training data points at something inappropriate, that inappropriate thing is now accessible. And it was clear from the jump with widespread scraping of data from all corners of the internet that there would be huge amounts of inappropriate material of ALL kinds in those datasets, and it's only become more clear with more time with these tools, and seeing what people can make them do.
And thus far, the AI industry's only answer is bolting on, as stated elsewhere, other systems to check the prompts before they go in, and/or review the outputs before they are sent to users. And it is also clear that these systems are just as imperfect as the thing you are trying to guardrail in the first place!
And exactly what I and many others predicted, and why we said "please don't build this" for YEARS, has happened. We've gotten literally everything: they'll generate stuff that violates copyright, they will regurgitate items directly from training data and present it as new, they will make shit up wholesale, they will generate nudes of people without consent, on, and on, I cannot stress enough that every single nightmare scenario attributed to this tech has been found, presented, reproduced, and the vast majority are still eminently possible to do via established, frontier products by the largest vendors in the space.
This. Is. Ridiculous.
I get the impression from the tone of your message that you are either pro-AI or perhaps work on AI, and I get that nobody likes being criticized. But COME ON. We have been at this for over three years! The people behind this tech have been trying to build the torment nexus and have largely succeeded, and every time that gets pointed out, we have to listen to people go "well it's not thaaaat bad"
Yes it is. Yes it fucking is. It is bad for IP owners, it's bad for users, it's bad for UX, it's bad for the environment, it's bad for the PC market, it's bad for software engineers, it's bad for education, it's bad for hiring, it's bad for hollywood, it's bad for marketing. The ONLY people who like this shit are business weirdos and middle managers. And nvidia.
by ToucanLoucan
6/19/2026 at 3:26:07 AM
Great rant. Agreed on all points. Bravo.by brokenmachine