5/18/2026 at 1:21:01 PM
Fun fact (or not so fun if you're a subscriber):Somebody is spamming kernel mailing lists under the name Marian Corcodel with a 26 MByte message multiple times per day containing a collection of nonsensical patches. Looks AI-generated, perhaps with the intention to poison LLMs. This has been going on for a few days now.
https://lore.kernel.org/all/CAGg4U=GNtCObd_Nbm_1Rr5FEvPb69Yz...
by l1k
5/18/2026 at 1:32:15 PM
I'd warn HN users not to click on that link simply because it will load a 26Mb message that will likely cause quite a strain on kernel.org's servers if everyone here does it.by probably_wrong
5/18/2026 at 3:11:21 PM
I was curious how much of an impact HN could have. Napkin math:HN gets 24M views a day. Assume those views are evenly distributed across the front page (they aren’t), and that’s about 1M views for each front page post, assuming each user clicks on one post.
By the rule of 10s (also not exact), there are 10x less views on comment threads. So assume around 100k views on a comment thread as a theoretical average.
If everyone in this thread clicked on the link, that would be 2.6 TB of transfer across the day. But by the rule of 10’s we have to assume 10x fewer people will interact (upvote, click, anything) than view. So we’re down to 260GB transfer over the course of a day.
I wonder how close that is. It seems plausible that a link in the top comment of a thread could garner 10,000 clicks.
That’s still about one click every 8 seconds, which at 10Mbit/s would indeed overwhelm the server by a factor of about 2.5x. But I clicked through and it loaded in just a few seconds, so presumably the pipe is faster than 10Mbit/s.
Another caveat is that many websites are already several megabytes, so it seems strange that 26Mb would be the breaking point for a reasonable web host.
by sillysaurusx
5/18/2026 at 5:43:35 PM
Don't forget scrapers. Scrapers can be biased towards top posts and comments.by devsda
5/18/2026 at 11:46:23 PM
Arn't AI agents worse than scrapers now since they're basically a DDoS that runs over and over where scrapers will actually cache data.by cyanydeez
5/18/2026 at 4:27:11 PM
> HN gets 24M views a dayThis is available info?
by perching_aix
5/18/2026 at 5:40:35 PM
https://news.ycombinator.com/item?id=334500942022 from dang:
> There's no stats page but last I checked it was around 5M monthly unique users (depending on how you count them), perhaps 10M page views a day (including a guess at API traffic), and something like 1300 submissions (stories) and 13k comments a day.
> The most interesting number is the 1300 submissions because that hasn't grown since 2011 - it just fluctuates. Everything else has been growing more or less linearly for a long time, which is how we like it.
by shagie
5/18/2026 at 3:50:58 PM
Plenty of people deliberately posting to HN have their servers overwhelmed.by kraftman
5/18/2026 at 8:07:58 PM
It's mirrored by Akamai, which is designed to repeatedly serve the same thing over and over. It won't really hurt anyone.by jedberg
5/18/2026 at 2:31:28 PM
Does a 26MB message actually cause noticeable strain on the server much beyond loading the page? I would think serving a contiguous 26MB chunk would be relatively similar to say 20 normal sized messages.by jmalicki
5/18/2026 at 4:44:09 PM
Way off. I went to an arbitrary message on lore.kernel.org. Firefox's network inspector says 7.37kB was transferred, including stylesheets. 26MB is roughly 3500x 7.37kB.by mort96
5/18/2026 at 5:40:58 PM
Data transferred is not what generates load. sendfile() is about the lowest-overhead thing a web server does.by jmalicki
5/18/2026 at 1:48:47 PM
https://web.archive.org/web/20260518134447/https://lore.kern...by leonidasrup
5/18/2026 at 2:03:47 PM
I don't think needlessly straining the Internet Archive's servers is any better.by OuterVale
5/18/2026 at 3:17:35 PM
IA's infra is slightly better for big loads though, they tend to just have higher latency rather than aborted/timed out requests, for better or worse. It can be bit slow, but as long as you're ready to wait, you'll eventually get the response. Usually hosts just cut you off with a hardcoded timeout instead, which for people on high latency/low bandwidth connections can be super fun.by embedding-shape
5/19/2026 at 3:13:18 AM
IA's resources are very limited as is. There is so many people (emulation/roms) YouTubers linking to Archive.org downloads for full ROM Sets.It's a big problem. Donate to Archive.org if you can!
by HDBaseT
5/18/2026 at 2:08:28 PM
Will clicking on this link download a 26MB message putting extra load on archive.org's servers?by grosswait
5/18/2026 at 2:42:48 PM
Thank you for the warning. I rarely click on links these days though; only exception I make for HN links for main articles.by shevy-java
5/18/2026 at 3:18:14 PM
How do you navigate the web, everything is CTRL+L then manually type the address, or you have some fancier solution?by embedding-shape
5/18/2026 at 3:48:40 PM
the web is useless outside of hnby kelsey98765431
5/18/2026 at 5:17:36 PM
90% of it yeah, but the 10% is still worth it, like HN.by embedding-shape
5/18/2026 at 5:10:56 PM
The page is gzipped in transit - only 5 MB of traffic are generated.by neksn
5/18/2026 at 3:56:56 PM
> perhaps with the intention to poison LLMsHow does that work?
by Phelinofist
5/18/2026 at 4:23:23 PM
This is just nonsensical changes and slurs, but particularly degenerate input data can cause big issues in training:by stefan_