4/17/2026 at 2:58:37 PM
A lot of geolocation data on the market is anonymized, following medium-lived unique IDs that aren't able to be mapped to other identifiers. The problem with that is that if you have precise locations, or enough samples that you can apply statistics to find precise locations, in many cases you can de-anonymize the IDs. You can purchase address and resident listings from a number of different data vendors, and by checking where the device returns to at night you can figure its home address. Then if you find information on the residents (work locations, schools, etc.), you see if said device goes where each resident of the home address is likely to go, and you now have a pretty good idea of exactly who the device belongs to.by Johnbot
4/17/2026 at 3:31:28 PM
There is no such thing as anonymized location data when you have the location of something where and when they sleep and work.It's a rhetorical fiction the ad industry tells itself.
by rockskon
4/17/2026 at 5:14:59 PM
Right, there's probably no other phone in the world that typically stops for hours within 1000 feet of my bed and typically stops on Monday-Friday within 1000 feet of my work-desk.by Terr_
4/17/2026 at 6:36:03 PM
Now think what Lavrenti Beria and an LLM could have done with that.by mapt
4/17/2026 at 7:13:19 PM
Somebody once said that if Stalin had access to television, he would never have to kill 20+ million ppl. What would he do with all that data? No idea.by wafflemaker
4/18/2026 at 2:03:54 PM
If all you've got is full political power and control over propaganda networks, your won't get the USSR. You'll get Hungary between 2010 and 2026. It works well, but in the critical moments when things start going wrong you need to kill people to maintain power, or else your nascent autocracy collapses as quick as Orban's.by kspacewalk2
4/18/2026 at 6:50:22 AM
I'm no fun of Stalin, but this meme about 20+ million victims needs to be purged."The scholarly consensus affirms that archival materials declassified in 1991 contain irrefutable data far superior to sources used prior to 1991, such as statements from emigres and other informants.
Before the dissolution of the Soviet Union and the archival revelations, some historians estimated that the numbers killed by Stalin's regime were 20 million or higher. After the Soviet Union dissolved, evidence from the Soviet archives was declassified and researchers were allowed to study it. This contained official records of 799,455 executions (1921–1953), around 1.5 to 1.7 million deaths in the Gulag, some 390,000[ deaths during the dekulakization forced resettlement, and up to 400,000 deaths of persons deported during the 1940s, with a total of about 3.3 million officially recorded victims in these categories. According to historian Stephen Wheatcroft, approximately 1 million of these deaths were "purposive" while the rest happened through neglect and irresponsibility. The deaths of at least 5.5 to 6.5 million persons in the Soviet famine of 1932–1933 are sometimes included with the victims of the Stalin era." [0]
https://en.wikipedia.org/wiki/Excess_mortality_under_Joseph_...
by drysine
4/19/2026 at 10:29:12 AM
lol naturally the criminals were obsessed with honestly keeping comprehensive official records of their misdeedsby nacozarina
4/19/2026 at 1:01:02 PM
> I'm no fun of StalinI would argue for the generality of this characterization
by maximinus_thrax
4/18/2026 at 12:44:53 AM
Only thing better to rule with is a network connected telescreen that monitors and issues orders to the proles.by kevin_thibedeau
4/18/2026 at 7:51:11 AM
So Instagram and TikTok?by nlitened
4/18/2026 at 10:07:48 PM
The party needs a modern alternative to FauxNews to manipulate the youth. They've already sunk their claws into TikTok.by kevin_thibedeau
4/18/2026 at 10:29:02 AM
Please kill me firstby cwmoore
4/17/2026 at 10:50:28 PM
Pretty sure it would be hard to enslave these people through televisionby breppp
4/17/2026 at 11:12:48 PM
Would it be? I'd argue the current US administration is entirely propped up by television. Hell, the president seems to "rule" based on what Fox News said last night.by hightrix
4/18/2026 at 12:26:11 AM
A slightly different and no more charitable perspective is that the people pulling the president's strings are the same people pulling Fox News's strings.by lithocarpus
4/18/2026 at 7:14:12 AM
Never saw the current US administration shipping people to labor camps with a single winter life expectancyby breppp
4/18/2026 at 10:20:23 AM
What is the life expectancy in CECOT?by tdeck
4/18/2026 at 5:12:08 PM
Let's see, during Stalin's Rule 18 million people went through forced labor camps and roughly 10% died, around 1.8 millionLet's add around 5 million for man made famine, and probably a 2 million for arbitrary executions and deportations, while many estimate the full death count as between 15-20 million
As far I can understand the top range of estimates for CECOT, which is a non American facility, are that 500 died, of around imprisoned 20,000 inmates. So the scale is a bit... different
I think the issue here is that contrary to popular belief, not every wrong thing is the same
by breppp
4/18/2026 at 9:51:48 PM
Death rates are particularly hard to compare because part of the idea of El Salvador's system is that people are expected to die there - there is no release policy - yet most of them are young healthy men recently detained.If we just look at incarceration rates:
CECOT is one facility, but around 2% of El Salvador's population has been imprisoned by Bukele's operation.
In 1950 the USSR had a population of around 180 million, and the gulag system was at its height with a population of 2.5 million, very similar.
The US prison system has been around 1% from the peak of the War On Drugs until recent fads in liberalized sentencing, currently holding at 0.7%, one of the highest in the world if you exclude ethnic purges like Xinjiang or Gaza.
by mapt
4/19/2026 at 5:46:17 PM
Imagine how lost your morale compass needs to be to defend Stalin because you don't like Trump.Apart for the fact that people were released from El Salvador system, the population percentage is wrong for El Salvador, USSR and US, the difference between slavery camps and a penal system, Gaza not being a prison.
But what are you really saying, that the 200-500 dead in El Salvador, most non associated with Trump, makes Trump equivalent with Stalin's 15 million dead? Does that make sense?
by breppp
4/18/2026 at 7:55:06 AM
Ever seen entire countries of people locked up in their homes within a week — for months?by nlitened
4/18/2026 at 3:12:42 PM
I’m pretty sure most phones have a higher location accuracy than 1000 feet.by xigoi
4/17/2026 at 3:58:32 PM
And with LLM’s now it’s easier than ever to piece the parts together. Companies were doing it before we even knew what LLM’s were capable of.Edit: It's a rhetorical fiction the ad industry tells us.
by Forgeties79
4/17/2026 at 10:27:25 PM
I think this begs the question of what anonymous data means. Sure my visit to HN is "anonymous" in that it doesn't say "abustamam visited this site" but piece together all the other visits that have my "anonymous ID" then eventually it paints a pretty nice picture of who I am.by abustamam
4/18/2026 at 12:29:41 AM
Does it map to a single, identifiable person or something close enough that the distinction is meaningless?Then it's not anonymous.
Simple as that.
by rockskon
4/18/2026 at 4:42:50 AM
My point is that even completely anonymous data that conforms to what you just said can easily become de-anonymized when contextualized to other "anonymous" data.by abustamam
4/18/2026 at 5:31:55 AM
A marketer's definition of anonymized is worthless. It's a fantasy they want everyone else to believe in.If it can be "de-anonymized" then it was never anonymous to begin with.
"De-anonymized" is quite literally an oxymoron.
by rockskon
4/18/2026 at 6:08:16 PM
> A marketer's definition of anonymized is worthless. It's a fantasy they want everyone else to believe in.I'm using your definition.
> Does it map to a single, identifiable person or something close enough that the distinction is meaningless?
Also
> If it can be "de-anonymized" then it was never anonymous to begin with.
Well sure, that's the point I was trying to make in my rhetorical question above. Individual pieces of data may be "anonymous" but put together with other anonymous data that can be traced to a single source and suddenly you can figure out quite easily who this person is. The data itself is still technically anonymous but it can be pieced together.
by abustamam
4/18/2026 at 2:50:57 PM
Does that mean that no non-post-quantum encryption was ever actually encryption because in 20 years someone will be able to decrypt things?by thfuran
4/17/2026 at 4:17:54 PM
We should have learned this lesson 20 years ago when researchers were able to deanonymize a lot of the Netflix Prize dataset, which contained nothing except movie ratings and their associated dates.https://arxiv.org/abs/cs/0610105
If movie ratings are vulnerable to pattern-matching from noisy external sources, then it should be obvious that location data is enormously more vulnerable.
by teraflop
4/18/2026 at 3:00:35 AM
> In contrast to previous attacks on micro-data privacy [22], our de-anonymization algorithm does not assume that the attributes are divided a priori into quasi-identifiers and sensitive attributes. Examples include anonymized transaction records (if the adversary knows a few of the individual's purchases, can he learn all of her purchases?), recommendation and rating services (if the adversary knows a few movies that the individual watched, can he learn all movies she watched?), Web browsing and search histories (12], and so on. In such datasets, it is impossible to tell in advance which attributes might be available to the adversary;Is Location data highly dimensional though?
by totetsu
4/17/2026 at 3:37:15 PM
exactly. calling it 'anonymized' is pure security theater once you have enough data points to map out someones daily routine.waiting for legislation or eulas to fix this is a lost cause since adtech always finds a loophole. the fix has to be architectural. moving toward stateless proxies that strip device identifiers at the edge before they even hit upstream servers. if the payload never touches a persistent db there is literally nothing to de-anonymize. stateless infra is the only sane way forward
by vovanidze
4/17/2026 at 3:52:13 PM
To be honest, I feel like this is where iOS and Android are failing us. Why is every app allowed to embed a bunch of trackers? Only blocking cross-app tracking on user request as iOS does is not enough (and data of different apps/websites can be correlated externally).by microtonal
4/17/2026 at 4:14:50 PM
Because we don’t enforce antitrust law in this country and the people that make those decisions profit from the ads.by CPLX
4/17/2026 at 6:09:28 PM
> To be honest, I feel like this is where iOS and Android are failing us. Why is every app allowed to embed a bunch of trackers? Only blocking cross-app tracking on user request as iOS does is not enough (and data of different apps/websites can be correlated externally).Even if Google and Apple both want to commit to fighting this, it becomes a game of whack-a-mole, because there are all sorts of different ways to track users that the platforms can't control.
As an easy example: every time you share an Instagram post/video/reel, they generate a unique link that is tracked back to you so they can track your social graph by seeing which users end up viewing that link. (TikTok does the same thing, although they at least make it more obvious by showing that in the UI with "____ shared this video with you").
by chimeracoder
4/17/2026 at 4:27:09 PM
im not sure about allowed. perhaps required may be closer.why would someone include tech that makes people think twice about using the app, unless it is required if you want to "sell" in a particular venue.
if your developing geolocation based apps, location tracking is a core function.
a calender, absolutely does not require location tracking beyond what side of the prime meridian are you on.
by rolph
4/17/2026 at 5:01:01 PM
> if your developing geolocation based apps, location tracking is a core function.But the subsequent sale of that data is not—is the discussion here.
by nickburns
4/17/2026 at 6:08:55 PM
and the reason why that data is available for sale, starts with forced collection of data, if you want to participate in an app store as a developer.you cant sell what you dont have unless you lie lower than a rug.
fix the data collection problem and a second order effect of no data for sale emerges.
by rolph
4/17/2026 at 6:59:39 PM
Are you suggesting Android/iOS app developers are forced into data collection somehow? If so, how? I'm genuinely curious.by nickburns
4/17/2026 at 8:00:25 PM
> why would someone include tech that makes people think twice about using the app, unless it is required if you want to "sell" in a particular venue.Because the overwhelming majority of people don't think twice about this tech.
I do, and that's why I use a lot of web tools or old-fashioned phone calls, but most people think metadata=unimportant and assume that the purpose of the app is what it does for them rather than to gather their personal information for sale.
by LeifCarrotson
4/17/2026 at 6:06:32 PM
How is this legal under the GPDR? There is clear examples in the citizenlab document of a user been tracked inside of the EU from outside.Is there not also a requirement for clean consent? Ie a weather app can’t track your precise location?
by uxhacker
4/17/2026 at 3:29:37 PM
Companies exist that de-anonymize other data brokers data. Lets the other data brokers claim they have anonymized data while end end users get everything.by sroussey
4/17/2026 at 3:47:56 PM
you could probably run a anonymization company at the same time you run a de-anonymization companyby ImPostingOnHN
4/17/2026 at 5:37:37 PM
Best of both worlds - legal and profitable \sby gessha
4/17/2026 at 7:15:09 PM
> enough samples that you can apply statistics to find precise locations, in many cases you can de-anonymize the IDsI think a lot of people don't realize the power of a big enough sample size. With enough samples even something pretty innocent looking like your daily step counter could make you identifiable.
As far as I know we don't have large enough databases to make this happen in practice, but I don't think this is impossible in the future.
by nzach
4/17/2026 at 7:34:22 PM
How large are you estimating is "large enough"?by jandrewrogers
4/17/2026 at 3:57:40 PM
Location and identity are inextricably linked. You can't destroy identity without also destroying location and location is critical for myriad purposes.The analytic reconstruction of identity from location is far more sophisticated than the scenarios people imagine. You don't need to know where they live to figure out who they are. Every human leaves a fingerprint in space-time.
by jandrewrogers
4/17/2026 at 4:03:28 PM
> and location is critical for myriad purposes.It's not though.
Critical for myriad elective purposes? Sure.
by nickburns
4/17/2026 at 4:15:58 PM
Only if you consider the entire concept of logistics in civilization as "elective".by jandrewrogers
4/17/2026 at 4:53:08 PM
Seems hyperbolic we had logistics that functioned extremely well before we had customer location data for sale on 3rd party sites.by xphos
4/17/2026 at 6:04:11 PM
If you re-read the comment they didn't say that selling it was intrinsic.by philipallstar
4/18/2026 at 12:30:35 AM
The article is about privacy tracking spyware cookies. I think making statements in that context about how modern logistics don't work with out location data implies you mean location data from those sources. I mean i suppose it doesn't have to but than it just feels off topic no?by xphos
4/17/2026 at 4:19:09 PM
I don't follow what you mean by 'logistics in civilization' as that's pretty vague and amorphous.Could you be more specific with maybe a single example of where my physical geographic location is electronically critical for a purpose that isn't elective/optional/avoidable?
(And I'm not just trying to be obtuse. I think you're touching on at least part of the 'heart' of both this conversation and that of digital ID verification.)
by nickburns
4/17/2026 at 4:49:15 PM
How does tracking the movements of individual humans aid shipping and logistics, other than providing traffic data to freight companies? How did we manage to have global supply chains prior to GPS being invented?Edit: I assume I am missing a crucial part of logistics that you’re familiar with, genuinely curious.
by quickthrowman
4/17/2026 at 4:10:21 PM
In what sense can the latitude and longitude of my house be called anonymous data?by ninalanyon
4/17/2026 at 4:19:34 PM
Ultimately, a map is anonymous data containing lat/lon of everyone's houseAlone, these points are not deanonymizing, it's when there's other data associated.
by kube-system
4/17/2026 at 7:07:07 PM
From what I've seen none of this is that complex, one could simply 'draw a circle around your house' and get all the "anonymized" device pings and just trace those.by ramoz
4/17/2026 at 3:21:13 PM
Yep. With side channel/one order of thinking above the laws, its trivial to get around said laws. Need better laws.by 1121redblackgo
4/17/2026 at 3:48:04 PM
> A lot of geolocation data on the market is anonymizedA lot isn't good enough.
by malfist