Framework for Artificial Intelligence Diffusion

1/16/2025 at 9:04:43 PM

I have no idea if comments actually have any impact, but here is the comment I left on the document:

I am Christopher Kanan, a professor and AI researcher at the University of Rochester with over 20 years of experience in artificial intelligence and deep learning. Previously, I led AI research and development at Paige, a medical AI company, where I worked on FDA-regulated AI systems for medical imaging. Based on this experience, I would like to provide feedback on the proposed export control regulations regarding compute thresholds for AI training, particularly models requiring 10^26 computational operations.

The current regulation seems misguided for several reasons. First, it assumes that scaling models automatically leads to something dangerous. This is a flawed assumption, as simply increasing model size and compute does not necessarily result in harmful capabilities. Second, the 10^26 operations threshold appears to be based on what may be required to train future large language models using today’s methods. However, future advances in algorithms and architectures could significantly reduce the computational demands for training such models. It is unlikely that AI progress will remain tied to inefficient transformer-based models trained on massive datasets. Lastly, many companies trying to scale large language models beyond systems like GPT-4 have hit diminishing returns, shifting their focus to test-time compute. This involves using more compute to "think" about responses during inference rather than in model training, and the regulation does not address this trend at all.

Even if future amendments try to address test-time compute, the proposed regulation seems premature. There are too many unknowns in future AI development to justify using a fixed compute-based threshold as a reliable indicator of potential risk. Instead of focusing on compute thresholds or model sizes, policymakers should focus on regulating specific high-risk AI applications, similar to how the FDA regulates AI software as a medical device. This approach targets the actual use of AI systems rather than their development, which is more aligned with addressing real-world risks.

Without careful refinement, these rules risk stifling innovation, especially for small companies and academic researchers, while leaving important developments unregulated. I urge policymakers to engage with industry and academic experts to refocus regulations on specific applications rather than broadly targeting compute usage. AI regulation must evolve with the field to remain effective and balanced.

---

Of course, I have no skin in the game since I barely have any compute available to me as an academic, but the proposed rules on compute just don't make any sense to me.

by chriskanan

1/16/2025 at 9:59:28 PM

  "First, it assumes that scaling models automatically leads to something dangerous"

The regulation doesn't exactly make this assumption. Not only are large models stifled, the ability to serve models via API to many users, and the ability to have many researchers working in parallel on upgrading the model is also stifled. It wholesale stifles AI progress for the targeted nations.

This is an appropriate restriction on what will likely be a core part of military technology in the coming decade (eg drone piloting).

Look, if Russia didn't invade Ukraine and China didn't keep saying they wanted to invade Taiwan, I wouldn't have any issues with sending them millions of Blackwell chips. But that's not the world we live in. Unfortunately, this is the foreign policy reality that exists outside of the tech bubble we live in. If China ever wants to drop their ambitions over Taiwan then the export restrictions should be dropped, but not a moment sooner.

by energy123

1/17/2025 at 6:03:11 AM

Limiting US GPU exports to unaligned countries is completely counterproductive as it creates a market in those countries for Chinese GPUs, accelerating their development even faster. Because a mediocre Huawei GPU is better than no GPU. And it harms the revenue of US-aligned GPU companies, slowing their development.

by logicchains

1/17/2025 at 11:49:03 AM

Interesting theory. Any evidence that this is how the world really works? (And, is there a catchy name for the phenomenon?)

by dr_dshiv

1/17/2025 at 12:32:53 PM

Import substitution. Before the free trade ideology it was quite a popular and (usually) deliberate method for industrialisation and technological progress.

https://en.m.wikipedia.org/wiki/Import_substitution_industri...

by jampekka

1/17/2025 at 3:06:26 PM

But also no external competition causes no pressure to improve.

by tartoran

1/17/2025 at 4:35:23 PM

That’s why the Huawei story seems different.

I think it’s about this attempt to block that actually makes the opponent stronger. The export restrictions were designed to weaken their competitiveness not enhance!

The restrictions blocked asml and Samsung and others from trading with China. Now China can just replace them. It seems.

by dr_dshiv

1/17/2025 at 5:45:05 PM

Seemed to work for the space and arms races for example.

by jampekka

1/16/2025 at 10:44:02 PM

right. China. but Switzerland? Israel? what is going on here?

by babkayaga

1/17/2025 at 4:08:29 AM

Israel is a known industrial espionage threat to the us, how'd you think they got nuclear weapons? some analysts say they're the largest threat after china. Not to mention theyre currently using ai in targeting systems while under investigation for war crimes.

by thatcat

1/17/2025 at 1:07:15 PM

> how'd you think they got nuclear weapons?

I rather assumed they were able to re-invent them from scratch by the work of their own scientists. I mean, the US did it before the invention of the transistor and what I've heard about the USSR project is their espionage only helped them know the critical mass without needing so many test runs, so it doesn't seem like it would be implausible that Israel could do it themselves about 20 years later.

by ben_w

1/17/2025 at 1:51:19 PM

Whether it is plausible or not is unrelated to what happened.

In 1965 NUMEC owner, Zalman Shapiro - in coordination with israeli intelligence, diverted 100 kg of 95% enriched uranium from the facility and shipped it to Israel. Enriching the material to weapons grade is the technically difficult part - which I would think israel would certainly not have the budget for given the size of its economy.

by thatcat

1/17/2025 at 3:59:28 PM

Thanks for the specifics.

> which I would think israel would certainly not have the budget for given the size of its economy.

Hmm.

I'll buy that. I've seen a lot of wildly different cost estimates for separation work units, so I can only guess how much it might have cost at the time.

If it would have otherwise been the full Manhattan Project, I don't even need to guess, you're definitely correct they couldn't have afforded it: https://www.wolframalpha.com/input?i=gdp+israel+1965

by ben_w

1/16/2025 at 10:55:16 PM

It could be related to 14eyes with modifications (finland and ireland, plus close asian allies).

https://res.cloudinary.com/dbulfrlrz/images/w_1024,h_661,c_s... (from https://protonvpn.com/blog/5-eyes-global-surveillance).

Israel, Poland, Portugal and Switzerland are also missing from it

by jagrsw

1/17/2025 at 11:08:20 AM

As someone who is in the verge of being killed with no side in this entire reality it's cool that in addition to trade and economics we now get compute as a geopolitical indicator maybe it can really all be automated

by gunian

1/17/2025 at 12:41:55 AM

> Switzerland? Israel?

I hope someone with a better understanding of the details can jump in, but they are both Tier 2 (not Tier 3) restricted, so maybe there are some available loopholes or Presidential override authority or something. Also I believe they can still access uncapped compute if they go via data centers built in the US.

by energy123

1/16/2025 at 9:41:35 PM

> Even if future amendments try to address test-time compute, the proposed regulation seems premature. There are too many unknowns in future AI development to justify using a fixed compute-based threshold as a reliable indicator of potential risk.

I'm disinclined to let that be a barrier to regulation, especially of the export-control variety. It seems like letting the perfect be the enemy of the good: refusing to close the barn door you have, because you think you might have a better barn door in the future.

> Instead of focusing on compute thresholds or model sizes, policymakers should focus on regulating specific high-risk AI applications, similar to how the FDA regulates AI software as a medical device. This approach targets the actual use of AI systems rather than their development, which is more aligned with addressing real-world risks.

How to you envision that working, specifically? Especially when a lot of models are pretty general and not very application-specific?

by tivert

1/16/2025 at 9:50:20 PM

<< It seems like letting the perfect be the enemy of the good: refusing to close the barn door you have, because you think you might have a better barn door in the future.

Am I missing something? I am not an expert in the field, but from where I sit, there literally is no barn door at this point to even close too late..

by iugtmkbdfil834

1/16/2025 at 9:27:49 PM

> First, it assumes that scaling models automatically leads to something dangerous.

The impression I had is with reversed causation: that it can't be all that dangerous if it's smaller than this.

Assuming this alternative interpretation is correct, the idea may still be flawed, for the same reasons you say.

by ben_w

1/16/2025 at 10:52:10 PM

I also suspect that the only real leverage the U.S. has is on big compute (i.e. requires the best chips), and less capable chips are not as controllable.

by SubiculumCode

1/16/2025 at 10:49:46 PM

While your critiques most likely have some validity (and I am not positioned to judge their validity), you failed to offer a concrete policy alternative. The rules were undoubtedly made with substantial engagement from the industry and academic researchers, as there is too much at stake for them not to engage, and vigorously. Likely there were no perfect policy solutions, but decided to not let the perfect stop the good enough since timeliness matters as much or more than the policy specifics.

by SubiculumCode

1/17/2025 at 6:06:58 AM

Doing nothing is a better alternative, because these restrictions will just encourage neutral countries to purchase Chinese GPUs, because their access to US GPUs is limited by these regulations. This will accelerate the growth of Chinese GPU companies and slow the growth of US-aligned ones; it's basically equivalent to the majority of nations in the world placing sanctions on NVidia.

by logicchains

1/16/2025 at 11:09:27 PM

[dead]

by sieabahlpark

1/17/2025 at 3:19:23 AM

> these rules risk stifling innovation

These rules intentionally "stifle innovation" for foreigners - this is a feature, not a bug.

by 15155

1/16/2025 at 9:19:16 PM

You wrote three separate comments for this course. Can you just combine them?

by behnamoh

1/16/2025 at 9:21:29 PM

The three comments are communicating three separate things, I think it's clearer that way.

by p1necone

1/16/2025 at 8:05:36 PM

The most salient thing in the document is that it put export controls on releasing the weights of models trained with 10^26 operations. While there may be some errors in my math, I think that corresponds to training a model with over 70,000 H100s for a month.

I personally think the regulation is misguided, as it assumes we won't identify better algorithms/architectures. There is no reason to assume that the level of compute leads to these problems.

Moreover, given the emphasis on test-time compute nowadays and that it seems like a lot of companies have hit a wall with performance gains with trying to scale LLMs at train-time, I especially think this regulation isn't especially meaningful.

by chriskanan

1/16/2025 at 8:59:39 PM

Traditional export control applied to advanced hardware is because the US doesn't want its adversaries to have access to things that erode the US military advantage. But most hardware is only controlled at the high-end of the market. Once a technology is commodotized, the low-end stuff is usually widely proliferated. Night vision goggles are an example, only the latest generation technology is controlled, and low-end stuff can be bought online and shipped worldwide.

Applying this to your thoughts about AI, is that as the efficiency of training gets better, the ability to train models is commodotized, and those models would not be considered to be advantageous and would not need to be controlled. So maybe setting the export control based on the number of operations is a good idea- it naturally allows efficiently trained models to be exported since they wouldn't be hard to train in other countries anyway.

As computing power scales maybe the 10^26 limit will need to be revised, but setting the limit based on the scale of the training is a good idea since it is actually measurable. You couldn't realistically set the limit based on the capability of the model since benchmarks seem become irrelevant every few months due to contamination.

by parsimo2010

1/17/2025 at 1:41:40 AM

I wonder what makes people believe that the US currently enjoys any kind of a meaningful "military advantage" over e.g. China? After failing to defeat the Taliban and running from the Houthis especially. This seems like a very dangerous belief to have. China has 4x the population and outproduces us 10:1 in widgets (2:1 in dollars). Considering just e.g. steel, China produces about 1 billion metric tons of it per year. We produce 80 million tons. Concrete? 2.4B tons vs 96M tons. 70+% of the world's electronics. Their shipbuilding industry is 230x more productive (not a typo). Etc, etc.

The short term profits US businesses have been enjoying over the past 25 years came at a staggering long term cost. The sanctions won't even slow down the Chinese MIC, and in the long run they will cause them to develop their own high end silicon sector (obviating the need for our own worldwide). They're already at 7nm, at a low yield. That is more than sufficient for their MIC, including the AI chips used there, currently and in the foreseeable future.

by ein0p

1/17/2025 at 2:25:44 AM

a) just because the government has policies that doesn’t mean they are 100% effective

b) export controls aren’t expected to completely prevent a country from gaining access to a technology, just make it take longer and require more resources to achieve

You may also be misunderstanding how much money China will spend to develop their semiconductor industry. Sure, they will eventually catch up to the West, but the money they spend along the way won’t be spent on fighter jet, missiles, and ships. It’s still preferable (from the US perspective) to having no export controls and China being able to import semiconductor designs, manufacturing hardware, and AI models trained using US resources. At least this way China is a few months behind and will have to spend a few billion Yuan to achieve it.

by parsimo2010

1/17/2025 at 4:58:35 AM

Everyone also thought Russia had a strong military yet look how that worked out

by airstrike

1/17/2025 at 6:35:04 PM

NATO is currently losing a conventional proxy war against it, that's how. Which only reinforces my point: there's zero reason to believe that either proxy or direct confrontation with China can be "won".

by ein0p

1/17/2025 at 8:16:00 PM

NATO is not fighting that war. You're delusional. They are supporting Ukraine without crossing the threshold of actually fighting the war, as bullshit as it sounds.

by airstrike

1/17/2025 at 8:30:26 PM

$250B and depleted arsenals say otherwise. It's a proxy war. It's been a proxy war by design all along: https://www.rand.org/content/dam/rand/pubs/research_briefs/R...

by ein0p

1/17/2025 at 9:40:20 PM

Spending has nothing to do with military reality. Russia is a paper tiger. If the US really supported Ukraine in this war without worrying about crossing some imaginary Rubicon, the war would have been over a long time ago—except Russia might be tempted to use their nukes, so we can't do that.

The way to win against Russia is not via sanctions but rather via destabilizing the regime through guerrilla propaganda. The Russians, the Chinese, and the Soviets before them have always known this. The West is just too slow to catch on.

by airstrike

1/17/2025 at 11:29:41 PM

> The way to win against Russia is not via sanctions but rather via destabilizing the regime through guerrilla propaganda. The Russians, the Chinese, and the Soviets before them have always known this. The West is just too slow to catch on.

This is a totally uninformed vibes-based opinion, but I can't help but feel like the sort of "guerilla propaganda" you're talking about must be a major factor in the current fracturing of cultural and political discourse in the US.

by jrussino

1/17/2025 at 11:50:04 PM

You know what's by definition uninformed and vibes-based? You claiming my opinion is uninformed and vibes-based. I'm not only not American to be subject to "the current fracturing of cultural and political discourse in the US" but I also have a degree in International Relations, so I think my opinion is pretty informed.

Now if you'd like to also be informed, start by reading (or reading about) Antonio Gramsci and https://en.wikipedia.org/wiki/Cultural_hegemony

by airstrike

1/18/2025 at 1:59:14 AM

I would delete this comment if I could, as I totally misinterpreted your prior comment. I am sorry.

Yes, the guerrilla propaganda is real and has been practiced for decades now...and one of its pervasive traits is that it focuses not only on media but also in changing the minds of students while they are in university, when they are particularly susceptible to influence. It's textbook Gramsci stuff.

Again, sorry for my knee-jerk reaction. I probably had too much or too little coffee, and I'd buy you one to make up for it if I could.

by airstrike

1/16/2025 at 9:18:25 PM

The practical problem I see is that unless US AI labs have perfect security (against both cyber attacks and physical espionage), which they don’t, there is no way to prevent foreign intelligence agencies from just stealing the weights whenever they want.

by thorum

1/16/2025 at 10:00:56 PM

Of course. They're mitigations, not preventions. Few defenses are truly preventative. The point is to make it difficult. They know bad actors will try to circumvent it.

This isn't lost on the authors. It is explicitly recognized in the document:

> The risk is even greater with AI model weights, which, once exfiltrated by malicious actors, can be copied and sent anywhere in the world instantaneously.

by kube-system

1/16/2025 at 11:40:24 PM

> The point is to make it difficult.

Does it, though?

by thorum

1/16/2025 at 9:54:33 PM

This. We put toasters on the internet and are no longer surprised, when services we use send us breach notices at regular intervals. The only thing this regulation would do, as written, is add an interesting choke point for compliance regulators to obsess over.

by iugtmkbdfil834

1/17/2025 at 6:37:20 AM

>The most salient thing in the document is that it put export controls on releasing the weights of models trained with 10^26 operations.

Does this affect open source? If so, it'll be absolutely disastrous for the US in the longer term, as eventually China will be able to train open weights models with more than that many operations, and everyone using open weights models will switch to Chinese models because they're not artificially gimped like the US-aligned ones. China already has the best open weights models currently available, and regulation like this will just further their advantage.

by logicchains

1/17/2025 at 2:05:43 PM

"consistent with its general practice, BIS will not require a license for the export of the model weights of open-weight models"

by gyre

1/16/2025 at 9:04:00 PM

Could be nice with some artificial pressure to use more efficient algorithms though. The current game of just throwing in more data centers and power plants may be kind of convenient for those who can afford it, but it's also intellectually embarrassing.

by etiam

1/16/2025 at 9:52:10 PM

this is like saying that regulating automatic weapons is misguided because someone might invent a gun that is equally dangerous without being automatic

by permo-w

1/16/2025 at 9:57:06 PM

This appears to be a very shallow take and lazy argument that does not capture even basic nuance of the issue at hand. For the sake of expanding it a little and hopefully moving it in the right direction, I will point out that BIS framework discusses use of advanced models as dual use goods ( ie. not automatically automatic weapons ).

edit(removed exasparated sigh; it does not add anything )

by iugtmkbdfil834

1/16/2025 at 8:57:05 PM

We can't let perfect be the enemy of good, regulations can be updated. Capping FLOPs is a decent starter reg.

by HeatrayEnjoyer

1/16/2025 at 9:04:00 PM

Counterpoint would be the $7.25 minimum wage. It can be updated, but politicians aren't good at doing that. In both cases (FLOPS and minimum wage), at least a lower bound for inflation should be included:

Something like: 10^26 FLOPS * 1.5^n where n is the number of years since the regulation was published.

by reaperman

1/16/2025 at 10:01:16 PM

> Something like: 10^26 FLOPS * 1.5^n where n is the number of years since the regulation was published.

Why would you want to automatically increase the cap algorithmically like that?

The purpose of a regulation like this is totally different than the minimum wage. If the point is to keep and adversary behind, you want them to stay as far behind as you can manage for as long as possible.

So if you increase the cap, you want only increase it when it won't help the adversary (because they have alternatives, for instance).

by tivert

1/16/2025 at 9:20:16 PM

I don’t see an issue here, because our legislators probably care more about FLOPS than humans.

by Cyph0n

1/16/2025 at 8:47:05 PM

This smells a lot like the misguided crypto export laws in the 90s that hampered browser security for years.

by geuis

1/16/2025 at 9:12:33 PM

And don't forget the amazing workaround Zimmerman of PGP fame came up with - the source code in printed form was protected 1A speech, so it was published, distributed, and then scanned and OCR'd outside the US - https://en.wikipedia.org/wiki/Pretty_Good_Privacy#Criminal_i...

by philjohn

1/16/2025 at 9:26:56 PM

And don’t forget Thawte which ended up selling strong SSL outside the US, cornering the international market thanks to US restrictions, and getting bought by Verisign for $600M.

by mmaunder

1/17/2025 at 1:24:11 AM

I hope this time we finally get a Supreme Court ruling that export controls on code are unconstitutional, instead of the feds chickening out like last time

by rangestransform

1/17/2025 at 2:28:04 AM

I doubt that would work for model weights because they are generated algorithmically rather than being written by humans, which probably means that they are not speech.

by tzs

1/17/2025 at 2:30:30 PM

Not to mention how much paper it would take to print 500B weights

by mbil

1/16/2025 at 9:31:25 PM

I for one would love to see model weights published in hardcover book form.

by z2

1/16/2025 at 9:44:53 PM

What a throwback to the time when some edgy folks would share printed codes in Pascal… I even remember seeing a hard copy of a binary in hex which was best not to execute.

by dkga

1/16/2025 at 10:12:09 PM

It might be somewhat prohibitive to print the model weights for any sufficiently large model, though.

by cheald

1/16/2025 at 10:52:10 PM

Using 2d barcodes you can fit ~20MB per page. Front and back you could probably fit a model that violated the rule on less than a thousand pages.

Edit: maybe 10k pages

by galangalalgol

1/16/2025 at 11:36:59 PM

Skip the barcodes - just cast it to 24bit RGB data, add in some error correction, and print as a high-DPI bitmap (or a series of them, one per page).

by TeMPOraL

1/17/2025 at 12:22:02 AM

Printers that can keep all the color channels lined up all the way down to their tightest dot pitch are few and far between. That is why piql used black and white for the boxing code they used to print out all of github. I'm not sure why greyscale wasn't used.

by galangalalgol

1/17/2025 at 8:23:14 AM

> Printers that can keep all the color channels lined up all the way down to their tightest dot pitch are few and far between.

It only takes one for a PoC. It only takes few to establish a service. Spatial and color accuracy is going to be a challenge, but I imagine there already exist methods to correct for that - e.g. printing reference patterns along with the data, so the scanner/digitizer can calibrate to it on the fly.

by TeMPOraL

1/16/2025 at 11:58:18 PM

I’m doubtful that barcodes would qualify as free speech.

by layer8

1/17/2025 at 12:11:48 AM

how about printing URL to download the weights file?

by slt2021

1/16/2025 at 10:43:45 PM

It hampered the security of a lot of things. That wasn't misguided -- that was the point.

China, Russia, and Iran used Internet Explorer too.

by kube-system

1/16/2025 at 9:10:12 PM

It’s worth noting that this splits countries into three levels - first without restrictions, second with medium restrictions, third with harsh restrictions.

And the second level, for some reason, includes (among others) a bunch of countries that would normally be seen as close US allies - e.g. some NATO countries (most of Central/Eastern Europe).

by cube2222

1/16/2025 at 9:38:52 PM

I see the return of cold war computing model, where many countries had their own computer platforms and programming languages.

Which apparently might be a good outcome to FOSS operating systems, with national distributions like Kylin.

As European I vote for SuSE.

by pjmlp

1/16/2025 at 9:26:58 PM

Someone still sees Eastern Europe as a provider of cheap brainpower. This is insulting.

by surfingdino

1/17/2025 at 3:23:37 AM

The value of a service is whatever someone is willing to pay for it - and conversely, the price someone is willing to render the service for.

These folks aren't "forced" to provide "cheap brainpower:" they are offering services at their market rate.

by 15155

1/16/2025 at 9:40:16 PM

But is it untrue?

by Chance-Device

1/16/2025 at 10:02:42 PM

This smells about as well informed as the genius move that forced Ukrainian owner, Max Polyakov, to divest from Firefly Aerospace. A US government position that was widely derided by space industry watchers, and has now been reversed.

This might be a product of the USA being a gerontocracy.

by consumer451

1/16/2025 at 9:20:01 PM

includes (among others) a bunch of countries that would normally be seen as close US allies - e.g. some NATO countries (most of Central/Eastern Europe).

Yeah, this is really a bit insulting.

by inglor_cz

1/16/2025 at 10:18:05 PM

>> And the second level, for some reason, includes (among others) a bunch of countries that would normally be seen as close US allies - e.g. some NATO countries (most of Central/Eastern Europe).

> Yeah, this is really a bit insulting.

So you're insulted some country of other wasn't included in:

> First, this rule creates an exception in new § 740.27 for all transactions involving certain types of end users in certain low-risk destinations. Specifically, these are destinations in which: (1) the government has implemented measures to prevent diversion of advanced technologies, and (2) there is an ecosystem that will enable and encourage firms to use advanced AI models to advance the common national security and foreign policy interests of the United States and its allies and partners.

IMHO, it's silly to get insulted over something like that. Your feelings are not a priority for an export control law.

Taiwan, even though it's a US ally, is only allowed limited access to certain sensitive US technology it deploys (IIRC, something about Patriot Missile seeker heads, for instance), because their military is full of PRC spies (e.g. https://www.smh.com.au/technology/chinese-spies-target-taiwa...), so if they had access the system would likely be compromised. It's as simple as that.

by tivert

1/16/2025 at 8:37:47 PM

We're sorry, an error has occurred A general error occurred while processing your request.

by intunderflow

1/16/2025 at 8:47:18 PM

What do the regulators writing this intend for this to slow down/stop?

I can't seem to find any information about that anywhere.

by mlfreeman

1/16/2025 at 9:00:26 PM

Obviously to prevent proliferation of dual-use technologies to potentially adversarial actors. The same intent behind restricting high-fidelity infrared camera and phased radar equipment.

by HeatrayEnjoyer

1/16/2025 at 9:36:46 PM

China is leading AI race with their open source deepseek-v3. It is laughable to think they this regulation with stop them. USA should actually collaborate, not isolate.

China engineers have capability to get around these silly sanctions, by renting cloud GPUs from USA companies for example, to get access to as many compute as they want. or to use consumer grade compute, or their homegrown Chinese CPUs/GPUs.

USA should actually embrace open source and collaborate together, as we are still in the very beginning of AI revolution

by slt2021

1/16/2025 at 10:23:29 PM

The entire point is to not collaborate, because this tech is being used for military purposes. The US wants to throw up roadblocks to make it more difficult. Obviously, against a foreign military, anything is a mitigation and not a prevention.

> China engineers have capability to get around these silly sanctions, by renting cloud GPUs from USA companies for example

That's why they're also moving towards KYC for cloud providers.

https://www.federalregister.gov/documents/2024/01/29/2024-01...

by kube-system

1/16/2025 at 10:40:32 PM

Also when you look at the list of nations targeted it's easy to see that this is not necessarily only about China. I think on HN people get China blindness and maybe have a hard time seeing other threats out there.

The list is controversial obviously, but I have to think nations wouldn't be on any tier of the list except the no-restrictions tier if there wasn't something our intel people weren't worried about. Maybe the concerns are not legitimate, but there's definitely a reason we don't want those nations having access to SOTA AI models.

by bilbo0s

1/16/2025 at 11:05:45 PM

Yes, USA wants to ensure global domination of large big tech in AI world so that openai/anthropic/etc did not have competitors

by slt2021

1/16/2025 at 8:54:37 PM

Maybe they intend for it to speed up/start implementation of federal agencies and regulations. The intent is to exert control over an emerging market while it’s still comprised of cooperative participants. Regulators want to define the regulatory frameworks rather than relying on self-policing before it’s “too late.”

Let’s see if this survives the next administration. Normally I’d be skeptical, but Musk has openly advocated about the “dangers” of AI and will likely embrace attempts to regulate it, especially since he’s in a position to benefit from the regulatory capture. In fact he’s doubly well-placed to take advantage of it. Regardless of his politics, xAI is a market leader and would already be naturally predisposed to participate in regulatory capture. But now he also enjoys unprecedented influence over policymaking (Mar a Lago) and regulatory reform (DOGE). It’s hard to see how he wouldn’t capitalize on that position.

by chatmasta

1/16/2025 at 9:37:04 PM

> Regardless of his politics, xAI is a market leader

Lol what?

The only people who think this are Elon fanboys.

I guess you think Tesla is the self-driving market leader, too. Okay.

by chronic4930018

1/16/2025 at 9:46:53 PM

I don’t even use it. But in terms of funding, it’s in the top 5, according to Crunchbase data [0].

[0] https://news.crunchbase.com/ai/startup-billion-dollar-fundra...

by chatmasta

1/16/2025 at 9:54:14 PM

> But in terms of funding, it’s in the top 5.

WeWork had more funding than xAI.

Not saying xAI will die, but you can’t look at funding.

by cheonic516442

1/16/2025 at 10:11:41 PM

You can look at funding to see who will benefit most from regulatory capture, a phenomenon that raises the cost of entry into a market. Obviously the most well-funded participants will have the greatest motivation to increase barrier to entry.

If anything, a well-funded company with a bad product is more likely to engage in regulatory capture because it limits their risk of exposure to a new entrant with a good product.

by chatmasta

1/16/2025 at 10:44:06 PM

In this particular market though it's a bit useless because only well funded entities are really able to play here.

Where are any of us gonna get 10,000 H100's?

So it's not like you're overwhelming small players by throwing up regulations. Those players are extremely unlikely to get the compute they'd need to even experiment in the first place.

by bilbo0s

1/17/2025 at 7:56:31 AM

It's also quite decent, giving a lot of free request so I have switched from Claude.

by miohtama

1/16/2025 at 9:55:01 PM

Hard to take comments like this seriously when you can’t even be bothered to be associated with it from your primary account.

by DSingularity

1/16/2025 at 9:15:23 PM

This feels like déjà vu from the crypto wars (1990s). If that experience helps, it is impossible to repress knowledge without violence, and it motivates more people to hack the system. Good times ahead "PGP released its source code as a book to get around US export law" <https://news.ycombinator.com/item?id=7885238>

by wslh

1/16/2025 at 11:40:41 PM

Not the same situation at all. PGP would run on any computer you happened to have around. The source code was small enough to fit in a book. The people who already had the code wanted to release it. Lots of people could have rewritten it relatively quickly.

The ML stuff they're worried about takes a giant data center to train and an unusually beefy computer even to run. The weights are enormous and the training data even more enormous. Most of the people who have the models, especially the leading ones, treat them as trade secrets and also try to claim copyright in them. You can only recreate them if you have millions to spend and the aforementioned big data center.

by Hizonner

1/17/2025 at 2:32:36 PM

> The ML stuff they're worried about takes a giant data center to train and an unusually beefy computer even to run.

Now, consider this: the Palm [1] couldn’t even create an RSA [2] public/private key pair in “user time” The pace of technological advancement is astonishing, and new techniques continually emerge to overcome current limitations. For example, in 1980, Intel was selling mathematical coprocessors [3] that were cutting-edge at the time but would be laughable today. It’s reasonable to expect that the field of machine learning will follow a similar trajectory, making what seems computationally impractical now far more accessible in the future.

[1] https://en.wikipedia.org/wiki/Palm_(PDA)

[2] https://en.wikipedia.org/wiki/RSA_(cryptosystem)

[3] https://es.wikipedia.org/wiki/Intel_8087

by wslh

1/16/2025 at 9:20:12 PM

One interesting geopolitical fact about this document that's not being discussed much is the way it includes Taiwan in lists of "countries".

Usually, the US government tries not to do that.

by clhodapp

1/16/2025 at 10:11:59 PM

Only in some diplomatic contexts.

https://www.cia.gov/the-world-factbook/countries/taiwan/

by kube-system

1/17/2025 at 3:14:51 PM

Most of the comments here only make sense under a model where AI isn't going to become extremely powerful AI in the near term.

If you think upcoming models aren't going to be very powerful, then you'll probably endorse business-as-usual policies such as rejecting any policy that isn't perfect or insisting on a high bar of evidence before regulating.

On the other hand, if you have a world model where AI is going to provide malicious actors with extremely powerful and dangerous technologies within the next few years, then instead of being radical, proposal like this start appearing extremely timid.

by casebash

1/17/2025 at 7:55:57 PM

Unblocked:

https://web.archive.org/web/20250117042813if_/https://www.fe...

by 1vuio0pswjnm7

1/16/2025 at 9:27:26 PM

The compute limit is dead on arrival, because models are becoming more capable with less training anyways. (See DeepSeek, Phi-4)

by veggieroll

1/16/2025 at 9:56:56 PM

Looks down to me.

https://archive.is/F78tf

by PostOnce

1/16/2025 at 8:34:43 PM

Strong opposition to this regulation seems to be one of the main things that led a16z, Oracle, etc. to go all in for Donald Trump. It's interesting that Meta too fought the regulation by its unprecedented open sourcing of model weights.

Regardless of who is currently in the lead, China has its own GPUs and a lot of very smart people figuring out algorithmic and model design optimizations, so China will likely be in the lead more obviously within 1-2 years, both in hardware and model design.

This law is likely not going to be effective in its intended purpose, and it will prevent peaceful collaboration between US and Chinese firms, the kind that helps prevent war.

The US is moving toward a system where government controls and throttles technology and picks winners. We should all fight to stop this.

by resters

1/16/2025 at 9:28:50 PM

I agree this law won’t be effective in its intended purpose, and that China will develop models of their own that are sufficiently competitive (as we’ve already seen). However, I think seeking “peaceful collaboration” between the US (or Europe or many others) and China - either between governments or private firms - is a naive strategy that will simply lead to the US being replaced by a more dangerous superpower that does not respect the values of free and democratic societies.

I also think that to a great extent, we’re already at war. China has not respected intellectual property rights, conducted espionage against both companies and government agencies, repeatedly performed successful cyberattacks, helped Russia in the Ukraine conflict, severed telecommunications cables, and more. They’ve also built up the world’s largest navy, expanded their nuclear arsenal, and are working on projects to undermine the status of the US Dollar. All of this should have been met with a much stronger and forceful reaction, since clearly it does not fit into the notion of “peaceful collaboration”.

China’s unpeaceful actions aren’t limited to the West. China annexed much of its current territory illegally and through force (see Xinjiang and Tibet). When Hong Kong was handed back, it was under a treaty that China now says is not valid. China has been trying to steal territory from neighboring countries repeatedly, for example with Bhutan or India. They’ve also threatened to take over Taiwan many times now, and may do so soon. They’re about to build a dam that will prevent water from reaching Bangladesh and force them to become subjugated. The only peaceful and just outcome is for those territories to be freed from the control of China - which will require help from the West (sanctions, tariffs, blockades, and maybe even direct intervention).

Even within China, the CCP rules with an iron fist and violates virtually all principles of free societies and classically liberal values that we value in the West. I don’t see that changing. And if it doesn’t, how can they be trusted with more economic and military power? That’s why I don’t think we should seek peaceful collaboration with China. We just need smarter strategies than this hasty AI declaration.

by blackeyeblitzar

1/17/2025 at 7:58:50 AM

China models are not only competitive, but better, because they do not care about copyright violations and they do not need to litigate with everyone. They can use better training material faster.

by miohtama

1/17/2025 at 10:32:01 AM

Where I'm looking from (Turkey) China is mostly harmless and the USA is a foreign invader that has invaded multiple neighboring countries. Between the two, I find the USA to be much more illiberal and dangerous.

by buyucu

1/17/2025 at 3:01:46 PM

You think China is harmless? Go tell that to Tibet, India, or Taiwan.

by 9283409232

1/17/2025 at 3:35:54 PM

Less harmful than USA for sure. The list of places US and allies have invaded is very long.

by buyucu

1/17/2025 at 3:41:48 PM

In recent years China does not do much blatant invading and prefer much more subtle means daring people to respond to them like in the South China Sea.

by 9283409232

1/17/2025 at 3:50:38 PM

still not as bad as 25 years of wars in the Middle East.

I would take those subtle means over the blatant invading that America has been doing.

by buyucu

1/17/2025 at 5:06:19 PM

You do know both can be bad right? You don't need to pick the lesser evil.

by 9283409232

1/19/2025 at 12:27:24 PM

where did I pick anything?

USA is the greater evil, with a much higher bodycount. that is something we agree on.

by buyucu

1/17/2025 at 2:48:59 AM

1. Sikkim was annexed by India 2. Bangladesh has many floods every year

by almaight

1/17/2025 at 6:44:18 AM

I am not sure why Sikkim is relevant. I am not familiar with it, so I read about it now. And it looks like India’s prime minister actually pushed a resolution through noting that Sikkim is independent. But then Sikkim had a domestic movement to join India and voluntarily did that. This seems like the opposite of annexation.

Regarding floods - if Bangladesh wants an upstream dam, why aren’t they included as a decision maker on whether the Chinese dam goes ahead? Clearly this is because they would say no to it. The issue isn’t floods - it’s that China can withhold water for drinking and irrigation and threaten the country with starvation and famine. It’s a huge national security threat.

by blackeyeblitzar

1/16/2025 at 9:20:02 PM

> The US is moving toward a system where government controls and throttles technology and picks winners

What else can it do? They don’t want to lose their lead, and whatever restrictions they’ve been putting on China et al. have let the exact desired outcomes so far. The idea is to try to slow down the beast that has very set goals (e.g. to become high tech manufacturing and innovation center), and try to play catch up (like on-shoring some manufacturing).

Personally, I’m skeptical that it will work, because by raw number of hands on deck, they have the advantage. And it’s fairly hard when your institutional knowledge of doing big things is a bit outdated. I would argue, a good bet in North America would be finding a financially engineered solution to get Asian companies bring their workers and knowledge to ramp us up. Kinda like the TSMC factory. Basically the same thing as China did in 2000s with western companies.

by tokioyoyo

1/17/2025 at 7:59:35 AM

> They don’t want to lose their lead, and whatever restrictions they’ve been putting on China et al. have let the exact desired outcomes so far.

They absolutely have not. The best open weights LLM is Chinese (and it's competitive with the leading US closed source ones), and around 10x cheaper both to train and to serve than its western competitors. This innovation in efficiency was largely brought about by US sanctions limiting GPU availability.

by logicchains

1/17/2025 at 10:30:16 AM

> They don’t want to lose their lead

What lead? The best open-source language models right now are Chinese. deep-seek is amazin, so is qwq.

by buyucu

1/18/2025 at 10:46:31 AM

Training an AGI/ASI does not requires the biggest datacenters/massive GPUs, nor it takes years already. Early algorithmic advances and narrow AGI AIs have radically shortened the requirements in hardware and time of training.

You can only expect more algorithmic advances from now on.

The attempt of regulation falls within the limits of the (publicly available) SOTA AI technology from maybe a month ago, so it has been surpased by the reality of no one capable of being in control of the brain functions outside the selected countries for free interoperation of AI tech.

Those brains outside the wire were six months ago already creating the algorithmic breakthroughs we are currently witnessing, of course there's not only one (there are most certainly many improvements currently being pipelined for future models a few months from now), and they are actually fully independant of regulations from any country, you can expect just in this year, lots of radical breakthroughs, and given the new regulations, more players just further advancing the algorithmic side of the technology.

The regulation could have been effective only in an scenario where US and selected countries would control the fully indispensable hardware required to train and run advanced AI, which is not anymore indispensable.

The most radical forecastings clock the future (months, not years), training and deploying of frontier AIs (AGI/ASI level), at maybe weeks to few months of training using sintetic generated data (from opensource models already available), simply relying on standard datacenter level CPUs (not GPUs) for the backbone of the training infrastructure, and a light, precise use of limited GPUs (two, three years old datacenter GPU hardware), and distributing the training across several massive datacenters, if you care at all about speed (having the most advanced AI the faster you can). But anyway, the jumping forward framework could be just doing incremental advances, and letting the advanced AIs to just improve the algorithmic side of the technology development, so to just further making even more efficient the available hardware, one cycle of improvement at a time.

It's not a game over with hardware, US and allies could try to use to jump faster to more sophisticated AI, but the game cannot be controlled just by limiting the hardware, nor the difussion of advanced models.

by 343rwerfd

1/16/2025 at 10:31:36 PM

> The US is moving toward a system where government controls and throttles technology and picks winners.

Moving towards? The US has a pretty solid history of doing a great deal of this (and more) in the 20th century. But so did all of the world's powers... as they all continue to do today. It seems to be an inherent part of being a world power.

by kube-system

1/16/2025 at 9:01:49 PM

I don't think you're wrong but Big Tech is bending the knee to Trump because he will be picking the winners.

by 9283409232

1/16/2025 at 9:49:35 PM

More information here:

https://www.federalregister.gov/documents/2025/01/15/2025-00...

by djoldman

1/16/2025 at 9:32:28 PM

It is going to be like it was in the 1990s with PGP and such all over again.

by pjmlp

1/16/2025 at 9:42:02 PM

WH Executive Order Affecting Chips and AI Models

https://news.ycombinator.com/item?id=42683251

by ChrisArchitect

1/16/2025 at 9:48:04 PM

It seems some EU countries are unrestricted but others are. How is this compatible with the EU single market/customs union?

by United857

1/17/2025 at 6:57:46 AM

Perhaps this regulation will be a major force for next gen symbolic AI systems.

by miovoid

1/16/2025 at 8:35:16 PM

This `regulations.gov` is leaking info on who accesses what, to Google (via `www.google-analytics.com` tracker).

There should be a federal regulation about that.

by neilv

1/16/2025 at 9:44:16 PM

It's for the government's google analytics account, which is open data: https://analytics.usa.gov/general-services-administration

by datadrivenangel

1/16/2025 at 10:00:09 PM

Parent is unhappy with the fact the data is being tracked and analyzed at all, not about the closed or open nature of the analytics.

by dooglius

1/16/2025 at 10:28:58 PM

I'm unhappy that: (1) Google is getting this information; (2) because Google is involved, who reads what is attributable to individuals; (3) citizens don't realize this.

by neilv

1/17/2025 at 12:48:01 PM

What’s the point in this? Isn’t Trump going to just cancel this immediately on Monday?

I don’t see how we can assume it will be enacted at all.

by saberience

1/16/2025 at 9:08:02 PM

I'm not sure why the link no longer works, but this one works. The link should be updated to this one: https://www.federalregister.gov/documents/2025/01/15/2025-00...

by chriskanan

1/16/2025 at 9:12:43 PM

@dang

by cshimmin

1/16/2025 at 11:59:30 PM

“@dang” doesn’t do anything. You need to email hn@ycombinator.com.

by layer8

1/17/2025 at 4:21:03 AM

Fixed. Thanks!

by dang

1/19/2025 at 4:50:32 AM

[dead]

by lincpa