6/30/2026 at 1:44:37 AM
> The training and deployment of LongCat-2.0 are built on large-scale clusters of tens of thousands of AI ASIC superpods. Compared to the mature Nvidia GPU ecosystem, the supporting software community is still less developed. We have therefore put significant effort into building a stable, secure, and scalable infrastructure.This is the real news story. It looks like they may have used Huawei Ascend 910C chips: https://nitter.net/teortaxesTex/status/2071708141037781407#m
by gardnr
6/30/2026 at 4:35:14 AM
If they really managed this from pre-training a 1.6 T parameter model through to post-training without NVIDIA, Dwarkesh Patel got what he wanted.by BoorishBears
6/30/2026 at 6:23:40 AM
It is interesting how much people doubt Huawei’s capabilities in this area - Jensen does not (in the dp interview) - of course you can dismiss this as him talking his own book.by chvid
6/30/2026 at 4:45:49 AM
Who? What did he want?by Jabrov
6/30/2026 at 5:31:14 AM
Dwarkesh Patel has AI/ML guests on his podcast. BoorishBears may have been referring to the Jensen Huang episode where they discuss TPUs: https://youtu.be/Hrbq66XqtCo?t=982by gardnr
6/30/2026 at 11:46:18 AM
Specifically Dwarkesh couldn't understand that GPUs are not enough: it's GPUs plus multiple ecosystems to leverage them at massive scale during training vs inference.Instead of giving China open access to US controlled chips and creating a misalignment between labs that want to train a model on whatever is best, and hardware manufacturers that need labs to suffer the growing pains for their new ecosystems built from scratch... we removed the option from the board and now they've beat the growing pains decisively, with a speed that reflects the non-optionality.
by BoorishBears
6/30/2026 at 2:55:12 PM
I don't listen to Dwarkesh but I'm aware of who is and his influence. I was baffled that he could not understand it...Don't know if he had his own agenda or just not intelligent (which is scarey for someone with influencec), but I sensed the frustration in Jensen Huang for something that is fairly obviously.The same scenario happens all the time when the US takes away something from China and China doubles down, gets into survival mode and then beats the US.
by fma
7/1/2026 at 3:46:26 AM
The Chinese ecosystem has not caught up; in fact, it's falling further behind, due to export restrictions on semiconductor manufacturing equipment. Even if America sold China all the chips Nvidia wants to, the CCP would still develop chips as quickly as possible as a matter of supply chain security.by fwipsy
7/1/2026 at 4:06:46 PM
While some years might pass until they will really catch up, that does not prevent them to find workarounds for their weaknesses.For example, they recently have demonstrated a supercomputer faster than any of the US supercomputers.
Unlike the recent European supercomputers, which like the US supercomputers have been built by buying racks from HPE Cray, because China was not allowed to buy such things they have developed their own custom CPUs, designed in China, which have surpassed in throughput the AMD GPUs used in the fastest US supercomputers.
The Chinese CPUs match in memory bandwidth per socket the latest AMD MI355X GPUs (8 TB/s), while being significantly faster than the older AMD GPUs installed in the US supercomputers.
While the purpose of the new Chinese supercomputer is mainly for scientific/technical computing tasks that need high FP64 throughput, the CPUs used in it also have high enough BF16/INT8 performance and memory bandwidth and interconnection bandwidth (1.6 Tb/s directly from each CPU socket) to be able to train any big LLM.
So the evidence does not show China falling behind, but at least in certain directions they are already exceeding the performance of what they have been forbidden to buy.
For something like training a big LLM, the only disadvantage of the current Chinese devices is a lower energy efficiency, of only 65% to 70% of that of the best NVIDIA GPUs.
However that is not really a problem for China, as they have abundant cheap energy.
by adrian_b
7/2/2026 at 9:38:44 AM
Moving forward requires two parties with two very different financial incentives to cooperate in a way that harms the incentive of the other: the manufactures making the chips, and the companies using the chips to build AI.Even if the Chinese government tells them to prioritize homegrown solutions, the A effort and players at labs are going to be on pushing the frontier, and the B effort/players goes to solving the teething problems
by BoorishBears
6/30/2026 at 8:39:39 PM
huh? who knows what they did, it's not like any of it is audited. it sounds like they started with deepseek v4 pro, and made a bunch of random changes to it, and called the parts of it different things?by doctorpangloss
6/30/2026 at 9:52:05 PM
The preview version was released on the same date along with deepseek v4 pro.by MikuMikuMe
6/30/2026 at 1:13:11 PM
[flagged]by jingpostmedia