6/16/2026 at 10:12:35 PM
Given that DeepSwe is one of the very few coding benchmarks worth taking a look at, this achieves rather excellent result at it (not far from opus 4.8).From looking at the results and my own impression of 5.1 and other models, I think this is the best Chinese coding model by some non-insignificant margin.
by osti
6/16/2026 at 10:19:46 PM
I've been very pleased with it's performance over the last few days.It's definitely not near Opus 4.8 level but it's very impressive nonetheless and it does do design extremely well.
by LaurensBER
6/16/2026 at 11:10:40 PM
> it does do design extremely wellBetter than Opus?
by ebbi
6/17/2026 at 2:50:56 AM
I don't know what people mean when they say design lol, is it for frontends?by osti
6/17/2026 at 6:39:46 AM
Yeah, that's what I mean anyway. Each model has certain design tropes it repeats everywhere, and some of them are very old-school or not really UI best practice.And then the more ambitious cases where you ask for a feature without being prescriptive with UI needs, the end result is sometimes atrocious with weird font use, colours, etc.
by ebbi