6/4/2026 at 3:54:56 PM
Better performance than TQ and better quality than FP16?Am I reading this right??
by throwa356262
6/4/2026 at 5:04:42 PM
It's not better quality: 59.3% vs 59.4% fp16 on AIME 25by qeternity
6/5/2026 at 12:42:54 AM
0.1% is within margin of error. Depending on the performance boost, it might be worthwhile taking a minuscule quality hit.by sheepscreek
6/6/2026 at 11:07:00 AM
I think it very much is worth it!But the point was that quality didn't magically increase.
by qeternity
6/4/2026 at 9:33:54 PM
any divergence (even if the benchmark is better) from full precision is errorby electroglyph
6/5/2026 at 4:32:20 AM
Just pretend that it is the next step update when training. You didn’t train your model to step=inf, I hope?by 7e
6/4/2026 at 5:02:26 PM
Faster than Fp16, not better quality i guessby thefox96
6/4/2026 at 4:55:19 PM
[dead]by pbich