4/14/2026 at 4:24:37 AM
Last week I got together with my math alumni friend. We cracked some beers, we chatted with voice mode ChatGPT and toyed around with Collatz Conjecture and we sent some prompt to a coding agent to build visualizations and simulation. It was a lot of fun directing these agents while we bounced off ideas and the models could explore them.I think with the right problem and the right agentic loop it’s clear to me improvements will speed up.
by bgirard
4/14/2026 at 4:45:24 AM
I think voice mode uses weaker models, just an FYI relative to the SOTAby drakenot
4/14/2026 at 4:04:52 PM
The bigger problem for me is that the realtime voice modes lack tool use, so they can't look anything up or do anything. Model strength definitely also matters, but even dumb models can be helpful when they can look things up and try things out. And smart models that don't do those things kinda suck.by pxc
4/14/2026 at 12:13:34 PM
Can get around this with a local STT model and use text input but UX is probably clunkierby SOLAR_FIELDS
4/14/2026 at 5:41:24 AM
Definitely, seems like gpt 3by scrollop