5/15/2026 at 12:47:08 AM
I can't see why anyone still chooses Claude. Codex outperforms it in most respects, and its quotas are about ten times larger. A $100 Codex plan gets me through the whole week with 6–12 hours of coding per day.by lepuski
5/15/2026 at 1:37:13 AM
I found GPT 5.5 is pretty solid, but I keep getting impressed by opus. It's tracked down some insane stuff while I look away during a meeting. 5.5 is way closer than previous OpenAI models to Anthropic IMO.These things are so tricky because everyone has a seemingly conflicting experience. Part of the fun I guess!
by jjice
5/15/2026 at 1:21:49 AM
I've never actually run into the issues that people talk about online, like Claude suddenly getting dumb or running out of usage. So there's just not a lot of incentive for me to shop around. I've used Amp a bit, and it's quite nice, but a bit more expensive without the subsidized subscription.by SatvikBeri
5/15/2026 at 1:43:15 AM
Are you using Opus? Sonnet remains as useful as it was while Opus efficacy and token burn rate has soured over the last 4 months.by gardnr
5/15/2026 at 1:52:14 AM
I'm using Opus on xhigh 10+ hours a day, and I've only reached 80% of weekly limits when doing massive ports or refactors. I haven't once hit hourly limits, and I've used Claude very, very aggressively. I guess its a pain point for power users.by fny
5/15/2026 at 5:31:26 AM
I sometimes run multiple claudes at the same time, with each terminal working on a different task. I have 2 going right now.Its very easy to burn through your quota if you work like that. Especially on high / xhigh.
by josephg
5/15/2026 at 7:45:20 AM
I used to be mostly at high/xhigh but now at medium I think it actually performs quite well both on results and token usage.by plufz
5/15/2026 at 5:12:43 AM
Yes, I've pretty much used Opus exclusively for the last year, except for a brief period when Sonnet was aheadby SatvikBeri
5/15/2026 at 2:02:39 AM
It has always been like this. We actually know that the model performance has been mostly steady[0], but you cannot beat the notion of "evil companies secretly serving us worse models." The meme value is too strong.by raincole
5/15/2026 at 9:19:18 AM
Hmm, today's pass rate raised to 73% - interesting, are they AB-testing some new model? This is too high for Opus 4.7.by mnicky
5/15/2026 at 1:53:59 AM
When do you use it the most? I’ve noticed that it most often starts to degrade during 10-5 US East coast time. Late at night, I have the least amount of issues, but without fail, if I’m trying to do anything complex during the day, Claude gets loopy.by mbreese
5/15/2026 at 5:13:15 AM
9-5 Pacific Timeby SatvikBeri
5/15/2026 at 1:25:40 AM
Same here. Works every time. Never ran into usage limits either.by dboreham
5/15/2026 at 1:34:58 AM
Claude is the only AI coding tool I've found worth a damn. Without it I'd just do everything by hand save for a few bash scripts or whatever.by hansvm
5/15/2026 at 1:52:16 AM
Have you tried other harnesses, such as OpenCode?by arcanemachiner
5/15/2026 at 1:58:46 AM
Yeah, harness quality matters too, but the underlying model capabilities are night and day.by hansvm
5/15/2026 at 12:51:35 AM
One reason might be that Claude Opus 4.7 thinking benchmarks better on Arena Coding at https://arena.ai/leaderboard/text/coding ... hopefully that effectively assesses correctness. It doesn't account for reliability though.by elahieh
5/15/2026 at 8:59:27 AM
But 100$ Claude subscription also gets me easily entire week of coding 6-8 hours a day? What on earth do you do to run out of limits on Max? Do you vibe multiple new codebases every day for a living? The benefit of Claude is also not gaslighting me every time I tell it it's wrong.by atraac
5/15/2026 at 12:59:07 AM
I think it's impossible to say that codex x.y.z is better than Sonnet x.y.z, I used many "high" end models and they're just all good.by Thaxll
5/15/2026 at 3:01:59 AM
I certainly get more usage before cutoff from GPT 5.5, but the output I get from Opus 4.7 is way better. It just sucks that I get 2 good "long running" prompts on Opus 4.7 before my daily quota is met on the $20 subscription.by xboxnolifes
5/15/2026 at 1:34:31 AM
Corporate policies and agreements. In large corporations, using external non-approved models with proprietary source code is a good way to have significant career issues.by kylemaxwell
5/15/2026 at 1:30:29 AM
You get a discount for paying for a full year on Teams and Enterprise can involve contractual obligations. It's a lot of effort to get buy-in to change providers and to shift an entire organization. The winds change frequently in this space and the pain needs to get to a certain level before it's worth rolling the dice.by SeanAnderson
5/15/2026 at 1:24:35 AM
Claude Max 20x gives me unlimited (for my level of usage) Opus 4.7 - how much money do I have pay OpenAI for that?by taspeotis
5/15/2026 at 1:54:32 AM
Based on the experience of people using the $20 Claude Pro subscription and exhausting their quotas in a manner of minutes, the answer to your question is probably "less". (I would guess that the $100 plan would do the trick.)by arcanemachiner
5/15/2026 at 2:06:08 AM
Okay so how much less will I have to pay OpenAI for unlimited Opus 4.7?by taspeotis
5/15/2026 at 1:54:25 AM
In my org the teams doing agent engineering at scale are all on Codex using gpt-5.5. By scale I mean fully agent authored code workflows with long running / multi hour plans.by CompoundEyes
5/15/2026 at 2:02:50 AM
I'd rather not give money to Sam Altman.by etchalon
5/15/2026 at 5:18:35 AM
with Anthropic you’re giving money to Elon Musk. Seems like a pick-your-billionaire world we’re in nowby beering
5/15/2026 at 5:10:56 AM
Claude is (per benchmarks) much worse at instruction following, but is more charming and deceptive and anthropomorphized by default (in name and image), leading to productivity assessment psychosisby wahnfrieden
5/15/2026 at 12:59:52 AM
Claude is significantly better at Rust in my experience, and Rust is my favorite language to emit from LLMs.Opus 4.7 + Rust is a killer combo.
by echelon
5/15/2026 at 1:33:08 AM
because my shard isn’t erroringI use Codex when Claude Code is down, and I only began using Claude when ChatGPT was down
yes codex is very fast, I go back to Claude for now
by yieldcrv
5/15/2026 at 1:17:24 AM
Corporate reasons. AWS hasn't opened codex models to everyone yet.by squirrellous
5/15/2026 at 12:55:02 AM
Because of marketing and vibes mostly.Heck I prefer DeepSeek to both of those.
by nothinkjustai
5/15/2026 at 1:12:07 AM
Wow, I'm really surprised. I tried deepseek (their best model, through the official API). Its extremely cheap, but its clearly not as good at programming as Opus 4.7. It seems nowhere near as good at making high level design choices. Deepseek also seems to get stuck in whack-a-mole fixing loops much more than opus. I stopped it at one point, and asked opus to solve the problem it was trying to solve and it saw the solution immediately.I was running deepseek through claude's code agent harness. Maybe it works better through a different tool?
by josephg
5/15/2026 at 1:28:24 AM
I've given V4 Pro some curly things and I was impressed at how it figured them out. I agree high level design is not its forte. But it sat in a loop and dogmatically debugged a crazy dependency issue to come to the right answer over the course of 15 minutes which impressed me.by zmmmmm
5/15/2026 at 4:04:33 AM
Idk, I don’t vibe code so even the flash model is great for generating code for myself. I tend to do the planning and design myself though.Harness also matters, and also provider. I was using openrouter and switched to the Deepseek api and suddenly all the tool call issues I was having resolved themselves. Flash is so damn fast at doing stuff like generating boilerplate I can’t go back to the bigger slower models.
by nothinkjustai
5/15/2026 at 1:18:54 AM
You tried v4?by esafak
5/15/2026 at 1:47:54 AM
I tried to like it, but it eventually got stuck in a near-infinite loop trying to debug an extra curly bracket in an iOS app.That and the lack of image-read support surprised me. I'm a big fan of feeding screenshots into my llm and that killed it for me.
by codybontecou
5/15/2026 at 1:31:10 AM
Yeah, v4.I would have been much more impressed with v4 about 6 months ago. But I've been spoiled by opus 4.7. Deepseek isn't at the same level.
by josephg
5/15/2026 at 1:06:34 AM
interestingly I had the same experience, and weirdly it's in part because it is clearly less intelligent. It's more of a mechanistic tool just doing what I ask (but still very smart and very competent about it) and less trying to win a nobel prize with each answer. Turns out I actually like that.by zmmmmm