4/8/2026 at 7:45:29 PM
I'm suspicious that this is going to lead to optimal orchestration ... or rather, that open source won't produce a far better alternative in time.The best performance I've gotten is by mixing agents from different companies. Unless there is a "winner take all" agent (I seriously doubt it, based on the dynamics and cost of collecting high quality RL data), I think the best orchestration systems are going to involve mixing agents.
Here, it's not about the planner, it's about the workers. Some agents are just better at certain things than others.
For instance, Opus 4.6 on max does not hold a candle to GPT 5.4 xhigh in terms of bug finding. It's just not even a comparison, iykyk.
Almost analogous to how diversity of thought can improve the robustness of the outcomes in real world teams. The same thing seems to be true in mixture-of-agent-distributions space.
by mccoyb
4/8/2026 at 8:24:38 PM
My fear is that this is going to lead to an optimal orchestration language. For example, that Claude switches to Sumerian for all communication between agents. One thing is if they try to silo like that, but my real fear is that it may actually perform well.(Not sure if it would be Sumerian, Esperanto or something more artificial. As long as it is esoteric enough for one company to hoard all the expertise in it.)
by sjdv1982
4/8/2026 at 8:43:47 PM
I've seen Antigravity outputting chinese characters in its thinking traces from time-to-time.I also remember chinese being discussed as a potential orchestrating language but I don't remember the sources, so 100% anecdotical.
by mrbungie
4/8/2026 at 7:56:32 PM
Another way to think about it:For Anthropic to have the best version of this software, they'd have to simultaneously ... well, have the best version of the software, but also beat every other AI company at all subtasks (like: technical writing, diagramming, bug finding -- they'd need to have the unequivocal "best model" in all categories).
Surely their version is not going to allow you to e.g. invoke Codex or what have you as part of their stack.
by mccoyb
4/8/2026 at 7:56:01 PM
Yeah this has been my experience too, mixing agents/models from different companies..Having Opus write a spec, then send to Gemini to revise, back to Opus to fix, then to me to read and approve..
Send to a local model like Qwen3.5 to build, then off to Opus to review ...
This was such an amazing flow, until Anthropic decided to change their minds.
by intothemild