Here's my question:
if agents continue to get better with RL, what is future proof about this environment or UI?
I think we all know that managing 5-10 agents ... is not pretty. Are we really landing good PRs with 100% cognitive focus from 5-10 agents? Chances are, I'm making mistakes (and I assume other humans are too)? Why not 1 agent managing 5-10 agents for you? And so on?
Most of the development loop is in bash ... so as long as agents get better at using bash (amongst other things), what happens to this in 6 months?
I don't think this is operating at a higher-level of abstraction if agents themselves can coordinate agents across worktrees, etc.
Having built and run agentic systems in production — the "why not agents managing agents" question has a practical answer: the orchestration layer is where the most expensive mistakes happen. Trust boundaries, escalation decisions, when an agent should act vs ask — that's engineering judgment, not boilerplate.
What's future-proof isn't the UI chrome, it's maintaining human visibility into the decision layer. The agents will get better at everything below that line. The line itself is the thing worth building tools around.
Interesting thoughts - thank you! And directionally agree - given that agents are becoming ever better, they'll take more and more of the orchestration on themselves. Still, we believe that developers need an interface to interact with these agents; see their status and review / test their work. Emdash is our approach for building this interface of the future - the ADE :)
> Still, we believe that developers need an interface to interact with these agents;
CLIs like claude code equally improve over time. tmux helps running remote sessions like there were local.
Why should we invest long time into your „ADE“, really?
> see their status and review / test their work
Won’t that be addressed eventually by the CLIs themselves?
Maybe you’re betting on being purchased by one of the agentic coding providers given your tool has long term value on its own?