Why can't you do the planning ? Figure out what needs to be done , break it down into small tasks and then ask the agent to execute those small tasks?
When we executed projects in the past, this is what I would do as a lead: figure out the overall software architecture and delegate the tasks to developers.
This way I always knew how the system worked and could extend it as needed. I am not in development role anymore but I am trying to understand why we are delegating planning and software architecture to coding agents?
Maybe I spent 2-4 hours reviewing it, checking things with colleagues etc.
Then I press "go" and maybe an hour later I have a tested system ready for manual review.
It's plans are at least as good as any I've seen. Their weakness is if there are unstated assumptions I have about how things need to be done, so most of my time is now getting those assumptions stated properly and then reviewing.
Why wouldn't I use this? It's the best tool I've used in my 30 years of professional programming.
Even in that situation, I think I would still only feel comfortable approaching the task as I would do it without AI, and using the AI to accelerate the parts that would be time-consuming. E.g. finding where/how feature X is implemented, how it would affect the overall system if I were to change it this way, etc.
…but it’s than just that (in most cases; I am just lazy sometimes); but fundamentally there’s a limit to how much complexity people can comprehend.
We are good at working at high level abstractions, modules with clear apis that can be sprung to together into some kind of feature.
You don’t need to look inside the black box of the module if you trust the implementer; Ive never opened up the internals of a calendar be like “how does this work?”. I just don’t care. It’s a calendar. I use the api.
I think most people are using these tools in this way; very few people are having an agent write a plan, then a sub agent review it, no human in the loop. Those are for prototypes and are yolo cowboys using open claw and playing with the phones instead of working; we have a few at work, but their PRs are regularly rejected as slop.
…but, realistically; many people aren’t software architects. They may not even know coding patterns, forget architecture patterns.
Having an agent spit out generic software architecture is probably better than what they were producing before.
Writing a module / feature using generic architecture and planning is probably better than random code spaghetti right?
It’s easy to lament the loss of craft here, but at the end of the day, the models today do an ok job of this. The models of tomorrow will probably be better at it than many people.
Architecture is easy composed to actually implementing things. You just wave your hands from your ivory tower and say “more event sourcing”.