Except I find that agents are much better at debugging than writing code. When the code gets to the state where the agents can no longer debug it and keep it running (as happened in Anthropic's failed C compiler attempt) humans will likely not be able to save the situation (in an efficient way). And without careful human supervision from the start, the code ultimately gets to that place.