Sure, but also, who cares? The machine code is completely incidental for most purposes.
Or maybe just compare Hermes vs OpenClaw for long-horizon personal agentic tasks. Which one performs better in offline inference personal finance analysis tasks?
Or read up on how the `/code-review` workflow works in Opus 4.8 and give me a guess as to how long it'll take Codex to implement it and which tool would be more appropriate for your engineering team (don't forget to include enterprise API token costs in workflows – it can spin up 100 agents in thirty seconds).
If you can figure out how to secure agents with simultaneous access to personal data and the internet to run unsupervised while avoiding the lethal trifecta (Willison, 2025) let me know.
It's like having a naive but super knowledgeable junior developer starting under you. It's obvious you'd learn a lot in how to communicate, framing, specifications, and what kind of follow-up you'd need to do to ensure good results.