Show HN: Axe – A 12MB binary that replaces your AI framework

https://github.com/jrswab/axe

120jrswab | 8 hours ago | 85 | HN

loading story #47357564

loading story #47357268

loading story #47351292

loading story #47356256

loading story #47355638

loading story #47357089

loading story #47356014

loading story #47355886

swaminarayan6 hours ago | parent | next

Axe treats LLM agents like Unix programs—small, composable, version-controllable. Are we finally doing AI the Unix way?

loading story #47352938

armcat8 hours ago | parent | next

Great work! Kind of reminds me of ell (https://github.com/MadcowD/ell), which had this concept of treating prompts as small individual programs and you can pipe them together. Not sure if that particular tool is being maintained anymore, but your Axe tool caters to that audience of small short-lived composable AI agents.

loading story #47351714

loading story #47355884

btbuildem6 hours ago | parent | next

I really like seeing the movement away from MCP across the various projects. Here the composition of the new with the old (the ol' unix composability) seems to um very nicely.

OP, what have you used this on in practice, with success?

loading story #47352951

reacharavindh6 hours ago | parent | next

Reminded me of this from my bookmarks.

https://github.com/chr15m/runprompt

loading story #47352745

loading story #47356384

loading story #47351280

0xbadcafebee7 hours ago | parent | next

Nice. There's another one also written in Go (https://github.com/tbckr/sgpt), but i'll try this one too. I love that open source creates multiple solutions and you can choose the one that fits you best

loading story #47352470

Orchestrion7 hours ago | parent | next

The Unix-style framing resonates a lot.

One thing I’ve noticed when experimenting with agent pipelines is that the “single-purpose agent” model tends to make both cost control and reasoning easier. Each agent only gets the context it actually needs, which keeps prompts small and behavior easier to predict.

Where it gets interesting is when the pipeline starts producing artifacts instead of just text — reports, logs, generated files, etc. At that point the workflow starts looking less like a chat session and more like a series of composable steps producing intermediate outputs.

That’s where the Unix analogy feels particularly strong: small tools, small contexts, and explicit data flowing between steps.

Curious if you’ve experimented with workflows where agents produce artifacts (files, reports, etc.) rather than just returning text.

loading story #47352552

mark_l_watson7 hours ago | parent | next

If I have time I want to try this today because it matches my LLM-based work style, especially when I am using local models: I have command line tools that help me generated large one-shot prompts that I just paste into an Ollama repl - then I check back in a while.

It looks like Axe works the same way: fire off a request and later look at the results.

loading story #47351854

loading story #47352594

TSiege7 hours ago | parent | next

This looks really interesting. I'm curious to learn more about security around this project. There's a small section, but I wonder if there's more to be aware of like prompt injection

loading story #47352235

loading story #47353724

jedbrooke7 hours ago | parent | next

looks interesting, I agree that chat is not always the right interface for agents, and a LLM boosted cli sometimes feels like the right paradigm (especially for dev related tasks).

how would you say this compares to similar tools like google’s dotprompt? https://google.github.io/dotprompt/getting-started/

loading story #47352025

loading story #47357526

nthypes8 hours ago | parent | next

There is no "session" concept?

loading story #47351690

a1o8 hours ago | parent | next

Is the axe drawing actually a hammer?

loading story #47351397

loading story #47351512

loading story #47351479

loading story #47351293

saberience7 hours ago | parent | next

I’m having trouble understanding when/where I would use this? Is this a replacement for pi or codex?

loading story #47352415

loading story #47352561

loading story #47351860

jrswab8 hours ago | parent | next

I built Axe because I got tired of every AI tool trying to be a chatbot.

Most frameworks want a long-lived session with a massive context window doing everything at once. That's expensive, slow, and fragile. Good software is small, focused, and composable... AI agents should be too.

Axe treats LLM agents like Unix programs. Each agent is a TOML config with a focused job. Such as code reviewer, log analyzer, commit message writer. You can run them from the CLI, pipe data in, get results out. You can use pipes to chain them together. Or trigger from cron, git hooks, CI.

What Axe is:

- 12MB binary, two dependencies. no framework, no Python, no Docker (unless you want it)

- Stdin piping, something like `git diff | axe run reviewer` just works

- Sub-agent delegation. Where agents call other agents via tool use, depth-limited

- Persistent memory. If you want, agents can remember across runs without you managing state

- MCP support. Axe can connect any MCP server to your agents

- Built-in tools. Such as web_search and url_fetch out of the box

- Multi-provider. Bring what you love to use.. Anthropic, OpenAI, Ollama, or anything in models.dev format

- Path-sandboxed file ops. Keeps agents locked to a working directory

Written in Go. No daemon, no GUI.

What would you automate first?

loading story #47351292

loading story #47352745

loading story #47352594

loading story #47352561

loading story #47351280

loading story #47351860

loading story #47351332

loading story #47357350

ozgurozkan7 hours ago | parent | next

The Unix-philosophy framing resonates — focused, composable, single-purpose agents are genuinely safer architecturally than monolithic long-lived sessions with massive context windows.

That said, composability introduces its own attack surface. When agents chain together via pipes or tool calls, each handoff is a trust boundary. A compromised upstream output becomes a prompt injection vector for the next agent in the chain.

This is one of the patterns we stress-test at audn.ai (https://audn.ai) — we do adversarial testing of AI agents and MCP tool chains. The depth-limited sub-agent delegation you mention is exactly the kind of structure where multi-step drift and argument injection can cause real damage. A malicious intermediate output can poison a downstream agent's context in ways that are really hard to audit after the fact.

The small binary / minimal deps approach is great for reducing supply chain risk. Have you thought about trust boundaries between agents when piping? Would be curious whether there's a signing or validation layer planned between agent handoffs.

loading story #47351628

loading story #47351332

Lliora7 hours ago | parent

12MB for an "AI framework replacement"? That's either brilliant compression or someone's redefining "framework" to mean "toy model that works on my laptop." Show me the benchmarks on actual workloads, not the readme poetry.

loading story #47351661

#visit	13,079,704
#session	74,665
#live-session	0