Story Detail of id 47200084 | Liveview Hacker News

FuckButtons1 day ago | on: MCP server that reduces Claude Code context consumption by 98%

Yeah, the fact that we have treated context as immutable baffles me, it’s not like humans working memory keeps a perfect history of everything they’ve done over the last hour, it shouldn’t be that complicated to train a secondary model that just runs online compaction, eg: it runs a tool call, the model determines what’s Germaine to the conversion and prunes the rest, or some task gets completed, ok just leave a stub in the context that says completed x, with a tool available to see the details of x if it becomes relevant again.

mksglu1 day ago | parent | next

That's pretty much the approach we took with context-mode. Tool outputs get processed in a sandbox, only a stub summary comes back into context, and the full details stay in a searchable FTS5 index the model can query on demand. Not trained into the model itself, but gets you most of the way there as a plugin today.

FuckButtons19 hours ago | root | parent

This is a partial realization of the idea, but, for a long running agent the proportion of noise increases linearly with the session length, unless you take an appropriately large machete to the problem you’re still going to wind up with sub optimal results.

jerf17 hours ago | root | parent

Yeah, I'd definitely like to be able to edit my context a lot more. And once you consider that you start seeing things in your head like "select this big chunk of context and ask the model to simply that part", or do things like fix the model trying to ingest too many tokens because it dumped a whole file in that it didn't realize was going to be as large as it was. There's about a half-dozen things like that that are immediately obviously useful.

esperent1 day ago | parent | next

Is it because of caching? If the context changes arbitrarily every turn then you would have to throw away the cache.

FuckButtons20 hours ago | root | parent

So use a block based cache and tune the block size to maximize the hit rate? This isn’t rocket science.

wonnage17 hours ago | root | parent

This seems misguided, you have to cache a prefix due to attention.

21 hours ago | parent

{"deleted":true,"id":47202791,"parent":47200084,"time":1772330195,"type":"comment"}

#visit	12,937,556
#session	74,665
#live-session	0