DeepSeek reasonix, DeepSeek native coding agent with high caching and low cost

https://esengine.github.io/DeepSeek-Reasonix/

382Alifatisk | 10 hours ago | 184 | HN

embedding-shape9 hours ago | parent | next

I'm not sure you need a "DeepSeek native coding agent" to take advantage of DeepSeeks cache, yesterday as the Codex quota usage issue still wasn't solved for me, I wrote a tiny little bridge so I could use DeepSeek V4 Pro via Codex, and seems most of everything I did was basically cached as far as I can tell: https://i.imgur.com/7eKn6wN.png (2026-05-23 Input (Cache hit): 39,123,200 tokens, Input (Cache miss) 1,692,286), and the bridge is doing not special, just massage the DeepSeek API shape into what Codex expects, nothing particular about caching at all.

Besides being even better at the caching, I'm not sure what benefits you'd get compared to just firing up OpenCode with the DeepSeek API yourself, it'll similarly do caching for sure and also "talks directly to api.deepseek.com" if that matters, and you'll get a much more mature harness.

loading story #48257912

loading story #48257543

loading story #48257544

loading story #48259739

loading story #48260508

skeledrew8 hours ago | parent | next

Not a fan of that page. The animated typing and resulting continuous resize of the example keeps moving the content beneath it down and up. Such bad UX.

loading story #48257741

loading story #48260594

loading story #48261785

unshavedyak7 hours ago | parent | next

It's pretty funny, i'm a $200/m Claude subscriber and i've had little need to use anything else. However the more Claude has been restricting my workflow (notably around the recent IDE/-p usage change) the more i've been wanting to go elsehwere.

I'm concerned since i really want SOTA reasoning, but DeepSeek still has me interested.

loading story #48261685

loading story #48261701

declan_roberts8 hours ago | parent | next

I love the focus on cache hit efficiency. Hats off to the deekseek team for creating a great product that maximizes cost efficiency for the user.

loading story #48257756

loading story #48258434

loading story #48258352

loading story #48257820

loading story #48260779

schaefer8 hours ago | parent | next

Okay, I'm curious.

From the FAQ, I see:

>Can I point it at a self-hosted / private DeepSeek endpoint?

>Yes. Since 0.30 we accept non-standard key prefixes for self-hosted DeepSeek endpoints. Just point `baseUrl` at your internal address — the loop, cache strategy, and tool protocol are unchanged.

But my question is: If I use Reasonix to talk to a deepseek endpoint through openrouter, am I still getting the cache-hit benifits of this agent harness?

loading story #48258060

mmaunder8 hours ago | parent | next

Unusable thanks to the top animation pushing the rest of the site down repeatedly as you’re trying to read.

loading story #48258831

loading story #48259379

singiamtel7 hours ago | parent | next

I would've liked benchmarks against other harnesses showing the caching performance

loading story #48259725

hirako20008 hours ago | parent | next

Good timing given the cost spike across other frontier models.

loading story #48257736

loading story #48259683

theanonymousone8 hours ago | parent | next

Isn't caching a server-side thing? How does the agent affect it, significantly at least?

loading story #48257725

hebetude7 hours ago | parent | next

Wow the UI looks exactly what I vibe coded yesterday. What a coincidence

loading story #48260024

loading story #48258652

loading story #48259119

yalogin7 hours ago | parent | next

Can someone give me a eli5 version of what this is? It really sounds useful to Claude subscribers.

Is this improving the cache hit and hence overall efficiency of coding workflows?

Does it also let me host a local llm (deepseek)? What are model min requirements for this?

loading story #48259735

loading story #48260128

loading story #48259625

loading story #48258718

9 hours ago | parent | next

{"deleted":true,"id":48257526,"parent":48256953,"time":1779632805,"type":"comment"}

loading story #48258680

loading story #48258695

loading story #48261410

pkulak7 hours ago | parent | next

Doesn't Pi Agent do exactly this? Assuming "append only" means they do some kind of compaction as well.

loading story #48260188

quotemstr7 hours ago | parent | next

> no reordering, no marker-based compaction

Is this really the behavior you want? Yes, doing tool-result clearing and such will blow your cache, but if you do it only occasionally, it's still likely a win. Yes, cache hits are good, but not so good that it's okay to be profligate with context to preserve those precious, precious KVs.

canadiantim8 hours ago | parent | next

So what's best low cost coding agent these days? Kimi 2.6? Qwen's latest closed model? Composer 2.5? DeepSeek?

loading story #48257558

loading story #48258381

loading story #48257629

loading story #48257603

loading story #48257589

loading story #48257723

loading story #48257578

loading story #48258879

loading story #48258939

loading story #48258679

loading story #48260326

sergiotapia8 hours ago | parent | next

What AI model did you use for the website design? This is the second one I see with the exact same font and color scheme. Just curious because Claude models lean towards purples for example. Thank you!

loading story #48257749

loading story #48258025

loading story #48258205

loading story #48257849

loading story #48259066

8 hours ago | parent | next

{"deleted":true,"id":48257979,"parent":48256953,"time":1779636087,"type":"comment"}

loading story #48259639

loading story #48259353

loading story #48259276

the_mitsuhiko8 hours ago | parent

[dead]

#visit	13,352,461
#session	74,665
#live-session	0