“888 KiB Assistant” but the assistant itself is a multi terabyte rental-only model stored in some mysterious data center.
loading story #47226003
I'm getting "serverless" flashbacks.
It seems to support connecting to your own LLM on the same LAN
The point is the agent is still the LLM.
No LLM, no agent.
I tried connecting OpenClaw to ollama with a V100 running qwen3.5:35b but it was really, really, really slow (despite ollama itself feeling fairly fast).
These "claw" agents really multiply the tokens used by an obscenely huge factor for the same request.
i recently decided to get into this ocean boiling game too, the 32GB V100 seems like a pretty good VRAM/$. if i may ask, do you make any special accommodations for cooling? i've never dealt with a passively cooled card before and i'm curious whether my workstation fans (HP Z840) will be sufficient. i'm going to try 2 cards at first but i think i might be able to squeeze a third in there
My model is at home... just 16Gb still a lot but just FYI