Hacker News new | past | comments | ask | show | jobs | submit
> It’s really disappointing to see the on-device models being limited to so few devices.

At first I thought it was the usual planned obsolescence. Then I realized it may be a true technical limitation. I suspect an embedding model is required to run on device in order to make several of the features work. Embedding models are small compared to LLMs, but, depending on their capabilities, could be the memory driver.

loading story #48465747