Motra, my workout app on iPhone (with great Apple Watch support, including counting reps), uses Apple foundation models to build a workout.
Not all use cases will be big agentic coding things that will use millions of tokens.
Some on device (or on server) stuff might be small one shot calls that just use what’s the OS provides.