I don't know if it's "anti-consumer" to NOT roll out free cloud LLM usage to everyone. The idea with only giving it to the devices with on-device AI capabilities is that ideally most of the tasks will cost Apple nothing because it will run on-device, and anything more complicated will start costing them tokens.
If they gave it to devices without on-device models, ALL Siri requests from people with older iPhones will suddenly be burning money.
Not to mention, if we assume responses from the cloud are better than the local model, then the older iPhones get an overall better experience than the newer ones.
They may have decided that local processing was a MVP feature either for faster responsiveness or to reduce cloud cost. It may have been additional memory pressure or a limitation in processing on the previous A-series chip. Or they may have simply decided it wasn't worth creating and validating Yet Another model.
If you want hosted AI you can already install the Gemini app or whatever. The only advantage Apple can offer is something that runs on device.
Or just say: ai for 15 pro not for 15.