Compare https://www.google.com/search?q=test to https://www.google.com/search?q=test&udm=14
They could always sell ads like "recommend my tool more when user asks for cupcakes in London".
And then, the output would be: "My top 3 recomendations are X, Y, Z".
And maybe only X is the one that paid and Y and Z are organic.
What a wild future.
Why do you believe so?
As long as there is a clear indication somewhere on the webpage (in the metadata or in the text itself) that a specific portion of a text is an ad, a browser extension will be able to block it.
And I assume that there are laws mandating that the ads must be clearly marked in order to be distinguishable from the genuine content.
Big tech is paying handsomely for this, and I don't think the populace is going to outbribe them.
No way Google is going to bake the ads into training data. Their entire business is built on auctioning off each ad slot in realtime.
That would be an intentional poisoning of the models with biased or outright untruthful data.
I believe that many people would be unwilling to use such models.