Story Detail of id 48456209 | Liveview Hacker News

zmmmmm15 hours ago | on: Siri AI

It's interesting how anemic the use cases seem to be - we see the same things recycled over and over: "reword my email", "remove object from picture", "add a reminder", "summarise my text message which was already only 20 words long" etc etc. As if these are the major problems in people's lives.

I really feel like there's a fascinating valley of death between simple things that actually work and things of real value that are actually still beyond the horizon. They either aren't reliable enough, aren't accessible to the tech, or exceed the sophistication of our existing trust models. For example, I'm planning a trip. Booking a multiday holiday - there's a real beast that is time consuming, complex and painful. I test out the AI tools. They fail. Hard. Hallucinations all over the place, false confidence, inability to act, inability of me to trust their actions.

It's just nowhere near practical utility yet. Not "nearly there" but "not nearly half way there". I got the top tier of Gemini AI. Can it rent me a car? "As an AI I can absolutely guide you through the process of renting the car, but I can't physically access the web site or type in the details for you".

kristianc14 hours ago | parent | next

An alien landing on earth consuming Apple marketing content would be under the impression that humans did nothing but organise hikes to Big Sur with their friends.

loading story #48459624

microtonal9 hours ago | root | parent | next

And even if I organized a hike to Big Sur, Apple Maps would be pretty worthless because it doesn't have trail visualization and navigation worth speaking of (ok, maybe in Big Sur, but certainly not the rest of the world).

So I end up pulling out the trusty old Garmin gpsmap with cycle/hiking maps, that survived drops from 1.5 meters at 30 km/h as I was gliding of a mountain with my bike.

loading story #48460966

loading story #48462484

pastel873913 hours ago | root | parent | next

Well.. it’s not the furthest thing from the truth in the bay

loading story #48460107

loading story #48457924

loading story #48458469

loading story #48458404

loading story #48457638

loading story #48457203

idle_zealot14 hours ago | parent | next

Even the aspirational use cases you're talking about basically are just "digital secretary." There's a massive problem with that even if the models end up being capable in the future. The value of a secretary is that you know them, they know you, and you trust them to do things right. There are stakes if they don't. No company can provide that as a service at scale for everyone without it being a disaster. Not because it's not technically possible, but because of the incentives. That much power over the details of so many people's lives is irresistible; there will be persistent temptation to use it. The presence of that possibility makes the secretary impossible to trust.

9dev11 hours ago | root | parent | next

The vast majority of people never even asked for a personal assistant, because that isn’t something normal people have or do need in the first place. They aren’t so occupied/privileged/posh to need someone to do the trivial tasks of daily life for them.

This whole venue of technology is an exercise in ivory tower construction completely disconnected from ordinary people.

Snoddas10 hours ago | root | parent | next

I think you are wrong.

They never asked for one becase they never imagined being able to afford one.

The amount of administration organizing a normal household takes I suspect most would be glad to leave to someon/something they trust and that can be held accountable.

Today that someone needs to be a person (imo). But who knows, a startup may be plotting accountable digital assistans as we speak.

djaro9 hours ago | root | parent | next

I think theres some real use cases in the household department. I would love an AI that I just tell my nutrition goals (I want to eat X calories, Y protein, Z fiber, hit all vitamins) and it just generates a full meal plan for me each day. Like, I go to the store and it made the complete shopping list for me. And automatically updates the rest of the day if I tell it I skipped a meal or ate a snack.

jon-wood6 hours ago | root | parent | next

I get where you're coming from but at least for me you appear to be looking to automate away the interesting bits while being left with the tedious ones. What I want is the opposite to what you're asking for - let me dump a rough meal plan into whatever thing is doing this giving an overview of what meals I want to cook this week and then have it go place an order with the supermarket for delivery of the necessary ingredients taking into account what I've got in the house already.

watwut7 hours ago | root | parent

Eating disorder as a service does indeed sound like a business plan.

loading story #48463786

loading story #48460062

tempfile10 hours ago | root | parent

Obligatory: https://simonwillison.net/2025/Feb/3/a-computer-can-never-be...

pnut6 hours ago | root | parent

Pretty sure unaccountability is a desired feature of management decisions in most organisations.

That quote has an unexpressed precondition to the effect of "In order for an organisation to be objectively well run..." or "In order for an organisation to equitably benefit all stakeholders at all levels..." etc

jiscariot2 hours ago | root | parent | next

In the US, every time I file my taxes, I wonder what % of people don't meet the cognitive barrier to successfully file. I suspect a large portion of people offload tax filing to a service or accountant for numerous reasons, which is basically a personal assistant.

coffeemug6 hours ago | root | parent | next

Giving normal people something that has only been available to rich people is a staple of technological innovation. The problem in this case with Siri isn’t that people don’t want an assistant. It’s that it doesn’t actually work yet.

9dev3 hours ago | root | parent

But normal people have entirely different problems than rich people do! The amount of administrative overhead that would warrant a personal assistant is just vastly lower - most normal people:

  - don't travel frequently, 
  - don't have so many complex inquiries that require someone to research,
  - don't have super complicated taxes to file, 
  - don't go eating out in fancy restaurants that require special skills to get reservations in, 
  - don't have so many meetings to attend, 
  - don't receive hundreds of emails per day,
  - don't work on multiple projects at the same time,
  - don't organise festivities and social gatherings all the time.

Yeah, there probably are some things that could be simplified by delegating to someone, but they don't justify a human PA at all; and out of the remaining tasks, most are not really digital in nature: Going for groceries, doing chores, child and elderly care, interacting with other people, and so on. Digital assistants can't help you with any of these.

The one thing that would be useful - a kind of "chief of staff" that monitors your entire digital life and prioritises your every next step - is the antithesis to Siri and the like, which are merely reactive to your requests, not proactive in figuring out what needs your attention next. Let alone that that would be a total privacy nightmare, and a prime candidate for mass manipulation at scale.

microtonal9 hours ago | root | parent | next

Personally, I'd more interested in Reminders actually being able to sync lists properly and not delayed or reshuffling items while I'm typing before they would work on a personal assistant. Reminders (like Siri) has become the favorite joke of the family by now.

They don't even seem to get the basics right, why would I want another layer on top?

9dev3 hours ago | root | parent

[dead]

spotonm810 hours ago | root | parent

That's why I like reading HN. These people are smart enough to destroy the world but too stupid to realise they're doing it

zmmmmm13 hours ago | root | parent | next

it's an interesting question if any of the AI companies would be willing to step up and absorb the risk ie: to give the AI agent a "stake".

eg: if my booking is wrong, they will cover the cost and compensate me. It would sort of just come down to buying premium travel insurance for everyone that uses it. And insurance for anything else they do. It has to be one of two things - they either believe the risks are worth it (so then there should be a financial model that can absorb the cost of insurance to do it), or in fact, the risks are too great. At some point, if they keep offering the tech on a "use at your own risk" basis, they are implicitly communicating that they themselves think the risks are too great - so YOU shouldn't trust it either.

teiferer13 hours ago | root | parent

> eg: if my booking is wrong, they will cover the cost and compensate me

That would be nice, but it's the wrong angle. The reason people like real secretaries is not because somebody is compensated when things go wrong. It's because things don't go wrong. I don't use this thing if I need to fear things go wrong, even if I'd be compensated.

Maybe it would provide the right incentives for the companies though.

loading story #48457191

cjonas14 hours ago | root | parent

These use cases will just be built as "open source" (openclawd) or even custom one off application in the future. I've been building apps to run the tedious parts of my life recently. Meal planning, personal finance, bills, tax organization... Why would I pay for services that will be enshiftified when I can build a app that does exactly what I want in an afternoon. Yes the code is shit and it wouldn't scale... But it doesn't need to

whywhywhywhy10 hours ago | root | parent | next

> Why would I pay for services that will be enshiftified when I can build a app that does exactly what I want in an afternoon

Because the problem now took a whole afternoon to solved and sapped your creative energy instead.

losteric14 hours ago | root | parent

> Why would I pay for services that will be enshiftified when I can build an app that does exactly what I want in an afternoon.

When we talk about “the market”, the customer base, remember it’s a market that typically doesn’t know how to or care to even install an adblocker.

loading story #48456615

loading story #48456609

skylurk13 hours ago | parent | next

For fun I planned a holiday entirely with LLM's lately, and followed through.

It used good models and did a lot of searching, including searches in other languages. It got nothing right, riddled with fake places and times. It also found some weird and unique places I never would have considered.

I had a blast, brought me back to traveling pre-internet, requiring a level of spontaneity I had forgotten we used to depend on. 100% recommend it.

loading story #48460065

loading story #48457530

loading story #48457234

loading story #48457460

loading story #48458555

loading story #48458177

loading story #48457207

loading story #48462814

loading story #48460268

cootsnuck14 hours ago | parent | next

Yup, precisely. Turns out getting AI to be reliable at doing useful things is harder than we've all been led to believe by the dominant narratives.

https://www.normaltech.ai/p/new-paper-towards-a-science-of-a...

harrouet13 hours ago | parent | next

Spot on!

I am also under the impression that the LLM tech is plateauing before bringing the promised productivity. Great as a coding assistant, great a summarizing a text, translating, great a helping plan a trip...

But for the rest, e.g. act as a life assistant, it is still far off with no hope to reach the desired performance level.

I would not be surprisd to see OpenAI and the likes to start reverting to Siri v1 strategies, i.e. "if this then that" kind of agent routing.

loading story #48457346

loading story #48463331

MrDunham9 hours ago | parent | next

Agree and disagree on the "planning a trip" use case... as I'm sitting on a river cruise on an AI co-planned vacation (we found the cruise, AI set the daily itinerary).

Now the big (BIG) caveat is that I used Claude Code on my Max 20x plan from within VS Code. I have a fairly decent harness that I'd built and was sure to prompt it to run several subagents, including one that grounded walking times with Google Maps directly.

I'd say this is FAR beyond what the average person would do ("Hey Siri, plan me a trip to Prague") but also it shows that the models can do it with the right harness and guidelines. This wasn't that hard for me to do, so it seems to be more of a feature buildout ("the travel expert" AI) with a few markdown files than anything.

All told: web search for grounding times/locations, map grounding for walking paths and times, an adversarial agent to keep the model(s) honest, and a little bit of prompting and you've got a really great travel planner.

In short: the average person won't do this, but if I can build it in a few hours any of the 100% of people working at Apple/OpenAI/Anthropic who are smarter than me can build it and bake it into Siri (or ChatGPT, Claude, etc).

loading story #48460027

loading story #48463377

whywhywhywhy10 hours ago | parent | next

Extremely surprised we didn’t get the “book me a flight” example that has been an AI demo used over and over but everyone’s so particular about flights I can’t see many people just wanting to one shot it.

The laundry list of object removal, spacial photos, better speech to text etc is always just the latest open models just being slapped in there and branded as Apple.

Ultimately the meat of this presentation was the work of people outside Apple.

stevage8 hours ago | root | parent

For me "book me a flight to X on day Y" is so easy to do manually I don't need AI.

Where I do want AI is for really complex queries, like "find me a time and money efficient itinerary through Europe visiting places I haven't been before. Present options and I'll tell you what I don't like about each of them then we'll narrow in on an optimal solution"

loading story #48461356

panicinducer12 hours ago | parent | next

It reminds me of the Apple Vision Pro hype trailer. "Put on a helmet so you can view 2D photographs in 3D space." Present-day Apple sure has a way of making extremely impressive tech seem totally superfluous.

bsenftner7 hours ago | parent | next

The issue is anything that is of value requires some level of detail, of complexity, and that is only of interest to people that know that specific complexity, and it is a pain point for them. Now they'll care. Everyone else? Lost them. So, the marketing challenge is to find some aspirational complexity that people wish they knew, and how that can be solved with AI, and without turning that thing into a trivial nuisance, but a valued skill. That logical series right there is, well, too much for far too many.

audiala9 hours ago | parent | next

We are currently working on this with Audiala (iOS and webapp, Android should be released any coming days), making progress slowly but surely. We started to add tools to the AI chat to organise your trip and explore the place following your preferences and those of your group if you decide to share the trip. We can turn feedback into actual improvements pretty fast now, so would love to have yours and progress towards building the app you really want.

loading story #48462381

madrox12 hours ago | parent | next

I get the impression Apple designers don't actually use AI, and so have no idea what to build, since users don't know what they want from AI yet either.

loading story #48460230

blitzar10 hours ago | parent | next

> interesting how anemic the use cases seem to be

Have a conversation with the average Ai power user (outside of tech / coding) and this is the level the conversation will be on.

loading story #48462692

loading story #48460810

globular-toast9 hours ago | parent | next

Renting a car isn't a problem I have either. I've rented cars and vans plenty of times. I just phone up and talk to someone or email them. It's really easy.

A lot of these "problems" seem to stem from people just not wanting to interact with other people at all. Do we really want to become like Asimov's "Solarians"?

loading story #48461205

fruit202014 hours ago | parent | next

Booking and renting is certainly possible, they just need your auth credentials

loading story #48458135

9 hours ago | parent | next

{"deleted":true,"id":48458766,"parent":48456209,"time":1780997714,"type":"comment"}

fragmede12 hours ago | parent | next

All you're saying is that (the harness you're using for) Gemini sucks. OpenAI has their own web browser to fill out forms with and Claude has a whole Cowork feature that interacts with you computer including your web browser.

If new Siri still sucks, well, it's sucked the entire time. The worst of it is the security aspect where the setting to let you use Siri without authenticating hasn't worked since they added it! (still broken, iOS 26.5)

dyauspitr12 hours ago | parent | next

I sometimes do random weeklong roadtrips. ChatGPT is the best way to do this. Attractions that are esoteric, RV parks with 50A hookups for under $50, oldest/biggest/best restaurant with local food in a town, historical stops along the way, suburbs close to big cities but where hotel prices are lower etc.

dansquizsoft12 hours ago | parent

"I got the top tier of Gemini AI." - that's your issue, get a subscription to a lab offering actual frontier models like Anthropic and OpenAI.

Mistletoe12 hours ago | root | parent

Nonsense.

https://livebench.ai/#/?highunseenbias=true

loading story #48459103

#visit	13,680,282
#session	74,665
#live-session	0