Ask HN: What was your "oh shit" moment with GenAI?
Now it sits in a slack channel, and I watch it doing work, responding to ambiguity, and taking feedback/edits all day. It's unreal. It's literal magic. It saves a HUGE amount of time and gave us a pattern to do more.
This is the real deal. It's not easy to find problems with the right shape, and it's not easy to build agents that fit even when you do... but once it clicks, it clicks.
I use it professionally all the time and could cite technical scenarios where it’s become almost indispensable, but saving me time and money and reducing stress on this mundane stuff… now imagine applying to people’s stressors: job searches, health, big purchases, debt… there’s an opportunity to actually make people’s lives better. After 30 years of hype cycles, I should be wary of techno-optimism. But here I am feeling cautiously optimistic anyway.
For some people that matches their expectation or they don't really have an expectation. While for other people it doesn't match their expectation.
When deepseek again produced an entire web app that somewhat looked alright.
When Gemini could finally produce json was I specified.
The issue is, all LLMs can do. When they do, is boilerplate and code a mediocre coder could produce if they cared to try and insist.
In a way we should praise the ability of these things, but at what (in) efficiency. Code still need to be reviewed as we can't trust these things and context got a limit to entertain the idea of possibly having them fix their own mess.
The amount of masterpiece level art flowing per hour was astounding.
For every one doing a ninja waifu, there were ten doing art from davinci and leonardo crossed with hockney.
it almost gave you art sickness
And software that I can imagine I might want to "make" or have at my fingertips is readily available even though I have a busy schedule with very little free time!
Also, I love feeling like a manager whose direct report actually does what I tell it to. Crazy good feeling.
That was the day I realised the plagiarism potential llms has.
Aka handsome, confident successful, affluent alpha male on a boat, yet looking perfectly like me.
I asked it to write a script that would search for a specific string in footers in a massive series of DOCX files and change them according to some rules. The strings ended up being embedded in cells within an invisible table in the footers, the LLM realized this and switched strategy to a full deep traversal of the underlying XML. It correctly processed like 50 of these files in about 10 minutes, using libraries I wasn't aware of. I had spent an hour being annoyed before trying.
It was an "oh shit" moment for at least that category of work.
Frankly, to an outsider whatever it presents looks legit, but as an expert I recognize its failures, which makes me even more entrenched in the idea to never use it outside my area of expertise.
I have a question for all them believers: If on a hypothetical scenario you, having no medical experience, find yourself and your child on a mountain, 12 hours away from nearest road, and your offspring is having appendicitis (let's assume your recognize this 100%), with a sharp knife and Claude at your disposal - would you risk to operate on your child? Or hurry the fuck down to get him to a hospital? I know I would chose to get him to a hospital, because that would be a better chance for my kid to live than me to operate on my kid with Claude's assistance. I am pretty sure I would kill my kid on that mountain. So yeah, outside my area of expertise I don't trust Claude one bit.
Grok just did these things for me, no questions asked, no ethical judgments. No woke.
Elon really doesn't get enough credit for Grok. People don't want the most powerful reasoning model or "constitutional AI". They just want a model that does what they say. Elon understood that insight (like he usually does) and no one else really did and that's probably why Grok has been growing rapidly over the last two years or so.