Hacker News new | past | comments | ask | show | jobs | submit

Ask HN: What was your "oh shit" moment with GenAI?

loading story #48418158
loading story #48417379
loading story #48417845
loading story #48422829
loading story #48418620
loading story #48420889
loading story #48417445
loading story #48419523
loading story #48424383
loading story #48421843
loading story #48421303
loading story #48420458
loading story #48418364
loading story #48424304
loading story #48419636
loading story #48418164
loading story #48420538
loading story #48419989
loading story #48419480
loading story #48417016
loading story #48417647
loading story #48423838
loading story #48423241
loading story #48418042
loading story #48419364
loading story #48419242
loading story #48417372
loading story #48421250
loading story #48418018
loading story #48420914
loading story #48418716
loading story #48417949
loading story #48419305
loading story #48425576
loading story #48419220
loading story #48417658
loading story #48418006
loading story #48422478
loading story #48421467
loading story #48431219
loading story #48419885
loading story #48430510
loading story #48417492
loading story #48417163
loading story #48419817
loading story #48423425
loading story #48431821
loading story #48417930
loading story #48418472
loading story #48420635
loading story #48417342
loading story #48422255
loading story #48420135
loading story #48421581
loading story #48417791
loading story #48420444
loading story #48426293
loading story #48418284
loading story #48421652
Some business users spent ~30 minutes on an internal process, and we prototyped an "Agent" in Slack to take over. At first it didn't work, then it didn't work some more, eventually it ALMOST worked. Then one day, it worked, and the old business process died never to be revived.

Now it sits in a slack channel, and I watch it doing work, responding to ambiguity, and taking feedback/edits all day. It's unreal. It's literal magic. It saves a HUGE amount of time and gave us a pattern to do more.

This is the real deal. It's not easy to find problems with the right shape, and it's not easy to build agents that fit even when you do... but once it clicks, it clicks.

loading story #48420027
loading story #48418960
loading story #48425504
loading story #48426496
loading story #48420530
loading story #48422439
loading story #48417384
loading story #48423184
loading story #48419450
loading story #48424024
loading story #48421187
loading story #48423405
loading story #48422745
loading story #48421806
loading story #48420052
loading story #48419883
loading story #48418044
loading story #48428452
loading story #48420059
loading story #48431029
I had received four very different bids for a home repair project. Just wildly different itemization breakdowns, costs, timelines, scopes, even formats. Opus helped me turn it into an apples-apples comparison, filled in missing areas with reasonable inferences based on the other bids, provided a nice pdf I printed to review with my partner, even offered suggested key questions for follow up calls. It really clarified the advantages of one of the bids.

I use it professionally all the time and could cite technical scenarios where it’s become almost indispensable, but saving me time and money and reducing stress on this mundane stuff… now imagine applying to people’s stressors: job searches, health, big purchases, debt… there’s an opportunity to actually make people’s lives better. After 30 years of hype cycles, I should be wary of techno-optimism. But here I am feeling cautiously optimistic anyway.

loading story #48419412
loading story #48417843
loading story #48419540
loading story #48423257
loading story #48445112
loading story #48417208
loading story #48421713
loading story #48426222
loading story #48417185
loading story #48422278
loading story #48432938
loading story #48430500
loading story #48429556
loading story #48418370
loading story #48417338
loading story #48422276
loading story #48430989
loading story #48434635
loading story #48417699
loading story #48418856
loading story #48426096
loading story #48419933
loading story #48419923
loading story #48421311
loading story #48423663
loading story #48431198
loading story #48417274
loading story #48418718
loading story #48419042
loading story #48417633
loading story #48421138
loading story #48418776
loading story #48418903
I started to look at LLMs not as writing code, but rather as predicting what code it would expect someone to write given the context.

For some people that matches their expectation or they don't really have an expectation. While for other people it doesn't match their expectation.

loading story #48417820
loading story #48418385
loading story #48417210
loading story #48417267
loading story #48417859
loading story #48423421
loading story #48423138
loading story #48422802
loading story #48434872
loading story #48432074
loading story #48421395
loading story #48426163
loading story #48418698
loading story #48424199
loading story #48417394
loading story #48428115
loading story #48431960
loading story #48431750
loading story #48426560
loading story #48422177
loading story #48426665
loading story #48418590
loading story #48420387
loading story #48418035
loading story #48421992
loading story #48417913
loading story #48421695
loading story #48419888
loading story #48417168
loading story #48423784
loading story #48434410
loading story #48419002
loading story #48417939
loading story #48417401
loading story #48421329
loading story #48430533
loading story #48418391
loading story #48430010
loading story #48420818
loading story #48422011
loading story #48427244
loading story #48535278
loading story #48422117
loading story #48430143
loading story #48417719
loading story #48438789
loading story #48419074
loading story #48418483
loading story #48423186
loading story #48425989
loading story #48435183
loading story #48429843
loading story #48424022
loading story #48419011
loading story #48422966
loading story #48422614
loading story #48425226
loading story #48423057
loading story #48425920
loading story #48420042
When deepseek found a fix for a bug I couldn't find in minutes.

When deepseek again produced an entire web app that somewhat looked alright.

When Gemini could finally produce json was I specified.

The issue is, all LLMs can do. When they do, is boilerplate and code a mediocre coder could produce if they cared to try and insist.

In a way we should praise the ability of these things, but at what (in) efficiency. Code still need to be reviewed as we can't trust these things and context got a limit to entertain the idea of possibly having them fix their own mess.

MidJourney public discord channel.

The amount of masterpiece level art flowing per hour was astounding.

For every one doing a ninja waifu, there were ten doing art from davinci and leonardo crossed with hockney.

it almost gave you art sickness

loading story #48432001
loading story #48426364
loading story #48432686
loading story #48417074
loading story #48423083
loading story #48417622
loading story #48418078
loading story #48426522
loading story #48422712
loading story #48406780
loading story #48417853
loading story #48420344
loading story #48432364
loading story #48421726
loading story #48420303
loading story #48425464
loading story #48423947
loading story #48418781
loading story #48418924
loading story #48417960
loading story #48417963
loading story #48417388
loading story #48418877
loading story #48421215
loading story #48419902
loading story #48417418
loading story #48418459
loading story #48418658
loading story #48419330
loading story #48421437
loading story #48425482
loading story #48419172
loading story #48417314
loading story #48422683
loading story #48418188
loading story #48521162
loading story #48421302
loading story #48422035
loading story #48418712
loading story #48421815
loading story #48421431
loading story #48515849
loading story #48422329
loading story #48417178
loading story #48428450
loading story #48418250
loading story #48436668
loading story #48424083
loading story #48423074
loading story #48423087
loading story #48433163
loading story #48423240
loading story #48429994
loading story #48420013
loading story #48419469
loading story #48423902
loading story #48421383
loading story #48421619
loading story #48427374
loading story #48433889
loading story #48424126
loading story #48457205
loading story #48420944
loading story #48443657
loading story #48423452
loading story #48425154
loading story #48417075
loading story #48424872
loading story #48422854
loading story #48420636
loading story #48419844
loading story #48463864
loading story #48417872
loading story #48417330
loading story #48421957
loading story #48432857
loading story #48419897
loading story #48426186
Maybe my daily work is rather mundane compared to most people who frequent HN but I am able to create, think about, refine and then go through review cycles at least 2 or 3 times more quickly than I used to.

And software that I can imagine I might want to "make" or have at my fingertips is readily available even though I have a busy schedule with very little free time!

Also, I love feeling like a manager whose direct report actually does what I tell it to. Crazy good feeling.

loading story #48418144
When I tried, just for fun, to put together an MVP of a fully autonomous business, I wanted to see how far it would go, when I got it generally working to around a 30% level I stopped because it was enough to see people would make a concerted effort to build this for real. HN was not impressed, heh: https://news.ycombinator.com/item?id=44143928
loading story #48419602
loading story #48429795
loading story #48431970
loading story #48420694
loading story #48418850
loading story #48418575
loading story #48418196
loading story #48432176
loading story #48419955
loading story #48421030
loading story #48427013
loading story #48437331
loading story #48419771
loading story #48418843
loading story #48420620
loading story #48417575
loading story #48421086
loading story #48423175
loading story #48419887
loading story #48428607
loading story #48425615
loading story #48422390
loading story #48425049
loading story #48434378
loading story #48418140
loading story #48433556
{"deleted":true,"id":48420103,"parent":48406174,"time":1780705600,"type":"comment"}
loading story #48417905
loading story #48423729
loading story #48425690
loading story #48425640
loading story #48419355
loading story #48423842
loading story #48429757
loading story #48423338
loading story #48420448
loading story #48429790
loading story #48421584
loading story #48431592
loading story #48420432
loading story #48422749
loading story #48424765
loading story #48418212
loading story #48417785
loading story #48430886
loading story #48431265
Mine was testing out the copilot preview in the early days. Testing how well it knew semi obscure public codebases. Started filling out the first few lines and got the entire document word for word in tab complete.

That was the day I realised the plagiarism potential llms has.

loading story #48422739
loading story #48419951
loading story #48423552
loading story #48425620
loading story #48421459
loading story #48420322
loading story #48418793
loading story #48421336
loading story #48437042
loading story #48445826
loading story #48422213
loading story #48424489
loading story #48420536
loading story #48408585
loading story #48417651
loading story #48432681
loading story #48419475
loading story #48420479
I told the bot I liked Steely Dan, Eagles, Bob Seger, and Roxette and asked it for music recommendations. It replied with Toto. Exasperated, I wrote "Oh, shit, you stupid bot, you don't know ANYTHING about music!"
loading story #48421659
loading story #48425274
loading story #48417976
loading story #48418502
loading story #48420463
loading story #48422807
loading story #48422866
loading story #48418553
loading story #48418296
loading story #48425562
loading story #48428872
loading story #48425817
loading story #48422345
loading story #48422874
loading story #48418007
loading story #48423558
loading story #48418481
loading story #48420957
loading story #48421553
loading story #48430934
loading story #48424026
loading story #48431803
loading story #48422141
loading story #48420189
loading story #48429094
loading story #48430621
loading story #48431477
How could you not instantly see ChatGPT was absolutely revolutionary the first time you tried it? I was absolutely blown away and I'm sort of still are
loading story #48426652
loading story #48447595
loading story #48419864
loading story #48419472
loading story #48439662
loading story #48427387
loading story #48418214
That it could create mugshots of myself better than I could have managed to take.

Aka handsome, confident successful, affluent alpha male on a boat, yet looking perfectly like me.

loading story #48421261
loading story #48419439
loading story #48421450
loading story #48428917
loading story #48418766
loading story #48426170
loading story #48433727
loading story #48423248
loading story #48418265
loading story #48419696
loading story #48430601
loading story #48406740
January 2026 when i started using opus 4.5 and understood that it could do actual useful work beyond coding small snippets
loading story #48418455
loading story #48420628
loading story #48432484
loading story #48420127
loading story #48419359
loading story #48418002
loading story #48420439
loading story #48417346
loading story #48421115
loading story #48419683
loading story #48421717
loading story #48408215
loading story #48406288
loading story #48419800
loading story #48423137
loading story #48417106
loading story #48419325
loading story #48427803
loading story #48420657
loading story #48417967
loading story #48425035
loading story #48427783
A couple of years ago now.

I asked it to write a script that would search for a specific string in footers in a massive series of DOCX files and change them according to some rules. The strings ended up being embedded in cells within an invisible table in the footers, the LLM realized this and switched strategy to a full deep traversal of the underlying XML. It correctly processed like 50 of these files in about 10 minutes, using libraries I wasn't aware of. I had spent an hour being annoyed before trying.

It was an "oh shit" moment for at least that category of work.

loading story #48425459
loading story #48432675
loading story #48422447
loading story #48443713
loading story #48420749
loading story #48423453
loading story #48438905
loading story #48417353
loading story #48440138
loading story #48420655
loading story #48419289
loading story #48406380
loading story #48431243
loading story #48418271
loading story #48418493
loading story #48419451
loading story #48442388
loading story #48420493
loading story #48421399
loading story #48421626
loading story #48418596
loading story #48419388
loading story #48421686
loading story #48422684
loading story #48432580
loading story #48433680
loading story #48422006
loading story #48426643
loading story #48421049
loading story #48428893
loading story #48422880
loading story #48419649
loading story #48433697
loading story #48430056
loading story #48433575
loading story #48421464
loading story #48418665
loading story #48420643
loading story #48429305
loading story #48420051
loading story #48430873
loading story #48425719
loading story #48418570
loading story #48422648
loading story #48418377
loading story #48421288
loading story #48434447
loading story #48426127
loading story #48422978
loading story #48421685
loading story #48426077
loading story #48418112
loading story #48429713
loading story #48422251
loading story #48429849
loading story #48432255
loading story #48426207
loading story #48423226
loading story #48422067
loading story #48425531
loading story #48421943
loading story #48418545
loading story #48419562
loading story #48425010
loading story #48429814
loading story #48425171
loading story #48420660
loading story #48426010
loading story #48418599
loading story #48420562
loading story #48428221
Didn't had one yet. Apparently all I have is "crap, here we go again" whenever Claude is giving me a solution to the problem I am presenting to it. Because I understand where it goes and it's full of errors, but those are errors I can avoid. Together we cobble something in the end, I do learn something new as well, but was never "here is my prompt, then Claude delivered final solution next" - like so many commenters here point out they have.

Frankly, to an outsider whatever it presents looks legit, but as an expert I recognize its failures, which makes me even more entrenched in the idea to never use it outside my area of expertise.

I have a question for all them believers: If on a hypothetical scenario you, having no medical experience, find yourself and your child on a mountain, 12 hours away from nearest road, and your offspring is having appendicitis (let's assume your recognize this 100%), with a sharp knife and Claude at your disposal - would you risk to operate on your child? Or hurry the fuck down to get him to a hospital? I know I would chose to get him to a hospital, because that would be a better chance for my kid to live than me to operate on my kid with Claude's assistance. I am pretty sure I would kill my kid on that mountain. So yeah, outside my area of expertise I don't trust Claude one bit.

loading story #48420047
loading story #48420177
loading story #48418258
loading story #48422587
loading story #48417272
loading story #48420824
loading story #48418756
loading story #48421122
loading story #48428408
loading story #48424019
loading story #48417276
loading story #48435401
loading story #48419396
loading story #48418822
loading story #48423101
loading story #48432136
loading story #48417193
loading story #48525165
loading story #48556318
loading story #48424277
loading story #48424951
loading story #48420214
loading story #48426372
loading story #48424879
loading story #48430675
loading story #48427209
loading story #48422573
loading story #48433571
loading story #48419127
loading story #48433134
loading story #48418719
loading story #48418771
loading story #48432556
loading story #48417358
loading story #48442928
loading story #48428538
loading story #48431206
loading story #48441212
loading story #48475675
loading story #48420353
loading story #48417846
loading story #48424691
loading story #48430987
loading story #48442913
loading story #48417356
loading story #48424914
loading story #48418313
loading story #48420026
loading story #48427427
loading story #48433874
loading story #48443890
loading story #48433866
loading story #48434210
loading story #48431817
loading story #48424200
loading story #48425371
loading story #48421802
loading story #48426756
loading story #48446040
loading story #48420498
loading story #48423417
loading story #48422385
loading story #48420393
loading story #48418067
loading story #48435883
loading story #48429750
loading story #48424665
My first time using Grok. I'd been so used to using AI models that declined to do things I told them, like tagging people in a video feed, helping me "optimize" my taxes or managing my Twitter bot farm.

Grok just did these things for me, no questions asked, no ethical judgments. No woke.

Elon really doesn't get enough credit for Grok. People don't want the most powerful reasoning model or "constitutional AI". They just want a model that does what they say. Elon understood that insight (like he usually does) and no one else really did and that's probably why Grok has been growing rapidly over the last two years or so.