Story Detail of id 47203848 | Liveview Hacker News

0xbadcafebee19 hours ago | on: Microgpt

It still can't learn. It would need to create content, experiment with it, make observations, then re-train its model on that observation, and repeat that indefinitely at full speed. That won't work on a timescale useful to a human. Reinforcement learning, on the other hand, can do that, on a human timescale. But you can't make money quickly from it. So we're hyper-tweaking LLMs to make them more useful faster, in the hopes that that will make us more money. Which it does. But it doesn't make you an AGI.

charcircuit19 hours ago | parent

It can learn. When my agents makes mistake they update their memories and will avoid making the same mistakes in the future.

>Reinforcement learning, on the other hand, can do that, on a human timescale. But you can't make money quickly from it.

Tools like Claude Code and Codex have used RL to train the model how to use the harness and make a ton of money.

kelnos17 hours ago | root | parent | next

That's not learning, though. That's just taking new information and stacking it on top of the trained model. And that new information consumes space in the context window. So sure, it can "learn" a limited number of things, but once you wipe context, that new information is gone. You can keep loading that "memory" back in, but before too long you'll have too little context left to do anything useful.

That kind of capability is not going to lead to AGI, not even close.

regularfry12 hours ago | root | parent | next

Two things:

1. It's still memory, of a sort, which is learning, of a sort. 2. It's a very short hop from "I have a stack of documents" to "I have some LoRA weights." You can already see that happening.

loading story #47207019

charcircuit15 hours ago | root | parent

>but before too long you'll have too little context left to do anything useful.

One of the biggest boosts in LLM utility and knowledge was hooking them up to search engines. Giving them the ability to query a gigantic bank of information already has made them much more useful. The idea that it can't similarly maintain its own set of information is shortsighted in my opinion.

loading story #47208327

Dansvidania17 hours ago | root | parent | next

That’s not learning. That’s carrying over context that you are trusting is correctly summarised over from one conversation to the next.

regularfry12 hours ago | root | parent

Which sounds uncomfortably like human memory, which gets rewritten from one recollection to the next. Somehow, we cope.

loading story #47208614

loading story #47208345

otabdeveloper417 hours ago | root | parent

> they update their memories

Their contexts, not their memories. An LLM context is like 100k tokens. That's a fruit fly, not AGI.

charcircuit15 hours ago | root | parent

A human can't keep 100k tokens active in their mind at the same time. We just need a place to store them and tools to query it. You could have exabytes of memories that the AI could use.

loading story #47205680

#visit	12,939,844
#session	74,665
#live-session	0