Story Detail of id 48482379 | Liveview Hacker News

jamwise22 hours ago | on: πFS

Reminds me of when I tried to use the library of babel as a data compression tool. It led me down a fun rabbit hole and was my first introduction to information theory.

The conclusion being that you basically need the same amount of data to represent the address of your data as the data itself, so it's not really effective at compression, just a fun thought experiment.

The cool part of this in modern times is that LLMs are basically a form of lossy compression that actually achieves the gist of what these tools fail at. Although it is lossy, and requires a massive substrate. This is related to the idea of AI/LLMs being a form of language compression.

ithkuil19 hours ago | parent | next

You'll find this an interesting watch:

Reinventing Entropy Compression is Intelligence Part 1

3blue1brown https://youtu.be/l6DKRf-fAAM?is=ne73FCJ7ErXhzZ-v

nz7 hours ago | root | parent | next

You, and the HN users, `lojban`, `klingon`, `ido`, `brithenig`, `solresol`, `babm`, and `tokipona`, may want to start a club. Amusingly, nobody seems to have registered the `esperanto`, `volapuk`, `interslavic`, `balaibalan`, and `dothraki` usernames.

loading story #48490419

loading story #48491509

loading story #48490130

sam_lowry_13 hours ago | root | parent

Also this article by Ted Chiang as a literary explanation of the connection between intelligence and compression: https://www.newyorker.com/tech/annals-of-technology/chatgpt-...

ainch20 hours ago | parent | next

In some sense, science is the most extreme form of compression - Newtonian mechanics explains an incredible number of phenomena in a few lines of text.

everforward2 hours ago | root | parent | next

It does, but only vaguely unless you already know how it works and can work backwards to Newton's laws. Eg Newtonian mechanics can explain how flying works, but if you don't already know then it's hard to go from Newton's 3 laws to a functional explanation of why planes don't fall out of the sky.

Some of that is also the domain. It's less that science is an extreme form of compression, and more that natural phenomenon are highly compressible. They're a small number of kinds of interactions repeated a bajillion times. How many equations does it take to explain electricity (ignoring equations that are derivatives of ones already included)? I think it's less than 5.

On some level, you could probably reduce all of the Standard Model down to models of atoms, their motion, and the basic subatomic particles (the non-quantum ones). That would explain almost everything that happens on Earth in a very short form, though few people would be able to go from that to explaining how lightning works.

loading story #48494632

mdp202112 hours ago | root | parent

Yes. But /lossful/ compression: (scientific, philosophical etc.) laws compress an abstract narration of events into that tiny, hard, fundamental, predictive detail.

(Then it depends on your concern: "Aagh, the aunt fell!" // "Oh yes, that'd be Newton")

esquivalience11 hours ago | root | parent

> "Aagh, the aunt fell!" // "Oh yes, that'd be Newton"

This is totally lost on me.

user_783210 hours ago | root | parent | next

> This is totally lost on me.

Appears to be lossy then ;)

(Sorry, you have to admit that was too easy to not say)

mdp202110 hours ago | root | parent

Compression minimizes the representation of information.

Laws (scientific, philosophical etc.) as compression represent the common side of classes of events - an abstraction of said events, stripping the irrelevant - irrelevant to some perspective, or irrelevant in a potential Procuste's bed. So, laws are compression, but a so extremely lossful compression that the loss can be relevant.

Brutally, "there may be more to the story of the fall of an elderly than just gravitation" - also in the sense that there are details behind the event.

Laws are compression - yes, with caveats.

On a more scientific, epistemological side: Einstein extended Newton covering more exceptions (reducing the abstraction - reducing the loss).

loading story #48489202

quirino20 hours ago | parent | next

3Blue1Brown just released a viduo about this Intelligence-Compression connection.

https://youtu.be/l6DKRf-fAAM

jamwise20 hours ago | root | parent

The idea was fresh in my mind because I watched this yesterday. Great video, the illustrations and intuition-building of the compressability of information was so good! I'm so grateful for 3Blue1Brown.

janalsncm19 hours ago | parent | next

The level of compression is pretty impressive when you think about it. I wrote a comment a while back which is still true (although bytes should be bits, so in that sense it’s still wrong): https://news.ycombinator.com/item?id=39559969

Back of the envelope calculation for storing valid 4-grams (sequences of four words) is around 10 billion x 14 bits per word = 17 gb for all 10 billion. There are LLMs 100x smaller which can write coherent prose.

frotaur9 hours ago | root | parent

If you combine the LLM probability distribution with arithmetic coding you can actually use them to compress text losslessly. When people reports 'bits per byte', it is actually the compression rate for text.

GPT-2 for instance achieves roughly 1 bit per byte, so it can be used to compress (english) text 8-fold. Modern models are likely much better.

loading story #48488976

loading story #48489143

divbzero13 hours ago | parent | next

> you basically need the same amount of data to represent the address of your data as the data itself

Almost like the other Borges work where “the Cartographers Guilds struck a Map of the Empire whose size was that of the Empire”.

21 hours ago | parent | next

{"deleted":true,"id":48483471,"parent":48482379,"time":1781129685,"type":"comment"}

loading story #48489209

#visit	13,745,396
#session	74,665
#live-session	0