Story Detail of id 47349824 | Liveview Hacker News

andy12_1 day ago | on: Executing programs inside transformers with exponentially faster inference

This seems a really interesting path for interpretability, specially if a big chunk of a model's behavior occurs pseudo-symbolically. This is an idea I had thought about, integrating tools into the main computation path of a model, but I never imagined that it could be done efficiently with just a vanilla transformer.

Truly, attention is all you need (I guess).

#visit	13,092,386
#session	74,665
#live-session	0