Story Detail of id 47393779 | Liveview Hacker News

bigyabai18 hours ago | on: LLM Architecture Gallery

If your definition of "competitive" is loose enough, you can write your own Markov chain in an evening. Transformer models rely on a lot of prior art that has to be learned incrementally.

jasonjmcghee18 hours ago | parent

Not that loose lol.

I’m thinking it’s still llama / dense decoder only transformer.

#visit	13,135,798
#session	74,665
#live-session	0