Fine-tuning an LLM to write docs like it's 1995

159taubek | 13 hours ago | 56 | HN

The trick about documentation is depth, not prose. You need context and understanding to write documentation "like in the old days". No amount of LLM trickery will free you from that. Once you have that source material, it's easy to re-shape it into an 80's/90's/00's doc format.

Negative example: I was looking into the German manual of my Canon EOS R5 II, and it is just fluff. Hundreds of pages, full of white space, telling me about features without actually explaining what they mean. Awful automatic translations. Their manuals used to be good (looking at my EOS 6D). But these days: oh boy.

loading story #48410215

loading story #48410362

loading story #48409401

loading story #48411496

vintagedave11 hours ago | parent | next

I love old-school docs, and this was a fantastic read. But, I couldn't see the three generated doc pages linked anywhere. Did I miss something?

I'd really like to see the Win2K-style docs on REST, for example.

Edit: it was right there, in bold, too. https://gist.github.com/theletterf/0b8ee1112fbd087f3141d0cad...

loading story #48410364

loading story #48409168

loading story #48413105

loading story #48411115

loading story #48414051

loading story #48413354

loading story #48410463

loading story #48411059

mock-possum11 hours ago | parent | next

> we’re not there yet, in part because of how much more powerful connected frontier models are

Is that why though? You need a beast of a machine to run a functional local model in my experience.

I think the big part is there’s significant sticker shock to buying capable hardware.

That said,

> weekend. I chose to try fine-tuning on two models, Llama 3.1 8B Instruct and Qwen 2.5 7B Instruct. At their size (around 8B) they run comfortably on a MacBook Air

Perhaps I spoke too soon?

Anyway

> I chose the Microsoft collection as the source of training materials. The collection contains out-of-print docs published between 1977 and 2005: more than 37 million words, covering old systems and SDKs

this strikes me as a very specific brand of 1995’s prose, spanning about 30 years. It’s a cool article though, so maybe that’s a forgivably clickbaity title.

loading story #48409096

loading story #48409128

spacebacon10 hours ago | parent | next

Now do it without the fine tuning.

https://github.com/space-bacon/SRT

The HF zool4nd3r demo may be useful

loading story #48410282

anentropic9 hours ago | parent | next

Tip: neither the "30 second TL;DR" nor the intro paragraph above it really explain to anyone unfamiliar with your (possibly novel?) jargon what it does

loading story #48410392

loading story #48409977

loading story #48410125

holoduke10 hours ago | parent | next

Who is reading docs these days? It there is one thing a LLM is good at is reading docs. I never read docs anymore and I am so happy about it.

loading story #48409545

loading story #48409670

loading story #48414398

loading story #48413235

loading story #48413421

loading story #48412875

loading story #48411030

loading story #48412907

loading story #48410574

loading story #48411532

loading story #48411848

#visit	13,588,359
#session	74,665
#live-session	0