Hacker News new | past | comments | ask | show | jobs | submit
Love it! That's awesome. Thank you for replying. :-)

I've occasionally wanted to put the subtitles from all of my Simpson episodes into an easily searchable format. What do you use to extract subtitles?

I should note it sounds like i am a pirate, but when i rip a DVD i use handbrake to "Burn In" the subtitles straight to the video - since i originally cared about this because of my hearing. This "hey having all these as .vtt/.srt/.ass means i can search my recollection of this media!" came much (much, 20 years) later.

I burn in the subtitles because "streaming media players" nearly universally are awful at handing anything except 100% perfect subtitles - and that's if they bother handling them at all, over the last 20 years. Also my best friend was dating a deaf person, so the impetus for burning in for streaming was because we'd watch movies together in my living room on a rear projection TV via wifi streaming from a WHS "plex-like" server in my room. The device was a western digital something TV.

Yeah I've been ripping movies and TV shows off of discs since the early Netflix days. I'd get 3 DVDs in the mail, rip them with HandBrake and then send them back the next day. Rinse and repeat.

I also own the first 20 seasons of the Simpsons on DVD. I'd own more but they stopped making them. Despite starting it decades ago, they took so long that the DVD (and disk format in general) has become obsolescent for the most part. But I've ripped all 20 seasons on to my network, including the commentary tracks, which I absolutely love.

I don't burn subtitles in though. I've never had a problem using them with vlc (since the early aughts) and then mpv (as of maybe 10 years ago). I don't use anything fancy though for viewing. I just have a handful of Intel NUCs running Archlinux hooked up to each TV in the house. I skipped all the Plex bullshit.

Oh i've never extracted, i use openai-whisper for long content and whisper-diarization for shorter content (<8 minutes or so). As i suspected, ffmpeg claims to handle it: https://trac.ffmpeg.org/wiki/ExtractSubtitles with a note that probably should be used with `-c copy` to ensure a 1:1 copy of the subtitles.

also when i get stuff from a website with yt-dlp for archival i use

```pwsh

$userInput = Read-Host -Prompt '480 video download script enter URL'

Write-Output "URL:`t`t$userInput"

yt-dlp.exe `

-f 'bestvideo[height<=480]+bestaudio/best[height<=480]' `

--write-auto-subs --write-subs `

--fragment-retries infinite `

$userInput

```

Interesting, thanks!