Hacker News new | past | comments | ask | show | jobs | submit
For wayback machine, are those compressed, deduplicated numbers? A semi-popular domain can have millions of results on their CDX api, but with https/https duplicated and about 90% of results are error pages or pages with deliberate garbage / LFI attempts in them.
loading story #41452786