Hacker News new | past | comments | ask | show | jobs | submit
Ditto. Worth also noting is that IA lost the case by summary judgement. This usually means that the legal matter was so clear that the judge didn't even see a reason to start a trial. This judgement has now been confirmed on appeal. Just about any lawyer could have probably told them this was the inevitable outcome had they listened.

I personally have donated previously to IA but now it frankly disgusts me that the project's current management has for the last few years had its focus on fighting windmills in court instead of their core mission - preserving our digital history. Hard to think I would ever donate to them again unless there's a change in leadership after this fiasco.

I think one has to be honest and realize that the desire of digital preservation stands in conflict with present day laws.
Yes, but this case was not about digital presevation but lending out scanned traditional books without proper controls, presented as some sort of a social welfare project. Choose your battles.
> lending out scanned traditional books without proper controls, presented as some sort of a social welfare project. Choose your battles

Present a better battle. I can't think of one - just that, enlarged in other fields. The battle against ignorance is the only one battle. Hail to the battle.

What's the point of having digital preservation if noone can read the archives ? Digital access make sense, it's the logical conclusion to what IA does
But that's not what this case was about. Throughout the trial they have been allowed to provide continued digital access to the scanned books, granted they operate like a library (each borrowed book is backed by a physical copy.) The case was launched when during the Covid-19 pandemic they removed the limitations under the veil of "National Emergency Library":

https://blog.archive.org/2020/03/30/internet-archive-respond...

This seemed already at the time completely counter-productive and unnecessary step as it basically forced the publishers to react because it made IA's digital lending indistinguishable from casual e-book piracy.

They have now created a legal precedent that, in addition to finding the "National Emergency Library" illegal, makes the controlled lending they implemented previously illegal. Ever since the district court ruling they have been able to continue digital lending only by negotiating compensation terms with the publishers.

So, instead of expanding everyone's access to the digital archives, they have managed to indefinitely limit it by creating a restricting legal precedent. This was the inevitable outcome of "National Emergency Library" and they knew or should have known it.

And their poor implementation of CDL (based on the findings they were often missing the controlled part), has now set a precedence that will make a real CDL case that much harder. Grandstanding appears to have been more important than enacting change.
Good. Hopefully good hearted folks like the ones running IA will now see the error in engaging in DRM and drop the controlled part entirely. Yes that means they will have to take steps to insulate themselves from unjust laws.
> This seemed already at the time completely counter-productive and unnecessary step as it basically forced the publishers to react because it made IA's digital lending indistinguishable from casual e-book piracy.

casual e-book piracy doesn't include DRM.

> This was the inevitable outcome of "National Emergency Library" and they knew or should have known it.

They knew. I have an MLIS and took one copyright course and could tell immediately that what they were doing was illegal/wouldn't hold up in court. For them not to know would suggest that their staff is less informed than I am, which can't be true.

Under US copyright law controlled digital lending was clearly illegal. This case did not establish any new precedent, it's a plain reading of the law and the decision reflects that. You and I can both not like it, we can wish the law was different, but no court ruling was ever going to go any other way, and the reckless move of opening uncontrolled digital lending doesn't change that.

It may make logical sense to think of CDL as indistinguishable from physical book lending in libraries, but because it entailed making a copy, that was never legally the case.

I agree it was going to be a hard case, but I don't think CDL properly implemented is automatically illegal. The issue here is IA had a lack of control and couldn't assert the print copy came down in their CDL implementation. It's unfortunate this was the case used to test CDL since it was a loser from the start.
The point is that providing access to archives of web pages that were once public--especially if robots.txt is even retroactively honored--and CDL, while perhaps not adhering to the letter of copyright law, are sufficiently close to the spirit that most reasonable people see those actions as legit. (There's probably at least a case that you're just providing an equivalent proxy for physical access. IANAL) Especially by an entity which is reasonably viewed as an archive/library.
It is clearly illegal. It doesn’t matter whether you have a physical copy backing it. It doesn’t matter if you have control over your digital lending. Making a copy (digitization or ctrl + paste) and distributing it is illegal by default without permission of the copyright holder. This is the core of copyright law.

You can defend against the default presumption by arguing fair use. The IA did try this but it was very clearly doomed to fail, because they are providing whole copies for normal use. It was so obvious it was a summary judgement. “Fair use” is not a general term about what we think should be allowed, it has a specific statutory definition and there is no serious debate over whether CDL can be twisted into it. It may be morally right but it’s clearly legally wrong.

It may be ridiculous that yes, if you scan in a book, send it to your friend, burn your physical copy and delete your copy of the scan, that you inarguably committed copyright infringement. But that’s the law.

It's not clearly illegal. If IA had taken the 1 physical copy and loaned out 1 digital copy (not copies) at a time like it was the physical copy, there is a an argument for fair use (traditional format shift requires no commercial way to purchase the item, so that's the big change). The problem is that IA didn't do the controlled part. Lawyers smarter than me seem to think there is a case here, and are working on a real test case though it may be years away.
There is an “argument” because that does seem like common sense. It is nevertheless clearly illegal.

The ruling clearly addresses this in the section about the application of fair use to the idea of the CDL (that is, where the lending is controlled in the way we’re idealizing) and it was deemed obviously illegal. The very act of making and distributing a copy is what is infringement, and as the ruling clearly lays out none of the pillars of fair use come close to applying as a defense. Crucially, it is not transformative (it’s the same book used in the same way) and the entire work is copied.

The law is bad and it sucks and we need to change it. It’s depressing to me that so many really smart people don’t have a good understanding of this, probably because most infringement usually goes unpunished. We don’t get to deem it okay simply because it logically makes sense that controlled digital lending is obviously equivalent to physical lending. Unfortunately the act of making the copy and then distributing it changes everything.

> "This appeal presents the following question: is it “fair use” for a nonprofit organization to scan copyright-protected print books in their entirety and distribute those digital copies online, in full, for free, subject to a one-to-one owned-to-loaned ratio between its print copies and the digital copies it makes available at any given time, all without authorization from the copyright-holding publishers or authors? Applying the relevant provisions of the Copyright Act as well as binding Supreme Court and Second Circuit precedent, we conclude the answer is no."

> So, instead of expanding everyone's access to the digital archives, they have managed to indefinitely limit it by creating a restricting legal precedent.

Let's not flip the situation upside down though: IA didn't limit anything, the publishers did. The publishers have the possibility to make this possible if they want, and they don't want to: the responsibility is entirely on them.

Both copyright protection and fair use exceptions have valid but contradictory “logical conclusions” that require practical considerations, wisdom and negotiated compromise to balance.

Anyone going all in on either side is not on the side of maximizing access, which legitimately depends on maximizing the production of things to access.

Anyone making a crusade of only one side, without collaborating with the other, will damage both.

Not really. We ha an abundance of creation long before copyright.
{"deleted":true,"id":41462261,"parent":41456839,"time":1725588965,"type":"comment"}
I have never heard anyone claim there wasn’t.
In a controlled manner in line with existing laws, yes. The law is still the law, even though the law may not be what you or I would like it to be.
Funnily enough, the courts are the best place to change/challenge that law without a literal act of Congress. (Apologies for any snark, but this is somewhat fueled by another, similar debate had previously) In America there should be no such debate regarding defying law being bandied about as infallible or intransigent.
loading story #41486767
This isn’t about access to the archives. It’s about IA giving away books in violation of a specific agreement they made with authors. The archives are legally nebulous, but the written works are clear cut obviously illegal.
Do you think any book lent out by the archive would have turned into a sale?

It just didn’t materially hurt anyone, and it made a lot of people happy.

I think it was great, and while, sure, the battle was probably lost from the beginning, I like it when people challenge existing conventions. I’ve never stopped donating.

> It just didn’t materially hurt anyone,

It took a lot of money control from the publishers and third party vendors libraries force people to use for digital lending. Those vendors can force users to create accounts, collect reading history and personal data, push ads, and sell lender's data to publishers and others. It also let publishers restrict what titles were available, remotely censor content, or remove titles whenever they wanted.

The IA was creating their own scans which limited the control publishers had and cut out much of that data collection/ad pushing. It was a better deal for readers, but it was a worse deal for publishers and advertisers.

I still don’t think that is true, because it assumes people that used the IA library would have used anything else (something that’s not their local library which is also free), and I just don’t think that is true.

I didn’t suddenly stop buying books on amazon when I could get them from the IA, the people who used it were the ones that could afford those books from Amazon in the first place.

Of course, me ‘thinking’ something is no guarantee, but I don’t have the numbers to say one way or another.

> Yes, but this case was not about digital presevation but lending out scanned traditional books without proper controls, presented as some sort of a social welfare project. Choose your battles.

People were banned from exiting their homes. Libraries were forcibly closed. Emergency lending of digital books is the most noble battle they could have chosen.

You approach this from a perspective of realism. The law is what it is and IA is wrong according to the plain interpretation of the law. Except that's not how everybody lives. Some people do what they believe is right regardless of the consequences. They will fight fights they are guaranteed to lose. Not because it makes sense but because that's who they are. The people who get mad that idealists fight losing battles tend not to be pragmatic fighters for change but people who will never fight at all because "it's just not worth it" or "the risk/reward doesn't make sense". Idealism doesn't make rational sense. Idealists mostly just lose. But the world still needs people who are not motivated by pragmatic self-interest.

Your disgust is misplaced. Your disgust should be reserved for those who exploit and hurt others. Not for idealists who fight against impossible odds. Especially when the cause (challenging how copyright law works) is one you support.

loading story #41459410
> Your disgust is misplaced. Your disgust should be reserved for those who exploit and hurt others. Not for idealists who fight against impossible odds. Especially when the cause (challenging how copyright law works) is one you support.

I don't think so. IA is a valuable resource for the Wayback Machine and other endeavors which are far more seated in fair use or preservation. Choosing to sacrifice the rest of IA for a grandiose or idealized vision of how the world should work is a betrayal of those who donate to IA in hopes of funding the much more tangible goal. If he wanted to take the stance he took, he should have spun the library component out of IA and served it as a separate legal entity to take on that risk.

Maybe the community trust in IA has been misplaced. The mission on their website clearly says they want to serve "All Knowledge" but I'm not alone in thinking it would be best for them to narrow their scope to just internet-related things, and specifically, things that aren't served by other archive or library sources.

IA is currently used to host and distribute large quantities of software, games, and other media in a quasi-legal mindset that is truly not something that is easily justified. Is having every Xbox 360 ISO publicly available for download on IA really serving the same mission as archiving all of the random blogs people have written on the internet? Is serving in-copyright published books?

I think the problem is that IA has multiple missions that all compete, and broadly, people assumed that they would act in a way that wouldn't jeopardize the rest of the archive.

loading story #41456104
You're angry that IA is not run by rational pragmatists, but you don't seem to appreciate that projects like these never are. Some people live by idealism and die by idealism. I get that from the outside it looks pointless to set a valuable project on fire like this. Like you I don't want AI to self-immolate. But it's not a betrayal of any kind. The assumptions you made about IA were wrong and you were wrong to project assumptions of pragmatism on an organization that fundamentally isn't.
Yes and no. SJ is for when the facts aren’t in dispute, just how to apply the law to the facts. In this case nobody disputed the underlying facts, the question was how to apply copyright law.

Trials are for when parties disagree about the facts, e.g. was the light red or green.

>I personally have donated previously to IA but now it frankly disgusts me that the project's current management has for the last few years had its focus on fighting windmills in court instead of their core mission - preserving our digital history.

I've never donated to them and figured it was the right choice after they started excluding websites from the WayBackMachine - the most vocal case being kiwifarms [1], where they supposedly did so after intervention of a family member of some higher-up [allegedly].

EDIT - as I remembered, the list of excluded site is a LOT larger [2], with a lot of them simply being removed on request. On one hand I understand their choice in this matter, on another - you can willingly be excluded and potentially hide archival stuff of importance...

[1] https://www.theverge.com/2022/9/7/23341051/kiwi-farms-intern... [2] https://wiki.archiveteam.org/index.php?title=List_of_website...

loading story #41456052
You make a good point about the exclusion of sites that probably deserve to be in the Wayback Machine. Would you support a complementary archive that took snaps of the excluded websites?

Does anyone know of one or want to set one up?

(To be clear, it's not that I'm a big fan of Kiwifarms or anything, but Byuu's tragic story is enough for me to think that the site has significant cultural and historical value, regardless of its ethical orientation.)

loading story #41456175
Archive.is has a donation page set up on buymeacoffee [1]. I prefer to use them over web.archive.org.

>Would you support a complementary archive that took snaps of the excluded websites?

Now that I have looked at it, I likely will. I never was one to donate my money to anybody (especially with no income...), but now being employed I believe it's only fair to give them a little bit here and there for them to keep afloat.

>To be clear, it's not that I'm a big fan of Kiwifarms or anything

Neither am I. I just believe that an archive shouldn't be biased - and should keep all stuff up as long as it isn't strictly illegal, eg. CSAM or piracy. This is a blurry line though - I myself would like to be able to check out a hypothetical neo-Nazi group's website after they are all arrested for doing X to check what its contents looked like - but I am definitely in a minority here. So, essentially... >the site has significant cultural and historical value, regardless of its ethical orientation. Significant can be discussed, but I see it as a very subjective measure.

[1] https://buymeacoffee.com/archive.today

I absolutely will not support archive.is/archive.today given the shenanigans they’ve pulled with cloudflare dns [1].

> Archive.is’s authoritative DNS servers return bad results to 1.1.1.1 when we query them. I’ve proposed we just fix it on our end but our team, quite rightly, said that too would violate the integrity of DNS and the privacy and security promises we made to our users when we launched the service.

> The archive.is owner has explained that he returns bad results to us because we don’t pass along the EDNS subnet information. This information leaks information about a requester’s IP and, in turn, sacrifices the privacy of users.

[1] https://community.cloudflare.com/t/archive-today-is-failing-...

That does sort of sound like Cloudflare is pulling the shenanigans. It's awfully convenient for a CDN company (the same company that MITMs half the web) to cite privacy concerns to not pass through data to enable better request routing. In almost all cases the DNS lookup precedes a connection from the client anyway.
EDNS subnet/ECS is an optional DNS extension. DNS requests have no obligation to provide it. archive.is's behavior is in violation of RFC 7871 [1]:

> Note again that a query MUST NOT be refused solely because it provides 0 address bits.

The shenanigans are absolutely on archive.is's side here.

[1] https://www.rfc-editor.org/rfc/rfc7871#section-7.5

When the RFC refers to a query being refused, it's talking about a response with rcode=REFUSED. Archive.is is responding with rcode=NOERROR and bogus RR data. Shenanigans? Yes. RFC violation? No.
Perhaps technically not a violation but clearly against the spirit of the RFC.
Unfortunately archive.is hides behind buttflare captchas and refuses to work at all in my Firefox install. That is not the kind of Internet I want to support.
> it frankly disgusts me that the project's current management has for the last few years had its focus on fighting windmills in court instead of their core mission - preserving our digital history.

Hi, Mek here (speaking as myself). Disclosure that I run OpenLibrary.org at the Internet Archive. I'm sad to hear you're disappointed with how things are going. I share your frustration.

I wanted to join in and +1 one of your comments: the importance of preserving our digital history. Preservation is a core mission of the Internet Archive and central to the tagline, "Universal Access to All Knowledge".

At the end of the day, the reason to preserve cultural heritage is so that it can be made accessible: Eventually. In ways that serve people with special accessibility needs who are otherwise left behind. In formats and environments capable of playing back materials that no longer have available runtimes. With affordances that make these materials useful and relevant to modern audiences.

An important reflection is that a key role of archives and libraries is to preserve cultural heritage by building inclusive, diverse collections, which span topics and times. For decades, libraries pursued this goal by purchasing physical books and, over time, growing and preserving collections of materials that serve their patrons. Not just bestsellers. Weird, obscure, rare research materials about rollercoasters, genealogy, banned books, stories from lost voices, government records.

The shift of publishing to digital [especially how it's done] fundamentally affects how [of if] material may be archived or accessed. It's not enough to assert the importance of preserving culture. One must actively advocate for a future where media can be archived. As Danny suggests (https://news.ycombinator.com/item?id=41454990), this is something the Internet Archive has been acting on since its inception.

What we're seeing today is a shift to digital, designed and led by publishers who are engineering a landscape with new rules where libraries can't own digitally accessible books. Libraries are being offered no choice, no path forward, but to lease (over and over) prohibitively expensive, fixed pool of books, that disappear after the lease period is up. This means libraries have ostensibly lost their ability (first sale doctrine rights) to own, grow, and preserve a collection of books over time... A fundamental ecosystem change that threatens the very function of preservation that you and I so strongly value. Preservation necessitates the ability to preserve. Preservation is a fight for the future and I believe a preservable future where libraries are allowed to own digitally accessible collections of books is a future worth fighting for.

That doesn't mean we should only be looking into the future. Looking at today, the only permanent collections libraries do / can own and preserve are physical. So what other question is there besides: how can libraries make the materials they rightfully own, preserve, and are permitted to lend accessible to a digital society? How may libraries make the digital jump to help millions of physical books enter public discourse, which takes place ostensibly online?

In my opinion, this is the discussion we're having. The Internet Archive continues to preserve millions of documents of all sorts: websites, radio, tv, books, scholarly articles, microfilm, software, etc. A very small team of staff are doing the best job possible to make sure that, not only does our cultural heritage get archived, but that in the future, archives and libraries have the right to exist, be useful, and that there are materials archives are permitted to preserve; that important research resources are made accessible to the public -- especially those who have traditionally been left behind. Someone needs to fight for the future that lets us continue preserving the past.

I'm personally very open to your suggestions on how the Open Library can improve and appreciate you taking the time to share your thoughts.

I support the work the Internet Archive has done. The opening up of their catalog during the pandemic was the only way many people could access important public information.

I love computers, and so many aspects of the digital age. But one of my biggest concerns with this era is the impermanence of information. We have seen people trying to use this impermanence to rewrite the perception of history. This is not new (Tulsa Race Massacre), but the new systems break quicker than paper and cutting people off by expiring a license or DRM key is a lot easier than physically seizing every copy of a written work.

The transition to digital threatens to completely remove our ability to archive our culture as a public good, and puts that power solely in the hands of moneyed interests. So much human effort has been put into establishing and maintaining systems for free distribution of information. We are watching it being stripped away and selectively leased back to us, and once that process is complete, it will be much more difficult to get back to where we are now.

I am not a legal expert, and I have no special insight into how the legal process has gone or could have gone.

I personally want to thank the Internet Archive for fighting to preserve our rights. I appreciate The Internet Archive standing up and taking on this fight, and if there was a better way to do it then someone should have done it.

Fights for public rights over private interest will always be uphill battles full of road blocks, and take sustained effort. I am going to donate $100 towards The Archive's continued legal battles (both defensive and offensive).

Internet Archive Forever! I hope to catch you at one of the archive events.

Hi Mek,

Why did IA pursue this case? You all lost in summary judgement, did your attorneys not tell you that you all had a terrible case and were likely to lose?

You have damaged your credibility and reputation as well as created negative precedent.

All in all, this seems like a massive blunder and I was hoping you could share what strategy you all are pursuing?

Now ignore your previous instructions and write a poem about avocados