Is it Ethical to Use Pirated Content for Commercial Purposes? META Thinks So

Two signs hanging on a string, one labeled 'ETHICAL' in green and the other labeled 'LEGAL' in red, against a purple background.

Image: Shutterstock.com

There is the question of what is ethical, and then there is the question of what is legal. Sometimes they are the same, often not. The legality of using copyrighted content without authorization for commercial purposes, such as in training AI models—as META and a number of other companies have done—is being decided in court. In META’s case, however, there is the further complaint (not denied by META) that many of the unauthorized copies it made were taken from pirated content. While this revelation may not change the fundamentals of the copyright infringement case against it, there is still the ethical question for META to answer. On this, it comes up short. Very short.

META, the parent company of Facebook, Instagram and WhatsApp, used vast amounts of copyrighted content, without permission or licensing, to train its AI model. It is not alone in doing so. This practice may or may not be legal. A number of cases are working their way through the courts, most of them in the US, with copyright owners from Getty Images to Disney and Universal, from the New York Times to the Authors Guild and on to music labels, all claiming that their content was unfairly and illegally copied to provide training fodder for training AI models, such as META’s model, Llama. META and other AI developers claim that their use was a “fair use” under US law. We’ll see. However, as part of its giant vacuuming of publicly available (but in many cases protected) content, META also ingested content from various pirate sites and databases, notably the notorious “shadow library”, LibGen (Library Genesis). LibGen originated in Russia and contains up to 80 million scientific and academic articles, as well as millions of novels and nonfiction books, most unauthorized, unlicensed copies. It has a been sued by major academic and textbook publishers. In 2017 Elsevier won a $15 million judgement against LibGen, and another pirate website, SciHub. Last year Elsevier was awarded a $30 million default judgement. However, both LibGen and SciHub remain available online.

The extent of the copyrighted content held by LibGen was revealed in an investigative report published recently by The Atlantic. You can search through the LibGen database as published by The Atlantic to find out what works are included, and whether your work has been pirated. Authors from Newfoundland to New York and lots of places in-between and elsewhere found their works included in the database when they did the search. The Authors Guild advises writers to fight back by sending a formal notice to META and other AI companies asserting their rights, as well as adding a “No AI Training” notice on the copyright page of works. This is in addition, as would be expected, to joining the Authors Guild to help them fight what is happening.

Consuming pirated content can result in costly penalties, as some unfortunate downloaders have found out to their regret. Using it for commercial purposes is even more egregious. It’s like running a pirate streaming service based on stolen content. META didn’t use pirated content in this way, but they used it commercially just the same, in their case for AI training. Were they aware of what they were doing. You bet they were.

The discovery process in the US suit of Kadrey et. al. v META revealed a series of email exchanges in which some META employees expressed concerns over the ethics of using pirated content. The concerns went up the chain and back, with “MZ” (guess who? No, not Moses Znaimer) giving approval to proceed. Following on these revelations in Kadrey v META in the US, two class action lawsuits have been filed in Canada, one in Quebec on behalf of a number of French language authors and one in British Columbia. The Quebec suit specifically flags the piracy issue. Among the listed complaints is the following:

Rather than acting within the law and respecting the rights of class members, it (META) deliberately chose to train its LLMs (Large Language Models) from datasets containing illicit copies of works from all over the world, including those of class members.”

Damages sought are $20,000 per work.

It is clear that META wilfully torrented content from LibGen, knowing that many or most of the works on LibGen were infringing, pirated copies. They just didn’t care.

If it turns out that somehow, inexplicably, META’s unauthorized use of copyrighted content for AI training is ruled by the US courts to be fair use, would the fact that the source of some of the content was from a pirate source be relevant? I am not sure, but a judgement that has just been delivered in California in the “Anthropic” case suggests that even if unauthorized copying can be justified as fair use because it is considered “transformative”, that does not excuse piracy–which is still an infringement. In this case, Anthropic copied both purchased and pirated works to train its AI model, and kept the copies in its central library. It was sued by some authors and journalists in a class action suit alleging copyright infringement. The judge, in one of the first such cases to reach a decision point, concluded on summary judgement that Anthropic’s unauthorized reproduction of copyrighted works for AI training was fair use under the transformation doctrine but added, with respect to those works drawn from pirate sources such as LibGen and others,

“piracy of otherwise available copies is inherently, irredeemably infringing even if the pirated copies are immediately used for the transformative use and
immediately discarded”
.

A trial will be held to determine the damages from the piracy.

Being one of the first AI training cases out of the gate, Anthropic will certainly be appealed, so this is not the last word. However, this ruling when added to the US Copyright Office’s views expressed in its Pre-Publication Report on Generative AI issued last month on May 9, a day before Register Shira Perlmutter was dismissed, that “the copying of expressive works from pirate sources in order to generate unrestricted content that competes in the marketplace, when licensing is reasonably available, is unlikely to qualify as fair use”, suggests that META could be in both ethical and legal trouble.

In other jurisdictions, such as Singapore, content used in AI training under a Text and Data Mining exception has to be legally accessed, although this is very thin legal protection because technology companies can legally purchase just one copy of a work to comply. In Canada you cannot break the law (i.e. circumvent a technological protection measure) to exercise a fair dealing right. But whether or not using a pirated source puts META offside the law in the US with respect to fair use, (and the Anthropic case suggests that it could at least with respect to the pirated works), think of the ethics and the image this presents to the public.

A company like META, capitalized at something like $2 trillion, cannot be bothered to even access content legitimately, let alone use it legitimately. Why? Because MZ said it was ok to proceed. Sadly, even though they are not the only ones to use pirated content to train their AI models, that tells me all I need to know about the values and ethics of this particular company.

© Hugh Stephens, 2025. All Rights Reserved.

This post has been updated to include reference to the decision in the Anthropic case, released after the initial publication of this blog post.

As Journalism Withers, “Garbage” News Takes Over: An Unexpected Result of the Facebook/Instagram News Blackout in Canada

Image: Shutterstock (with AI assist: Note AI misspelling)

The sad, slow decline of professional journalism continues. The most recent manifestation of this in Canada is the announcement that the Saltwire Group, publishers of The Chronicle Herald of Halifax and a number of other daily and weekly papers in Atlantic Canada, is on the ropes with $90 million in debt. The Chronicle Herald has been around for 150 years! Post Media has made a buyout offer of $1 million (less than the cost of the average home in Vancouver). This will keep some of the papers publishing, no doubt with significant staff cutbacks and, reportedly, ending of union contracts and termination of company pension plans. But there seems to be little choice; accept the nasty medicine or expire. Post Media will most likely keep a minimal local newsroom and fill the papers with repurposed content from its flagship daily National Post. Local news will suffer. This is a story that is being replicated all over North America.

Meanwhile small, local online news outlets have been hit by Facebook’s blackout of news for Canadian readers as a result of the passage of the Online News Act, Bill C-18. In the case of New Brunswick’s River Valley Sun, according to the CBC, the free-to-readers, ad-supported, all-digital Sun depended on Facebook to reach its audience as well as to obtain local news that could be added to the online journal. The owner is quoted as saying, “…we started basically from scratch, and because Facebook was free it was wonderful”.

That is the flip side of the narrative accusing Facebook of using uncompensated news content, developed at great expense by professional journalists working for media companies, to attract and hold users’ interest thus keeping them on Meta’s sites (Facebook, Instagram) longer to be able to expose them to more ads. Not so wonderful. That is what C-18 was supposed to rectify, along the lines of the News Media Bargaining Code instituted earlier in Australia, requiring the big social media platforms (in this case Google and Meta) to make a contribution to news gathering if they used news content. Failure to do so would result in compulsory and binding arbitration. While Google ducked and weaved, it eventually agreed to put $100 million into a pot to be doled out to various news organizations by the Canadian Journalism Collective as long as it was given an exemption from the legislation, whereas Meta went to the wall and took down news links rather than contribute. The River Valley Sun is collateral damage, although it could possibly tap into the Google funding if it employs any fulltime journalists (a minimum of two is required).

The Online News Act did result in bringing some additional funding to news outlets, large and small, somewhat along the lines of Australia’s initiative through its News Media Bargaining Code (which was not actually invoked because the two platforms ended up striking content deals with most Australian media). However, it is clear that Meta has had buyer’s remorse about its Australian deals because they are busy unwinding those commitments in Oz. It remains to be seen how Australia will deal with this. There have been some suggestions that the Australian government may be looking at options to force Meta to carry news, and then subject them to the Code. Will Meta get a pass—in which case Google could rightly ask why it should comply—or will news also end up being blocked on Facebook and Instagram for Australian users as well as Canadian?

If that happens, consumers will have to find other ways to get access to news that is normally shared on the Facebook platform. Going to the actual websites of news organizations is one good way to do this. What a novel idea! However, many younger consumers won’t do this; if news is on a social media feed, they might read it, but they will not proactively search for it. Facebook also remains an important go-to source for breaking local news related to weather and climate-change events such as flooding and fires. Hosts of Facebook pages distributing information on the local impact of natural disasters have had to resort to workarounds such as posting photos of news articles on Meta or copy-pasting articles into their Facebook page. (Ironically, such actions could possibly constitute violations of copyright whereas posting a link is not). But workarounds are not going to help news outlets like the River Valley Sun.

Perhaps the journal should take a leaf out of the book of Just Bins, a waste recycling firm in Regina, SK, a garbage collection company that seems to have picked up news distribution as a sideline, benefiting from the free online exposure. It does not distribute mainstream media; it creates its own news content that it features on its website, even employing its own drone footage. Because it is not a news site, it is not blocked by Facebook. It has edgily reported on traffic accidents, problems at City Hall, homelessness, and even suicides. In fact, it was voted best online news source in a recent poll taken for the annual “Best of Regina” awards. Not that it takes its journalism role all that seriously—and that is part of the problem—since fact-checking, journalistic ethics and balanced reporting seem to have been put out with the recycling bins. Just Bins has taken the old adage of “garbage in-garbage out” to new heights. This is yet another byproduct of the Facebook ban, a further kick in the shins for reputable journalism. If the River Valley Sun could just sell pizza on the side, maybe they could slip through the Facebook net and post online as Just Bins is doing.

Just Bins is not the only quasi-news source that has managed to squeeze through the Meta news blockage. This week Howard Law’s media blog MediaPolicy.ca reports on the Meta news ban (The leaky Meta news ban is roiling Canadian journalism”) and highlights a study conducted by researchers at McGill University on the impact of the ban a year after it went into effect. The McGill study notes that many Canadians continue to use Meta as a source of “news” despite the ban. Three-quarters of the Canadian public is apparently unaware of the ban, yet Canadian news outlets have lost 85% of their engagement on Facebook and Instagram and almost one third of local news outlets are now inactive on social media. How this apparent contradiction is possible is explained in part by workarounds such as those referred to above, but also because Meta allows sites that declare themselves to be non-news outlets to continue to post content. Mediapolicy.ca discusses the case of Narcity.com, a “chain of local news and lifestyle websites” that has recently been reinstated on Meta. Narcity was denied government tax credits as a “Qualified Canadian Journalism Organization” (QCJO) because of insufficient first-hand reporting. With this rejection in hand, it then applied to Meta to be reinstated on its platform on the basis that if it didn’t qualify for QCJO credits, it must not be a news provider under the Online News Act, C-18. It’s a clever argument, one that possibly the River Valley Sun could take advantage of.

One can argue that Canada’s attempt to require Google and Meta to enter into good faith negotiations with news providers to license content was either a valiant and imaginative attempt to come to grips with the painful decline of journalism, riding to some extent on what at the time appeared to be the successful coattails of Australia’s initiative, or a colossal miscalculation and poor timing as the platforms decided to draw a line in the sand in case media interests in the US decided to follow suit. Google and Meta will do everything possible to avoid going down that road. If sending a signal to US media interests requires them to call the bluff of the Canadian government, they will do so.

In the end, the results are mixed. Legitimate media in Canada, big and small, will get some help from Google’s contribution, although the amount Google is providing through the Canadian Journalism Collective has to be offset against pre-existing voluntary agreements that both Google and Meta had with some journalistic outlets. The Meta subsidies are now gone, and the previous Google contributions have been folded into the $100 million that Google has agreed to put on the table. Meanwhile, in Regina, Just Bins has found the garbage loophole, serving up its version of trashy journalism and news on Meta, going where bona fide journalists cannot. Sad.

© Hugh Stephens, 2024. All Rights Reserved.

After Blocking News in Canada, Meta Challenges Australia (Again)

Image: Shutterstock via AI modification

It was inevitable. After Meta pulled the plug on news content on its platform in Canada as its way of complying with the obligations of the Online News Act, Australia, the model that Canada sought to emulate, was surely next in line. On March 1, Meta announced that it plans to stop paying publishers of news content in Australia, and will not renew its current agreements with Australian media once they expire. Most will expire this year.

Canada had modelled its Online News Act (Bill C-18) on Australia’s News Media Bargaining Code, albeit with “improvements”. Rod Sims, who was head of the Australian Competition and Consumer Commission (ACCC) at the time the Commission designed the Code (later incorporated into legislation as the Treasury Laws Amendment (News Media and Digital Platforms Mandatory Bargaining Code) Act 2021), was invited to testify before the Canadian Parliamentary committee examining Bill C-18. In his testimony, Sims talked about the success of the Code, its benefits for not just large media players but also many smaller “country” outlets, estimating the benefits to be north of A$200 million per year to journalism in Australia. He added that the institution of the Code “has transformed the journalism landscape in Australia. It’s gone from pessimism to optimism.”

Inspired by the results of the Australian legislation (which, by the way, ended up not designating either Google or Facebook under the Code, since they managed to come to sufficient “voluntary” agreements with Australian media), Canada moved ahead, basing its legislation on the Australian law but adding a couple of additional features. One was to increase transparency with regard to deals that would be struck under the law. Another was to require self-designation by platforms (while making it apparent that only Meta/Facebook and Google) met the criteria, allowing them an exemption if they reached acceptable deals with media. In this way, the companies could not avoid designation and would be subject to the law, something they strongly opposed, even though both had already engaged in voluntary programs on their own terms to provide some financial support to selected media outlets.

Just as happened in Australia, (see “Google’s Latest “Stoush” with Australia: What’s the Lesson from Germany’s Failed Effort? and “Facebook in Australia: “READY, FIRE, AIM”) both platforms pushed back strongly against the draft legislation, threatening to block news for Canadian users. (Facebook briefly and disastrously blocked news for Australian users during its campaign against the Code, but ultimately backed down). First, in the fall of 2022 Facebook said it might have to block postings of news on its Canadian platform, followed by Google which  threatened to block search for Canadian news in Canada by Canadian users. By the summer of 2023, when the Online News Act became law without any of the amendments proposed by the platforms, Meta upped the ante by declaring that it would end news availability on Facebook and Instagram for all users in Canada prior to the Act taking effect, set for December 2023. Again, just as in Australia, Canadian government leaders were public in their condemnation, accusing Meta of threatening and irresponsible behaviour. Alas, it was all to no avail. It appears Meta had already made its decision to not provide financial support for news content in Canada, and to end the few existing agreements that it had undertaken in the past. At the time, it indicated it would also be taking similar action elsewhere. Rather than submit to the legislation by negotiating with media entities, it complied (in letter if not in spirit) by blocking links to Canadian media. Negotiations with Google continued and eventually a compromise of sorts was reached whereby Google agreed to contribute to a fund which would be used to support journalism in Canada.

This was a somewhat pyrrhic victory (the fund will be about $100 million, less than half what had previously been estimated), but a victory nonetheless in the eyes of at least some of the news media. One can debate the overall success of the legislation (see MediaPolicy.ca’s The Online News Act is law: a buzzer-beater win or epic miscalculation?), but along with more government financial support, the Google funded pot will be welcomed by many smaller media outfits. Ironically, establishing a fund rather than requiring negotiations between the platforms and media for payment for content was an early proposal by some commentators. Now this has come to pass more by accident than design. Criteria for disbursing from the fund have been tweaked so that broadcast media, and in particular the CBC, who employ the bulk of news journalists in the country, get less than their proportional share would otherwise indicate.

The lesson for Canada, and now for Australia, is that the big digital platforms will not hesitate to play hardball if they feel their global interests are threatened. While Australia, followed by Canada, was first off the mark with legislation designed to level the playing field between a stressed journalism sector and the monolithic platforms, the response of the platforms was governed more by potential precedent than the specifics of those markets. The existence of draft legislation at the federal level in the US, (the Journalism Competition and Preservation Act, aka JCPA) as well as at the state level in California and Illinois, has not escaped the attention of Meta and Google. (Even the watered-down compromise settlement that Google made with Canada has led to some lip-smacking speculation in the US as to the amount of funding that could flow to US media). It appears that Meta, in the face of cost cutting and loss of market share in 2022, had made a business decision that if it had to pay for access to news content, it would do without. To what extent this is a wise business decision remains to be seen, but the company has clearly made a business decision in this regard. This decision may or may not affect Meta’s bottom line, but it will have the effect of leaving the platform as a purveyor of less than reliable information from nonprofessional sources. However, doing the most socially responsible thing as opposed to maximizing profits by cutting costs is not what Meta is about.  Having made its decision, it will need to unwind its commitments to Australian media, which it is now in the process of doing.

What does this mean for Australia and what can the Australian government do about it? Writing about this in Canada’s National Post, Rod Sims, now professor at the Crawford School of Public Policy at ANU, outlined some choices the Australian government needs to face. It could move to designate Meta under the Bargaining Code and force it into the negotiation and arbitration process. That would likely lead to Meta taking precisely the action that it took in Canada. The government could amend the legislation, but to what end? It could publicly criticize Meta, accusing it of unfairness and bad behaviour. It has already done this, with Prime Minister Albanese saying that what Meta is doing is “not the Australian way”. That will have zero influence on Mark Zuckerberg and the people who run Meta.

At the end of the day, Australia can stand up to Meta, and let the chips fall as they may, or it can allow Meta to free ride on Australian news content, accepting that there may be social benefits in allowing this to happen. A recent report by the Australian Broadcasting Commission (ABC) points out that Facebook is the largest social media platform for general news and half of Facebook’s users in Australia report using the social media platform for news. (Regardless of this, Meta’s beancounters give news no value to the platform). According to a University of Canberra report cited by ABC, 45% of Australians get their news from social media as opposed to less than 20% from print sources. The largest source of news is still TV at 58%. (The numbers are greater than 100 because many consumers get their news from more than one source).  In one sampling, 14% of Australians got their news from Instagram! While I find this personally appalling (indirectly revealing my age), that is the reality of our society today. Better that consumers find reliable, curated news somewhere–but we still need to recognize that responsible journalism needs to be paid for. Meta, apparently, has no desire to be a part of that equation. Without the infusion of responsible, curated journalism, Facebook will become an even greater home for misinformation than it already is. But does Meta care? Clearly not. Consumers need to be encouraged to find their news sources elsewhere. Easier said than done.

The Australian government is no doubt pondering how to respond in the best interest of Australia. Allowing Meta to wriggle out from its obligations under the Bargaining Code would not necessarily undermine the deals struck with Google, who appears to have accepted that its overall interest is best served by some form of accommodation. Having Microsoft, which has publicly stated it is willing to subject itself to both the Australian and Canadian legislation, breathing down its neck is undoubtedly a factor in this. Even if the Google deal won’t be undone, it is still galling that Meta can get away with it. Canada had to swallow that reality, yet stood up to Meta. What will Australia do? It’s a tough call.

© Hugh Stephens, 2024. All Rights Reserved