Digital Platforms and News Content: Australia Takes Off the Gloves

Image: Shutterstock

Canada infamously tried to take a leaf from Australia’s book in dealing with large internet platforms, like Google and Meta, that benefit from news media content without paying for it. In 2023, Canada introduced the Online News Act (Bill C-18), a Canadian version of Australia’s News Media Bargaining Code. The Australian approach, first introduced through legislation in late 2021, was initially very successful. Rod Sims, the author of the Code from his then position as Commissioner of the Australian Competition and Consumer Commission (ACCC), testified before the Canadian House of Commons Committee studying C-18, pointing to the success of the initiative in generating some AUD200 million in financial support for Australian media annually. Although there had been pushback by both Google and Meta in Australia, both eventually came onside, especially after the spectacular flop of Meta’s news blackout campaign. The threat of being designated under the Australian code was enough to get the two platforms to negotiate agreements with most Australian media in the form of funding to support journalistic output. As a result of the agreements reached, neither platform was designated under the Code and thus was not subject to “final offer” arbitration imposed by the ACCC.

Canada thought it would “improve” on the Australian precedent by making the process somewhat more transparent in terms of funding offers, and by requiring the platforms to self-designate. Whether it was the tweaked Canadian legislation or, more likely, a reappraisal of the value and cost of the agreements (particularly when it became apparent that the Australian precedent was likely to be followed elsewhere, with Canada being the first out of the gate), both platforms dug in. Meta in particular refused to engage with the Canadian process and declared that it would “comply” with the legislation by removing all links to Canadian media. That is not what the Government of Canada or Canadian media had in mind when Bill C-18 was introduced. Meta has held that line, although the extent to which it is fully complying is under review by the regulator, the CRTC. Various workarounds to allow news content to appear on Facebook have been employed by both Facebook users and some news providers, and META seems willling to turn a blind eye. Why that doesn’t trigger the Online News Act requirement to reach funding agreements with news content providers is a question that cries out for a response. (CRTC take note: We are waiting).

Google was slightly more amenable to striking a deal with the Government of Canada, agreeing that in return for exemption from the legislation, it would contribute $100 million (CAD) annually for five years (adjusted to inflation) to a fund that would provide support to qualified Canadian media enterprises. The $100 million subsumes existing contributions Google was already making to some Canadian journalism programs, so the net result is not $100 million in new money. Google has now begun to disburse this funding through the Canadian Journalism Collective (CJC), an entity established by what could legitimately be called “non mainstream media”, i.e. many small digital startups. The CJC was selected by Google as the executing agency for its funding, thus snubbing the organization representing the major media enterprises, News Media Canada. There are likely to be disputes over whether some of the “little guys” actually qualify as bona fide journalists. The more mouths there are to feed, the less there is for each supplicant and the big players are not happy to see the Google revenue stream diluted.

Meanwhile, back in Oz, Meta has announced that once they expire it will not renew the media agreements it reached back in 2022. Many will end this year. It appears Meta has decided it will adopt a consistent global position by insisting that news media content provides it with no value. Zero. None. And therefore it will not pay a cent. In part, this is to head off similar moves in the US where news media providers would like to bring in an arrangement similar to that instituted in Australia, or Canada. A separate initiative in California ended up with an outcome close to the one in Canada, with Google reluctantly agreeing to contribute funding to local journalism while Meta walked away. The Australian government has seen where this is heading, and it is not happy. It is taking the gloves off.

The Albanese government has announced it will be taking measures to require that any internet company that refuses to negotiate with publishers or removes news from its platform will be forced to pay, regardless. This is the big stick to counter META. What happens next is a consultation process, beginning now, to determine how what is being called a “news bargaining incentive” will actually be applied, retroactive to January 1. All digital platforms with annual revenues of AUD250 million annually will likely be subject to it. This will expand the net to include ByteDance (Tik Tok) and Microsoft (Bing, LinkedIn) as well as META and Google. Google has already said it will carry on and will renew the deals it signed with Australian media, allowing it to be exempted under the Code. META is not backing down.

The so-called “incentive” will take the form of a discounted penalty or fine. The initial proposal is that companies that sign deals amounting to 90% or more of the total of the fine that would otherwise be levied will be exempt. In other words, find ways to strike deals that in the end will save you money. Simply refusing to carry news content, as META has done in Canada, will not let a designated platform off the hook, a significant variation from the Canadian legislation, which many have criticized as being flawed. Rod Sims, now back in academia, fully supports the new incentive initiative. It appears the only way META can avoid payment is by closing its business in Australia. Given META’s track record, this might even be a card it is prepared to play. One can expect it to pull out all the stops to oppose the “incentive”, from legal challenges to threatening a pullout to seeking to invoke the support of the Trump Administration.

What position will the Trump Administration adopt? Donald Trump certainly has no love for the news media, as evidenced by his current and threatened lawsuits against US media outlets for providing coverage he doesn’t like. On the other hand, NewsCorp, which has strong holdings in Australia, has in recent years built its reputation and business model on catering to Trump’s vanity and desires. Trump also is not fan of Facebook, but Mark Zuckerberg has smelled the coffee and has donated a $1 million to Trump’s inauguration, after having kissed the ring by dining with the President-elect at Mar-a-Lago. So in the end, who knows where the US government will be on this question? All I can say to the Australian government’s expressed intention to deal head-on with META’s scorched-earth tactics is “good on ya, mate”. I wish Canada had the gumption to do the same.

© Hugh Stephens, 2025. All Rights Reserved.

Looking Back at 2024: It’s All About AI and Copyright (And a Few Other Things)

Image: Shutterstock

A retrospective on the year now coming to a close is what one expects this time of year, so I will try not to disappoint. However, when I look back at the copyright developments I wrote about in 2024, the dominant issues that jump out are AI, AI and AI. You can’t read or think about copyright without Artificial Intelligence, or to be more correct, Generative Artificial Intelligence (GAI), occupying most of the space despite many other issues on the copyright agenda. The mantra of “AI, AI and AI”, as in “Location, Location and Location” is apt because there are at least three important copyright dimensions related to AI; training of AI models; copyright protection for outputs generated by AI; and infringement of copyright by works created with or by AI. Of the three, the use of copyrighted content for AI training is the most salient.

Last year in my year-ender, I also discussed AI and the numerous lawsuits that were emerging as rightsholders pushed back on having their content vacuumed up by AI developers to train their algorithms. Those lawsuits have only multiplied. At last count, there are more that 30 cases in the US, ranging from big media vs big AI (New York Times v OpenAI/Microsoft) to class action suits brought by artists and authors, as well as litigation in the UK, EU, and now in Canada (see here and here). That is just on the input side.

In terms of output, i.e. whether works produced by an AI can be copyrighted, there are a couple of interesting cases in the US where applications for copyright registration have been refused by the US Copyright Office (USCO) because of a lack of human creativity. A couple of months ago, I discussed two such high profile cases, one brought by Stephen Thaler, and the other by Jason Allen. To date the USCO is not budging, although it is undertaking an extensive study of the issue. Part 1 of its study, on digital replicas, was published in July of this year. The next section on copyrightability is expected to be published in January with the issues of ingestion for training and licensing in Q1 2025.

While the USCO has to date denied applications for copyright registration of AI-generated works, the Canadian copyright office (CIPO-Canadian Intellectual Property Office) has been caught up in a problem of its own making. This is because Canadian copyright registration is granted automatically, so long as tombstone data and the prescribed fee is provided. The work for which registration is sought is not examined. As a result, copyright certificates have been issued to works created by AI, notwithstanding the general presumption that copyright protection is only accorded to human created work (although this is not explicitly stated in the Act). In July a legal challenge was launched against copyright registrant Ankit Sahni, who successfully registered a work with CIPO claiming an AI as co-author. The case was brought by the Canadian Internet Policy and Public Interest Clinic (CIPPIC) at the University of Ottawa, as I wrote about here. (Canadian Copyright Registration and AI-Created Works: It’s Time to Close the Loophole).

While the courts in the US, UK, Canada and elsewhere are grappling with various issues related to AI and copyright, governments are studying the issue.

In Australia, the Select Committee on Adopting Artificial Intelligence issued its final report in November. While the report was wide-ranging, three of its recommendations related to copyright;

engagement with the creative Industry to address unauthorized use of their works by AI developers and tech companies,

transparency in Training Data by requiring AI developers to disclose the use of copyrighted works in training datasets and ensure proper licensing and payment for these works, and

remuneration for AI Outputs, with an appropriate mechanism to be determined through further consultation

These are important principles, but how they will be implemented in practice remains to be determined.

In Canada, a consultation on AI and copyright was launched late in 2023 with submissions to be received by January 15, 2024. The Canadian cultural community put forth three key demands;

No weakening of copyright protection for works currently protected (i.e. no exception for text and data mining to use copyrighted works without authorization to train AI systems)

Copyright must continue to protect only works created by humans (AI generated works should not qualify)

AI developers should be required to be transparent and disclose what works have been ingested as part of the training process (transparency and disclosure).

Submissions to the consultation were published in mid-year but since then there has been no apparent action. Given the current political crisis facing the Trudeau government, none is expected in the near term although the issue will inevitably have to be addressed after the general election in 2025.

While the EU has already established some parameters dealing with use of copyrighted materials for AI training, the new UK Labour government is taking another run at the issue after various proposals in Britain to find a modus vivendi between the AI and content industries under the Tories went nowhere. The current UK discussion paper on Copyright and Artificial Intelligence, which seems excessively tilted in favour of the AI industry, has aroused plenty of controversy. While it says some of the right things, such as proclaiming that one of the objectives of the consultation is to “support…right holders’ control of their content and ability to be remunerated for its use” the thrust of the paper is to find ways to encourage the AI industry to undertake more research in the UK by establishing a more permissive regime with respect to use of copyrighted content. It is based on three self-declared principles; (notice how these things always seem to come in threes?);

Control: Right holders should have control over, and be able to license and seek remuneration for, the use of their content by AI models

Access: AI developers should be able to access and use large volumes of online content to train their models easily, lawfully and without infringing copyright, and

Transparency: The copyright framework should be clear and make sense to its users, with greater transparency about works used to train AI models, and their outputs.

These three objectives then lead to what is clearly the preferred solution;

“A data mining exception which allows right holders to reserve their rights, underpinned by supporting measures on transparency”

Fine in principle, but the devil is always in the detail and the details in this case revolve around transparency (how detailed, what form, what about content already taken?) and, in particular, reservation of rights, aka “opting out”. This is easy to proclaim in principle but difficult to do in practice. British creators are up in arms, led by artists such as Paul McCartney, and supported by the creative industries in the US. The British composer Ed Newton-Rex has penned a brilliant satire explaining how AI development in the UK will work if current proposal is enacted. The problem with an opt-out solution is essentially twofold; it doesn’t deal with content already absorbed by AI developers and it would be cumbersome if not impossible for many rightsholders to use.

Other governments have addressed the issue in different ways. Singapore has taken a very loose approach toward copyright protection, putting its thumb firmly on the scale in favour of AI developers. It is currently considering additional proposals that would strip even more protection from rights-holders, who are pushing back strongly. Japan had been widely and incorrectly reported to have been on the same path, resulting in a welcome clarification this year from the Agency for Cultural Affairs regarding the limits of Japan’s text and data mining (TDM) exception.

While AI dominated the copyright agenda in 2024, there were other issues relating to copyright and copyright industries that I wrote about. The ongoing question of payment for news content by large digital platforms continued to play out in different ways. In Canada, the struggle between the government and US tech giants Google and META was finally “resolved” (after a fashion) at the end of last year. Google agreed to “voluntarily” pay $100 million annually into a fund for Canadian journalism in return for being exempted from the Online News Act (ONA) while META called the government’s bluff by blocking Canadian news providers from its platform thus, in theory, avoiding being subject to the ONA. However, META has a very subjective interpretation as to what is Canadian news content, allowing some news providers to post to it, while many users have found workarounds, as documented by McGill’s Media Ecosystem Observatory. While the CRTC investigated, the issue is still unresolved.

Meanwhile in Australia, it seems that META intends to go down the same road of blocking news, announcing it will not renew the content deals it initially signed with Australian media in response to Australia’s News Media Bargaining Code, the model upon which Canada’s legislation was based. Unlike in Canada, the Australian government is planning a robust response. (More on this in a future blog post). Finally, on the same topic, California (which was threatening to introduce its own version of legislation to require digital platforms to compensate news content providers) emerged with an outcome very similar to that reached in Canada, with Google offering up some funding (although proportionally less than in Canada) while META appears to have walked away.

Controlled Digital Lending (CDL) was another copyright issue finally settled in 2024 (in the US). The Internet Archive, after losing a lawsuit brought against it by a consortium of publishers who argued that the digital copying of their works constituted copyright infringement, notwithstanding the Archive’s theory that they were simply lending a digital version of a legally obtained physical work held by them (or someone else associated with them), lost its appeal. In December, the deadline for further appeals expired, thus effectively ending this saga. Whether Canadian university libraries, some of whom are avid devotees of CDL, will take note remains to be seen.

The issue of circumventing a TPM (“Technological Protection Measure”), commonly referred to as a “digital lock” and often represented by a password allowing access to content behind a paywall, was also front and centre this year in Canada. In the case of Blacklock’s Reporter v Attorney General for Canada, the Federal Court found that an employee of Parks Canada, who shared a single subscription to Blacklock’s with a number of other employees by providing them with the password did not infringe Blacklock’s copyright since the employee did not circumvent (in the meaning of the law) the TPM and the purpose of the sharing was for “research“, which is a specified fair dealing purpose. Blacklock’s is a digital research service that sells access to its content and protects its content with a paywall, as is common for many online content providers, like magazines and newspapers.

Despite the hoo-ha of anti-copyright commentators asserting the Court had found that “digital lock rules do not trump fair dealing“, it was equally clear the Court had ruled that fair dealing does not trump digital locks (TPMs). The Court did not undermine the protection afforded to businesses to protect their content through use of TPMs. Rather, it determined that sharing a licitly obtained password did not constitute circumvention as outlined in the Act, as I explained here. (Fair Dealing, Passwords and Technological Protection Measures (TPMs) in Canada: Federal Court Confirms Fair Dealing Does Not Trump TPMs (Digital Lock Rules). Although the Court did not legitimize circumvention of a TPM for fair dealing purposes, contrary to claims stating the opposite, its acceptance of password sharing is an outcome that legal experts have disagreed with, (as do I for what it is worth). The law is very clear that fair dealing cannot be used as a pretext or a defence against violation of the anti-circumvention provisions of the Copyright Act. The decision now under appeal by Blacklock’s.

Finally, the last copyright point of note for 2024 is that this year marked the bicentenary of the introduction of the first copyright legislation in Canada, in the Assembly of Lower Canada, in 1824. It also marked the centenary of the entry in force of the first truly Canadian Copyright Act on January 1, 1924. This two hundred years of domestic copyright history is worth celebrating. The first legislation was introduced “for the Encouragement of Learning” so that more local school texts would be written and printed. Given the current standoff between the secondary and post-secondary educational establishment and Canadian authors and their copyright collective over license payments for use of copyrighted works in teaching, one wonders whether we have really learned anything about the role copyright plays in our society. (Copyright and Education in Canada: Have We Learned Nothing in the Past Two Centuries? (From the “Encouragement of Learning” to the “Great Education Free Ride”).

Leaving that question with you to ponder, gentle Reader, is probably a good way to end this look back over the past 12 months. Stay tuned for more commentary on copyright developments in 2025.

© Hugh Stephens, 2024. All Rights Reserved.

Tech Platforms and News Media: California Cuts a Deal with Google, While Meta Walks Free

Image: Shutterstock (AI assisted)

The ongoing financial tug-of-war between large tech/social media platforms and news media outlets, with governments trying to play a mediator/arbitrator role, has taken another turn with the announcement that California has cut a deal with Google, similar in principle to the one reached in Canada at the end of last year. When Google was fighting being subjected to regulation in Canada under the Online News Act, Bill C-18, it was glancing over its shoulder at impending developments on its own home turf, California. The California Journalism Preservation Act would have instituted a regime similar to that originally proposed in both Australia and Canada by requiring Google to negotiate agreements with news content providers, compensating them for Google’s use of news content links.

The California bill is similar to draft US federal legislation introduced, but not adopted, in the 2022 session of the US Congress. That legislation would have given news media enterprises an exemption from anti-trust laws allowing them to negotiate jointly with the tech platforms while specifying requirements regarding the negotiations, including arbitration in certain circumstances. (That draft US federal legislation made it difficult for Google to enlist the support of the US Government in opposing similar legislation in Canada, as I wrote about here). Google didn’t like the draft legislation in the US one bit and employed various measures to try to stop it. In California it threatened to drop news from Search, (as it did in Canada) and, similar to its tactics in Canada, it trialed blocking links to news sites for some users.

As most readers will know, and as I have written about on several occasions, Google played hardball in Australia and Canada, but in the end came to an accommodation of sorts. In Australia it reached unspecified content deals with the majority of Australian media players. In Canada, in return for avoiding designation under the Online News Act, Google agreed to contribute CAD $100 million (about US$75 million) annually to a journalism fund to support news media outlets, both print and broadcast. While the result was not what the government or the news media had originally expected, it was certainly better than nothing. It was not all “new money”, however, as Google had previously signed some voluntary content deals in a vain attempt to head off the legislation. This pre-existing funding will now be rolled into the $100 million fund. The voluntary deals will be wound up, thus putting an effective cap on Google’s contribution. Its tactics in Canada demonstrate that Google will do just about anything to avoid being made to share revenues for linking to the content of others, seeing this as an existential threat to its business and operating model. The same applies in California.

In return for withdrawal of California’s original legislation—which would have subjected links to compensation—Google has agreed to ante up approximately US$55 million over 5 years to be put into a fund to be managed by the Graduate School of Journalism at the University of California, contingent on the California legislature also contributing $70 million to the fund over the same period of time. That outcome is by no means guaranteed as the funding must be approved by the state legislature each year. In addition, according to the New York Times, Google has undertaken to provide approximately $60 million over 5 years to an AI Innovation Accelerator Fund, plus maintaining its existing $10 million annual support payments for journalism. That is pretty small change for Google. California’s population, at roughly 39 million, is almost exactly the same as that of Canada although California’s GDP is roughly twice as large. Taking that into consideration, maybe Canada did not fare so badly in getting Google to contribute $100 million (CAD) annually.

While Google and the California legislators who brokered the deal were touting its benefits, it received a mixed reaction from some publishers and outright condemnation from unions representing journalists and staff working at news outlets, who called it a “shakedown”. Meanwhile, Meta (Facebook) appears to have walked away scot-free. Meta was one of the targets of the original California legislation, as it was in Australia and Canada. Meta has claimed it gets no value from news links and in Canada “complied” with the Online News Act by blocking news links, thus eliminating any obligation to pay for news content. The result has been perverse, with non-news outlets, like a garbage disposal company in Saskatchewan being allowed to post news to Facebook because technically it is not a news provider, while legitimate professional journalism outlets are blocked. While Meta initially reached some content deals in Australia after its blockage of news content blew up in its face, it is now busy terminating them as renewals come up. The Australian government will have to decide whether to bite its tongue and do nothing or call Meta’s bluff. If they do, there is every likelihood that Meta will block news for Australian users as it has done in Canada.

There are still voices in California calling for ways to “encourage” Meta to make a financial contribution to news media, and the Google offer is not a completely “done deal” although it has support from the Governor of the state. With the California domino having fallen, government attempts to bring Google to heel seem to have sputtered out, at least for now. The Journalism Competition and Preservation Act in the US Congress appears unlikely to go anywhere. The UK is still trying to figure out its approach, as MediaPolicy.ca reported last week. Google and Meta appear to have successfully stared down sovereign governments at the national and state level. While Meta appears to have completely closed its wallet, Google has negotiated payments it can live with as part of the cost of doing business. And given the scale of its business, the amounts it is throwing on the table are exactly that, a business cost that is a small price to pay for its dominant quasi-monopolistic grip on the online advertising market which, at its most basic level, is a vehicle to provide access to content generated by others.

© Hugh Stephens, 2024. All Rights Reserved.

As Journalism Withers, “Garbage” News Takes Over: An Unexpected Result of the Facebook/Instagram News Blackout in Canada

Image: Shutterstock (with AI assist: Note AI misspelling)

The sad, slow decline of professional journalism continues. The most recent manifestation of this in Canada is the announcement that the Saltwire Group, publishers of The Chronicle Herald of Halifax and a number of other daily and weekly papers in Atlantic Canada, is on the ropes with $90 million in debt. The Chronicle Herald has been around for 150 years! Post Media has made a buyout offer of $1 million (less than the cost of the average home in Vancouver). This will keep some of the papers publishing, no doubt with significant staff cutbacks and, reportedly, ending of union contracts and termination of company pension plans. But there seems to be little choice; accept the nasty medicine or expire. Post Media will most likely keep a minimal local newsroom and fill the papers with repurposed content from its flagship daily National Post. Local news will suffer. This is a story that is being replicated all over North America.

Meanwhile small, local online news outlets have been hit by Facebook’s blackout of news for Canadian readers as a result of the passage of the Online News Act, Bill C-18. In the case of New Brunswick’s River Valley Sun, according to the CBC, the free-to-readers, ad-supported, all-digital Sun depended on Facebook to reach its audience as well as to obtain local news that could be added to the online journal. The owner is quoted as saying, “…we started basically from scratch, and because Facebook was free it was wonderful”.

That is the flip side of the narrative accusing Facebook of using uncompensated news content, developed at great expense by professional journalists working for media companies, to attract and hold users’ interest thus keeping them on Meta’s sites (Facebook, Instagram) longer to be able to expose them to more ads. Not so wonderful. That is what C-18 was supposed to rectify, along the lines of the News Media Bargaining Code instituted earlier in Australia, requiring the big social media platforms (in this case Google and Meta) to make a contribution to news gathering if they used news content. Failure to do so would result in compulsory and binding arbitration. While Google ducked and weaved, it eventually agreed to put $100 million into a pot to be doled out to various news organizations by the Canadian Journalism Collective as long as it was given an exemption from the legislation, whereas Meta went to the wall and took down news links rather than contribute. The River Valley Sun is collateral damage, although it could possibly tap into the Google funding if it employs any fulltime journalists (a minimum of two is required).

The Online News Act did result in bringing some additional funding to news outlets, large and small, somewhat along the lines of Australia’s initiative through its News Media Bargaining Code (which was not actually invoked because the two platforms ended up striking content deals with most Australian media). However, it is clear that Meta has had buyer’s remorse about its Australian deals because they are busy unwinding those commitments in Oz. It remains to be seen how Australia will deal with this. There have been some suggestions that the Australian government may be looking at options to force Meta to carry news, and then subject them to the Code. Will Meta get a pass—in which case Google could rightly ask why it should comply—or will news also end up being blocked on Facebook and Instagram for Australian users as well as Canadian?

If that happens, consumers will have to find other ways to get access to news that is normally shared on the Facebook platform. Going to the actual websites of news organizations is one good way to do this. What a novel idea! However, many younger consumers won’t do this; if news is on a social media feed, they might read it, but they will not proactively search for it. Facebook also remains an important go-to source for breaking local news related to weather and climate-change events such as flooding and fires. Hosts of Facebook pages distributing information on the local impact of natural disasters have had to resort to workarounds such as posting photos of news articles on Meta or copy-pasting articles into their Facebook page. (Ironically, such actions could possibly constitute violations of copyright whereas posting a link is not). But workarounds are not going to help news outlets like the River Valley Sun.

Perhaps the journal should take a leaf out of the book of Just Bins, a waste recycling firm in Regina, SK, a garbage collection company that seems to have picked up news distribution as a sideline, benefiting from the free online exposure. It does not distribute mainstream media; it creates its own news content that it features on its website, even employing its own drone footage. Because it is not a news site, it is not blocked by Facebook. It has edgily reported on traffic accidents, problems at City Hall, homelessness, and even suicides. In fact, it was voted best online news source in a recent poll taken for the annual “Best of Regina” awards. Not that it takes its journalism role all that seriously—and that is part of the problem—since fact-checking, journalistic ethics and balanced reporting seem to have been put out with the recycling bins. Just Bins has taken the old adage of “garbage in-garbage out” to new heights. This is yet another byproduct of the Facebook ban, a further kick in the shins for reputable journalism. If the River Valley Sun could just sell pizza on the side, maybe they could slip through the Facebook net and post online as Just Bins is doing.

Just Bins is not the only quasi-news source that has managed to squeeze through the Meta news blockage. This week Howard Law’s media blog MediaPolicy.ca reports on the Meta news ban (The leaky Meta news ban is roiling Canadian journalism”) and highlights a study conducted by researchers at McGill University on the impact of the ban a year after it went into effect. The McGill study notes that many Canadians continue to use Meta as a source of “news” despite the ban. Three-quarters of the Canadian public is apparently unaware of the ban, yet Canadian news outlets have lost 85% of their engagement on Facebook and Instagram and almost one third of local news outlets are now inactive on social media. How this apparent contradiction is possible is explained in part by workarounds such as those referred to above, but also because Meta allows sites that declare themselves to be non-news outlets to continue to post content. Mediapolicy.ca discusses the case of Narcity.com, a “chain of local news and lifestyle websites” that has recently been reinstated on Meta. Narcity was denied government tax credits as a “Qualified Canadian Journalism Organization” (QCJO) because of insufficient first-hand reporting. With this rejection in hand, it then applied to Meta to be reinstated on its platform on the basis that if it didn’t qualify for QCJO credits, it must not be a news provider under the Online News Act, C-18. It’s a clever argument, one that possibly the River Valley Sun could take advantage of.

One can argue that Canada’s attempt to require Google and Meta to enter into good faith negotiations with news providers to license content was either a valiant and imaginative attempt to come to grips with the painful decline of journalism, riding to some extent on what at the time appeared to be the successful coattails of Australia’s initiative, or a colossal miscalculation and poor timing as the platforms decided to draw a line in the sand in case media interests in the US decided to follow suit. Google and Meta will do everything possible to avoid going down that road. If sending a signal to US media interests requires them to call the bluff of the Canadian government, they will do so.

In the end, the results are mixed. Legitimate media in Canada, big and small, will get some help from Google’s contribution, although the amount Google is providing through the Canadian Journalism Collective has to be offset against pre-existing voluntary agreements that both Google and Meta had with some journalistic outlets. The Meta subsidies are now gone, and the previous Google contributions have been folded into the $100 million that Google has agreed to put on the table. Meanwhile, in Regina, Just Bins has found the garbage loophole, serving up its version of trashy journalism and news on Meta, going where bona fide journalists cannot. Sad.

© Hugh Stephens, 2024. All Rights Reserved.

It Took Glue on Pizza to Spotlight Google’s AI Problem

Image: Shutterstock (with AI assist)

Google, the “indispensable” search engine relied on by millions for accurate and reliable search, has done it again, stepping smack into the pile of steaming excrement waiting for it in the middle of the road. Its most recent ill-starred foray into AI generated search has yielded some hilarious results, lighting up the blogosphere and making Google the butt of many jokes. After flubbing the public launch of its first AI enabled service, Bard, back in early 2023 when the AI driven search function produced the wrong results for a simple question about the James Webb Space Telescope, overnight wiping $100 million off Google’s valuation, Google’s new Gemini “AI Overview” service scored another own goal with its “hallucinatory” responses to questions like how to ensure cheese will stay on pizza (add glue) or how many rocks a day should a human eat. (Only one, in case you were wondering). It also informed users that Barack Obama was the first Muslim President of the United States.

When it comes to AI, “hallucinations” refer to incorrect or misleading results resulting from lack of training data, biased or selective training data, or incorrect assumptions made by the model. Hallucinations leading to trademark dilution was one of accusations levelled against OpenAI and Microsoft by the New York Times in its landmark copyright infringement case that is still working its way through the courts. In this case, the AI algorithm incorrectly attributed the false information to the Times, thus undermining its journalistic credibility, and diluting its trademark, or so the argument goes.

Apparently, the source of the pizza glue misinformation was an old tongue-in-cheek post on Reddit. I guess an AI algorithm has no sense of humour and can’t tell sarcasm from reality. It also gives credibility to conspiracy theories and blatantly false information, such as the Barack Obama example. Normally a search on any subject turns up a variety of sources on Google, some clearly more authoritative than others. Searchers can weigh a Wikipedia entry against a Reddit post against information from a government website or reputable academic institution. Even a plain old tendentious website put up by an advocacy organization can be probed and the bona fides of the source checked out. That is becoming more difficult, or at least less obvious, with the AI generated search summary provided by Google’s AI Overview.

If the search topic falls within AI Overview’s purview (and at the moment, not all do), viewers will see a summary of the information requested drawn from sources chosen by the algorithm. The algorithm decides how much information is drawn from any given site, and which sites are chosen. Users have the option of clicking through to access these and other sites that are displayed (below the annoying sponsored listings). However, many consumers, looking for a quick information fix, will not bother to do this and thus risk taking the AI summary as gospel. If you are being advised to mix glue into your pizza topping, you can probably figure out that something is haywire, but if the summary is only slightly wrong, or is on a subject that you are not familiar with, watch out. A good example was provided by the website Plagiarism Today. It asked Google five questions about copyright in the US. Its conclusion regarding the responses provided by AI Overview? Decidedly mixed. One A, one B, one C, one D and one resounding F.

The accuracy of the summary obviously depends on the sources of information chosen by the algorithm, and the emphasis it chooses to put on information from any given source. Unfortunately, in many instances it does not seem to prefer credible and authoritative sources, but instead goes for those that are popular. That is one of the basic problems of AI generally—quantity over quality, popularity over facts. (By the way, this account of how AI Overview works is based on reading about it from US sources since it is not yet available in Canada, which may be a good thing since I have read various US posts explaining that Overview is impossible to disable and very hard to turn off). Google intends to cram it down your throat whether you want it or not.

Of course, Google assures everyone that Overview is a “good thing” and the early kinks will be ironed out. Many websites are not happy with the new interface that will now exist between themselves and the consumer. They lose traffic when users simply read the AI summary and move on, not visiting the source website. Google used to boast that it had a symbiotic relationship with content providers because it facilitated, and even drove, eyeballs to the sites. No longer. It has appropriated–without permission–content from independent sites to feed AI Overview in the same way that the LLM (Large Language Model) AI developers have scooped up content, including copyrighted content, from rightsholders, without permission, licence or payment to enable their AI training. It is one thing to link to third party content, which requires a visit to the actual site to access the content; it is quite another to freely copy from it and mix it, sometimes inaccurately or inappropriately, with content from other websites that may not be reliable or acceptable sources of information.

Google clearly controls what goes into AI Overview and has said that it will apply more filters. If it can screen out sources of sarcasm and parody, it clearly has the capacity to install other filters that could differentiate trustworthy information from garbage. This might require Google to license the use of this curated information (Horrors! Google having to pay for the information of others that it so freely uses!). Licensing has already begun for content used for generative AI training. News Corp has just signed a licensing deal with OpenAI, as has the AP and the Financial Times. Licensing is at the heart of the dispute that OpenAI is facing with the New York Times.

Licensing presupposes knowledge of what inputs are being used, a requirement now enshrined in EU law which requires that AI developers maintain an inventory of works used for training purposes (transparency). This will allow rightsholders to opt out or negotiate a licensing solution unless the copying meets the text and data mining exception (i.e. for research by research and cultural organizations).

However, Silicon Valley has variously proclaimed that (a) it is impossible to track all the information ingested during AI training; (b) it would bankrupt the industry should they have to pay for content (c) they need to use copyrighted content because there is not enough current public domain information available (d) it is not feasible to filter out or identify specific works amongst the millions of datapoints that it ingests (e) everything that it does is fair use anyway (f) all of the above. Google’s embarrassment, and its apparent ability to finetune AI Overview, demonstrates that it is clearly feasible to filter out certain works and types of content. It is the will to do so generally that is lacking. Meanwhile, the number of lawsuits brought by rightsholders against AI developers continues to multiply.

By making itself the object of social media ridicule, and then admitting it can address the problem, Google has actually done us all a favour by highlighting the “garbage in; garbage out” problem. Not all copyrighted material is responsible or accurate but a good chunk of it is, such as professional journalism and academic journals. Access to that material is essential to provide credible results. And that material needs to be paid for, on terms set by the content owners.

The solution is not to stop the development of generative AI; for one thing, that won’t happen. It is to corral it, improve it and make it more trustworthy, if necessary with penalties if it is not. The penalties could be imposed by the market (i.e. Google search is not reliable so I will go elsewhere) or, in certain cases, by regulation. Licensing of accurate, credible information to drive search will inevitably distinguish the fake from the real and dubious from the trustworthy. This is what any credible search engine seeks. It is market gold.

Google, open your bulging wallet and start licensing content that will make us want to continue to try you first to get reliable information. Right now, through your clumsy rollout of AI supported search, you are rapidly losing that trust. It is also not acceptable to plagiarize someone else’s content, mix it with garbage from some other source, and serve it up on a platter to consumers on the pretext that this is the definitive answer. The result, as we have seen, is gluey pizza.

(c) Hugh Stephens, 2024. All rights reserved.