PCMag editors select and review products independently. If you buy through affiliate links, we may earn commissions, which help support our testing.

Novelists Sue Nvidia Over Alleged AI Copyright Violations

Three book authors want to take Nvidia to court, alleging that the US tech giant's AI NeMo Megatron copies and repurposes their work without compensation or consent.

 & Kate Irwin Reporter

Our team tests, rates, and reviews more than 1,500 products each year to help you make better buying decisions and get more from technology.

Our Expert
LOOK INSIDE PC LABS HOW WE TEST
65 EXPERTS
43 YEARS
41,500+ REVIEWS
(Credit: Jakub Porzycki / NurPhoto via Getty Images)

Nvidia is facing a potential class-action lawsuit over its NeMo Megatron AI model. Three novelists sued the company for alleged copyright infringement, arguing that Nvidia has used their work to train its model and has therefore violated their books' copyright protections.

The authors argue that Nvidia's NeMo Megatron-GPT, first released in September 2022, copies and draws from their books "without consent, without credit, and without compensation."

"During training, the LLM copies and ingests each textual work in the training dataset and extracts protected expression from it," the complaint reads.

The lawsuit states that Nvidia's NeMo Megatron large language model (LLM) was trained on EleutherAI's dataset, dubbed "The Pile," which consists of 800GB of data including 108GB worth of books. The Pile's books component is also referred to as "Books3," which is reportedly made up of more than 196,000 books on "Bibliotik" and includes those of the authors who filed the lawsuit.

Bibliotik is a login-gated "shadow library" of copyrighted books, and Books3 creator Shawn Presser has previously confirmed that Bibliotik's entire library was used to make the AI dataset. In October 2023, Books3 was subsequently removed from AI data site Hugging Face over copyright concerns, but NeMo continues to use this dataset, the lawsuit argues.

The suit's plaintiffs—novelists Abdi Nazemian, Brian Keene, and Stewart O'Nan—are seeking damages and requesting a class-action lawsuit so that all other authors whose work was included in the Books3 dataset can join the suit against Nvidia.

Reached for comment, an Nvidia spokesperson told PCMag via email: "We respect the rights of all content creators and believe we created NeMo in full compliance with copyright law."

This lawsuit comes as AI tech firms see their stock prices soar to all-time highs while artists grow increasingly frustrated that their names and work are being used to train AI models without their permission and without being paid.

Nvidia isn't the only tech firm with an AI model that's being accused of copyright infringement, though. The New York Times's copyright lawsuit against OpenAI and Microsoft over ChatGPT is ongoing. And in the past week, multiple artists have raised concerns that generative AI image tool Midjourney is using their unique styles to create outputs that pull from their bodies of work without their consent, calling it "dehumanizing" and disrespectful.

Editors' Note: This story has been updated to include comment from Nvidia.

About Our Expert

Kate Irwin

Kate Irwin

Reporter

I’m a reporter for PCMag covering tech news early in the morning. Prior to joining PCMag, I was a producer and reporter at Decrypt and launched its gaming vertical, GG. I have previously written for Input, Game Rant, Dot Esports, and other places, covering a range of gaming, tech, crypto, and entertainment news.

I’ve been a PC gamer since The Sims (yes, the original) in the CD-ROM days. I still think about my first-gen pink iPod mini, which, looking back, was not so mini. In 2020, I finally built my own custom Windows PC for gaming with a 3090 graphics card, but I also regularly use Mac and iOS devices. As a reporter, I’m passionate about documenting the wide world of tech and how it affects our daily lives.

My Areas of Expertise

  • Microsoft
  • Google
  • Artificial intelligence 
  • Cybersecurity
  • Video games are a big one. I specialize in shooters (Apex Legends, Fortnite, Overwatch) but I occasionally test out other genres as well, especially indie games or cozy games (The Sims series, Animal Crossing). 
  • The business and tech that powers video games
  • Cryptocurrency and blockchain technology
  • Social media platforms, including Meta’s apps, X/Twitter, Telegram, TikTok, etc.
  • Tech regulation

The Technology I Use

  • MSI gaming laptops
  • Nvidia graphics cards
  • AMD CPUs
  • MacBook Pro and Air laptops
  • An iPhone from 2019 (though I’m thinking about getting a “dumb phone” like the Light Phone)
  • Nintendo Switch
  • PlayStation 5
  • Freewrite Traveler 
  • At home: Sonos speakers (we have them all over the house), Philips Hue + Ring security products

Read full bio