Internet and e-mail policy and practice
including Notes on Internet E-mail


2023
Months
May

Click the comments link on any story to see comments or add your own.


Subscribe to this blog


RSS feed


Home

07 May 2023

Can large language models use the contents of your web site? Copyright Law
Large Language Models (LLM) like GPT-4 and its front end ChatGPT work by ingesting gigantic amounts of text from the Internet to train the model, and then responding to prompts with text generated from those models. Depending on who you ask, this is either one step (or maybe no steps) from Artificial General Intelligence, or as Ted Chiang wrote in the New Yorker,
ChatGPT Is a Blurry JPEG of the Web. While I have my opinions about that, at this point I'm considering what the relationship is under copyright law between the input text and the output text. Keeping in mind that I am not a lawyer, and no court has yet decided a LLM case, let's take a look.

See more ...


  posted at: 13:17 :: permanent link to this entry :: 0 comments
Stable link is https://jl.ly/Copyright_Law/llmcopy.html

Topics


My other sites

Who is this guy?

Airline ticket info

Taughannock Networks

Other blogs

CAUCE
It turns out you don’t need a license to hunt for spam.
24 days ago

A keen grasp of the obvious
Italian Apple Cake
583 days ago

Related sites

Coalition Against Unsolicited Commercial E-mail

Network Abuse Clearinghouse

My Mastodon feed



© 2005-2020 John R. Levine.
CAN SPAM address harvesting notice: the operator of this website will not give, sell, or otherwise transfer addresses maintained by this website to any other party for the purposes of initiating, or enabling others to initiate, electronic mail messages.