Internet and e-mail policy and practice
including Notes on Internet E-mail


2026
Months
MarApr
May Jun
Jul Aug
Sep Oct
Nov Dec

Click the comments link on any story to see comments or add your own.


Subscribe to this blog


RSS feed


Home

17 Mar 2026

AI scrapebot update Internet
A long time ago I set up a toy web farm, which turned out to be very popular with web spiders, particularly the ones from AI companies. To help their training process, rather than just pages of links, it now has paragraphs of training text.

See more ...


  posted at: 13:48 :: permanent link to this entry :: 0 comments
Stable link is https://jl.ly/Internet/scrapeup.html

Topics


My other sites

Who is this guy?

Airline ticket info

Taughannock Networks

Other blogs

CAUCE
How Harassment Shaped the Internet
50 days ago

Related sites

Coalition Against Unsolicited Commercial E-mail

Network Abuse Clearinghouse

My Mastodon feed



© 2005-2024 John R. Levine.
CAN SPAM address harvesting notice: the operator of this website will not give, sell, or otherwise transfer addresses maintained by this website to any other party for the purposes of initiating, or enabling others to initiate, electronic mail messages.