beSpacific · @bespacific
1106 followers · 2068 posts · Server newsie.social

The has blocked ’s , meaning that OpenAI can’t use content from the publication to train its AI models. If you check the NYT’s robots.txt page, you can see that the NYT disallows , the crawler that OpenAI introduced earlier this month. Based on the ’s , it appears NYT blocked the crawler as early as August 17th. theverge.com/2023/8/21/2384070

#newyorktimes #openai #WebCrawler #gptbot #internetarchive #waybackmachine #copyright #legalresearch

Last updated 1 year ago

: Well, I have only been here for three days, but it feels very much like 1994 all over again: That first hesitant email, those first adventurous clicks on pre- internet search engines. Remember grappling with these?







Let's be patient, progress may be happening before our eyes, again!

#mastodon #google #WebCrawler #lycos #AltaVista #Excite #Dogpile #AskJeeves #JumpStation

Last updated 2 years ago