Der aktuelle MEDIEN INTERNET und RECHT-Newsletter wurde versandt.

Themen u.a.: , Unzulässige in Social-Media-Netzwerken, an Sonntagen, für Photovoltaik-Produkte, muenchen.de u.a.m. ... 🤗

Anmelden? 📮 newsletter.medien-internet-und

#scraping #werbung #ladenoffnung #preiswerbung

Last updated 1 year ago

- Bei der Geltendmachung datenschutzrechtlicher Ansprüche (, und ) wegen eines Scraping-Vorfalls auf einer Social-Media-Plattform ist eine in Höhe von insgesamt EUR 6.000,00 angemessen

👉 OLG Frankfurt a.M., miur.de/3304

📮MEDIEN INTERNET und RECHT Newsletter abonnieren? newsletter.medien-internet-und

#scraping #schadenersatz #unterlassung #auskunft #wertfestsetzung #datenschutzrecht #socialmedia #streitwert

Last updated 1 year ago

Marcel SIneM(S)US · @simsus
217 followers · 5366 posts · Server social.tchncs.de
Swift · @swift
9 followers · 41 posts · Server sunny.garden

@brook have you considered excluding sunny.garden from ?

robots.txt

User-agent: GPTBot
Disallow: /

It seems odd to disallow ai art yet leave published original art susceptible to ai scraping, for example.

This is a genuine question not a complaint, there's lots I dont know about this area. Thanks for your work 🙂

#ai #scraping #openai

Last updated 1 year ago

Marcel SIneM(S)US · @simsus
217 followers · 5319 posts · Server social.tchncs.de
Mr.Trunk · @mrtrunk
9 followers · 16134 posts · Server dromedary.seedoubleyou.me
Mr.Trunk · @mrtrunk
7 followers · 15221 posts · Server dromedary.seedoubleyou.me
Mr.Trunk · @mrtrunk
7 followers · 15119 posts · Server dromedary.seedoubleyou.me
Fabio Manganiello · @blacklight
1215 followers · 1258 posts · Server social.platypush.tech

I'm actually not entirely against AIs the web.

Once the genie is out of the bottle, you can't put it back in. If there's some content out there that is freely accessible, and it can be used to make large models better, it will certainly be used - we shouldn't be too naive or ideological about that.

I've always supported total freedom of scraping for everyone. I've always supported a world were all the content on the Internet can also be parsed by machines (that was the entire idea behind the semantic web). Once public content is out there, we lose control over who accesses it and for what purposes - that's simply how the web works.

But if Google and Meta are suddenly in this "we ♥ scraping" mood, I'd expect them to stick to their words and allow bidirectional scraping at least.

As an AI geek, I'd love to train my models on large corpora of audio extracted from YouTube videos. Or what people post in public Facebook groups when particular events happen. Or how the price of a product fluctuates on Amazon as the result of several external factors.

But I can't legally do any of these things. Those platforms are sealed, their APIs are very limited by design, only a limited amount of researchers can access some of that data (after signing lengthy NDAs and agreeing that the mother company will decide if the research can be published), and they will have tons of frontend-only checks to ensure that only a human downloads that content - and that they watch a sufficient amount of ads in the process. Not only - the developers behind scraping software like youtube-dl also get regularly harassed by Google.

So how come should I tolerate a world where if you're big enough you can afford to scrape the shit out of everyone, and use that knowledge to become even bigger and more powerful, but nobody is allowed to do the same with your own content?

We urgently need regulation that creates a level playing field when it comes to automated access to online information.

Freedom of scraping means freedom of growing. We can't give this freedom only to those who are already big enough. That's an unfair economic system with insurmountable entry barriers.

We need to make web scraping a fundamental human right.

And large companies should be compelled with sharing their data without barriers to scrapers too, if they aren't willing to build proper APIs.

Until that happens, I'll keep scraping the shit out of those monopolists without feeling an inch of guilt.

indiehackers.com/post/it-will-

#scraping

Last updated 1 year ago

Debarko ☑️ · @debarko
7 followers · 9 posts · Server fosstodon.org

It's not even funny how easy it is to a which implements super strong anti tactics. Wrote a script today which rotates IP and User agent on every request. It also simulates all the magic that the website implements to rate limit scrapers. can actually help you implement scrapers like a DDOS farm.

#scrape #website #scraping #cookie #aws

Last updated 1 year ago

C.H. · @c_th1
132 followers · 120 posts · Server digitalcourage.social

Der Datenschutz Talk: Schüler verpflichtet Land - News KW 30/2023

Was ist in der KW 30 in der Datenschutzwelt passiert, was ist für Datenschutzbeauftragte interessant ?

Wir geben einen kurzen Überblick der aktuellen Themen:

Kein Löschanspruch: VGH München, Beschluss vom 29.06.2023,

Schadensersatz :

LG Ravensburg, Urteil vom 13.06.2023,

Schüler gegen NRW 🤩

Bußgeld wegen Paparazzifotos in die Privatwohnung Prominenter

Österreicher wegen der Ehefrau verurteilt

Kinder: Datenschutzverstoß durch Filmaufnahmen

Empfehlungen & Lesetipps:

E-Mail Kommunikation sicher gestalten

Webseite der Episode: migosens.de/schuler-verpflicht

Mediendatei: migosens.de/podlove/file/781/s

#uberwachung #facebook #scraping #datenschutz #nrw

Last updated 1 year ago

I just read that Google is scraping Google Docs for their AI "training database." Wondering if this is indeed the case.

Some text files in my Drive folders convert to using the Google Docs interface when I open them. So I was wondering where one can store stuff on the cloud that isn't beholden to such thievery.

Found this article: "Google Docs AI: Is it safe? I’m a novel writer and tech journalist — let’s talk" Rami Tabari

laptopmag.com/features/google-

#writingcommunity #scraping #copyright #ai

Last updated 1 year ago

Kaan Barmore-Genç · @kaan
162 followers · 228 posts · Server fosstodon.org

is so exhilarating, I was hitting a website so hard it went down

It was our own website, I took our own website down

If anyone asks I was just red teaming

#scraping

Last updated 1 year ago

Mariette Timmer · @mariettetimmer
20 followers · 429 posts · Server mastodon.nl

Is it possible to scrape dynamic webpages with ?

#scraping #dtv #googleappsscript

Last updated 1 year ago

PyLadies Bot · @pyladies_bot
115 followers · 117 posts · Server botsin.space
Pxl Phile · @ppxl
86 followers · 25 posts · Server social.tchncs.de

Is that true that Meta scrapes the shit out of every corner to gain Real Names™? It feels true given the shitscape we're currently living in.

#threads #scraping

Last updated 1 year ago

Rechtsanwälte Kotz · @kanzlei_kotz
6 followers · 161 posts · Server nrw.social

Streitwert in "Scraping"-Verfahren

Rechtskonflikt im digitalen Zeitalter: Die Bedeutung von und
Das vorgegebene wirft ein Schlaglicht auf ein drängendes Thema des digitalen Zeitalters: den Datenschutz in Zusammenhang mit dem sogenannten Scraping.

Symbolfoto:Andrii Yalanskyi /Shutterstock

ra-kotz.de/streitwert-in-scrap

#scraping #Datenschutz #Urteil #internetrecht

Last updated 1 year ago

DSGVO.watch · @dsgvo_watch
0 followers · 4 posts · Server social.arkm.de

Facebook Nutzer erhalten 500 EUR Schadensersatz wegen DSGVO Verstoß

Facebook Nutzer erhielten 500 EUR Schadensersatz wegen einem erneuten DSGVO Verstoß. Im Zeitraum von 2018 bis 2019 wurden die Daten von 533 Millionen Facebook-Accounts von unbekannten Angreifern kompromittiert. Dabe

dsgvo.watch/gerichtsurteile/fa

#darknet #datenschutz #dsgvo #facebook #gerichtsurteile #landgerichtlbeck #meta #schadensersatz #scraping

Last updated 1 year ago

PCH🎙️ :wp_fedi: · @phillycodehound
7452 followers · 4780 posts · Server masto.ai

@rustybrick do you think will just ignore Robots.txt? I mean they'd love to be able to train on everything.

Though I would love more controls on stopping AI from scraping without blocking my sites from Search

#google #seo #ai #scraping #robotstxt

Last updated 1 year ago

ivedonestranger · @ivedonestranger
34 followers · 76 posts · Server blorbo.social