#ai-scraping

[ follow ]
fromDigiday
4 days ago

Creators brace for AI bots scraping their work

As AI bot traffic grows, content creators are taking steps to protect their intellectual property from being scraped against their will. The publishing industry has spent the past year battling against the encroachment of AI tech, with companies like The New York Times and Ziff Davis suing AI platforms for scraping their copyrighted content and using it to train large language models.
Intellectual property law
fromFast Company
2 weeks ago

AI scraping is inevitable. Can publishers turn it into revenue?

A good place to start finding answers is the most recent State of the Bots report from AI startup TollBit. For publishers that are feeling the heat of AI, it attaches real numbers to the presence of AI in the media ecosystem and how quickly it's growing. And while the rise of AI bots is a worrisome trend to those in the content business, it may also be an opportunity.
Artificial intelligence
Information security
fromWIRED
2 weeks ago

Matthew Prince Wants AI Companies to Pay for Their Sins

Cloudflare enables websites to block unauthorized AI scraping via a pay-per-crawl model while evolving content-moderation stances and protecting sites from abuse and DDoS attacks.
E-Commerce
fromDigiday
1 month ago

Amazon quietly blocks AI bots from Meta, Google, Huawei and more

Amazon is blocking AI companies' web crawlers via robots.txt to prevent scraping of its e-commerce data and protect its marketplace and ad business.
E-Commerce
fromZDNET
1 month ago

How web scraping actually works - and why AI changes everything

Web scraping powers pricing, SEO, security, AI, and research industries.
Digital life
fromSlate Magazine
1 month ago

We Are Witnessing the Death of the Internet As We Know It

Reddit is blocking Internet Archive from saving most of its new webpages to protect user data and comply with platform policies.
fromThe Verge
1 month ago

Cloudflare says Perplexity's AI bots are 'stealth crawling' blocked sites

Cloudflare claims that Perplexity conceals its crawling identity to circumvent website restrictions, resulting in concerns over unauthorized content scraping from various sites.
Privacy professionals
[ Load more ]