Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News Editorials & Other Articles General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

highplainsdem

(63,112 posts)
Mon Aug 4, 2025, 12:40 PM Aug 2025

Perplexity accused of scraping websites that explicitly blocked AI scraping

Source: TechCrunch

AI startup Perplexity is crawling and scraping content from websites that have explicitly indicated they don’t want to be scraped, according to internet infrastructure provider Cloudflare.

On Monday, Cloudflare published research saying it observed the AI startup ignore blocks and hide its crawling and scraping activities. The network infrastructure giant accused Perplexity of obscuring its identity when trying to scrape web pages “in an attempt to circumvent the website’s preferences,” Cloudflare’s researchers wrote.

-snip-

Perplexity appears to be willingly circumventing these blocks by changing its bots “user agent,” meaning a signal that identifies a website visitor by their device and version type; as well as changing their autonomous system networks, or ASN, essentially a number that identifies large networks on the internet, according to Cloudflare.

“This activity was observed across tens of thousands of domains and millions of requests per day. We were able to fingerprint this crawler using a combination of machine learning and network signals,” read Cloudflare’s post.

-snip-

Read more: https://techcrunch.com/2025/08/04/perplexity-accused-of-scraping-websites-that-explicitly-blocked-ai-scraping/



Cloudflare also said Perplexity has been using "a generic browser intended to impersonate Google Chrome on macOS."

Very crooked company. But then, I don't know of any generative AI company that isn't based on theft and deceit.

The AI bots are doing terrible damage to the internet. Including here at DU. As EarlG explained last week

https://www.democraticunderground.com/101316061

the downtime and update then were at least partly about the bot problem, especially AI scrapers.
5 replies = new reply since forum marked as read
Highlight: NoneDon't highlight anything 5 newestHighlight 5 most recent replies
Perplexity accused of scraping websites that explicitly blocked AI scraping (Original Post) highplainsdem Aug 2025 OP
Lie and steal Quanto Magnus Aug 2025 #1
It doesn't really make any difference customerserviceguy Aug 2025 #2
While true, that is not an excuse to never regulate the AI industry. Until something is done, they see this as a blank Karasu Aug 2025 #3
Pass laws and regulations customerserviceguy Aug 2025 #4
I agree it won't solve the problem. But I also think it's better than ACTIVELY enabling them through inaction, certainly Karasu Aug 2025 #5

Quanto Magnus

(1,379 posts)
1. Lie and steal
Mon Aug 4, 2025, 12:48 PM
Aug 2025

this is how they all act....

Lie about the product
Steal material from others
Lie about the theft
Lie some more...

customerserviceguy

(25,406 posts)
2. It doesn't really make any difference
Mon Aug 4, 2025, 03:03 PM
Aug 2025

what laws or regulations we make, some geeks are always going to try to get around them, any way possible.

Karasu

(2,081 posts)
3. While true, that is not an excuse to never regulate the AI industry. Until something is done, they see this as a blank
Mon Aug 4, 2025, 03:30 PM
Aug 2025

check to do whatever the fuck they want, whenever they want, however they want.

It is beyond absurd that something this world-altering has gone completely unchecked for as long as it already has.

customerserviceguy

(25,406 posts)
4. Pass laws and regulations
Mon Aug 4, 2025, 03:32 PM
Aug 2025

if it makes you feel better, but don't be under any illusion that you've solved a problem.

Karasu

(2,081 posts)
5. I agree it won't solve the problem. But I also think it's better than ACTIVELY enabling them through inaction, certainly
Mon Aug 4, 2025, 03:39 PM
Aug 2025

in the case of the AI industry.

Kick in to the DU tip jar?

This week we're running a special pop-up mini fund drive. From Monday through Friday we're going ad-free for all registered members, and we're asking you to kick in to the DU tip jar to support the site and keep us financially healthy.

As a bonus, making a contribution will allow you to leave kudos for another DU member, and at the end of the week we'll recognize the DUers who you think make this community great.

Tell me more...

Latest Discussions»Latest Breaking News»Perplexity accused of scr...