Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News Editorials & Other Articles General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

erronis

(22,272 posts)
2. As long as anybody can browse the web and collect data this risk is everywhere.
Wed Dec 3, 2025, 05:53 PM
Wednesday

It's not just the large search engine/AI consumers. Spiders/harvesters/whatevers have existed for decades and retrieved content that should not have been exposed. I ran a few such engines to pull data from US government sites (open platforms) but I do know that many organizations harvest far afield, ignoring the niceties such as "robots.txt".

Yes, we could try and punish these "too big to fail" companies but we know that won't happen. The cats are out of the bag and post-facto punishment or old-fashioned legalese won't put them back.

Recommendations

1 members have recommended this reply (displayed in chronological order):

Latest Discussions»General Discussion»Lawyer's 6-year-old son u...»Reply #2