Welcome to DU!
The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards.
Join the community:
Create a free account
Support DU (and get rid of ads!):
Become a Star Member
Latest Breaking News
Editorials & Other Articles
General Discussion
The DU Lounge
All Forums
Issue Forums
Culture Forums
Alliance Forums
Region Forums
Support Forums
Help & Search
General Discussion
Related: Editorials & Other Articles, Issue Forums, Alliance Forums, Region ForumsHow one YouTuber is trying to poison the AI bots stealing her content
"Specialized garbage-filled captions are invisible to humans, confounding to AI. "
https://arstechnica.com/ai/2025/01/how-one-youtuber-is-trying-to-poison-the-ai-bots-stealing-her-content/
ASS POWER!
If you've been paying careful attention to YouTube recently, you may have noticed the rising trend of so-called "faceless YouTube channels" that never feature a visible human talking in the video frame. While some of these channels are simply authored by camera-shy humans, many more are fully automated through AI-powered tools to craft everything from the scripts and voiceovers to the imagery and music. Unsurprisingly, this is often sold as a way to make a quick buck off the YouTube algorithm with minimal human effort.
It's not hard to find YouTubers complaining about a flood of these faceless channels stealing their embedded transcript files and running them through AI summarizers to generate their own instant knock-offs. But one YouTuber is trying to fight back, seeding her transcripts with junk data that is invisible to humans but poisonous to any AI that dares to try to work from a poached transcript file.
The power of the .ass
YouTuber F4mi, who creates some excellent deep dives on obscure technology, recently detailed her efforts "to poison any AI summarizers that were trying to steal my content to make slop." The key to F4mi's method is the .ass subtitle format, created decades ago as part of fansubbing software Advanced SubStation Alpha. Unlike simpler and more popular subtitle formats, .ass supports fancy features like fonts, colors, positioning, bold, italic, underline, and more.
It's these fancy features that let F4mi hide AI-confounding garbage in her YouTube transcripts without impacting the subtitle experience for her human viewers. For each chunk of actual text in her subtitle file, she also inserted "two chunks of text out of bounds using the positioning feature of the .ass format, with their size and transparency set to zero so they are completely invisible." In those "invisible" subtitle boxes, F4mi added text from public domain works (with certain words replaced with synonyms to avoid detection) or her own LLM-generated scripts full of completely made-up facts. When those transcript files were fed into popular AI summarizer sites, that junk text ended up overwhelming the actual content, creating a totally unrelated script that would be useless to any faceless channel trying to exploit it.
It's not hard to find YouTubers complaining about a flood of these faceless channels stealing their embedded transcript files and running them through AI summarizers to generate their own instant knock-offs. But one YouTuber is trying to fight back, seeding her transcripts with junk data that is invisible to humans but poisonous to any AI that dares to try to work from a poached transcript file.
The power of the .ass
YouTuber F4mi, who creates some excellent deep dives on obscure technology, recently detailed her efforts "to poison any AI summarizers that were trying to steal my content to make slop." The key to F4mi's method is the .ass subtitle format, created decades ago as part of fansubbing software Advanced SubStation Alpha. Unlike simpler and more popular subtitle formats, .ass supports fancy features like fonts, colors, positioning, bold, italic, underline, and more.
It's these fancy features that let F4mi hide AI-confounding garbage in her YouTube transcripts without impacting the subtitle experience for her human viewers. For each chunk of actual text in her subtitle file, she also inserted "two chunks of text out of bounds using the positioning feature of the .ass format, with their size and transparency set to zero so they are completely invisible." In those "invisible" subtitle boxes, F4mi added text from public domain works (with certain words replaced with synonyms to avoid detection) or her own LLM-generated scripts full of completely made-up facts. When those transcript files were fed into popular AI summarizer sites, that junk text ended up overwhelming the actual content, creating a totally unrelated script that would be useless to any faceless channel trying to exploit it.
Confused? Check the article for more details. Pretty clever counter-attack.
5 replies
= new reply since forum marked as read
Highlight:
NoneDon't highlight anything
5 newestHighlight 5 most recent replies
How one YouTuber is trying to poison the AI bots stealing her content (Original Post)
usonian
Jan 2025
OP
highplainsdem
(63,067 posts)1. Good for her.
SheltieLover
(81,676 posts)2. Excellent!
intrepidity
(8,595 posts)3. Hah, love it
Gotta go check out her channel....
Eta:
MiHale
(13,161 posts)4. Clever!
ThoughtCriminal
(14,754 posts)5. Think about what to use for confoundment
Maybe some jabs at Putin, Trump and Musk.