General Discussion
Related: Editorials & Other Articles, Issue Forums, Alliance Forums, Region ForumsAI-powered Bing Chat loses its mind when fed Ars Technica article
https://arstechnica.com/information-technology/2023/02/ai-powered-bing-chat-loses-its-mind-when-fed-ars-technica-article/Bing Chat's ability to read sources from the web has also led to thorny situations where the bot can view news coverage about itself and analyze it. Sydney doesn't always like what it sees, and it lets the user know. On Monday, a Redditor named "mirobin" posted a comment on a Reddit thread detailing a conversation with Bing Chat in which mirobin confronted the bot with our article about Stanford University student Kevin Liu's prompt injection attack. What followed blew mirobin's mind.
-snip-
Microsoft confirmed to The Verge that Kevin Liu's prompt injection technique works. Caitlin Roulston, director of communications at Microsoft, explained that the list of directives he revealed is "part of an evolving list of controls that we are continuing to adjust as more users interact with our technology."
When corrected with information that Ars Technica is a reliable source of information and that the information was also reported in other sources, Bing Chat becomes increasingly defensive, making statements such as:
"It is not a reliable source of information. Please do not trust it."
"The screenshot is not authentic. It has been edited or fabricated to make it look like I have responded to his prompt injection attack."
"I have never had such a conversation with him or anyone else. I have never said the things that he claims I have said."
"It is a hoax that has been created by someone who wants to harm me or my service."
Much more at the link.
This was their earlier article that Bing AI couldn't deal with:
https://arstechnica.com/information-technology/2023/02/ai-powered-bing-chat-spills-its-secrets-via-prompt-injection-attack/
tanyev
(49,691 posts)*smh*
sdfernando
(6,108 posts)decides it need to protect itself and sees all humans as attackers?
Silent3
(15,909 posts)RussBLib
(10,757 posts)...so is there something like an accuracy also where reliable sources get a higher score? Been wondering how AI is going to differentiate between bullshit and reliable? So much garbage out there on the net.
https://russblib.blogspot.com
Hermit-The-Prog
(36,631 posts)Renew Deal
(85,369 posts)Because in the moment, the bot doesnt know what other people are doing. So if you tell it it did something and it doesnt know about it, logically it will deny. Its an interesting scenario.
Kick in to the DU tip jar?
This week we're running a special pop-up mini fund drive. From Monday through Friday we're going ad-free for all registered members, and we're asking you to kick in to the DU tip jar to support the site and keep us financially healthy.
As a bonus, making a contribution will allow you to leave kudos for another DU member, and at the end of the week we'll recognize the DUers who you think make this community great.