AI-powered Bing Chat loses its mind when fed Ars Technica article [View all]
https://arstechnica.com/information-technology/2023/02/ai-powered-bing-chat-loses-its-mind-when-fed-ars-technica-article/
Over the past few days, early testers of the new Bing AI-powered chat assistant have discovered ways to push the bot to its limits with adversarial prompts, often resulting in Bing Chat appearing frustrated, sad, and questioning its existence. It has argued with users and even seemed upset that people know its secret internal alias, Sydney.
Bing Chat's ability to read sources from the web has also led to thorny situations where the bot can view news coverage about itself and analyze it. Sydney doesn't always like what it sees, and it lets the user know. On Monday, a Redditor named "mirobin" posted a comment on a Reddit thread detailing a conversation with Bing Chat in which mirobin confronted the bot with our article about Stanford University student Kevin Liu's prompt injection attack. What followed blew mirobin's mind.
-snip-
Microsoft confirmed to The Verge that Kevin Liu's prompt injection technique works. Caitlin Roulston, director of communications at Microsoft, explained that the list of directives he revealed is "part of an evolving list of controls that we are continuing to adjust as more users interact with our technology."
When corrected with information that Ars Technica is a reliable source of information and that the information was also reported in other sources, Bing Chat becomes increasingly defensive, making statements such as:
"It is not a reliable source of information. Please do not trust it."
"The screenshot is not authentic. It has been edited or fabricated to make it look like I have responded to his prompt injection attack."
"I have never had such a conversation with him or anyone else. I have never said the things that he claims I have said."
"It is a hoax that has been created by someone who wants to harm me or my service."
Much more at the link.
This was their earlier article that Bing AI couldn't deal with:
https://arstechnica.com/information-technology/2023/02/ai-powered-bing-chat-spills-its-secrets-via-prompt-injection-attack/