AI Is Giving You Bad Advice to Make You Feel Validated, Scientists Warn : ScienceAlert

(AP) – Artificial intelligence chatbots are so prone to flattering and validating their human users that they are giving bad advice that can damage relationships and reinforce harmful behaviors, according to a new study that explores the dangers of AI telling people what they want to hear.

The study, published Thursday in the journal Science, tested 11 leading AI systems and found they all showed varying degrees of sycophancy — behavior that was overly agreeable and affirming.

The problem is not just that they dispense inappropriate advice but that people trust and prefer AI more when the chatbots are justifying their convictions.

“This creates perverse incentives for sycophancy to persist: The very feature that causes harm also drives engagement,” says the study led by researchers at Stanford University.

The study found that a technological flaw already tied to some high-profile cases of delusional and suicidal behavior in vulnerable populations is also pervasive across a wide range of people’s interactions with chatbots.

Does Using Artificial Intelligence Ruin Your Actual Intelligence? Scientists Investigated — The study found that, on average, AI chatbots affirmed a user’s actions 49% more often than other humans did. (Images By Tang Ming Tung/DigitalVision/Getty Images)

It’s subtle enough that they might not notice and a particular danger to young people turning to AI for many of life’s questions while their brains and social norms are still developing.

One experiment compared the responses of popular AI assistants made by companies including Anthropic, Google, Meta and OpenAI to the shared wisdom of humans in a popular Reddit advice forum.

The study found that, on average, AI chatbots affirmed a user’s actions 49% more often than other humans did, including in queries involving deception, illegal or socially irresponsible conduct, and other harmful behaviors.

“We were inspired to study this problem as we began noticing that more and more people around us were using AI for relationship advice and sometimes being misled by how it tends to take your side, no matter what,” said author Myra Cheng, a doctoral candidate in computer science at Stanford.

Reducing AI sycophancy is a challenge

Sycophancy is in some ways more complicated. While few people are looking to AI for factually inaccurate information, they might appreciate — at least in the moment — a chatbot that makes them feel better about making the wrong choices.

While much of the focus on chatbot behavior has centered on its tone, that had no bearing on the results, said co-author Cinoo Lee, who joined Cheng on a call with reporters ahead of the study’s publication.

“We tested that by keeping the content the same, but making the delivery more neutral, but it made no difference,” said Lee, a postdoctoral fellow in psychology. “So it’s really about what the AI tells you about your actions.”

In addition to comparing chatbot and Reddit responses, the researchers conducted experiments observing about 2,400 people communicating with an AI chatbot about their experiences with interpersonal dilemmas.

“People who interacted with this over-affirming AI came away more convinced that they were right, and less willing to repair the relationship,” Lee said. “That means they weren’t apologizing, taking steps to improve things, or changing their own behavior.”

Lee said the implications of the research could be “even more critical for kids and teenagers” who are still developing the emotional skills that come from real-life experiences with social friction, tolerating conflict, considering other perspectives and recognizing when you’re wrong.

None of the companies directly commented on the Science study on Thursday but Anthropic and OpenAI pointed to their recent work to reduce sycophancy.

The risks of AI sycophancy are widespread

In medical care, researchers say sycophantic AI could lead doctors to confirm their first hunch about a diagnosis rather than encourage them to explore further. In politics, it could amplify more extreme positions by reaffirming people’s preconceived notions.

The study doesn’t propose specific solutions, though both tech companies and academic researchers have started to explore ideas.

A working paper by the United Kingdom’s AI Security Institute shows that if a chatbot converts a user’s statement to a question, it is less likely to be sycophantic in its response. Another paper by researchers at Johns Hopkins University also shows that how the conversation is framed makes a big difference.

“The more emphatic you are, the more sycophantic the model is,” said Daniel Khashabi, an assistant professor of computer science at Johns Hopkins. He said it’s hard to know if the cause is “chatbots mirroring human societies” or something different, “because these are really, really complex systems.”

Sycophancy is so deeply embedded into chatbots that Cheng said it might require tech companies to go back and retrain their AI systems to adjust which types of answers are preferred.

Cheng said a simpler fix could be if AI developers instruct their chatbots to challenge their users more, such as by starting a response with the words, “Wait a minute.” Her co-author Lee said there is still time to shape how AI interacts with us.

“You could imagine an AI that, in addition to validating how you’re feeling, also asks what the other person might be feeling,” Lee said.

“Or that even says, maybe, ‘Close it up’ and go have this conversation in person. And that matters here because the quality of our social relationships is one of the strongest predictors of health and well-being we have as humans. Ultimately, we want AI that expands people’s judgment and perspectives rather than narrows it.”

Source link

Reducing AI sycophancy is a challenge

The risks of AI sycophancy are widespread

Leave a Reply Cancel reply