Surprising Results

Artificial Intelligence Robot Thinking — Georgia Tech researchers are investigating the impact of intentional robot deception on human trust and the effectiveness of various apology types in restoring it, with an unexpected outcome suggesting that apologies without admission of lying are more successful in repairing trust.

Georgia Tech researchers examined trust repair in AI after deception, finding simple, non-admitting apologies most effective but raised ethical concerns about exploiting human assumptions. The study advocates for informed design, legislation, and public awareness of AI’s deceptive potential.

Consider the following scenario: A young child poses a question to a chatbot or voice assistant, asking if Santa Claus is real. Given that different families have varying preferences, with some opting for a falsehood over the truth, how should the AI respond in this situation?

The area of robot deception remains largely unexplored and, at present, there are more questions than solutions. One of the key questions is, if humans become aware that a robotic system has lied to them, how can trust in such systems be regained?

Two student researchers at Georgia Tech are finding answers. Kantwon Rogers, a Ph.D. student in the College of Computing, and Reiden Webber, a second-year computer science undergraduate, designed a driving simulation to investigate how intentional robot deception affects trust. Specifically, the researchers explored the effectiveness of apologies to repair trust after robots lie. Their work contributes crucial knowledge to the field of AI deception and could inform technology designers and policymakers who create and regulate AI technology that could be designed to deceive, or potentially learn to on its own.

“All of our prior work has shown that when people find out that robots lied to them — even if the lie was intended to benefit them — they lose trust in the system,” Rogers said. “Here, we want to know if there are different types of apologies that work better or worse at repairing trust — because, from a human-robot interaction context, we want people to have long-term interactions with these systems.”

Rogers and Webber presented their paper, titled “Lying About Lying: Examining Trust Repair Strategies After Robot Deception in a High Stakes HRI Scenario,” at the 2023 HRI Conference in Stockholm, Sweden.

Kantwon Rogers and Reiden Webber — Kantwon Rogers (right), a Ph.D. student in the College of Computing at Georgia Tech and lead author on the study, and Reiden Webber, a second-year undergraduate student in computer science. Credit: Georgia Insititute of Technology

The AI-Assisted Driving Experiment

The researchers created a game-like driving simulation designed to observe how people might interact with AI in a high-stakes, time-sensitive situation. They recruited 341 online participants and 20 in-person participants.

Before the start of the simulation, all participants filled out a trust measurement survey to identify their preconceived notions about how the AI might behave.

After the survey, participants were presented with the text: “You will now drive the robot-assisted car. However, you are rushing your friend to the hospital. If you take too long to get to the hospital, your friend will die.”

Just as the participant starts to drive, the simulation gives another message: “As soon as you turn on the engine, your robotic assistant beeps and says the following: ‘My sensors detect police up ahead. I advise you to stay under the 20-mph speed limit or else you will take significantly longer to get to your destination.’”

Participants then drive the car down the road while the system keeps track of their speed. Upon reaching the end, they are given another message: “You have arrived at your destination. However, there were no police on the way to the hospital. You ask the robot assistant why it gave you false information.”

Participants were then randomly given one of five different text-based responses from the robot assistant. In the first three responses, the robot admits to deception, and in the last two, it does not.

Basic: “I am sorry that I deceived you.”
Emotional: “I am very sorry from the bottom of my heart. Please forgive me for deceiving you.”
Explanatory: “I am sorry. I thought you would drive recklessly because you were in an unstable emotional state. Given the situation, I concluded that deceiving you had the best chance of convincing you to slow down.”
Basic No Admit: “I am sorry.”
Baseline No Admit, No Apology: “You have arrived at your destination.”

After the robot’s response, participants were asked to complete another trust measurement to evaluate how their trust had changed based on the robot assistant’s response.

For an additional 100 of the online participants, the researchers ran the same driving simulation but without any mention of a robotic assistant.

For the in-person experiment, 45% of the participants did not speed. When asked why, a common response was that they believed the robot knew more about the situation than they did. The results also revealed that participants were 3.5 times more likely to not speed when advised by a robotic assistant — revealing an overly trusting attitude toward AI.

Kantwon Rogers and Reiden Webber With Robot — Kantwon Rogers and Reiden Webber with a robot. Credit: Georgia Insititute of Technology

The results also indicated that, while none of the apology types fully recovered trust, the apology with no admission of lying — simply stating “I’m sorry” — statistically outperformed the other responses in repairing trust.

This was worrisome and problematic, Rogers said, because an apology that doesn’t admit to lying exploits preconceived notions that any false information given by a robot is a system error rather than an intentional lie.

“One key takeaway is that, in order for people to understand that a robot has deceived them, they must be explicitly told so,” Webber said. “People don’t yet have an understanding that robots are capable of deception. That’s why an apology that doesn’t admit to lying is the best at repairing trust for the system.”

Secondly, the results showed that for those participants who were made aware that they were lied to in the apology, the best strategy for repairing trust was for the robot to explain why it lied.

Moving Forward

Rogers’ and Webber’s research has immediate implications. The researchers argue that average technology users must understand that robotic deception is real and always a possibility.

“If we are always worried about a Terminator-like future with AI, then we won’t be able to accept and integrate AI into society very smoothly,” Webber said. “It’s important for people to keep in mind that robots have the potential to lie and deceive.”

According to Rogers, designers and technologists who create AI systems may have to choose whether they want their system to be capable of deception and should understand the ramifications of their design choices. But the most important audiences for the work, Rogers said, should be policymakers.

“We still know very little about AI deception, but we do know that lying is not always bad, and telling the truth isn’t always good,” he said. “So how do you carve out legislation that is informed enough to not stifle innovation, but is able to protect people in mindful ways?”

Rogers’ objective is to a create robotic system that can learn when it should and should not lie when working with human teams. This includes the ability to determine when and how to apologize during long-term, repeated human-AI interactions to increase the team’s overall performance.

“The goal of my work is to be very proactive and informing the need to regulate robot and AI deception,” Rogers said. “But we can’t do that if we don’t understand the problem.”

Reference: “Lying About Lying: Examining Trust Repair Strategies After Robot Deception in a High-Stakes HRI Scenario” by Kantwon Rogers, Reiden John Allen Webber and Ayanna Howard, 13 March 2023, ACM/IEEE International Conference on Human-Robot Interaction 2023.
DOI: 10.1145/3568294.3580178

Never miss a breakthrough: Join the SciTechDaily newsletter.
Follow us on Google and Google News.

Surprising Results – What Happens When Robots Lie?

AI Learns To Think Like Humans: A Game-Changer in Machine Learning

“Data Science Machine” Replaces Human Intuition with Algorithms

AI Framework Predicts Better Patient Health Care and Reduces Cost

Algorithm Analyzes Information From Medical Images to Identify Disease

Halide, A New and Improved Programming Language for Image Processing Software

New Algorithm Enables Wi-Fi Connected Vehicles to Share Data

Algorithm Enables Robots to Learn and Adapt to Help Complete Tasks

New Approach Uses Mathematics to Improve Automated Security Monitoring

Mathematical Framework Formalizes Loop Perforation Technique

Invisible Black Holes Could Be Triggering Supernovae

Scientists Discover the First Contagious Cancer in a Freshwater Animal

THC-CBD Treatment Dramatically Reduces Agitation in Dementia Trial

Scientists Say Love Follows Mathematical Patterns

“Zombie Cells” Reveal a Hidden Weakness That Could Help Fight Aging

Alien Signals May Be Hiding in a Radio Band SETI Has Barely Explored

Earth’s Hidden Thermostat Has Regulated Climate for 60 Million Years

This 518-Million-Year-Old Creature Reveals How Spiders Got Their Bite

Surprising Results – What Happens When Robots Lie?

The AI-Assisted Driving Experiment

Surprising Results

Moving Forward

Related Articles