AI vs MD: ChatGPT Outperforms Physicians in Providing High-Quality, Empathetic Healthcare Advice

Robot AI Chatbot Concept — A study in *JAMA Internal Medicine* suggests that AI assistants like ChatGPT could significantly improve healthcare. Using real-world health questions, the study found that healthcare professionals preferred AI responses to those of physicians 79% of the time, citing higher quality and empathy. While not a replacement for doctors, AI could be integrated into health systems to enhance patient care, potentially reducing physician burnout and improving overall healthcare delivery.

While AI won’t replace your doctor, a new JAMA Internal Medicine paper suggests physicians working together with technologies like ChatGPT may revolutionize medicine.

There has been widespread speculation about how advances in artificial intelligence (AI) assistants like ChatGPT could be used in medicine.

A new study published today (April 28, 2023) in JAMA Internal Medicine led by Dr. John W. Ayers from the Qualcomm Institute within the University of California, San Diego (UCSD) provides an early glimpse into the role that AI assistants could play in medicine. The study compared written responses from physicians and those from ChatGPT to real-world health questions. A panel of licensed healthcare professionals preferred ChatGPT’s responses 79% of the time and rated ChatGPT’s responses as higher quality and more empathetic.

“The opportunities for improving healthcare with AI are massive,” said Ayers, who is also vice chief of innovation in the UCSD School of Medicine Division of Infectious Disease and Global Public Health. “AI-augmented care is the future of medicine.”

Is ChatGPT Ready for Healthcare?

In the new study, the research team set out to answer the question: Can ChatGPT respond accurately to questions patients send to their doctors? If yes, AI models could be integrated into health systems to improve physician responses to questions sent by patients and ease the ever-increasing burden on physicians.

“ChatGPT might be able to pass a medical licensing exam,” said study co-author Dr. Davey Smith, a physician-scientist, co-director of the UCSD Altman Clinical and Translational Research Institute and professor at the UCSD School of Medicine, “but directly answering patient questions accurately and empathetically is a different ballgame.”

“The COVID-19 pandemic accelerated virtual healthcare adoption,” added study co-author Dr. Eric Leas, a Qualcomm Institute affiliate and assistant professor in the UCSD Herbert Wertheim School of Public Health and Human Longevity Science. “While this made accessing care easier for patients, physicians are burdened by a barrage of electronic patient messages seeking medical advice that have contributed to record-breaking levels of physician burnout.”

Designing a Study to Test ChatGPT in a Healthcare Setting

To obtain a large and diverse sample of healthcare questions and physician answers that did not contain identifiable personal information, the team turned to social media where millions of patients publicly post medical questions to which doctors respond: Reddit’s AskDocs.

r/AskDocs is a subreddit with approximately 452,000 members who post medical questions and verified healthcare professionals submit answers. While anyone can respond to a question, moderators verify healthcare professionals’ credentials and responses display the respondent’s level of credentials. The result is a large and diverse set of patient medical questions and accompanying answers from licensed medical professionals.

While some may wonder if question-answer exchanges on social media are a fair test, team members noted that the exchanges were reflective of their clinical experience.

The team randomly sampled 195 exchanges from AskDocs where a verified physician responded to a public question. The team provided the original question to ChatGPT and asked it to author a response. A panel of three licensed healthcare professionals assessed each question and the corresponding responses and were blinded to whether the response originated from a physician or ChatGPT. They compared responses based on information quality and empathy, noting which one they preferred.

The panel of healthcare professional evaluators preferred ChatGPT responses to physician responses 79% of the time.

“ChatGPT messages responded with nuanced and accurate information that often addressed more aspects of the patient’s questions than physician responses,” said Jessica Kelley, a nurse practitioner with San Diego firm Human Longevity and study co-author.

Additionally, ChatGPT responses were rated significantly higher in quality than physician responses: good or very good quality responses were 3.6 times higher for ChatGPT than physicians (physicians 22.1% versus ChatGPT 78.5%). The responses were also more empathic: empathetic or very empathetic responses were 9.8 times higher for ChatGPT than for physicians (physicians 4.6% versus ChatGPT 45.1%).

“I never imagined saying this,” added Dr. Aaron Goodman, an associate clinical professor at UCSD School of Medicine and study coauthor, “but ChatGPT is a prescription I’d like to give to my inbox. The tool will transform the way I support my patients.”

Harnessing AI Assistants for Patient Messages

“While our study pitted ChatGPT against physicians, the ultimate solution isn’t throwing your doctor out altogether,” said Dr. Adam Poliak, an assistant professor of Computer Science at Bryn Mawr College and study co-author. “Instead, a physician harnessing ChatGPT is the answer for better and empathetic care.”

“Our study is among the first to show how AI assistants can potentially solve real-world healthcare delivery problems,” said Dr. Christopher Longhurst, Chief Medical Officer and Chief Digital Officer at UC San Diego Health. “These results suggest that tools like ChatGPT can efficiently draft high quality, personalized medical advice for review by clinicians, and we are beginning that process at UCSD Health.”

Dr. Mike Hogarth, a physician-bioinformatician, co-director of the Altman Clinical and Translational Research Institute at UCSD, professor in the UC San Diego School of Medicine and study co-author, added, “It is important that integrating AI assistants into healthcare messaging be done in the context of a randomized controlled trial to judge how the use of AI assistants impact outcomes for both physicians and patients.”

In addition to improving workflow, investments into AI assistant messaging could impact patient health and physician performance.

Dr. Mark Dredze, the John C Malone Associate Professor of Computer Science at Johns Hopkins and study co-author, noted: “We could use these technologies to train doctors in patient-centered communication, eliminate health disparities suffered by minority populations who often seek healthcare via messaging, build new medical safety systems, and assist doctors by delivering higher quality and more efficient care.”

Reference: “Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum” by John W. Ayers, PhD, MA; Adam Poliak, PhD; Mark Dredze, PhD; Eric C. Leas, PhD, MPH; Zechariah Zhu, BS; Jessica B. Kelley, MSN; Dennis J. Faix, MD; Aaron M. Goodman, MD; Christopher A. Longhurst, MD, MS; Michael Hogarth, MD; Davey M. Smith, MD, MAS, 28 April 2023, JAMA Internal Medicine.
DOI: 10.1001/jamainternmed.2023.1838

In addition to Ayers, Poliak, Dredze, Leas, Kelley, Goodman, Longhurst, Hogarth and Smith, authors of the JAMA Internal Medicine paper, “Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum,” are Zechariah Zhu of UCSD and Dr. Dennis J. Faix of the Naval Health Research Center.

Never miss a breakthrough: Join the SciTechDaily newsletter.
Follow us on Google and Google News.

2 Comments

Earlier the Better on April 28, 2023 2:35 pm
In Healthcare and then in Arts?
Can AI create an acceptable song in English along with Music and then Same Song in an Asian Language along with Music acceptable to them also? That is a song after song OR A line of Song followed by A line of Song in the next language? All languages of the world can be mixed up and it will be amazing to one and all, only if both parties love the song creations !
Earlier the Better on April 28, 2023 2:36 pm
Can AI create an acceptable song in English along with Music and then Same Song in an Asian Language along with Music acceptable to them also? That is a song after song OR A line of Song followed by A line of Song in the next language? All languages of the world can be mixed up and it will be amazing to one and all, only if both parties love the song creations !

AI vs MD: ChatGPT Outperforms Physicians in Providing High-Quality, Empathetic Healthcare Advice

New Tool Detects ChatGPT-Generated Academic Text With 99% Accuracy

Cancer and AI – Can ChatGPT Be Trusted?

Humans Reign Supreme: ChatGPT Falls Short on Accounting Exams

New Study: ChatGPT Can Influence Users’ Moral Judgments

ChatGPT Generative AI: USC Experts With Key Information You Should Know

The Rise of Artificial Intelligence: ChatGPT’s Stunning Results on the US Medical Licensing Exam

Highly-Efficient New Neuromorphic Chip for AI on the Edge

Futuristic AI-Based Computing Devices: Physicists Simulate Artificial Brain Networks With New Quantum Materials

New Artificial Neuron Device Runs Neural Network Computations Using 100 to 1000 Times Less Energy

2 Comments

Hair Loss May Be an Overlooked Side Effect of Ozempic and Mounjaro

Scientists Develop a Groundbreaking Chip That Operates at Brain-Like Speed

Viagra May Block a Hidden Pathway Cancer Cells Need To Spread

How Brown Tree Snakes Became an Unstoppable Invasive Species

Earth’s First Complex Organisms May Hold Clues to Life Beyond Our Planet

A 25,000-Year-Old Ritual Tradition Was Hidden Beneath This Cave Floor

Plant Compounds in Berries, Tea, and Cocoa May Support Healthy Brain Aging

Scientists Turn an Old Antibiotic Back Into a Superbug Killer

AI vs MD: ChatGPT Outperforms Physicians in Providing High-Quality, Empathetic Healthcare Advice

Related Articles

2 Comments