Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Health»The Future of Medicine? ChatGPT Shows “Impressive” Accuracy in Clinical Decision Making
    Health

    The Future of Medicine? ChatGPT Shows “Impressive” Accuracy in Clinical Decision Making

    By Mass General BrighamAugust 24, 2023No Comments5 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    Artificial Intelligence Laptop Data
    A recent study found that ChatGPT demonstrated a 72% accuracy in clinical decision-making across all medical specialties, with performance akin to a medical school graduate. The research suggests the potential of LLMs in augmenting medical practices but emphasizes the need for more research before clinical integration.

    Researchers from Mass General Brigham determined that ChatGPT achieved an accuracy rate of almost 72% across all medical specialties and phases of clinical care, and 77 percent accuracy in making final diagnoses.

    Researchers from Mass General Brigham have conducted a study which reveals that ChatGPT demonstrated an accuracy rate of approximately 72% in overall clinical decision-making processes, ranging from suggesting potential diagnoses to finalizing diagnoses and determining care management strategies. This expansive language model-based AI chatbot exhibited consistent performance in both primary care and emergency medical environments across diverse medical fields. The findings were recently published in the Journal of Medical Internet Research.

    “Our paper comprehensively assesses decision support via ChatGPT from the very beginning of working with a patient through the entire care scenario, from differential diagnosis all the way through testing, diagnosis, and management,” said corresponding author Marc Succi, MD, associate chair of innovation and commercialization and strategic innovation leader at Mass General Brigham and executive director of the MESH Incubator.

    Comparing ChatGPT to Medical Professionals

    “No real benchmarks exist, but we estimate this performance to be at the level of someone who has just graduated from medical school, such as an intern or resident. This tells us that LLMs, in general, have the potential to be an augmenting tool for the practice of medicine and support clinical decision-making with impressive accuracy.”

    Changes in artificial intelligence technology are occurring at a fast pace and transforming many industries, including health care. However the capacity of LLMs to assist in the full scope of clinical care has not yet been studied. In this comprehensive, cross-specialty study of how LLMs could be used in clinical advisement and decision-making, Succi and his team tested the hypothesis that ChatGPT would be able to work through an entire clinical encounter with a patient and recommend a diagnostic workup, decide the clinical management course, and ultimately make the final diagnosis.

    The study was done by pasting successive portions of 36 standardized, published clinical vignettes into ChatGPT. The tool first was asked to come up with a set of possible, or differential, diagnoses based on the patient’s initial information, which included age, gender, symptoms, and whether the case was an emergency. ChatGPT was then given additional pieces of information and asked to make management decisions as well as give a final diagnosis—simulating the entire process of seeing a real patient. The team compared ChatGPT’s accuracy on differential diagnosis, diagnostic testing, final diagnosis, and management in a structured blinded process, awarding points for correct answers and using linear regressions to assess the relationship between ChatGPT’s performance and the vignette’s demographic information.

    The researchers found that overall, ChatGPT was about 72 percent accurate and that it was best in making a final diagnosis, where it was 77 percent accurate. It was lowest-performing in making differential diagnoses, where it was only 60 percent accurate. And it was only 68 percent accurate in clinical management decisions, such as figuring out what medications to treat the patient with after arriving at the correct diagnosis. Other notable findings from the study included that ChatGPT’s answers did not show gender bias and that its overall performance was steady across both primary and emergency care.

    The Challenge of Differential Diagnosis

    “ChatGPT struggled with differential diagnosis, which is the meat and potatoes of medicine when a physician has to figure out what to do,” said Succi. “That is important because it tells us where physicians are truly experts and adding the most value—in the early stages of patient care with little presenting information, when a list of possible diagnoses is needed.”

    The authors note that before tools like ChatGPT can be considered for integration into clinical care, more benchmark research and regulatory guidance is needed. Next, Succi’s team is looking at whether AI tools can improve patient care and outcomes in hospitals’ resource-constrained areas.

    AI’s Potential to Improve Healthcare

    The emergence of artificial intelligence tools in health has been groundbreaking and has the potential to positively reshape the continuum of care. Mass General Brigham, as one of the nation’s top integrated academic health systems and largest innovation enterprises, is leading the way in conducting rigorous research on new and emerging technologies to inform the responsible incorporation of AI into care delivery, workforce support, and administrative processes.

    “Mass General Brigham sees great promise for LLMs to help improve care delivery and clinician experience,” said co-author Adam Landman, MD, MS, MIS, MHS, chief information officer and senior vice president of digital at Mass General Brigham. “We are currently evaluating LLM solutions that assist with clinical documentation and draft responses to patient messages with a focus on understanding their accuracy, reliability, safety, and equity. Rigorous studies like this one are needed before we integrate LLM tools into clinical care.”

    Reference: “Assessing the Utility of ChatGPT Throughout the Entire Clinical Workflow: Development and Usability Study” by Arya Rao, Michael Pang, John Kim, Meghana Kamineni, Winston Lie, Anoop K Prasad, Adam Landman, Keith Dreyer and Marc D Succi, 22 August 2023, Journal of Medical Internet Research.
    DOI: 10.2196/48659

    The study was funded by the National Institute of General Medical Sciences.

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google and Google News.

    Artificial Intelligence ChatGPT Machine Learning Mass General Brigham Medicine
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    AI Outperforms Humans in Creating Cancer Treatments – But Do Doctors Trust It?

    AI Uses Timing and Weather Data to Accurately Predict Cardiac Arrest Risk

    300+ COVID-19 Machine Learning Models Have Been Developed – None Is Suitable for Detecting or Diagnosing

    Artificial Intelligence System Calculates Suicide Attempt Risk – Here’s How It Performed

    MIT Develops Machine-Learning Approach to Finding New Treatment Options for COVID-19

    MIT Mirai: Robust Artificial Intelligence Tools To Predict Future Cancer

    Artificial Intelligence Predicts Drug Combinations That Kill Cancer Cells More Effectively

    How Computer Science and AI Can Help Fight COVID-19 — “We Have the Potential to Alter the Course of This Global Pandemic”

    Artificial Intelligence Uses “Self-Learning” to Make Cancer Treatment Less Toxic

    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    Monster Storms on Jupiter Unleash Lightning Beyond Anything on Earth

    Scientists Create “Liquid Gears” That Spin Without Touching

    The Simple Habit That Could Help Prevent Cancer

    Millions Take These IBS Drugs, But a New Study Finds Serious Risks

    Scientists Unlock Hidden Secrets of 2,300-Year-Old Mummies Using Cutting-Edge CT Scanner

    Bread Might Be Making You Gain Weight Even Without Eating More Calories

    Scientists Discover Massive Magma Reservoir Beneath Tuscany

    Europe’s Most Active Volcano Just Got Stranger – Here’s Why Scientists Are Rethinking It

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • Scientists Were Wrong About This Strange “Rule-Breaking” Particle
    • Webb Space Telescope Uncovers Unexpected Ice Clouds on a Jupiter-Like World
    • 289-Million-Year-Old Reptile Mummy Reveals Origin of Human Breathing System
    • New Brain Discovery Challenges Long-Held Theory of Teenage Brain Development
    • Sharks Are Overheating as Warming Oceans Push Them to the Edge
    Copyright © 1998 - 2026 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.