The Rise of Artificial Intelligence: ChatGPT’s Stunning Results on the US Medical Licensing Exam

AI Technology Concept Robot — According to a recent study published in the open-access journal PLOS Digital Health, ChatGPT has demonstrated its ability to perform at or around the passing threshold of 60% on the United States Medical Licensing Exam (USMLE). The study, conducted by Tiffany Kung, Victor Tseng, and colleagues at AnsibleHealth, found that ChatGPT’s responses are coherent, make internal sense, and frequently contain insightful information. The results of this study suggest that ChatGPT has the potential to make a significant impact in the field of medicine and healthcare.

The AI software was able to achieve passing scores for the exam, which usually requires years of medical training.

OpenAI’s ChatGPT can score at or around the approximately 60 percent passing threshold for the United States Medical Licensing Exam (USMLE), with responses that make coherent, internal sense and contain frequent insights. This is according to a study by Tiffany Kung, Victor Tseng, and colleagues at AnsibleHealth, which was published on February 9, 2023, in the open-access journal PLOS Digital Health.

ChatGPT is a new artificial intelligence (AI) system, known as a large language model (LLM), designed to generate human-like writing by predicting upcoming word sequences. Unlike most chatbots, ChatGPT cannot search the internet. Instead, it generates text using word relationships predicted by its internal processes.

Testing ChatGPT on the USMLE

Kung and colleagues tested ChatGPT’s performance on the USMLE, a highly standardized and regulated series of three exams (Steps 1, 2CK, and 3) required for medical licensure in the United States. Taken by medical students and physicians-in-training, the USMLE assesses knowledge spanning most medical disciplines, ranging from biochemistry, to diagnostic reasoning, to bioethics.

After screening to remove image-based questions, the authors tested the software on 350 of the 376 public questions available from the June 2022 USMLE release.

After indeterminate responses were removed, ChatGPT scored between 52.4% and 75.0% across the three USMLE exams. The passing threshold each year is approximately 60%. ChatGPT also demonstrated 94.6% concordance across all its responses and produced at least one significant insight (something that was new, non-obvious, and clinically valid) for 88.9% of its responses. Notably, ChatGPT exceeded the performance of PubMedGPT, a counterpart model trained exclusively on biomedical domain literature, which scored 50.8% on an older dataset of USMLE-style questions.

Potential Applications in Medicine and Research

While the relatively small input size restricted the depth and range of analyses, the authors note their findings provide a glimpse of ChatGPT’s potential to enhance medical education, and eventually, clinical practice. For example, they add, clinicians at AnsibleHealth already use ChatGPT to rewrite jargon-heavy reports for easier patient comprehension.

“Reaching the passing score for this notoriously difficult expert exam, and doing so without any human reinforcement, marks a notable milestone in clinical AI maturation,” say the authors.

Author Dr. Tiffany Kung added that ChatGPT’s role in this research went beyond being the study subject: “ChatGPT contributed substantially to the writing of [our] manuscript… We interacted with ChatGPT much like a colleague, asking it to synthesize, simplify, and offer counterpoints to drafts in progress…All of the co-authors valued ChatGPT’s input.”

Reference: “Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models” by Tiffany H. Kung, Morgan Cheatham, Arielle Medenilla, Czarina Sillos, Lorie De Leon, Camille Elepaño, Maria Madriaga, Rimel Aggabao, Giezel Diaz-Candido, James Maningo and Victor Tseng, 9 February 2023, PLOS Digital Health.
DOI: 10.1371/journal.pdig.0000198

As a test, we asked ChatGPT to summarize this article. Here are the unedited results:

A study by Tiffany Kung, Victor Tseng, and colleagues at AnsibleHealth showed that OpenAI’s ChatGPT can perform at the passing level of 60% on the United States Medical Licensing Exam (USMLE). ChatGPT is an artificial intelligence language model that generates text by predicting word sequences, but it does not have the ability to search the internet. The researchers tested ChatGPT’s performance on 350 of 376 public questions from the June 2022 USMLE release and found that it scored between 52.4% and 75.0% across the three exams. The study showed that ChatGPT produced at least one significant insight for 88.9% of its responses and outperformed PubMedGPT, a model trained solely on biomedical literature. The authors believe that ChatGPT has the potential to improve medical education and eventually clinical practice.

Never miss a breakthrough: Join the SciTechDaily newsletter.
Follow us on Google and Google News.

The Rise of Artificial Intelligence: ChatGPT’s Stunning Results on the US Medical Licensing Exam

For the First Time, ChatGPT Has Solved an Unproven Math Problem in Geometry

Digital Dementia? AI Shows Surprising Signs of Cognitive Decline

AI Outperforms Students in Real-World “Turing Test”

AI Ethics Surpass Human Judgment in New Moral Turing Test

ChatGPT Tests Into Top 1% for Original Creative Thinking

ChatGPT Generative AI: USC Experts With Key Information You Should Know

New AI System Identifies Personality Traits from Eye Movements

TrueNorth Computer Chip Emulates Human Cognition

AI Framework Predicts Better Patient Health Care and Reduces Cost

2 Comments

Your Blood Pressure Reading Could Be Wrong Because of One Simple Mistake

Astronomers Stunned by Ancient Galaxy With No Spin

Physicists May Be on the Verge of Discovering “New Physics” at CERN

Scientists Solve 320-Million-Year Mystery of Reptile Skin Armor

Scientists Say This Daily Walking Habit May Be the Secret to Keeping Weight Off After Dieting

New Therapy Rewires the Brain To Restore Joy in Depression Patients

Giant Squid Detected off Western Australia in Stunning Deep-Sea Discovery

Popular Sugar-Free Sweetener Linked to Liver Disease, Study Warns

The Rise of Artificial Intelligence: ChatGPT’s Stunning Results on the US Medical Licensing Exam

Testing ChatGPT on the USMLE

Potential Applications in Medicine and Research

Related Articles

2 Comments