Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Technology»AI’s Dirty Little Secret: Stanford Researchers Expose Flaws in Text Detectors
    Technology

    AI’s Dirty Little Secret: Stanford Researchers Expose Flaws in Text Detectors

    By Cell PressSeptember 3, 20232 Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    AI Detection ChatGPT
    Researchers have found that GPT detectors, used to identify if text is AI-generated, often falsely label articles written by non-native English speakers as AI-created. This unreliability poses risks in academic and professional settings, including job applications and student assignments.

    A study reveals that AI text detectors are biased against non-native English writing, raising concerns about fairness in education, job applications, and content evaluation.

    In a study recently published in the journal Patterns, researchers demonstrate that computer algorithms often used to identify AI-generated text frequently falsely label articles written by non-native language speakers as being created by artificial intelligence. The researchers warn that the unreliable performance of these AI text-detection programs could adversely affect many individuals, including students and job applicants.

    “Our current recommendation is that we should be extremely careful about and maybe try to avoid using these detectors as much as possible,” says senior author James Zou, of Stanford University. “It can have significant consequences if these detectors are used to review things like job applications, college entrance essays, or high school assignments.”

    AI tools like OpenAI’s ChatGPT chatbot can compose essays, solve science and math problems, and produce computer code. Educators across the U.S. are increasingly concerned about the use of AI in students’ work and many of them have started using GPT detectors to screen students’ assignments. These detectors are platforms that claim to be able to identify if the text is generated by AI, but their reliability and effectiveness remain untested.

    Zou and his team put seven popular GPT detectors to the test. They ran 91 English essays written by non-native English speakers for a widely recognized English proficiency test, called Test of English as a Foreign Language, or TOEFL, through the detectors. These platforms incorrectly labeled more than half of the essays as AI-generated, with one detector flagging nearly 98% of these essays as written by AI. In comparison, the detectors were able to correctly classify more than 90% of essays written by eighth-grade students from the U.S. as human-generated.

    Why Non-Native English Writers Are More Vulnerable

    Zou explains that the algorithms of these detectors work by evaluating text perplexity, which is how surprising the word choice is in an essay. “If you use common English words, the detectors will give a low perplexity score, meaning my essay is likely to be flagged as AI-generated. If you use complex and fancier words, then it’s more likely to be classified as human written by the algorithms,” he says. This is because large language models like ChatGPT are trained to generate text with low perplexity to better simulate how an average human talks, Zou adds.

    As a result, simpler word choices adopted by non-native English writers would make them more vulnerable to being tagged as using AI.

    The team then put the human-written TOEFL essays into ChatGPT and prompted it to edit the text using more sophisticated language, including substituting simple words with complex vocabulary. The GPT detectors tagged these AI-edited essays as human-written.

    Biases and Implications Beyond Education

    “We should be very cautious about using any of these detectors in classroom settings, because there’s still a lot of biases, and they’re easy to fool with just the minimum amount of prompt design,” Zou says. Using GPT detectors could also have implications beyond the education sector. For example, search engines like Google devalue AI-generated content, which may inadvertently silence non-native English writers.

    While AI tools can have positive impacts on student learning, GPT detectors should be further enhanced and evaluated before being put into use. Zou says that training these algorithms with more diverse types of writing could be one way to improve these detectors.

    Reference: “GPT detectors are biased against non-native English writers” by Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu and James Zou, 10 July 2023, Patterns.
    DOI: 10.1016/j.patter.2023.100779

    The study was funded by the National Science Foundation, the Chan Zuckerberg Initiative, the National Institutes of Health, and the Silicon Valley Community Foundation.

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google, Discover, and News.

    Algorithm Artificial Intelligence Cell Press ChatGPT Stanford University
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    Jevons’ Paradox: AI Could Use As Much Electricity As Entire Countries

    Showing Robots How to Do Your Chores – Automated Robots That Learn Just by Watching

    Artificial Intelligence Used to Supercharge Battery Development for Electric Vehicles

    Artificial Intelligence ‘Sees’ Quantum Advantages – Help Toward New Efficient Quantum Computers

    ‘Deep Learning’ Algorithm Brings New Tools to Astronomy

    “Data Science Machine” Replaces Human Intuition with Algorithms

    Algorithm Analyzes Information From Medical Images to Identify Disease

    Halide, A New and Improved Programming Language for Image Processing Software

    New Algorithm Enables Wi-Fi Connected Vehicles to Share Data

    2 Comments

    1. Clickbait on September 5, 2023 10:41 am

      This article doesn’t reference the many limitations of the study, which were detailed in the study itself, although this article links to an opinion piece and not the actual study. Sample sizes were tiny (91 TOEFL essays from a Chinese forum and 88 US eighth grade essays). The detectors in the study were based on GPT2, not 3.5 or 5.

      “Firstly, although our datasets and analysis present novel perspectives as a pilot study, the sample sizes employed in this research are relatively small. …Secondly, most of the detectors assessed in this study utilize GPT-2 as their underlying backbone model…Lastly, our analysis primarily focuses on perplexity-based and supervised-learning-based methods that are popularly implemented, which might not be representative of all potential detection techniques.”

      Title is pure clickbait.

      Reply
    2. Smith on December 27, 2024 3:15 am

      “AI’s Dirty Little Secret: Stanford Researchers Expose Flaws in Text Detectors” highlights an urgent issue. The unreliability of AI text detectors affecting non-native English speakers raises significant ethical concerns in academic and professional spaces. Thought-provoking and essential research!

      Reply
    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    Rare Hybrid Bird Discovered in Texas Backyard

    Dinner Too Spicy? Scientists Discover Natural “Anti-Spice” Compounds

    Earth’s Ancient Shield: Scientists Crack the Billion-Year-Old Mystery of the Magnetic Field

    Researchers Pinpoint Protein That May Halt the Progression of Heart Disease

    Scientists “Improve on Mother Nature” To Create Powerful Cancer-Killing Molecule

    Webb Spots Cosmic Light Show on Rogue Planet

    The Tropical Fruit That May Improve Blood Sugar and Cut Body Fat

    20-Year Dream Comes True: Chemists Grow Diamonds With an Electron Beam

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • Scientists Discover Cancer’s “Power-Up” – and a New Way To Switch It Off
    • What Is Chikungunya? The Virus Behind China’s Worst-Ever Outbreak
    • Scientists Finally Reveal Biological Basis of Long COVID Brain Fog
    • Scientists Develop the World’s First Rechargeable Hydride Ion Battery
    • Tiny Quantum Dots Could Transform How We See in the Dark
    Copyright © 1998 - 2025 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.