Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Biology»New Research Reveals the Brain Learns Differently Than We Thought
    Biology

    New Research Reveals the Brain Learns Differently Than We Thought

    By Sainsbury Wellcome CentreJune 1, 20253 Comments8 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    Human Head Luminous Brain Network Consciousness Psychology
    Scientists at UCL have uncovered a second brain learning system that explains how habits form, offering insights into addiction, compulsions, and Parkinson’s disease. Credit: Shutterstock

    Research provides new insights into how the brain forms habits and explains why they can be so difficult to break.

    Neuroscientists at the Sainsbury Wellcome Centre (SWC) at UCL have discovered that the brain uses two distinct systems to learn through trial and error. This is the first time a second learning system has been identified, offering new insight into how habits are formed.

    The discovery could provide a scientific foundation for developing strategies to treat conditions linked to habitual behavior, such as addictions and compulsions. Published in Nature, the study, conducted in mice, may also lead to new therapeutic approaches for Parkinson’s disease.

    “Essentially, we have found a mechanism that we think is responsible for habits. Once you have developed a preference for a certain action, then you can bypass your value-based system and just rely on your default policy of what you’ve done in the past. This might then allow you to free up cognitive resources to make value-based decisions about something else,” explained Dr Marcus Stephenson-Jones, Group Leader at SWC and lead author of the study.

    Dorsomedial Striatum and Tail of the Striatum
    Image shows the two regions of the brain that were inactivated during the task – the dorsomedial striatum (DMS) and the tail of the striatum (TS). Credit: Hernando Martinez Vergara

    Discovery of action prediction error

    The researchers identified a new type of dopamine signal in the brain that functions differently from the one previously known. Dopamine was already understood to generate reward prediction errors (RPE), which tell the brain whether an outcome is better or worse than expected.

    In this study, the scientists discovered a second dopamine signal, called action prediction error (APE), which tracks how often an action is repeated. Together, RPE and APE give animals two distinct ways to learn: by choosing the most rewarding option or by repeating the most frequently chosen one.

    Fluorescent Brain Images Showing Recording Sites in the Tail of the Striatum (TS) and Ventral Striatum (VS)
    Fluorescent images showing the locations in the brain that the scientists recorded from – the tail of the striatum (TS) and ventral striatum (VS). Credit: Francesca Greenstreet

    “Imagine going to your local sandwich shop. The first time you go, you might take your time choosing a sandwich and, depending on which you pick, you may or may not like it. But if you go back to the shop on many occasions, you no longer spend time wondering which sandwich to select and instead start picking one you like by default. We think it is the APE dopamine signal in the brain that is allowing you to store this default policy,” explained Dr Stephenson-Jones.

    Simplifying memory through repetition

    The newly discovered learning system offers a simpler way for the brain to store information, without needing to constantly compare the value of different choices. This efficiency may allow the brain to handle multiple tasks at once. For instance, after learning how to drive, you can hold a conversation while driving. Your default system manages the routine driving tasks, while your value-based system focuses on the conversation.

    Earlier research found that the dopamine neurons involved in learning are located in three parts of the midbrain: the ventral tegmental area, the substantia nigra pars compacta, and the substantia nigra pars lateralis. While some studies showed these neurons play a role in processing reward, previous findings revealed that about half of them are linked to movement—but the purpose of this connection remained unclear.

    Value Based vs Frequency Based
    Flow diagram showing how reward prediction error leads to choosing highest value option and action prediction error leads to choosing most common option. Credit: Sainsbury Wellcome Centre

    RPE neurons project to all areas of the striatum apart from one, called the tail of the striatum. Whereas the movement-specific neurons project to all areas apart from the nucleus accumbens. This means that the nucleus accumbens exclusively signals reward, and the tail of the striatum exclusively signals movement.

    Dopamine release linked to movement

    By investigating the tail of the striatum, the team were able to isolate the movement neurons and discover their function. To test this, the researchers used an auditory discrimination task in mice, which was originally developed by scientists at Cold Spring Harbor Laboratory. Co first authors, Dr Francesca Greenstreet, Dr Hernando Martinez Vergara and Dr Yvonne Johansson, used a genetically encoded dopamine sensor, which showed that dopamine release in this area was not related to reward, but it was related to movement.

    RPE and APE Coding Dopamine Neuron Projections
    Reward and action prediction error coding dopamine neurons project to distinct areas of the striatum to reinforce different types of associations. Credit: Sainsbury Wellcome Centre

    “When we lesioned the tail of the striatum, we found a very characteristic pattern. We observed that lesioned mice and control mice initially learn in the same way, but once they get to about 60-70% performance, i.e. when they develop a preference (for example, for a high tone go left, for a low tone, go right), then the control mice rapidly learn and develop expert performance, whereas the lesioned mice only continue to learn in a linear fashion. This is because the lesioned mice can only use RPE, whereas the control mice have two learning systems, RPE and APE, which contribute to the choice,” explained Dr Stephenson Jones.

    APE dominates in late-stage learning

    To further understand this, the team silenced the tail of striatum in expert mice and found that this had a catastrophic effect on their performance in the task. This showed that while in early learning animals form a preference using the value-based system based on RPE, in late learning they switch to exclusively use APE in the tail of striatum to store these stable associations and drive their choice. The team also used extensive computational modelling, led by Dr Claudia Clopath, to understand how the two systems, RPE and APE, learn together.

    Illustration of Value Based and Frequency Based Decision Making Driven by Dual Dopamine Signals in Mice
    Dual dopaminergic teaching signals are used to learn value-based or frequency-based decision-making strategies. Reward prediction errors are used to update the value of options allowing animals to choose the most valuable option. Action prediction errors are used to update how frequently an option has been chosen allowing animals to choose the most common option. Credit: Sainsbury Wellcome Centre

    These findings hint at why it is so hard to break bad habits and why replacing an action with something else may be the best strategy. If you replace an action consistently enough, such as chewing on nicotine gum instead of smoking, the APE system may be able to take over and form a new habit on top of the other one.

    Targeting the brain’s habit system

    “Now that we know this second learning system exists in the brain, we have a scientific basis for developing new strategies to break bad habits. Up until now, most research on addictions and compulsions has focused on the nucleus accumbens. Our research has opened up a new place to look in the brain for potential therapeutic targets,” commented Dr Stephenson Jones.

    This research also has potential implications for Parkinson’s, which is known to be caused by the death of midbrain dopamine neurons, specifically in substantia nigra pars compacta. The type of cells that have been shown to die are movement-related dopamine neurons, which may be responsible for coding APE. This may explain why people with Parkinson’s experience deficits in doing habitual behaviours such as walking, however they do not experience deficits in more flexible behaviors such as ice skating.

    Dr Marcus Stephenson Jones (Left) and Dr Francesca Greenstreet (Right)
    Dr Marcus Stephenson-Jones (left) and Dr Francesca Greenstreet (right) in the lab at SWC. Credit: Sainsbury Wellcome Centre

    “Suddenly, we now have a theory for paradoxical movement in Parkinson’s. The movement related neurons that die are the ones that drive habitual behavior. And so, movement that uses the habitual system is compromised, but movement that uses your value-based flexible system is fine. This gives us a new place to look in the brain and a new way of thinking about Parkinson’s,” concluded Dr Stephenson-Jones.

    The research team is now testing whether APE is really needed for habits. They are also exploring what exactly is being learned in each system and how the two work together.

    Reference: “Dopaminergic action prediction errors serve as a value-free teaching signal” by Francesca Greenstreet, Hernando Martinez Vergara, Yvonne Johansson, Sthitapranjya Pati, Laura Schwarz, Stephen C. Lenzi, Jesse P. Geerts, Matthew Wisdom, Alina Gubanova, Lars B. Rollik, Jasvin Kaur, Theodore Moskovitz, Joseph Cohen, Emmett Thompson, Troy W. Margrie, Claudia Clopath and Marcus Stephenson-Jones, 14 May 2025, Nature.
    DOI: 10.1038/s41586-025-09008-9

    This research was funded by an EMBO Long-Term Fellowship (ALTF 827-2018), a Swedish Research Council International Postdoc Grant (2020-06365), the Sainsbury Wellcome Centre Core Grant from the Gatsby Charitable Foundation and Wellcome (219627/Z/19/Z), the Sainsbury Wellcome Centre PhD Programme, and a European Research Council grant (Starting #557533).

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google and Google News.

    Addiction Behavioral Science Brain Learning Neuroscience University College London
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    Feeling Disconnected? Loneliness Alters Your Brain’s Social Network

    Learning What’s Dangerous Is Costly: Unlocking Fear Response of Social Animals

    The Secret of Motivation: How Neural Circuits Drive Hungry Individuals to Peak Performance

    Intense Brain Activity Drives Need for Sleep, Not Just How Long You’ve Been Awake

    Drunken Larvae Learn Just As Well as Sober Larvae

    Growth in Brain’s White Matter Tracts Could Predict Literacy

    Odor Processing Function of Fly Resembles Mammalian Brain

    Sex Deprived Fruit Flies Consume More Alcohol

    Mother’s Nurturing Results in Larger Hippocampus in Children

    3 Comments

    1. SAEID on June 2, 2025 6:23 am

      HELLO SCIENTIFIC FRIENDS *
      Reminder :: As we all know , Two + two doesn’t always equal 4
      ******* THANK YOU ******* GOOD LUCK *****

      Reply
    2. Wayne Liston on June 8, 2025 4:20 pm

      Finally the answer to the “Three Blind Mice” nursery rhyme, as to why the Farmer’s Wife, “cut off their tails with a carving knife”!!!

      Reply
    3. Andrei H on June 9, 2025 4:07 am

      I can hardly wait for them to upgrade the test animals and move to cats.
      Cats are the new rats (considering that the cat inside the house double our risk of lung cancer)…

      Reply
    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    Artificial Sweeteners May Harm Future Generations, Study Suggests

    Splashdown! NASA Artemis II Returns From Record-Breaking Moon Mission

    What If Consciousness Exists Beyond Your Brain

    Scientists Finally Crack the 100-Million-Year Evolutionary Mystery of Squid and Cuttlefish

    Beyond “Safe Levels”: Study Challenges What We Know About Pesticides and Cancer

    Researchers Have Found a Dietary Compound That Increases Longevity

    Scientists Baffled by Bizarre “Living Fossil” From 275 Million Years Ago

    Your IQ at 23 Could Predict Your Wealth at 27, Study Finds

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • What if Dark Matter Has Two Forms? Bold New Hypothesis Could Explain a Cosmic Mystery
    • Researchers Expose Hidden Chemistry of “Ore-Forming” Elements in Biology
    • Geologists Reveal the Americas Collided Earlier Than We Thought
    • 20x Difference: Study Reveals True Source of Airborne Microplastics
    • Scientists Uncover Hidden Force Powering Yellowstone’s Supervolcano
    Copyright © 1998 - 2026 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.