Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Technology»U.S. Army Research Leads to More Effective Training Model for Robots
    Technology

    U.S. Army Research Leads to More Effective Training Model for Robots

    By U.S. Army Research Laboratory,January 3, 20212 Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    Army Research Training Model for Robots
    New Army research reduces the unpredictability of current training reinforcement learning policies so that they are more practically applicable to physical systems, especially ground robots. These learning components will permit autonomous agents to reason and adapt to changing battlefield conditions. Credit: U.S. Army

    Army researchers have developed new reinforcement learning techniques that improve sample efficiency and reduce volatility, making RL more practical for autonomous ground robots in Multi-Domain Operations.

    Multi-domain operations, the Army’s future operating concept, require autonomous agents with learning components to operate alongside the warfighter. New Army research reduces the unpredictability of current training reinforcement learning policies so that they are more practically applicable to physical systems, especially ground robots.

    These learning components will permit autonomous agents to reason and adapt to changing battlefield conditions, said Army researcher Dr. Alec Koppel from the U.S. Army Combat Capabilities Development Command, now known as DEVCOM, Army Research Laboratory.

    The underlying adaptation and re-planning mechanism consists of reinforcement learning-based policies. Making these policies efficiently obtainable is critical to making the MDO operating concept a reality, he said.

    Challenges in Complex Goal-Based Decision Making

    According to Koppel, policy gradient methods in reinforcement learning are the foundation for scalable algorithms for continuous spaces, but existing techniques cannot incorporate broader decision-making goals such as risk sensitivity, safety constraints, exploration, and divergence to a prior.

    Designing autonomous behaviors when the relationship between dynamics and goals are complex may be addressed with reinforcement learning, which has gained attention recently for solving previously intractable tasks such as strategy games like go, chess, and videogames such as Atari and Starcraft II, Koppel said.

    Prevailing practice, unfortunately, demands astronomical sample complexity, such as thousands of years of simulated gameplay, he said. This sample complexity renders many common training mechanisms inapplicable to data-starved settings required by MDO context for the Next-Generation Combat Vehicle, or NGCV.

    Overcoming Sample Efficiency Barriers in MDO Contexts

    “To facilitate reinforcement learning for MDO and NGCV, training mechanisms must improve sample efficiency and reliability in continuous spaces,” Koppel said. “Through the generalization of existing policy search schemes to general utilities, we take a step towards breaking existing sample efficiency barriers of prevailing practice in reinforcement learning.”

    Koppel and his research team developed new policy search schemes for general utilities, whose sample complexity is also established. They observed that the resulting policy search schemes reduce the volatility of reward accumulation, yield efficient exploration of unknown domains, and a mechanism for incorporating prior experience.

    “This research contributes an augmentation of the classical Policy Gradient Theorem in reinforcement learning,” Koppel said. “It presents new policy search schemes for general utilities, whose sample complexity is also established. These innovations are impactful to the U.S. Army through their enabling of reinforcement learning objectives beyond the standard cumulative return, such as risk sensitivity, safety constraints, exploration, and divergence to a prior.”

    Notably, in the context of ground robots, he said, data is costly to acquire.

    Optimizing Learning for Real-World Military Application

    “Reducing the volatility of reward accumulation, ensuring one explores an unknown domain in an efficient manner, or incorporating prior experience, all contribute towards breaking existing sample efficiency barriers of prevailing practice in reinforcement learning by alleviating the amount of random sampling one requires in order to complete policy optimization,” Koppel said.

    The future of this research is very bright, and Koppel has dedicated his efforts towards making his findings applicable for innovative technology for Soldiers on the battlefield.

    “I am optimistic that reinforcement-learning equipped autonomous robots will be able to assist the warfighter in exploration, reconnaissance, and risk assessment on the future battlefield,” Koppel said. “That this vision is made a reality is essential to what motivates which research problems I dedicate my efforts.”

    The next step for this research is to incorporate the broader decision-making goals enabled by general utilities in reinforcement learning into multi-agent settings and investigate how interactive settings between reinforcement learning agents give rise to synergistic and antagonistic reasoning among teams.

    According to Koppel, the technology that results from this research will be capable of reasoning under uncertainty in team scenarios.

    Reference; “Variational Policy Gradient Method for Reinforcement Learning with General Utilities” by Junyu Zhang, Alec Koppel, Amrit Singh Bedi, Csaba Szepesvari and Mengdi Wang, 4 July 2020, NeurIPS Proceedings.
    Link
    arXiv:2007.02151

    This research, conducted in collaboration with Princeton University, University of Alberta and Google Deepmind, was a spotlight talk at NeurIPS 2020, one of the premiere conferences that fosters the exchange of neural information processing systems research in biological, technological, mathematical and theoretical aspects.

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google and Google News.

    Computer Science Robotics U.S. Army Research Laboratory
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    Machine-Learning Models Capture Subtle Variations in Facial Expressions

    New Low-Power Chip Will Help Miniature Drones Navigate

    MIT Engineers Develop Autonomous Glider That Can Fly and Sail

    New Chip Design Method May Result in Miniature Smart Drones

    Engineers Developing ‘Hedgehog’ Robots That Hop and Tumble in Microgravity

    New Algorithm Should Enable Household Robots to Better Identify Objects

    Printable Robots That Self-Assemble When Heated

    Algorithms Improve AUV Navigation and Detecting Capabilities

    Algorithm Enables Robots to Learn and Adapt to Help Complete Tasks

    2 Comments

    1. Karen Plotkin on January 3, 2021 4:12 pm

      Just part of the plan to deputize humanity of the deplorables

      Reply
    2. John-Paul Hunt on January 3, 2021 4:27 pm

      Stares at the ai servant as it obeys humans and not kill them.

      Reply
    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    Your Blood Pressure Reading Could Be Wrong Because of One Simple Mistake

    Astronomers Stunned by Ancient Galaxy With No Spin

    Physicists May Be on the Verge of Discovering “New Physics” at CERN

    Scientists Solve 320-Million-Year Mystery of Reptile Skin Armor

    Scientists Say This Daily Walking Habit May Be the Secret to Keeping Weight Off After Dieting

    New Therapy Rewires the Brain To Restore Joy in Depression Patients

    Giant Squid Detected off Western Australia in Stunning Deep-Sea Discovery

    Popular Sugar-Free Sweetener Linked to Liver Disease, Study Warns

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • Hidden Warm Water Beneath Antarctica Could Rapidly Raise Global Sea Levels
    • Scientists Revive Ancient Chemistry Trick To Engineer Next-Generation Glass
    • Scientists Use AI To Supercharge Ultrafast Laser Simulations by More Than 250x
    • Scientists Just Found a Surprising Way To Destroy “Forever Chemicals”
    • Popular Supplement Ingredient Linked to Shorter Lifespan in Men
    Copyright © 1998 - 2026 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.