Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Health»Mathematicians Use AI and New Clustering Algorithm To Identify Emerging COVID-19 Variants
    Health

    Mathematicians Use AI and New Clustering Algorithm To Identify Emerging COVID-19 Variants

    By University of ManchesterMarch 25, 2024No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    CLASSIX Clustering Coronavirus
    Stylized image of a CLASSIX clustering result overlaid on top of a coronavirus illustration. Credit: University of Manchester, CDC / Alissa Eckert, MSMI; Dan Higgins, MAMS

    An AI framework aids in identifying and tracking new COVID-19 variants, using a novel algorithm named CLASSIX to efficiently process large genomic datasets and enhance early detection efforts.

    Scientists at The Universities of Manchester and Oxford have developed an AI framework that can identify and track new and concerning COVID-19 variants and could help with other infections in the future.

    The framework combines dimension reduction techniques and a new explainable clustering algorithm called CLASSIX, developed by mathematicians at The University of Manchester. This enables the quick identification of groups of viral genomes that might present a risk in the future from huge volumes of data.

    The study, presented this week in the journal PNAS, could support traditional methods of tracking viral evolution, such as phylogenetic analysis, which currently require extensive manual curation.

    Roberto Cahuantzi, a researcher at The University of Manchester and first and corresponding author of the paper, said: “Since the emergence of COVID-19, we have seen multiple waves of new variants, heightened transmissibility, evasion of immune responses, and increased severity of illness.

    “Scientists are now intensifying efforts to pinpoint these worrying new variants, such as alpha, delta, and omicron, at the earliest stages of their emergence. If we can find a way to do this quickly and efficiently, it will enable us to be more proactive in our response, such as tailored vaccine development and may even enable us to eliminate the variants before they become established.”

    Proposed Method To Identify Emergent COVID 19 Variants
    Diagram showing the steps of the proposed method to identify emergent COVID-19 variants. Credit: The University of Manchester

    Like many other RNA viruses, COVID-19 has a high mutation rate and short time between generations meaning it evolves extremely rapidly. This means identifying new strains that are likely to be problematic in the future requires considerable effort.

    Currently, there are almost 16 million sequences available on the GISAID database (the Global Initiative on Sharing All Influenza Data), which provides access to genomic data of influenza viruses.

    Mapping the evolution and history of all COVID-19 genomes from this data is currently done using extremely large amounts of computer and human time.

    The described method allows the automation of such tasks. The researchers processed 5.7 million high-coverage sequences in only one to two days on a standard modern laptop; this would not be possible for existing methods, putting the identification of concerning pathogen strains in the hands of more researchers due to reduced resource needs.

    Thomas House, Professor of Mathematical Sciences at The University of Manchester, said: “The unprecedented amount of genetic data generated during the pandemic demands improvements to our methods to analyze it thoroughly. The data is continuing to grow rapidly but without showing a benefit to curating this data, there is a risk that it will be removed or deleted.

    “We know that human expert time is limited, so our approach should not replace the work of humans altogether but work alongside them to enable the job to be done much quicker and free our experts for other vital developments.”

    The proposed method works by breaking down genetic sequences of the COVID-19 virus into smaller “words” (called 3-mers) represented as numbers by counting them. Then, it groups similar sequences together based on their word patterns using machine learning techniques.

    Stefan Güttel, Professor of Applied Mathematics at the University of Manchester, said: “The clustering algorithm CLASSIX we developed is much less computationally demanding than traditional methods and is fully explainable, meaning that it provides textual and visual explanations of the computed clusters.”

    Roberto Cahuantzi added: “Our analysis serves as a proof of concept, demonstrating the potential use of machine learning methods as an alert tool for the early discovery of emerging major variants without relying on the need to generate phylogenies.

    “Whilst phylogenetics remains the ‘gold standard’ for understanding the viral ancestry, these machine learning methods can accommodate several orders of magnitude more sequences than the current phylogenetic methods and at a low computational cost.”

    Reference: “Unsupervised identification of significant lineages of SARS-CoV-2 through scalable machine learning methods” by Roberto Cahuantzi, Katrina A. Lythgoe, Ian Hall, Lorenzo Pellis and Thomas House, 13 March 2024, Proceedings of the National Academy of Sciences.
    DOI: 10.1073/pnas.2317284121

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google and Google News.

    Algorithm Artificial Intelligence COVID-19 Genetics Mathematics University of Manchester
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    AI-Powered Breakthrough for Improved At-Home Hepatitis and COVID-19 Testing

    When Did the First COVID-19 Case Arise? New Analysis With Surprising Findings

    AI Trained With Genetic Data Predicts How Patients With Viral Infections – Including COVID-19 – Will Fare

    Scientist Admits All Disease Models Are “Wrong” – Working to Fix

    Additional 12 Million to 89 Million Unreported COVID-19 Cases in China According to New Mathematical Model

    Clues to COVID-19 Treatment From DNA of Patients With Severe Forms of Coronavirus Disease

    No Evidence COVID-19 Coronavirus Was Genetically Engineered in a Lab – Epidemic Has a Natural Origin

    “Snake Pneumonia” – Coronavirus Outbreak in China Traced to Snakes by Genetic Analysis

    “Data Science Machine” Replaces Human Intuition with Algorithms

    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    Your Blood Pressure Reading Could Be Wrong Because of One Simple Mistake

    Astronomers Stunned by Ancient Galaxy With No Spin

    Physicists May Be on the Verge of Discovering “New Physics” at CERN

    Scientists Solve 320-Million-Year Mystery of Reptile Skin Armor

    Scientists Say This Daily Walking Habit May Be the Secret to Keeping Weight Off After Dieting

    New Therapy Rewires the Brain To Restore Joy in Depression Patients

    Giant Squid Detected off Western Australia in Stunning Deep-Sea Discovery

    Popular Sugar-Free Sweetener Linked to Liver Disease, Study Warns

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • Scientists Revive Ancient Chemistry Trick To Engineer Next-Generation Glass
    • Scientists Use AI To Supercharge Ultrafast Laser Simulations by More Than 250x
    • Scientists Just Found a Surprising Way To Destroy “Forever Chemicals”
    • Popular Supplement Ingredient Linked to Shorter Lifespan in Men
    • Scientists May Have Found a Way To Repair Nerve Damage in Multiple Sclerosis
    Copyright © 1998 - 2026 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.