Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Biology»A Software Swiss Army Knife for Genomic Data
    Biology

    A Software Swiss Army Knife for Genomic Data

    By California Institute of TechnologyApril 17, 2021No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    Swiss Army Knife for Genomic Data
    A new software tool processes large genomic datasets in approximately 30 minutes on an average laptop, offering versatile applications for various biological needs and enhancing the reproducibility of scientific studies. Credit: Caltech

    A new open-source tool allows researchers to process vast single-cell gene expression data in minutes on a laptop, boosting speed, scalability, and reproducibility for studies like COVID-19 and the Human Cell Atlas.

    A good way to find out what a cell is doing—whether it is growing out of control as in cancers, or is under the control of an invading virus, or is simply going about the routine business of a healthy cell—is to look at its gene expression. Though a vast majority of cells in an organism all contain the same genes, how those genes are expressed is what gives rise to different cell types—the difference between a muscle cell and a neuron, for example.

    In the last decade, technologies to measure gene expression in individual cells have revolutionized biology. No longer do biologists need to average out gene expression over many cells within tissues; now they can detect which genes are active in each cell at any time.

    Computational power has struggled to keep up with this explosion of data, however. For example, a single experiment can look at 100,000 cells and measure information from hundreds of thousands of transcripts (fragments of RNA produced when a gene is active), resulting in tens of billions of sequenced fragments. Genomic data from single-cell sequencing can take up terabytes of space and take hours or days to process on large computing servers.

    Breakthrough Software Cuts Processing Time

    Now, a new software tool enables the processing of large sets of genomic data in about 30 minutes, using the computing power of an average laptop. Like a Swiss Army knife, the tool can be used in myriad ways for different biological needs, and will help ensure the reproducibility of scientific studies.

    The tool, which is available online and open for anyone to use, now is being adapted by another research team to study the SARS-CoV-2 virus in samples collected from screening tests.

    The research was conducted as a collaboration between the laboratory of Lior Pachter (BS ’94), Bren Professor of Computational Biology and Computing and Mathematical Sciences, and Páll Melsted, professor of computer science at the University of Iceland. Melsted is a co-first author along with graduate student Sina Booeshaghi (MS ’19). A paper describing the research appears in the journal Nature Biotechnology on April 1, 2021.

    “There are many examples of different groups using different technologies to study the same tissues, for example, the brain,” says Booeshaghi. “Processing all of this data with the same engine—our technique—facilitates integrating the data. Our tool is fast, efficient, and allows for easy reprocessing, which is very important for consistency and reproducibility in science.”

    Developing this complex software tool “in-house” was important for it to actually address potential users’ concerns, because the potential users were right there in the lab.

    Diverse Team Drives Practical Software Design

    “The interdisciplinarity of our team was crucial to conceiving of and executing this project,” says Pachter. “There are people in the lab who are computer scientists, biologists, engineers. Sina is in the mechanical engineering department and brings the perspective of his design background and engineering; Páll has a strong background in theoretical computer science and software engineering.”

    The ease-of-use, low cost, and modularity of these tools will enable consistent and reproducible preprocessing of genomic data for large consortiums such as the Human Cell Atlas and the Brain Initiative Cell Census Network.

    Reference: “Modular, efficient and constant-memory single-cell RNA-seq preprocessing” by Páll Melsted, A. Sina Booeshaghi, Lauren Liu, Fan Gao, Lambda Lu, Kyung Hoi (Joseph) Min, Eduardo da Veiga Beltrame, Kristján Eldjárn Hjörleifsson, Jase Gehring and Lior Pachter, 1 April 2021, Nature Biotechnology.
    DOI: 10.1038/s41587-021-00870-2

    The paper is titled “Modular, fast, and constant-memory pre-processing of single-cell RNA-seq data.” In addition to Melsted, Booeshaghi, and Pachter, additional co-authors are undergraduate Lauren Liu, bioinformatics director Fan Gao, graduate student Lambda Lu, former undergraduate Joseph Min (BS ’20), graduate student Eduardo da Veiga Beltrame, former graduate student Kristján Eldjárn Hjörleifsson, and postdoctoral scholar Jase Gehring. Funding was provided by the Beckman Institute Caltech Bioinformatics Resource Center and the National Institutes of Health.

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google and Google News.

    Biotechnology California Institute of Technology Cell Biology Genetics
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    Genetic Copycatchers Detect Efficient and Precise CRISPR Editing in a Living Organism

    New Era in Coral Biology Research: Scientists Have Cultured the First Stable Coral Cell Lines

    New Research Reveals Survival Mechanism for Cells Under Stress

    NIRVANA: Fast, Portable Test Can Diagnose COVID-19 and Track Variants

    Heart Recovery After Heart Attack Mapped in Great Detail

    Chromosomes Actually Look Far Different Than the Pictures From High School Textbooks

    New Insights Into Human Development From “Monster Tumors”

    New Genetic Systems Created by Biologists to Neutralize Gene Drives

    CRISPR-HOT: New Genetic Tool Can Label Specific Genes and Cells

    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    Massive Study Warns Marijuana Use in Teens Is Linked to Serious Mental Illness

    Scientists Discover a Completely Unexpected Way T Cells Kill Cancer

    Scientists Just Found the Solar System’s Original “Planet Factory”

    Study Warns Widely Used Food Preservatives Linked to High Blood Pressure and Heart Disease

    New Treatment Could Reverse Osteoarthritis Within Weeks

    Physicists Have Measured “Negative Time” in Bizarre Quantum Experiment

    The Deadly Tapeworm Spreading Across America Has Reached the Pacific Northwest

    Could Low Vitamin D Be Making Your Pain Worse?

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • Stanford’s Revolutionary New Microscope Reveals Living Cells in Stunning Detail
    • Scientists Discover a Sea Slug Smaller Than a Sesame Seed in Taiwan
    • Wasp Colonies Explode Into Violence After Losing Their Queen
    • Antarctica Suddenly Became Far More Sensitive to Climate Change 1 Million Years Ago
    • A Hidden Arctic Ocean Crisis Is Unfolding Beneath the Melting Ice
    Copyright © 1998 - 2026 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.