Close Menu
    Facebook X (Twitter) Instagram
    SciTechDaily
    • Biology
    • Chemistry
    • Earth
    • Health
    • Physics
    • Science
    • Space
    • Technology
    Facebook X (Twitter) Pinterest YouTube RSS
    SciTechDaily
    Home»Technology»MIT Researchers Create a Tool for Predicting the Future
    Technology

    MIT Researchers Create a Tool for Predicting the Future

    By Adam Zewe, Massachusetts Institute of TechnologyMarch 28, 20229 Comments7 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn WhatsApp Email Reddit
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email Reddit
    Big Data Predictive Analysis AI Concept
    MIT researchers created tspDB, a system that integrates prediction functionality into a time-series database, enabling nonexperts to generate predictions in seconds through a simplified interface.

    Researchers design a user-friendly interface that helps nonexperts make forecasts using data collected over time.

    Whether someone is trying to predict tomorrow’s weather, forecast future stock prices, identify missed opportunities for sales in retail, or estimate a patient’s risk of developing a disease, they will likely need to interpret time-series data, which are a collection of observations recorded over time.

    Making predictions using time-series data typically requires several data-processing steps and the use of complex machine-learning algorithms, which have such a steep learning curve they aren’t readily accessible to nonexperts.

    To make these powerful tools more user-friendly, MIT researchers developed a system that directly integrates prediction functionality on top of an existing time-series database. Their simplified interface, which they call tspDB (time series predict database), does all the complex modeling behind the scenes so a nonexpert can easily generate a prediction in only a few seconds.

    Tool for Predicting the Future
    MIT researchers created a tool that enables people to make highly accurate predictions using multiple time-series data with just a few keystrokes. The powerful algorithm at the heart of their tool can transform multiple time series into a tensor, which is a multi-dimensional array of numbers (pictured). Credit: Figure courtesy of the researchers and edited by MIT News

    The new system is more accurate and more efficient than state-of-the-art deep learning methods when performing two tasks: predicting future values and filling in missing data points.

    One reason tspDB is so successful is that it incorporates a novel time-series-prediction algorithm, explains electrical engineering and computer science (EECS) graduate student Abdullah Alomar, an author of a recent research paper in which he and his co-authors describe the algorithm. This algorithm is especially effective at making predictions on multivariate time-series data, which are data that have more than one time-dependent variable. In a weather database, for instance, temperature, dew point, and cloud cover each depend on their past values.

    The algorithm also estimates the volatility of a multivariate time series to provide the user with a confidence level for its predictions.

    “Even as the time-series data becomes more and more complex, this algorithm can effectively capture any time-series structure out there. It feels like we have found the right lens to look at the model complexity of time-series data,” says senior author Devavrat Shah, the Andrew and Erna Viterbi Professor in EECS and a member of the Institute for Data, Systems, and Society and of the Laboratory for Information and Decision Systems.

    Joining Alomar and Shah on the paper is lead author Anish Agrawal, a former EECS graduate student who is currently a postdoc at the Simons Institute at the University of California at Berkeley. The research will be presented at the ACM SIGMETRICS conference.

    Adapting a New Algorithm

    Shah and his collaborators have been working on the problem of interpreting time-series data for years, adapting different algorithms and integrating them into tspDB as they built the interface.

    About four years ago, they learned about a particularly powerful classical algorithm, called singular spectrum analysis (SSA), that imputes and forecasts single time series. Imputation is the process of replacing missing values or correcting past values. While this algorithm required manual parameter selection, the researchers suspected it could enable their interface to make effective predictions using time series data. In earlier work, they removed this need to manually intervene for algorithmic implementation.

    The algorithm for single time series transformed it into a matrix and utilized matrix estimation procedures. The key intellectual challenge was how to adapt it to utilize multiple time series. After a few years of struggle, they realized the answer was something very simple: “Stack” the matrices for each individual time series, treat it as a one big matrix, and then apply the single time-series algorithm on it.

    This utilizes information across multiple time series naturally — both across the time series and across time, which they describe in their new paper.

    This recent publication also discusses interesting alternatives, where instead of transforming the multivariate time series into a big matrix, it is viewed as a three-dimensional tensor. A tensor is a multi-dimensional array, or grid, of numbers. This established a promising connection between the classical field of time series analysis and the growing field of tensor estimation, Alomar says.

    “The variant of mSSA that we introduced actually captures all of that beautifully. So, not only does it provide the most likely estimation, but a time-varying confidence interval, as well,” Shah says.

    The Simpler, the Better

    They tested the adapted mSSA against other state-of-the-art algorithms, including deep-learning methods, on real-world time-series datasets with inputs drawn from the electricity grid, traffic patterns, and financial markets.

    Their algorithm outperformed all the others on imputation and it outperformed all but one of the other algorithms when it came to forecasting future values. The researchers also demonstrated that their tweaked version of mSSA can be applied to any kind of time-series data.

    “One reason I think this works so well is that the model captures a lot of time series dynamics, but at the end of the day, it is still a simple model. When you are working with something simple like this, instead of a neural network that can easily overfit the data, you can actually perform better,” Alomar says.

    The impressive performance of mSSA is what makes tspDB so effective, Shah explains. Now, their goal is to make this algorithm accessible to everyone.

    Once a user installs tspDB on top of an existing database, they can run a prediction query with just a few keystrokes in about 0.9 milliseconds, as compared to 0.5 milliseconds for a standard search query. The confidence intervals are also designed to help nonexperts to make a more informed decision by incorporating the degree of uncertainty of the predictions into their decision making.

    For instance, the system could enable a nonexpert to predict future stock prices with high accuracy in just a few minutes, even if the time-series dataset contains missing values.

    Now that the researchers have shown why mSSA works so well, they are targeting new algorithms that can be incorporated into tspDB. One of these algorithms utilizes the same model to automatically enable change point detection, so if the user believes their time series will change its behavior at some point, the system will automatically detect that change and incorporate that into its predictions.

    They also want to continue gathering feedback from current tspDB users to see how they can improve the system’s functionality and user-friendliness, Shah says.

    “Our interest at the highest level is to make tspDB a success in the form of a broadly utilizable, open-source system. Time-series data are very important, and this is a beautiful concept of actually building prediction functionalities directly into the database. It has never been done before, and so we want to make sure the world uses it,” he says.

    “This work is very interesting for a number of reasons. It provides a practical variant of mSSA which requires no hand tuning, they provide the first known analysis of mSSA, and the authors demonstrate the real-world value of their algorithm by being competitive with or out-performing several known algorithms for imputations and predictions in (multivariate) time series for several real-world data sets,” says Vishal Misra, a professor of computer science at Columbia University who was not involved with this research. “At the heart of it all is the beautiful modeling work where they cleverly exploit correlations across time (within a time series) and space (across time series) to create a low-rank spatiotemporal factor representation of a multivariate time series. Importantly this model connects the field of time series analysis to that of the rapidly evolving topic of tensor completion, and I expect a lot of follow-on research spurred by this paper.”

    Reference: “On Multivariate Singular Spectrum Analysis and its Variants” by Anish Agarwal, Abdullah Alomar and Devavrat Shah, 13 February 2021, Computer Science > Machine Learning.
    arXiv:2006.13448

    Never miss a breakthrough: Join the SciTechDaily newsletter.
    Follow us on Google and Google News.

    Algorithm Artificial Intelligence Machine Learning MIT
    Share. Facebook Twitter Pinterest LinkedIn Email Reddit

    Related Articles

    AI Made Easy: Create Cutting-Edge Solutions, No Coding Needed

    When Algorithms Deliver: The AI Revolution in Logistics

    AI Revolutionizes Complex Problem-Solving in Logistics and Beyond

    Mastering Uncertainty: An Effective Approach to Training Machines for Real-World Situations

    CausalSim: MIT’s New Tool for Accurately Simulating Complex Systems

    Computing for Ocean Environments: Bio-Inspired Underwater Devices & Swarming Algorithms for Robotic Vehicles

    Showing Robots How to Do Your Chores – Automated Robots That Learn Just by Watching

    Halide, A New and Improved Programming Language for Image Processing Software

    New Algorithm Enables Wi-Fi Connected Vehicles to Share Data

    9 Comments

    1. Natalia on March 31, 2022 9:05 am

      It is possible to predict the event using the tools of complex dynamic of news model.

      Reply
    2. MELVIN HILDRETH III on April 1, 2022 6:41 am

      Great article it says it’s for people to use but no link ? Bummer

      Reply
      • Mike O'Neill on April 2, 2022 12:00 am

        Apologies for the lack of a link. This has been corrected at the top of the article. Here is the link as well:
        https://tspdb.mit.edu/

        Reply
    3. Synoptic 0 on April 3, 2022 1:49 am

      If it produces forecasts like the weather forecasts I get on my phone… nice web page toy, it d need a JavaScript version.

      Reply
    4. xABBAAA on April 3, 2022 10:39 am

      … no need for that USA will push, push till one starts to defend and then it is … well, we all know the game now…

      Reply
      • xABBAAA on April 4, 2022 7:35 am

        … next time when the picture is going to be sent into the space, instead of a person helping the one that has gone over the cliff, one should use the one in which one person is pushing another one over the cliff… or even better a small cartoon in which it is detail explanation of ways human kind does its things… it is a nice rock, little green ones, just drop by… it is nice and sunny…

        Reply
    5. polaris1983 on April 7, 2022 6:02 am

      mmm, sound wave harvesting energy quatrain prediction says teacher to ai saying build it. 😉

      Reply
    6. Roy on January 11, 2023 9:56 pm

      What was the program that overcame it? It caught my attention that they commit that I surpassed many except one, which was it? Sorry, my English is not very good

      Reply
    7. Jeff on November 10, 2024 3:24 pm

      Well until u start to using it the u’l see, how not useful it is. I read somewhere that NN can learn almost everything, but no one is saying to u that u struggle with accuracy a lot, thus more than 70% of business fail to incorporate AI solutions. Why u think is that so ? Software is underdeveloped and almost nothing is easy to use ! Those generative NN’s are so hard to use. PhD’s need 30+ years to be successful in use of NN’s and provide useful results. I’l be dead in 30+ years so my NN’s wont perform ever. To start to use those tools in CLI (open source soft almost all in CLI for that purpose) u need to at least 5+ years of coding training not to mention to be very good at math. When u start into this field u will discover how slow Python really is (its a sequence language, not true parallel processing one), fighting with it to use parallel processing power all the time. Good luck with that.
      Creating neural networks is easy, put it for job purpose is not and this is where developers fail on data mining software and other NN solutions and this is where user is failing to to get desired performance out of it.

      Reply
    Leave A Reply Cancel Reply

    • Facebook
    • Twitter
    • Pinterest
    • YouTube

    Don't Miss a Discovery

    Subscribe for the Latest in Science & Tech!

    Trending News

    5 Simple Ways To Remember More and Forget Less

    The Atomic Gap That Could Cost the Semiconductor Industry Billions

    Researchers Finally Solve 50-Year-Old Blood Group Mystery

    Scientists Discover “Molecular Switch” That Fuels Alzheimer’s Brain Inflammation

    Trees Emit Tiny Lightning Flashes During Storms and Scientists Finally Prove It

    Pomegranate Compound Could Help Protect Against Heart Disease

    Your Blood Test Might Already Show Alzheimer’s Risk

    Scientists Were Wrong About This Strange “Rule-Breaking” Particle

    Follow SciTechDaily
    • Facebook
    • Twitter
    • YouTube
    • Pinterest
    • Newsletter
    • RSS
    SciTech News
    • Biology News
    • Chemistry News
    • Earth News
    • Health News
    • Physics News
    • Science News
    • Space News
    • Technology News
    Recent Posts
    • Harvard Breakthrough Brings Powerful UV Light Sources Onto a Chip
    • This Strange Quantum “Dance” Could Rewrite Superconductivity
    • Scientists Make Breakthrough in Turning Plastic Trash Into Clean Fuel Using Sunlight
    • Scientists Complete Largest 3D Map of the Universe to Probe Dark Energy
    • Hidden Parasite Found in Popular Portuguese Lake Raises Health Concerns
    Copyright © 1998 - 2026 SciTechDaily. All Rights Reserved.
    • Science News
    • About
    • Contact
    • Editorial Board
    • Privacy Policy
    • Terms of Use

    Type above and press Enter to search. Press Esc to cancel.