
Machine learning tool will help answer fundamental questions about the universe.
A new AI-driven tool allows scientists to analyze vast amounts of LIGO data without human input. It identifies environmental noise sources, such as earthquakes and ocean waves, that interfere with gravitational wave detection, paving the way for improved signal clarity.
Harnessing Machine Learning for Gravitational Wave Data
Scientists at the University of California, Riverside, have developed a new machine learning tool that makes it easier to identify patterns and reduce noise in the vast, complex datasets generated by the LIGO gravitational wave observatory.
At a recent IEEE big-data workshop, the UCR team presented their research on an unsupervised machine-learning approach designed to detect patterns in LIGO’s auxiliary channel data. This method has potential applications beyond LIGO, including large-scale particle accelerator experiments and complex industrial systems.

LIGO: A Gateway to the Universe’s Mysteries
LIGO, the Laser Interferometer Gravitational-Wave Observatory, detects gravitational waves — ripples in spacetime caused by the movement of massive objects. It was the first facility to observe these waves from merging black holes, confirming a key prediction of Einstein’s Theory of Relativity. LIGO consists of two 4-km-long interferometers, located in Hanford, Washington, and Livingston, Louisiana. These observatories use high-powered laser beams to detect gravitational waves, providing new insights into black holes, cosmology, and the most extreme states of matter in the universe.
Both LIGO detectors record thousands of data streams, or channels, which come from environmental sensors placed throughout the sites. These sensors monitor conditions that could affect the observatory’s ability to detect gravitational waves accurately.
A Machine Learning Breakthrough in Noise Detection
“The machine learning approach we developed in close collaboration with LIGO commissioners and stakeholders identifies patterns in data entirely on its own,” said Jonathan Richardson, an assistant professor of physics and astronomy who leads the UCR LIGO group. “We find that it recovers the environmental ‘states’ known to the operators at the LIGO detector sites extremely well, with no human input at all. This opens the door to a powerful new experimental tool we can use to help localize noise couplings and directly guide future improvements to the detectors.”
Richardson explained that the LIGO detectors are extremely sensitive to any type of external disturbance. Ground motion and any type of vibrational motion — from the wind to ocean waves striking the coast of Greenland or the Pacific — can affect the sensitivity of the experiment and the data quality, resulting in “glitches” or periods of increased noise bursts, he said.
Identifying Earthquakes, Microseisms, and More
“Monitoring the environmental conditions is continuously done at the sites,” he said. “LIGO has more than 100,000 auxiliary channels with seismometers and accelerometers sensing the environment where the interferometers are located. The tool we developed can identify different environmental states of interest, such as earthquakes, microseisms, and anthropogenic noise, across a number of carefully selected and curated sensing channels.”
Vagelis Papalexakis, an associate professor of computer science and engineering who holds the Ross Family Chair in Computer Science, presented the team’s paper, titled “Multivariate Time Series Clustering for Environmental State Characterization of Ground-Based Gravitational-Wave Detectors,” at the IEEE’s 5th International Workshop on Big Data & AI Tools, Models, and Use Cases for Innovative Scientific Discovery that took place last month in Washington, D.C.
Letting the Machine Find the Patterns
“The way our machine learning approach works is that we take a model tasked with identifying patterns in a dataset and we let the model find patterns on its own,” Papalexakis said. “The tool was able to identify the same patterns that very closely correspond to the physically meaningful environmental states that are already known to human operators and commissioners at the LIGO sites.”
Papalexakis added that the team had worked with the LIGO Scientific Collaboration to secure the release of a very large dataset that pertains to the analysis reported in the research paper. This data release allows the research community to not only validate the team’s results but also develop new algorithms that seek to identify patterns in the data.
“We have identified a fascinating link between external environmental noise and the presence of certain types of glitches that corrupt the quality of the data,” Papalexakis said. “This discovery has the potential to help eliminate or prevent the occurrence of such noise.”
The team organized and worked through all the LIGO channels for about a year. Richardson noted that the data release was a major undertaking.
Opening Doors for Machine Learning Research
“Our team spearheaded this release on behalf of the whole LIGO Scientific Collaboration, which has about 3,200 members,” he said. “This is the first of these particular types of datasets and we think it’s going to have a large impact in the machine learning and the computer science community.”
Richardson explained that the tool the team developed can take information from signals from numerous heterogeneous sensors that measure different disturbances around the LIGO sites. The tool can distill the information into a single state, he said, that can then be used to search for time series associations of when noise problems occurred in the LIGO detectors and correlate them with the sites’ environmental states at those times.
Aiming for Actionable Changes in LIGO Detectors
“If you can identify the patterns, you can make physical changes to the detector — replace components, for example,” he said. “The hope is that our tool can shed light on physical noise coupling pathways that allow for actionable experimental changes to be made to the LIGO detectors. Our long-term goal is for this tool to be used to detect new associations and new forms of environmental states associated with unknown noise problems in the interferometers.”
Pooyan Goodarzi, a doctoral student working with Richardson and a co-author on the paper, emphasized the importance of releasing the dataset publicly.
“Typically, such data tend to be proprietary,” he said. “We managed, nonetheless, to release a large-scale dataset that we hope results in more interdisciplinary research in data science and machine learning.”
Reference: “Multivariate Time Series Clustering for Environmental State Characterization of Ground-Based Gravitational-Wave Detectors” by Rutuja Gurav, Isaac Kelly, Pooyan Goodarzi, Anamaria Effler, Barry Barish, Evangelos Papalexakis, Jonathan Richardson, 13 December 2024, Computer Science > Machine Learning.
arXiv:2412.09832
The team’s research was supported by a grant from the National Science Foundation awarded through a special program, Advancing Discovery with AI-Powered Tools, focused on applying artificial intelligence/machine learning to address problems in the physical sciences.
Richardson, Papalexakis, and Goodarzi were joined in the research by Rutuja Gurav, a doctoral student working with Papalexakis; Isaac Kelly, a summer undergraduate REU student; Anamaria Effler of the LIGO Livingston Observatory; and Barry Barish, a UCR distinguished professor in physics and astronomy.
Never miss a breakthrough: Join the SciTechDaily newsletter.
Follow us on Google and Google News.
2 Comments
Perhaps they could train the AI to recognize patterns and signals of interest in the discarded “noise trash” that would give insight about movements on and within the Earth.
Couple that with all the seismic data used to visualize the inner layers of the planet, and I would be interested in all the insights we could learn.