
AI could accelerate the hunt for new physics, but sometimes it knows too much to see what’s right in front of it.
Artificial intelligence could make it much cheaper and faster to search for new laws of physics, according to a new study published in the Journal of Cosmology and Astroparticle Physics (JCAP). But the research also points to an unexpected downside. In some situations, AI can become so dependent on its previous training that it struggles to recognize genuinely new phenomena.
AI has become an important tool in cosmology, helping researchers analyze enormous amounts of data about the universe. Yet investigating ideas that go beyond the current standard cosmological model, known as ΛCDM, remains an extremely expensive computational challenge.
While ΛCDM successfully explains many observed features of the universe, including its expansion and the large-scale distribution of galaxies, scientists do not believe it tells the whole story. Recent observations suggest that phenomena such as massive neutrinos, modified gravity, and evolving dark energy could reveal physics that lies beyond the current model.
Exploring these possibilities requires researchers to generate vast numbers of detailed simulations of virtual universes, each based on different physical assumptions. Producing these simulations often demands enormous computing power and time.
Transfer Learning Offers a Faster Route
The researchers investigated whether a machine learning approach called transfer learning could reduce that burden.
Transfer learning allows an AI system to apply knowledge gained from one task to help it learn another task more efficiently. Rather than starting from scratch, the AI builds on what it has already learned.
For this study, the team first trained a neural network using simulations based on ΛCDM. This initial training process, known as pretraining, gave the AI a foundation before it was exposed to more complex cosmological models that include possible new physics.
“It’s basically a shortcut,” explains Adrian Bayer a cosmologist at the Flatiron Institute and Princeton University, co-author of the study. “Usually people train the AI directly on the most computationally expensive simulations. What we do instead is first use simpler and less expensive ΛCDM simulations to give the AI an idea of what’s happening, and only afterward move to the more complex models.”
Bayer compares the process to learning from textbooks. “You first read a basic book to get an idea of the knowledge,” says Bayer, “and then move to the really complicated book.”
According to Veena Krishnaraj, an undergraduate student at Princeton University and the paper’s first author, this approach prevents the AI from having to “digest everything at once.”
The strategy proved highly effective. In some cases, transfer learning reduced the number of costly simulations required by more than a factor of ten.
When Prior Knowledge Becomes a Problem
The study also revealed a less obvious challenge known as negative transfer.
Using Bayer’s textbook analogy, imagine a medical student learning from introductory materials and later encountering a rare disease that resembles a common illness. Existing knowledge is usually helpful, but it can sometimes lead to the wrong conclusion.
A similar problem can arise in AI systems. Certain signals produced by new physics can look very similar to patterns the AI already learned from the standard cosmological model. When that happens, the AI may interpret the new information through the lens of its earlier training, making it more difficult to recognize something truly different.
The researchers saw this effect while studying simulations that included massive neutrinos. Some of the observable consequences of neutrino mass closely resemble changes associated with an existing ΛCDM parameter called σ8, which measures how strongly matter clusters throughout the universe.
Because the two effects can appear so similar, the pretrained neural network initially had trouble telling them apart.
“The negative transfer is not random. It is driven by underlying physical degeneracies in the model,” says Krishnaraj. In other words, different physical parameters can create nearly identical observable signatures, making it difficult for the AI to correctly separate them. “So this is something we need to be aware of and try to mitigate,” she concludes.
Promise and Risks for Future Cosmology
The findings illustrate both the benefits and potential pitfalls of applying foundation model strategies to physics. These approaches are conceptually similar to the techniques used in modern generative AI systems and large language models.
As the authors note in the paper, pretraining can speed up inference, “but may also hinder learning new physics.”
So far, the method has only been tested using simulations. However, the researchers believe it provides an important foundation for future applications involving real astronomical observations.
That could become increasingly valuable as next-generation cosmological surveys begin producing unprecedented volumes of high-precision data about the universe. If used carefully, transfer learning could help scientists analyze that information far more efficiently while continuing the search for physics beyond the Standard Model.
The paper, “Transfer Learning Beyond the Standard Model,” by Veena Krishnaraj, Adrian E. Bayer, Christian Kragh Jespersen, and Peter Melchior, is now available in JSTAT.
Never miss a breakthrough: Join the SciTechDaily newsletter.
Follow us on Google and Google News.
1 Comment
The AI needs to be programmed mostly to identify anomalies, not to recognize already known theories.