Google Scientists Discovered 380,000 New Materials Using Artificial Intelligence

Robot Holding Futuristic Crystal

The Materials Project, an open-access database for new materials, is revolutionizing how researchers discover and develop materials for future technologies, with Google DeepMind contributing 400,000 new compounds. This synergy of AI, supercomputing, and experimental data speeds up the creation of materials for applications like renewable energy, efficient electronics, and environmental solutions. (Artist’s concept). Credit:

The expansion of the open-access resource is instrumental for scientists in developing novel materials for future technologies.

New advancements in technology frequently necessitate the development of novel materials – and thanks to supercomputers and advanced simulations, researchers can bypass the time-consuming and often inefficient process of trial-and-error.

The Materials Project, an open-access database founded at the Department of Energy’s Lawrence Berkeley National Laboratory (Berkeley Lab) in 2011, computes the properties of both known and predicted materials. Researchers can focus on promising materials for future technologies – think lighter alloys that improve fuel economy in cars, more efficient solar cells to boost renewable energy, or faster transistors for the next generation of computers. 

Now, Google DeepMind – Google’s artificial intelligence lab – is contributing nearly 400,000 new compounds to the Materials Project, expanding the amount of information researchers can draw upon. The dataset includes how the atoms of a material are arranged (the crystal structure) and how stable it is (formation energy). 

The Materials Project can visualize the atomic structure of materials. This compound (Ba₆Nb₇O₂₁) is one of the new materials calculated by GNoME. It contains barium (blue), niobium (white), and oxygen (green). Credit: Materials Project/Berkeley Lab

“We have to create new materials if we are going to address the global environmental and climate challenges,” said Kristin Persson, the founder and director of the Materials Project at Berkeley Lab and a professor at UC Berkeley. “With innovation in materials, we can potentially develop recyclable plastics, harness waste energy, make better batteries, and build cheaper solar panels that last longer, among many other things.”

The Role of GNoME in Material Discovery

To generate the new data, Google DeepMind developed a deep learning tool called Graph Networks for Materials Exploration, or GNoME. Researchers trained GNoME using workflows and data that were developed over a decade by the Materials Project, and improved the GNoME algorithm through active learning. GNoME researchers ultimately produced 2.2 million crystal structures, including 380,000 that they are adding to the Materials Project and predict are stable, making them potentially useful in future technologies. The new results from Google DeepMind were recently published in the journal Nature.

A Lab

Robots guided by artificial intelligence created more than 40 new materials predicted by the Materials Project. Data from GNoME was used as an additional check on whether those predicted materials would be stable. Credit: Marilyn Sargent/Berkeley Lab

Some of the computations from GNoME were used alongside data from the Materials Project to test A-Lab, a facility at Berkeley Lab where artificial intelligence guides robots in making new materials. A-Lab’s first paper, also published in Nature, showed that the autonomous lab can quickly discover novel materials with minimal human input. 

Over 17 days of independent operation, A-Lab successfully produced 41 new compounds out of an attempted 58 – a rate of more than two new materials per day. For comparison, it can take a human researcher months of guesswork and experimentation to create one new material, if they ever reach the desired material at all. 

To make the novel compounds predicted by the Materials Project, A-Lab’s AI created new recipes by combing through scientific papers and using active learning to make adjustments. Data from the Materials Project and GNoME were used to evaluate the materials’ predicted stability.

Grid of Crystal Structures From Materials Project

The Materials Project at Berkeley Lab gives researchers access to crucial information on diverse materials. This image shows the structures of 12 compounds in the Materials Project database. Credit: Jenny Nuss/Berkeley Lab

“We had this staggering 71% success rate, and we already have a few ways to improve it,” said Gerd Ceder, the principal investigator for A-Lab and a scientist at Berkeley Lab and UC Berkeley. “We’ve shown that combining the theory and data side with automation has incredible results. We can make and test materials faster than ever before, and adding more data points to the Materials Project means we can make even smarter choices.” 

The Impact and Future of the Materials Project

The Materials Project is the most widely used open-access repository of information on inorganic materials in the world. The database holds millions of properties on hundreds of thousands of structures and molecules, information primarily processed at Berkeley Lab’s National Energy Research Science Computing Center. More than 400,000 people are registered as users of the site and, on average, more than four papers citing the Materials Project are published every day. The contribution from Google DeepMind is the biggest addition of structure-stability data from a group since the Materials Project began.

“We hope that the GNoME project will drive forward research into inorganic crystals,” said Ekin Dogus Cubuk, lead of Google DeepMind’s Materials Discovery team. “External researchers have already verified more than 736 of GNoME’s new materials through concurrent, independent physical experiments, demonstrating that our model’s discoveries can be realized in laboratories.”

This one-minute timelapse shows how people around the world use the Materials Project over the course of four hours. The data dashboard shows a rolling one-hour window of worldwide Materials Project activity, including the number of requests, the country of users, and the most commonly queried material properties. Credit: Patrick Huck/Berkeley Lab

The Materials Project is now processing the compounds from Google DeepMind and adding them into the online database. The new data will be freely available to researchers, and also feed into projects such as A-Lab that partner with the Materials Project.

“I’m really excited that people are using the work we’ve done to produce an unprecedented amount of materials information,” said Persson, who is also the director of Berkeley Lab’s Molecular Foundry. “This is what I set out to do with the Materials Project: To not only make the data that I produced free and available to accelerate materials design for the world, but also to teach the world what computations can do for you. They can scan large spaces for new compounds and properties more efficiently and rapidly than experiments alone can.”


Many of the calculations for the Materials Project are performed on supercomputers at Berkeley Lab’s National Energy Research Scientific Computing Center. Credit: Thor Swift/Berkeley Lab

By following promising leads from data in the Materials Project over the past decade, researchers have experimentally confirmed useful properties in new materials across several areas. Some show potential for use:

  • in carbon capture (pulling carbon dioxide from the atmosphere)
  • as photocatalysts (materials that speed up chemical reactions in response to light and could be used to break down pollutants or generate hydrogen)
  • as thermoelectrics (materials that could help harness waste heat and turn it into electrical power)
  • as transparent conductors (which might be useful in solar cells, touch screens, or LEDs) 

Of course, finding these prospective materials is only one of many steps to solving some of humanity’s big technology challenges.

“Making a material is not for the faint of heart,” Persson said. “It takes a long time to take a material from computation to commercialization. It has to have the right properties, work within devices, be able to scale, and have the right cost efficiency and performance. The goal with the Materials Project and facilities like A-Lab is to harness data, enable data-driven exploration, and ultimately give companies more viable shots on goal.”

Reference: “An autonomous laboratory for the accelerated synthesis of novel materials” by Nathan J. Szymanski, Bernardus Rendy, Yuxing Fei, Rishi E. Kumar, Tanjin He, David Milsted, Matthew J. McDermott, Max Gallant, Ekin Dogus Cubuk, Amil Merchant, Haegyeom Kim, Anubhav Jain, Christopher J. Bartel, Kristin Persson, Yan Zeng and Gerbrand Ceder, 29 November 2023, Nature.
DOI: 10.1038/s41586-023-06734-w

7 Comments on "Google Scientists Discovered 380,000 New Materials Using Artificial Intelligence"

  1. Advancement comes easy, but more materialisation means more chemical emission in atmosphere with new challenges to quality of biological life. In other words, materialisation at the risk of harmful environment for future generations.

  2. This is shameless nonsensical propaganda. Prof. Robert Palgrave of University of College London already debunked Ceder and his wife (former postdoc and loyal conspirator) on Twitter (X) and elsewhere. These computers are not discovering anything. They are finding old forgotten compounds that don’t do much.

    • Apparently the reviewers and editor(s) of Nature don’t share your opinion. Considering that you are using the ‘handle’ of Darth Ceder, I have to wonder if this is about some personal issues with Ger(bran)d Ceder. Links are allowed here. How about a link to a source that supports your assertions?

  3. Don’t believe everything you read Clyde. Nature has had many retractions this past year. Here is another Nature article itself calling out the Nature editors for that paper.

    And if pay walls stop you, go here

    I can answer technic details since I’m a subject matter expert.

    • I don’t believe everything I read. That’s why I asked.

    • Thank you for the links. Neither were available to me in their entirety, but I saw enough to strongly suggest that the LBNL is claiming more than is justified by the work.

      Unfortunately, that seems to be all too common lately, which is one of the reasons I comment so frequently, particularly in the area of climatology, where LBNL is also active. These press releases probably leave the laymen readers with the impression that the quality of the work is much higher than it is. The give away, if the reader is paying attention, us that there are contradictory conclusions that are often released nearly simultaneously. Yet, they are usually presented as though there is zero margin of error and no doubt.

      It seems that workers in most disciplines are unfamiliar with Chamberlain’s “Method of Multiple Working Hypotheses” and that most researchers implicitly assume that there is only one process involved in complex systems.

  4. P.s. if you have ever met him, you’d know why I went with Darth Ceder. He likes this name, and picked it himself to contain his ego.

Leave a comment

Email address is optional. If provided, your email will not be published or shared.