A schematic of the multi-fidelity graph network approach to predicting the properties of new materials. Image: Materials Virtual Lab at UC San Diego.
A schematic of the multi-fidelity graph network approach to predicting the properties of new materials. Image: Materials Virtual Lab at UC San Diego.

Advances in energy technologies, healthcare, semiconductors and food production all have one thing in common: they rely on developing new materials – new combinations of atoms – with specific properties that allow them to perform a needed function. In the not-too-distant past, the only way to know what properties a material had was by performing experimental measurements or by using very expensive computations.

More recently, scientists have been using machine-learning algorithms to rapidly predict the properties that certain arrangements of atoms should have. The challenge with this approach is that it requires a lot of highly accurate data to train the model, which often does not exist.

Now, by combining large amounts of low-fidelity data with smaller quantities of high-fidelity data, nanoengineers from the Materials Virtual Lab at the University of California (UC) San Diego have developed a new machine-learning method to predict the properties of materials with more accuracy than existing models. Crucially, their approach is also the first to be able to predict the properties of disordered materials – those with atomic sites that can be occupied by more than one element, or can be vacant. The researchers report their novel multi-fidelity graph networks approach in a paper in Nature Computational Science.

"When you are designing a new material, one of the key things you want to know is if the material is likely to be stable, and what kind of properties it has," said Shyue Ping Ong, a professor of nanoengineering at the UC San Diego Jacobs School of Engineering and the paper's corresponding author. "The fundamental problem is that valuable accurate data, such as experimental measurements, is difficult to come by, even though we have large databases of less accurate computed properties. Here, we try to get the best of both worlds – combine the large low-fidelity data and the smaller high-fidelity data to improve the models' accuracy in high value predictions."

While other multi-fidelity approaches exist, these methods do not scale well or are limited to only two fidelities of data. They are not as accurate or dynamic as this new multi-fidelity graph network approach, which can work with an unlimited number of data fidelities and can be scaled up very quickly.

In this study, the nanoengineers looked specifically at materials' band gaps – a property that determines electrical conductivity, the color of the material, solar cell efficiency, and more – as a proof-of-concept. Compared with a traditional single-fidelity approach, their multi-fidelity graph networks led to a 22–45% decrease in the mean absolute errors of experimental band-gap predictions. The researchers also showed that their approach can accurately predict high-fidelity molecular energies as well.

"There is no fundamental limitation as to what properties this can be applied to," said Ong. "The question is which kind of properties we have data on." In the near term, Ong's team plans to use this new method to develop better materials for energy storage, photovoltaic cells and semiconductor devices.

While predicting the properties of ordered materials, the team made another serendipitous discovery. In the graph deep-learning model they use, atomic attributes are represented as a learned length-16 embedding vector. By interpolating these learned embedding vectors, the researchers found they could also create a predictive model for disordered materials, which have atomic sites that can be occupied by more than one element or can be vacant at times, making them harder to study using traditional methods.

"While the bulk of computational and machine learning works have focused on ordered materials, disordered compounds actually form the majority of known materials," said Chi Chen, an assistant project scientist in Ong's lab, and first author of the paper. "Using this approach, multi-fidelity graph network models can reproduce trends in the band gaps in disordered materials to good accuracy."

This opens the door to much faster and more accurate design of new materials to meet key societal needs.

"What we show in this work is you can actually adapt a machine-learning algorithm to predict the properties of disordered materials. In other words, now we are able to do materials discovery and prediction across the entire space of both ordered and disordered materials rather than just ordered materials," said Ong. "As far as we know, that is a first."

This story is adapted from material from the University of California, San Diego, with editorial changes made by Materials Today. The views expressed in this article do not necessarily represent those of Elsevier. Link to original source.