S-4-31 Prediction of Tc Upper Limits for Superconducting Materials Assisted by Machine Learning

Prediction of Tc Upper Limits for Superconducting Materials Assisted by Machine Learning

Haiyou Huang1*, Yang Liu1, Yan Zhang1, Hongyuan Feng1, Ning Chen1, Yang Li2, Kui Jin3, Dezhen Xue4, Yanjing Su1

1 University of Science and Technology Beijing, Beijing, 100083, China.

2 Department of Engineering Science and Materials, University of Puerto Rico, Mayaguez, Puerto Rico 00681-9000, USA.

3 Institute of Physics, Chinese Academy of Sciences, Beijing, 100190, China.

4 State Key Laboratory for Mechanical Behavior of Materials, Xi'an Jiaotong University, Xi'an 710049, China.

 

ABSTRACT: Finding room temperature superconductors is a dream of generations. However, in reality, finding a superconductor is easy, finding a high temperature superconductor is hard. We are so longing for a convenient way to predict the critical temperature (Tc) of the superconductors. Owing that there are multiple factors affecting the superconductivity, we need a holistic perspective to learn the variation law of Tc. So, the problem of the superconductivity is to find the pattern in high dimensional space of the factors affecting Tc, is exactly calling for the help of the machine learning (ML). Usually, when taking about predict superconductors, we are thinking of two goals: one is to foresee a material superconduct or not, the other is to predict the Tc of each chemical composition.

However, for the ML, those two goals are less feasible. It is hard to know whether a material superconduct or not before knowing its magnetism, which is also unpredictable for now. On the other hand, the need of materials scientists is to predict brand new materials, instead of composition optimization. There is no need to predict the Tc of each composition, the ML should be used to predict the Tc maxima (Tcmax) of various kinds of materials. Hence, owing that our goal is to predict the Tcmax of new materials, the generalization ability of the ML models is of great significance.

Recently, encouraging progresses have been made in several ML studies on the Tc of superconductors. So far, the predictive score on the test data is quite good. However, their performance of those ML models on predicting unknown materials is not examined. We noticed that the previous studies, the Tc and the Tcmax were often confused with each other. The models were trained with the data of Tc, but prediction is on the Tcmax. Actually, they are two properties having very different meanings. Owing that the data distribution of the Tc is obviously imbalanced, the models trained with the Tc have poor generalization ability, which makes it not suitable for the prediction of the Tcmax. When predicting the Tcmax, the generalization ability of the models is the key to obtain reliable results.

In this work, we were looking for a strategy to get reliable ML models predicting the Tcmax of superconducting materials. We categorized the data in the superconductor database, obtaining ~1000 kinds of superconducting materials. In each kind of superconducting materials, we choose the highest Tc as the Tcmax of this kind of superconducting materials. Meanwhile, we designed a novel feature set, according to the orbital attributes of the valence electrons. These features may carry the information about the origin of the superconductivity, which lead to better interpretability of the ML models. The ML models trained with the dataset of the Tcmax, and the features derived from the valence electrons, yield good results, which help us find a new universal empirical law of Tcmax. It infers a necessary but insufficient condition of the existence of high Tcmax, providing an effective guidance for the designation of superior superconductors.

 

Keywords: superconductor;critical temperature;machine learning

Brief Introduction of Speaker
Haiyou Huang

Haiyou Huang has completed his PhD from Department of Material Physics and Chemistry, School of Material Science and Engineering, University of Science and Technology Beijing in 2007, and Postdoctoral studies from the Hong Kong University of Science and Technology in 2009. Now, he is associate professor and master supervisor of University of Science and Technology Beijing. He has published 3 books and more than 60 papers in reputed journals such as npj computational materials, Applied Physics Letters、APL Materials、Scripta Materials, Materials Design, and has applied 13 Chinese invention patents. In 2017, he won the second prize of Natural Science Award of the Ministry of Education. His main research directions include: MGE database and big-data technology; data driven research and development of advanced materials; shape memory alloy, etc.