Literature-ML-Validation Automation in Ecosystem of a HEA Database
ULtrahigh TEmperature Refractory Alloys (ULTERA) database, developed under the ARPA-E's ULTIMATE program, is focused on high entropy alloys (HEAs). It's main purpose is to automate the integration of data from sources such as literature extraction (manual and natural language processing), generative modeling of hypothetical HEAs, predictive modeling, experimental or computational validations. Furthermore, it connects a wide range of data sources, including manual collection by researchers, external open databases, and contributions from our industry partners. Merging of the data is done in real-time, fully automatically, on the cloud, allowing any project component to operate on the best available dataset. Thus, at any given time, generative modeling is done on the best starting dataset, and experiments/simulations can be run on the most likely candidate materials.