Hostname: page-component-6766d58669-fx4k7 Total loading time: 0 Render date: 2026-05-20T18:04:47.900Z Has data issue: false hasContentIssue false

Materials science with large-scale data and informatics: Unlocking new opportunities

Published online by Cambridge University Press:  04 May 2016

Joanne Hill
Affiliation:
Citrine Informatics, USA; jo@citrine.io
Gregory Mulholland
Affiliation:
Citrine Informatics, USA; greg@citrine.io
Kristin Persson
Affiliation:
Lawrence Berkeley National Laboratory, USA; kapersson@lbl.gov
Ram Seshadri
Affiliation:
University of California, Santa Barbara, USA; seshadri@mrl.ucsb.edu
Chris Wolverton
Affiliation:
Northwestern University, USA; c-wolverton@northwestern.edu
Bryce Meredig
Affiliation:
Citrine Informatics, USA; bryce@citrine.io

Abstract

Universal access to abundant scientific data, and the software to analyze the data at scale, could fundamentally transform the field of materials science. Today, the materials community faces serious challenges to bringing about this data-accelerated research paradigm, including diversity of research areas within materials, lack of data standards, and missing incentives for sharing, among others. Nonetheless, the landscape is rapidly changing in ways that should benefit the entire materials research enterprise. We provide an overview of the current state of the materials data and informatics landscape, highlighting a few selected efforts that make more data freely available and useful to materials researchers.

Information

Type
Research Article
Copyright
Copyright © Materials Research Society 2016 
Figure 0

Figure 1. Flowchart illustrating only a fraction of the tremendous variety in how materials researchers generate, manipulate, write about, distribute, and discover data. Note: USB, universal serial bus; PC, personal computer.

Figure 1

Figure 2. A partial list of the hundreds of organizations that all play important roles in the generation, distribution, and storage of materials data. Note: AFLOW, Automatic-FLOW for Materials Discovery; APS, American Physical Society; ACS, American Chemical Society; MRS, Materials Research Society; TMS, The Minerals, Metals and Materials Society; ASM, American Society for Metals; NIMS, National Institute for Materials Science; ICSD, Inorganic Crystal Structure Database; AFLOWLIB, Repository associated with the AFLOW project.

Figure 2

Table I. A list of some notable materials-data resources.

Figure 3

Figure 3. Access statistics (page views/month) for the Open Quantum Materials Database (OQMD) paper after it appeared on the Citrination platform. The spike in August 2015 resulted from Google indexing the OQMD data set on Citrination.

Figure 4

Figure 4. Screenshot of a battery data-mining resource,72 displaying the average potential of the electrode as a function of the calculated discharge capacity after the 25th cycle. Such visualizations make it easier for materials scientists to identify important patterns in very large data sets. The symbol size is proportional to the percentage capacity that is retained after 25 cycles (larger is better), and the color indicates the crystal structure type of the active electrode material. Hovering the mouse over data points allows metadata to be read, and clicking on the points takes the reader to the original literature.73 For thermoelectrics, see Reference 71. Note: CB, carbon black; PVDF&DMP, poly(vinylidene fluoride) & dimethyl phthalate; EC, ethylene carbonate; DMC, dimethyl carbonate; C, charge rate.

Figure 5

Figure 5. An Ashby plot auto-generated using structured data accessed via the Citrination API. Programmatic data access allows for the construction of key materials data visualizations, such as Ashby plots, without tens of hours of manual data entry. Note: AISI, American Iron and Steel Institute; GFRP, glass-fiber-reinforced plastic; CFRP, carbon-fiber-reinforced plastic.