Assessment of Fine-Tuned Large Language Models for Real-World Chemistry and Material Science Applications

Joren van Herck; Maria Victoria Gil; Kevin Maik Jablonka; Alex  Abrudan; Andy Anker; Mehrdad  Asgari; Ben  Blaiszik; Leander  Choudhury; Clemence  Corminboeuf; Hilal  Daglar; Ian T. Foster; Susana  Garcia; Matthew  Garvin; Guillaume  Godin; Lydia L. Good; Jianan  Gu; Noemie Xiao Hu; Xin  Jin; Tanja  Junkers; Seda  Keskin; Tuomas P.J. Knowles; Ruben  Laplaza; Sauradeep  Majumdar; Hossein  Mashhadimoslem; Ruaraidh D. McIntosh; Seyed Mohamad  Moosavi; Beatriz  Mourino; Francesca  Nerli; Covadonga  Pevida; Neda  Poudineh; Mahyar  Rajabi-Kochi; Kadi L. Saar; Fahimeh H. Saboor; Morteza  Sagharichiha; KJ Schmidt; Jiale  Shi; Dennis  Svatunek; Marco  Taddei; Igor  Tetko; Domonkos  Tolnai; Sahar  Vahdatifar; Jonathan  Whitmer; Florian  Wieland; Regine  Willumeit-Romer; Andreas  Zuttel; Berend Smit

doi:10.26434/chemrxiv-2024-mm31v

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Assessment of Fine-Tuned Large Language Models for Real-World Chemistry and Material Science Applications

31 July 2024, Version 1

Working Paper

Show author details

This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

The current generation of large language models (LLMs), like ChatGPT, have limited chemical knowledge. Recently, it has been shown that these LLMs can learn and predict chemical properties through fine-tuning. In this work, we explore the potential and limitations of this approach. We studied the performance of fine-tuning GPT-J-6B, a public-domain version of the GPT family, for a range of different chemical questions. We find that in most, if not all, cases, this approach outperforms the benchmark (random guessing) for a simple classification problem. Depending on the size of the dataset and the type of questions, we can also address more sophisticated problems. The most important conclusions of this work are that, for all datasets considered, their conversion into an LLM fine-tuning training set is straightforward and that fine-tuning with even relatively small datasets leads to predictive models. These results suggest that the systematic use of LLMs to guide experiments and simulations will be a powerful technique in any research study, significantly reducing unnecessary experiments or computations.

Keywords

large-language model

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

Detailed report of all case studies reported in this work.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Assessment of fine-tuned large language models for real-world chemistry and material science applications

Joren Van Herck, María Victoria Gil, Kevin Maik Jablonka, Alex Abrudan, Andy S. Anker, Mehrdad Asgari, Ben Blaiszik, Antonio Buffo, Leander Choudhury, Clemence Corminboeuf, Hilal Daglar, Amir Mohammad Elahi, Ian T. Foster, Susana Garcia, Matthew Garvin, Guillaume Godin, Lydia L. Good, Jianan Gu, Noémie Xiao Hu, Xin Jin, Tanja Junkers, Seda Keskin, Tuomas P. J. Knowles, Ruben Laplaza, Michele Lessona, Sauradeep Majumdar, Hossein Mashhadimoslem, Ruaraidh D. McIntosh, Seyed Mohamad Moosavi, Beatriz Mouriño, Francesca Nerli, Covadonga Pevida, Neda Poudineh, Mahyar Rajabi-Kochi, Kadi L. Saar, Fahimeh Hooriabad Saboor, Morteza Sagharichiha, K. J. Schmidt, Jiale Shi, Elena Simone, Dennis Svatunek, Marco Taddei, Igor Tetko, Domonkos Tolnai, Sahar Vahdatifar, Jonathan Whitmer, D. C. Florian Wieland, Regine Willumeit-Römer, Andreas Züttel, Berend Smit journal article

Chemical Science

Online publication date: 2025

Version History

Jul 31, 2024 Version 1

Metrics

2,432

1,526

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-mm31v

Funding

Author’s competing interest statement

K.M.J. and A.S.A. have been contractors for OpenAI (as part of the red teaming network). K.L.S. is a consultant for Transition Bio.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Assessment of Fine-Tuned Large Language Models for Real-World Chemistry and Material Science Applications

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share