Abstract
Rapid and sustained growth in the field of immuno-oncology has resulted in expansion of available scientific literature. Gaining valuable insights and establishing deep and often hidden meaningful connections in such a large body of work is the need of the hour. In this report we summarize our findings from a novel Natural Language Programming (NLP)-based approach on a large dataset of >350K scientific publications in immuno-oncology research spanning across two decades (2000-2022) retrieved from the CAS Content Collection. Our analysis led to identification of >300 emerging concepts across major categories such as therapeutic targets, biomarkers, therapies, and types of cancer. We present a “Trend Landscape Map” of emerging concepts in immuno-oncology possessing layers of intricacies – at first glance providing information for the >300 identified concepts arranged hierarchically across 8 major categories and at a deeper level providing detailed quantitative metrics of growth over the last three years (2020-2022). While concepts such as immune checkpoint inhibitors (ICIs), antibody-drug conjugates (ADCs) and chimeric antigenic receptors (CARs) continue to be important in immuno-oncology, their growth over the last three years have been modest. On the other hand, concepts including protein targets such as TROP2, nectin-4, and gasdermins display rapid increase in scientific publications over 2020-2022 while their absolute number of publications remain low potentially indicative of early emergence. Finally, guided by our trend landscape analysis, we performed substance data analysis leveraging data from >3.2 million substances from the CAS Registry and identified potential higher commercial interest in protein/peptide sequences rather than small molecules in cancer immunotherapy as seen with respect to patent publications. It is our hope that our subject matter experts' knowledge-guided big data analysis approach based on the corpus of robustly CAS indexed data provides a comprehensive picture of immuno-oncology as it stands today with the trend landscape map serving as a valuable resource to researchers in this field.
Supplementary materials
Title
Emerging Targets and Therapeutics in Immuno-Oncology Landscape: Insights from Natural Language Processing Analysis
Description
Methods and supplement figures
Actions



![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)