Relay catalysis connects multiple catalytic reactions so that the product of one step becomes the reactant of the next. This approach can improve efficiency, increase selectivity, and reduce energy use. However, designing such multi-step pathways is often slow and challenging. Chemists need to search through a lot of scattered papers, compare different conditions, and check whether each step works well together in practice.
A New AI-Assisted Framework
In a study published in the journal National Science Review, Jun Cheng and Ye Wang from Xiamen University, together with Jeff Z. Pan from the University of Edinburgh, developed a new AI-driven method to make this process faster and more reliable. Their method combines large language models (LLMs) with a custom-built catalysis knowledge graph called Cat-KG. The goal is to recommend useful relay catalysis pathways that are easily accessible, easy to understand, and clearly linked to the original research sources.
How It Works
The team built Cat-KG by using LLMs to extract key reaction data from over 15,000 published catalysis papers. This includes information on reactants, products, catalysts, reaction conditions, and performance. All data are cleaned, organized, and stored in a graph database, where each reaction is linked to its source article. This makes it easy for chemists to trace back every recommended step to the original literature.
To find suitable pathways, the system uses graph-based searches combined with chemistry-informed filtering rules. These rules help identify reaction sequences that are not only possible but also practical—ensuring, for example, that the temperature or gas used in one step doesn't conflict with the next. The filtered results are then summarized by the LLM in plain language and chemical equations, making them easy to understand and evaluate.

Workflow diagram of the construction of the Cat-KG and application for relay catalysis. The initial phase involves the selection of a catalytic schema and the acquisition of data from catalysis-centric literature to establish the Cat-KG (blue), followed by the recommendation of a reaction pathway (purple). Shared steps are marked with the corresponding color.
Results and Impact
For several important target molecules—such as ethylene, ethanol, and 2,5-furandicarboxylic acid—the system successfully found relay pathways that match those already proven in the lab. It also suggested 20 new and unreported pathways for further experimental study. Most of these results can be generated within minutes.
Why It Matters
Unlike black-box AI models, this method is transparent, explainable, and traceable. Every recommended pathway comes with supporting data and literature links, helping chemists evaluate the suggestions before starting experiments. The system is flexible and can be updated with better LLMs or applied to new areas like photocatalysis and electrocatalysis. The team also plans to improve the model by learning from expert feedback in future versions.
The Cat-KG constructed in this study is publicly accessible and supports catalytic reaction queries at: https://ai4ec.ac.cn/apps/chembrain
Looking Forward
Currently, this work mainly focuses on selecting each reaction step individually. Future research will place greater emphasis on handling more complex interactions between steps, such as considering coupling effects between catalysts, assessing catalyst stability under practical reaction conditions, and taking into account economic and operational feasibility, to make the entire catalytic process run more smoothly under real-world conditions.
About the Research Team
Jun Cheng – State Key Laboratory of Physical Chemistry of Solid Surface, iChEM, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; Tan Kah Kee Innovation Laboratory (IKKEM), Laboratory of AI for Electrochemistry (AI4EC lab), Xiamen 361005, China; Institute of Artificial Intelligence, Xiamen University, Xiamen 361005, China.
For more information, please visit the team's websites: https://www.cheng-group.net