Fast prediction of distances between synthetic routes with deep learning

Genheden, Samuel and Engkvist, Ola and Bjerrum, Esben (2022) Fast prediction of distances between synthetic routes with deep learning. Machine Learning: Science and Technology, 3 (1). 015018. ISSN 2632-2153

[thumbnail of Genheden_2022_Mach._Learn.__Sci._Technol._3_015018.pdf] Text
Genheden_2022_Mach._Learn.__Sci._Technol._3_015018.pdf - Published Version

Download (1MB)

Abstract

We expand the recent work on clustering of synthetic routes and train a deep learning model to predict the distances between arbitrary routes. The model is based on a long short-term memory representation of a synthetic route and is trained as a twin network to reproduce the tree edit distance (TED) between two routes. The machine learning approach is approximately two orders of magnitude faster than the TED approach and enables clustering many more routes from a retrosynthesis route prediction. The clusters have a high degree of similarity to the clusters given by the TED-based approach and are accordingly intuitive and explainable. We provide the developed model as open-source.

Item Type: Article
Subjects: STM Open Library > Multidisciplinary
Depositing User: Unnamed user with email support@stmopenlibrary.com
Date Deposited: 11 Jul 2023 04:07
Last Modified: 18 Oct 2025 03:49
URI: http://catalog.article4pub.com/id/eprint/1882

Actions (login required)

View Item
View Item