Substructure search with SMARTS query against ChEMBLDB #rdkit #razi #pychembldb

Recently I often use razi for making structure search because it is very easy to integrate many workflow written in python.

Today I would like to show how to perform substructure search with SMARTS query in ChEMBL. Because I’m modifying pychembldb to integrate razi for enabling structure search in pychembldb.

To perform substructure seach with SMARTS in razi, SMARTS string should be convert query object with mol_from_smarts method.

So I wrote sample code which search MCS from given compound list and perform search with the MCS. Code is shown below.

Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

https://github.com/iwatobipen/playground/blob/master/substrcut_seach_razi.ipynb

Modified version of pychembldb seems work fine. But sometime it takes long time for retrieving response, it depends on query smarts.

I would like to know way to improve the search speed if there is. Any suggestions comments are greatly appreciated ;)

Advertisement

Published by iwatobipen

I'm medicinal chemist in mid size of pharmaceutical company. I love chemoinfo, cording, organic synthesis, my family.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.

%d bloggers like this: