Make predictive models with small data and visualize it #Chemoinformatics

I enjoyed chemoinformatics conference held in Kumamoto in this week. The first day of the conference, I could hear about very interesting lecture. That was very basic data handling and visualization tutorial but useful for newbie of chemoinformatics. I would like to reproduce the code example, so I tried it. First, visualize training data. ItContinue reading “Make predictive models with small data and visualize it #Chemoinformatics”


standardization of tautomers #RDKit

One of the hot topic of new version of RDKit is an integration of MolVS which is tool for molecular standardization. Molecular standardization is important for not only chemist but also chemoinformatist. Because tautomer shows different representation of molecule and it will be affect accuracy of QSAR models. I wrote molecular standardization tools named ‘MolVS’Continue reading “standardization of tautomers #RDKit”

New finger print calculation method in RDKit #RDKit

It’s a good news for RDKitters! New version of rdkit is released and it can be installed with Anaconda! There are many implementations and enhancements. You can find details of that from URL below. One of interesting feature is a fingerprint bit information rendering function. There is nice blog post about that. AndContinue reading “New finger print calculation method in RDKit #RDKit”

(new?) medchem tool box for compound synthesis

This mini perspective shows recent progress of the direct C-H alkylation with Alkyl Sulfinates. There are many heterocyclic moieties such as pyridine, pyrrole etc. in drug like molecules. The C-H alkylation reaction of heterocycles is useful but difficult to conduct it under mild reaction conditions. So mild and universal reaction is very attractive forContinue reading “(new?) medchem tool box for compound synthesis”

Draw similarity network #RDKit #Cyjupyter

Recently Kei Ono who is developer of cytoscape developed cyjupyter. It seems attractive for me because the library can draw network diagram on jupyter notebook. There are many network structured data in chemoinformatics. For example molecule, molecular similarity map and MMP etc… I used the library to draw similarity map of molecules today. IContinue reading “Draw similarity network #RDKit #Cyjupyter”

New fingerprint/MinHash FingerPrint #RDKit #Chemoinformatics

Recently I found an article that describe new method for fast fingerprint calculation. You can read the article from chemrxiv, URL is below. They used MinHash method. MinHash method is the way to estimate jaccard similarity very efficiently. The authors developed MHFP (MinHash Fingerprint) and compared the performance with ECFP4. ”’ ? MinHash ?Continue reading “New fingerprint/MinHash FingerPrint #RDKit #Chemoinformatics”