This section is about data-mining. With data in this context I am referring to chemical structures and associated data from (mainly) public databases. In general I use the workflow creator “Knime” and combine it as required with Java, SQL and different APIs. These days I am also using more and more Python scripting. Many machine learning and especially AI methods are often done in Python which allows for more flexibility than Knime (for better or worse). Of course, Python can be incorporated into Knime if need be…
I am sharing, as time goes by, some practical (parts) of workflows here on my page which you may study, copy/use, whatever, though a “developed by A. Minidis, Pharmakarma”, even if only in tiny 6pt font, would be appreciated. Maybe even a mail that it helped you. Or a mail if you have improvement suggestions?
Most of these workflows could in principle also be converted to one of the other multitudes of platforms out there, such as e.g. Pipeline Pilot, or other script based languages such as R, if that is more to your liking.
Other code written in Python may be found on my GitHub with appropriate licensing.
You will find details in the Blog section, here are some direct links (to some older entries):
- Knime & External Tool for OCR of structures
- SpotRM+ & batch mode usage
- Part 3: What disease should I …. ? Knime workflows (and Part 1 & 2 linked within)
- Sweet, another publication! Machine Learning in Reaction predictions!
(this refers to a publication which includes Knime workflows)