Learning and Charting Chemical Space with Strings and Graphs: Challenges and Opportunities for AI and Machine Learning
Informatics methods and computers have not yet become as pervasive in chemistry as they have in physics and biology. Drawing analogies from bioinformatics, key ingredients for progress in chemoinformatics are the availability of large, annotated databases of compounds and reactions, data structures and algorithms to efficiently search these databases, and computational methods to predict the physical, chemical, and biological properties of new compounds and reactions. We will describe how graph-based methods play a key role in the development of:
- a large public database of compounds and reactions (ChemDB) and the underlying algorithms and representations;
- machine learning kernel methods to predict molecular properties; and
- the applications of these methods to drug screening/design problems and the identification of new drug leads against a major disease.
Author: Pierre Baldi, University Of California