Drug discovery is a long, costly process with high failure rates. Early preclinical stages, such as the hit-to-lead process, are critical in directing drug development. Recent AI advancements have shown the potential to streamline early drug discovery processes by improving efficiency, precision, and innovation.
Here are some examples of how AI is being used in hit identification
Data Processing and Analysis
AI can manage and analyze large biological datasets, like genomics or proteomics, enabling efficient data organization and retrieval and providing essential disease insights.
For example, Transcripta Bio uses AI to analyze gene expression data across 200 million experiments, building their Drug-Gene Atlas to help discover new therapeutic candidates. By leveraging AI and machine learning models, they accelerate the analysis of transcriptomic data, which is crucial in understanding drug responses and discovering new drugs that modulate gene expression. Their AI modeling suite can virtually screen billions of compounds for therapeutic benefits, focusing on gene modulation with high speed and accuracy.
Molecule Generation and Biophysical Property Prediction
AI enhances drug discovery through de novo drug design and property prediction. Generative models like NVIDIA MolMIM create new molecules from scratch, while other, predictive models assess these molecules' biological activity, toxicity, metabolism, and target binding affinity. This dual approach enables the generation and rapid optimization of drug candidates, improving efficacy and safety while streamlining drug development.
Atomwise, a member of the NVIDIA Inception program for startups, is a pioneering company in AI-driven drug discovery. Their AtomNet platform uses deep learning to predict the binding of small molecules to protein targets, effectively performing bioactivity prediction and toxicity estimation. Atomwise has partnered with several major pharmaceutical companies, demonstrating the industry's confidence in their AI-driven approach to predictive modeling.
Virtual Screening and Docking
AI accelerates Virtual Screening by evaluating molecules for similarity to known actives (ligand-based) or predicting how well compounds bind to targets (structure-based). Binding affinity prediction further ranks promising candidates based on interaction strength.
Accenture is tailoring the generative AI-powered NVIDIA Blueprints for drug discovery in collaboration with pharmaceutical partners. This approach enhances molecular generation steps by incorporating industry-specific requirements, optimizing binding affinity and pharmacokinetic properties like absorption, distribution, metabolism, and excretion (ADME).
Target Identification, Target Validation, and Structure Prediction
AI helps identify potential drug targets through omics data analysis and predicts protein structures, aiding structure-based drug design.
For example, one of the most transformative advancements in protein structure prediction, AlphaFold2, has revolutionized how proteins are understood, reaching atomic-level accuracy for many proteins. AlphaFold’s predictions are now used extensively to accelerate drug discovery by providing insights into protein interactions with other molecules. The AlphaFold2 database, developed in collaboration with EMBL-EBI, offers free access to predicted structures of nearly all known proteins, contributing to faster target identification for diseases such as cancer and Alzheimer’s.
Molecular Dynamics (MD) Simulations
AI in molecular dynamics simulations enhances the prediction of molecular movements and interactions by efficiently analyzing large datasets. Complementary to wet-lab synthesis and experimentation, MD can provide accurate insights into complex systems like protein folding, drug binding, and material behavior under various conditions.
Iambic Therapeutics, a startup dedicated to AI-driven drug discovery, incorporates quantum mechanics into MD simulations to enhance drug discovery efforts targeting cancer proteins. Their method is a hybrid of deep learning and physics-based simulations, which significantly reduces the data requirements while increasing the precision of predictions for small molecules and their binding interactions.
Image Analysis and Phenotypic Screening
AI processes imaging data in high-content screening, detecting phenotypic changes caused by drug treatment and histopathology analysis, and assesses disease progression or treatment responses from tissue samples.
Recursion, a clinical-stage biotechnology company, leverages AI-driven high-content imaging to explore phenotypic changes at a massive scale. Their platform integrates AI and machine learning to analyze millions of cellular images, identifying phenotypic changes in response to drug compounds. Their goal is to map all possible cellular behaviors, creating a detailed dataset that speeds up the discovery of new drugs.
Synthetic Route Prediction
AI is used to identify efficient pathways for creating molecules and predict reaction outcomes to forecast yields and the feasibility of chemical reactions.
The open-source tool AiZynthFinder is widely used for retrosynthetic planning. It uses Monte Carlo Tree Search (MCTS) and neural networks to break down molecules into purchasable precursors and predict efficient synthetic routes.