Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer

Herráiz-Gil, Sara; Nygren-Jiménez, Elisa; Acosta-Alonso, Diana N.; León, Carlos; Guerrero-Aspizua, Sara

doi:10.3390/app15052798

Open AccessReview

Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer

by

Sara Herráiz-Gil

^1,2,3,4,

Elisa Nygren-Jiménez

^5,6,7,8,

Diana N. Acosta-Alonso

¹,

Carlos León

^1,2,3,4

and

Sara Guerrero-Aspizua

^1,2,3,4,*

¹

Department of Bioengineering, Carlos III University, UC3M-IISFJD-CIEMAT-CIBERER, Av. de la Universidad 30, Leganés, 28911 Madrid, Spain

²

Network Research on Rare Diseases (CIBERER), U714, C/Melchor Fernández Almagro, 3, 28029 Madrid, Spain

³

Health Research Institute, Jiménez Díaz Foundation University Hospital (IIS-FJD), Av. de los Reyes Católicos, 2, 28040 Madrid, Spain

⁴

Centre for Energy, Environment and Technology Research (CIEMAT), Av. Complutense 40, 28040 Madrid, Spain

⁵

BioFab i3D Lab-Biofabrication and 3D (Bio)printing Laboratory, Department of Human Anatomy and Embryology, Faculty of Medicine, University of Granada, 18016 Granada, Spain

⁶

Excellence Research Unit “Modeling Nature” (MNat), University of Granada, 18016 Granada, Spain

⁷

Instituto de Investigación Biosanitaria ibs.GRANADA, University of Granada, 18016 Granada, Spain

⁸

Biopathology and Regenerative Medicine Institute (IBIMER), Centre for Biomedical Research (CIBM), University of Granada, 18016 Granada, Spain

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(5), 2798; https://doi.org/10.3390/app15052798

Submission received: 16 February 2025 / Revised: 28 February 2025 / Accepted: 3 March 2025 / Published: 5 March 2025

(This article belongs to the Special Issue Robotics, IoT and AI Technologies in Bioengineering)

Download

Browse Figures

Versions Notes

Abstract

:

Drug discovery and development remains a complex and time-consuming process, often hindered by high costs and low success rates. In the big data era, artificial intelligence (AI) has emerged as a promising tool to accelerate and optimize these processes, particularly in the field of oncology. This review explores the application of AI-based methods for drug repurposing and natural product-inspired drug design in cancer, focusing on their potential to address the challenges and limitations of traditional drug discovery approaches. We delve into various AI-based approaches (machine learning, deep learning, and others) that are currently being employed for these purposes, and the role of experimental techniques in these approaches. By systematically reviewing the literature, we aim to provide a comprehensive overview of the current state of AI-assisted cancer drug discovery workflows, highlighting AI’s contributions to accelerating drug development, reducing costs, and improving therapeutic outcomes. This review also discusses the challenges and opportunities associated with the integration of AI into the drug discovery pipeline, such as data quality, interpretability, and ethical considerations.

Keywords:

drug repurposing; artificial intelligence; machine learning; cancer

1. Introduction

The drug discovery and development pipeline remains a complex and resource-intensive process that requires significant time and financial investment. Typically, the journey from the identification of a lead compound to obtaining market authorization spans a lengthy period of 10 to 15 years. This extensive timeline underscores the considerable technical, regulatory, and economic obstacles that must be navigated throughout the process. Financial investments in this pipeline often exceed USD 1 billion, reflecting the high costs associated with research, development, and regulatory compliance [1]. These challenges are not only financial, but also involve overcoming numerous scientific and logistical hurdles, which can delay progress and increase costs.

Despite notable advances in technology and methodology, the drug development process continues to face significant impediments, particularly in the form of high clinical trial failure rates. These failures are primarily attributed to ongoing concerns regarding the safety and efficacy of compounds. While efforts to optimize the physicochemical properties of compounds have yielded some benefits, ensuring the safety and efficacy of novel therapeutics remains a critical challenge that the industry must address [2]. This challenge is further exacerbated in the context of complex diseases such as cancer. Cancer is characterized by diverse genetic and molecular heterogeneity, which complicates the development of effective treatments. The variability in genetic profiles and molecular pathways among different cancer types and even among patients with the same type of cancer adds layers of complexity to the drug development process [3].

Consequently, given the severity and high global incidence of cancer, there is an urgent need to prioritize the search for new strategies to optimize drug discovery. This priority is driven by the necessity to develop more effective and targeted therapies that can address the unique challenges posed by cancer’s heterogeneity. The integration of innovative technologies and approaches, such as artificial intelligence and machine learning, into the drug discovery process holds promise for overcoming some of these challenges by enabling more precise targeting of therapies and improving the efficiency of the development pipeline.

In this context, artificial intelligence (AI) has emerged as a transformative tool with great potential to significantly impact drug discovery. Through techniques such as machine learning (including convolutional neural networks [CNNs] and recurrent neural networks [RNNs]), deep learning, and evolutionary algorithms, AI makes it possible to analyze extensive biomedical data, predict molecular interactions, and optimize drug development with unparalleled accuracy, leading to significant reductions in time and costs [4]. These technologies also facilitate the identification of potential drug candidates, the prediction of their effects, and the tailoring of treatments to individual patients, enabling personalized medicine [5] (Figure 1).

For example, by using dynamic computational models of the pathophysiological processes derived from drugs, it has been possible to achieve personalized immunotherapy in cancer patients.

AI also plays a fundamental role in the analysis of omics data mining, particularly in the context of integrating genomic information from next-generation sequencing of thousands of individuals with clinical data and predictive computational models. This integration has been demonstrated to represent a significant advancement in the automation of the research and identification of novel therapeutic targets. Omics technologies also facilitate drug repositioning by identifying new therapeutic uses for existing drugs, accelerating the process and reducing attrition rates in development and clinical trials. This is primarily based on analyzing large datasets to identify patterns and relationships between drugs and diseases through computational modeling [6]. Consequently, multi-omics technologies, in conjunction with machine learning (ML) and deep learning (DL), have facilitated substantial progress in data integration. This is attributable to their substantial processing capacity for both linear and non-linear data, thereby establishing a linkage between genomic alterations and specific therapeutic options for patients. However, integration tools have not demonstrated significant performance since they use data generated from multiple databases, which has not allowed for standardized databases, but it is estimated that these challenges can be solved with AI approaches and standardization [7,8]. Furthermore, advances in AI are not only helping to develop more effective targeted therapies but also to predict patient response. For example, AI-based algorithms can identify subgroups of patients likely to respond to specific immunotherapies or combination treatments [4]. AI also plays an increasingly significant role in optimizing polypharmacology strategies and predicting drug side effects. Machine learning models can predict potential side effects even for new substances by analyzing databases of adverse events and drug–protein interactions [9,10]. AI-driven methods pave the way for designing multi-target medications, maximizing treatment effectiveness and reducing side effects [11]. Both strategies contribute to efficient drug development and reduce late-stage clinical trial failures by integrating diverse datasets (Figure 1).

In the domain of cancer research, AI plays a critical role in identifying biomarkers and therapeutic targets. Deep learning and machine learning models facilitate the analysis of thousands of data points, such as genomic and transcriptomic data, to identify key mutations and molecular alterations driving tumor growth [6]. Integrating these data with clinical information (patient history) can provide insights into the efficacy of different treatments for specific genetic profiles. The field has been revolutionized by tools such as AlphaFold, which can predict protein structures with a high degree of accuracy, thus providing essential information for the design of targeted drugs [12].

The integration of AI with multi-omics technologies signifies a paradigm shift in precision oncology. Integrative multi-omics data methods utilize genomic, transcriptomic, epigenomic, and proteomic information to identify crucial interactions between genes, proteins, and molecular pathways [13]. This integration facilitates the design of combined therapies specific to tumor subtypes and accelerates the identification of optimal drug combinations using neural networks and generative approaches, thereby optimizing therapeutic efficacy and minimizing side effects [13,14]. In addition, advances in deep learning have proven effective in predicting the binding affinity between drugs and proteins, a crucial step in the drug discovery and development process for cancer [15].

Complementing these technologies, reinforcement learning in oncology has made it possible to optimize treatment pathways by dynamically adjusting therapies according to the patient’s individual response. This approach holds particular promise in complex treatments such as immunotherapies and target therapies, where the management of multiple clinical variables poses a significant challenge [16].

On the other hand, the increasing interest in the discovery of drugs derived from natural products is also driving the use of computational approaches. AI-based algorithms have improved the identification and optimization of bioactive compounds from natural product databases, enabling the development of innovative strategies in oncology [17].

In summary, the confluence of drug development challenges, advances in AI, and the urgent need for effective cancer treatments underscores the importance of integrating advanced technologies in this field. This review examines the current state and potential impact of AI in cancer drug repurposing and design, highlighting the opportunities and challenges of this technology in modern medicine (Figure 2).

2. AI Techniques for Drug Repurposing and De Novo Drug Design

The use of artificial intelligence techniques has recently become a fundamental part of the process of both drug repurposing and de novo drug design. These techniques have a wide range of applications in drug development and discovery such as virtual screening, toxicity monitoring of the drug, drug efficacy, or dosage prediction, drug repurposing, and drug–target interaction predictions [18]. The selection of AI-based tools depends on the specific application and can significantly impact the results. Machine learning, deep learning, and knowledge graph-based tools are commonly used for drug repurposing, while generative models and reinforcement learning are more suitable for de novo drug design. Other machine learning techniques such as quantum computing have also been used for the prediction of ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties [19], but these applications fall out of the scope of this review. In this sense, although different AI approaches have been used in all stages of drug development, some of them are more effective. Neural networks and classical machine learning are effective for target identification due to their ability to analyze drug–target and protein–protein interaction networks, identify novel binding sites, and predict protein structures. Natural language processing (NLP) and large language models can be very useful in mining data in order to develop clinical trials or monitor adverse effects. Furthermore, multimodal AI models improve drug efficacy prediction by integrating diverse data sources, leading to more comprehensive and accurate insights. Some of this information can be chemical structure data, omics data, biomedical texts, histopathology data, medical imaging data, or electronic health records. Recently, Qiu et al. provided a very thorough list of AI-based methods for drug discovery [20].

2.1. Machine and Deep Learning Techniques

Some of the most common machine learning algorithms used for drug repurposing include k-nearest neighbor (kNN), random forest (RF), or support vector machine (SVM). Furthermore, with the development of deep learning, and thanks to its ability to automatically extract features from raw data, the impact of AI-based approaches in drug repurposing has increased significantly. The use of deep learning algorithms such as artificial neural networks (ANNs), CNNs, and long short-term memory (LSTM) has revolutionized the field. One of the best examples in this field is AlphaFold2, a commonly known, neural network-based model technology that predicts the 3D conformation of proteins, even those without known structures, and therefore offers emerging opportunities for structure-based drug discovery [12].

General machine learning approaches in drug repurposing can be classified according to the type of datasets used. As an example, structural information related to proteins or small molecules has been used for the development of structure-based virtual screening methods, such as molecular docking [21] or, together with binding activity data, ligand-based chemometric modeling methods such as quantitative structure–activity relationship (QSAR) modeling [22]. Other types of data commonly used in machine learning-based drug repurposing approaches include cell phenotype, employed for example through the CellProfiler software (v.4.2.8) [23]. Also, the use of transcriptomic data allows us to uncover novel drug mechanisms. Among this, the use of the L1000 assay data, an improvement over the old Connectivity Map (CMap), stands out, covering information on the variation in gene expression in response to more than 42,000 perturbing agents (including drugs) [24]. Finally, data related to electronic health records, and even social media, are also very helpful to develop deep learning approaches based on natural language processing (NLP) and other algorithms [25].

2.2. Knowledge Graph-Based AI Techniques

The use of graph-based approaches relies on the knowledge of drug–target interactions from databases. The generation and analysis of these networks has been crucial in many recent drug repurposing discoveries [26]. For example, Zeng et al. developed a network-based deep learning model (DeepDR) for the integration of networks, such as drug–disease and drug–target networks, that enabled them to learn high-level drug features through a random walk approach [27].

Another useful strategy involves the integration of genome wide association studies (GWASs) and multi-omics data within network-based AI approaches [20]. As an example, the network topology-based deep learning framework (NETTAG) uses an interpretable AI model to identify disease-associated genes based on multi-omics data and integrate them within the human protein–protein interaction (PPI) network in order to predict repurposable drugs [28].

2.3. Generative AI Models

Generative artificial intelligence models are progressing rapidly, and their use in novel drug discovery presents advancements faster than anybody can recall. Although these models are performing quite well in small-scale and controlled laboratory settings, doubts remain about their performance in more complex conditions [29]. In this sense, the implementation of large language models (LLMs), such as the commonly known ChatGPT, a generative pre-trained transformer technology developed by OpenAI, has recently made this type of tool more approachable and amenable to biomedical researchers [30]. Several generative AI-based tools, such as ChemSpaceAL (v.2.0.1) [31], GraphGPT (v.0.3.1) [32], PEtrans (v.1.0) [33], or DrugChat (v.5.1.4) [34], have been recently developed for different aspects of de novo drug design, such as molecular generation, the selection of contextual molecular features, or the determination of novel targetable sites in proteins.

2.4. Reinforcement Methods

Reinforcement learning methods are AI algorithms used to resolve decision problems with a dynamic approach, combining ANNs with deep reinforcement learning architectures. These methods are based on the analysis and estimation of the statistical relationship between all the possible actions and their outcomes. Afterwards, the algorithm tries to determine the most desirable outcome. The implementation of these approaches has been used for the optimization of the design of chemical libraries for screening and de novo drug design with the ReLEASE software (v.1.0) (Reinforcement Learning for Structural Evolution) [35]. In general, in reinforcement methods, a multilayer ANN serves as the generative model, using inputs like SMILES (structural codification of molecules) or molecular graphs. The model is then trained with data from known bioactive molecules by means of iterative learning and decision-making steps, and finally constructs new outputs and tests their response for optimization. In this sense, they act as a virtual agent that modifies molecules to optimize their properties under the guidance of the neural network [36]. An example of this approach is the development of reinforcement method models for the generation of analogues to a query structure or the generation of compounds predicted to be active against a biological target [37].

3. AI-Guided Applications in Cancer Drug Discovery

This section presents recent projects that use AI-guided approaches for cancer drug development and design. Table 1 summarizes the key aspects of these studies, including the computational methodology, AI model/algorithm, cancer type, and main findings. The studies are categorized based on the AI techniques described in Section 2.

AI in computational biology has significantly advanced cancer therapies by providing innovative tools to address the challenges of traditional drug development. Researchers have developed bioinformatics tools using various AI algorithms, such as DeepPurpose, to tackle drug repositioning in chondrosarcoma [38,40,42]. Most of these approaches require the use of huge amounts of diverse data (specifically, information about disease pathway, proteomic metabolism, drug–disease interaction, gene expression, etc.). The case study of familial melanoma conducted by Esteban-Medina et al. [39] exemplifies the importance of integrating diverse data sources and methodologies. The implementation of a machine learning model allowed for the contextualization of protein drug targets in terms of the functional landscape of the disease and for the identification of candidate drugs for the treatment of melanoma. Similarly, pathway-level analysis and disease-specific modeling further enhance AI-driven drug discovery efforts [41,42]. DRviaSPCN introduces a novel approach by analyzing subpathway (SP) crosstalk networks, capturing the intricate interactions between tumor pathways critical for cancer progression and therapy resistance. Validated through breast cancer datasets, DRviaSPCN demonstrated very good predictive performance, identifying FDA-approved drugs and novel candidates [41].

AI-driven insights into molecular mechanisms reveal new potential therapeutic targets. Different AI approaches are used throughout drug development, but some are more effective than others. For example, neural networks and classical machine learning are effective for target identification due to their ability to analyze drug–target and protein–protein interaction networks, identify novel binding sites, and predict protein structures. In the field of neuroscience, these methods are used to identify new therapeutic targets for neurodegenerative diseases such as Alzheimer’s and Parkinson’s. For example, machine learning models can analyze large gene expression and neuroimaging datasets to identify genes and pathways involved in these diseases. Many of these targets lack known ligands or structural data. Advances in deep generative modeling and machine learning in de novo oncology drug design are providing solutions to this challenge [44,45]. For instance, in a recent paper by Ren et al., they designed and synthesized molecules based on AlphaFold-predicted structures for cyclin-dependent kinase 20 (CDK20). This enzyme had previously been identified in silico as a novel target against hepatocellular carcinoma (HCC). This workflow resulted in a potent molecule with selective anti-proliferative activity against HCC cell lines overexpressing CDK20. Surprisingly, the development process of the de novo compound required only 30 days and involved the synthesis of only seven compounds [43]. These results highlight the efficiency and precision of AI-powered platforms in accelerating early-stage drug discovery, particularly for targets lacking experimental structural data.

4. Integration of AI with Experimental Techniques

AI has become an indispensable tool in the field of drug discovery, offering advanced methods to enhance experimental workflows by increasing efficiency, precision, and data integration.

4.1. AI-Guided High-Throughput Screening (HTS)

Traditionally, HTS involves testing thousands of compounds to identify potential therapeutic candidates, a resource-intensive endeavor. AI-powered models, such as CNNs, optimize this process by analyzing chemical libraries and predicting compound activity, thereby prioritizing candidates with the highest therapeutic potential. For example, the already mentioned AlphaFold algorithm has significantly advanced protein structure prediction, allowing researchers to identify molecular interactions that were previously elusive [12]. This has enabled the development of more targeted and cost-effective screening strategies, revolutionizing the early stages of drug development.

4.2. AI-Assisted Drug Synthesis and Optimization

In addition to screening, AI also plays a critical role in drug synthesis and optimization. Traditional methods of synthesizing complex molecules often rely on iterative and time-consuming experimental approaches. AI-driven retrosynthesis tools, such as Chemprop and MoleculeNet, leverage reaction databases to predict chemical pathways and propose efficient synthesis routes [46,47]. These tools not only streamline the development of new compounds but also minimize costs and reduce the time needed to bring potential drugs to preclinical testing. By automating the prediction of reaction outcomes and toxicity, these models ensure a higher success rate for synthesized molecules, accelerating the transition from lab to clinic.

4.3. AI-Driven In Vitro and In Vivo Testing

AI’s contributions extend to in vitro and in vivo testing, where predictive models guide experimental designs and reduce reliance on labor-intensive methods. Generative adversarial networks (GANs), for instance, simulate cellular responses to novel compounds, offering preliminary insights into efficacy and safety before traditional testing begins [48]. Furthermore, multi-omics AI models integrate genomic, transcriptomic, and proteomic data to predict how specific drug candidates might interact with patient-specific biological pathways. These integrative approaches not only enhance the precision of experimental assays but also facilitate the development of personalized treatment strategies.

4.4. AI for Data Integration in Preclinical Studies

Lastly, AI has proven invaluable for integrating diverse datasets into preclinical studies. By analyzing imaging data, omics datasets, and physiological measurements simultaneously, AI models can identify biomarkers and therapeutic targets that might otherwise remain undetected. For example, deep learning algorithms have been employed to analyze histological images in conjunction with gene expression profiles, uncovering novel insights into tumor microenvironments and drug interactions [49]. This holistic approach enhances the translational potential of preclinical research, paving the way for more effective clinical applications.

5. Challenges and Opportunities

While the integration of artificial intelligence (AI) in drug discovery offers transformative potential, realizing its full benefits requires addressing several key challenges (Figure 3).

5.1. Data Quality and Quantity

A primary concern is the quality and quantity of available data for training AI models. Biomedical datasets often suffer from inconsistencies, noise, and biases, which can compromise predictive accuracy. The proprietary nature of much pharmaceutical data further restricts access for researchers, hindering data sharing and standardization. To overcome these issues, solutions include enhanced data curation, the establishment of standardized dataset preparation protocols, and the use of synthetic data to augment real-world datasets. Initiatives like the Therapeutics Data Commons (TDC) are pivotal in establishing benchmarks for data quality and accessibility, effectively bridging the gap between research institutions and industry [50]. By improving data quality, researchers can enhance the reliability of AI models, leading to more accurate predictions and better therapeutic outcomes [50].

5.2. Interpretability of AI Models

The “black box” nature of many machine learning and deep learning models poses another significant hurdle. The lack of transparency in how these models arrive at their predictions creates challenges in regulatory and clinical settings where trust is paramount. In order to ensure that the predictions of AI models are clinically reliable and understandable to clinicians, explainable AI (XAI) frameworks, such as SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-Agnostic Explanations), are being developed [51,52]. These tools provide insights into the features driving model predictions, thereby building confidence among stakeholders and facilitating regulatory approval. By enhancing model interpretability, researchers can ensure that AI-driven decisions are more transparent and justifiable, which is crucial for gaining acceptance in clinical practice. However, these XAI methods also have their own limitations, including uncertainty in the calculation and distribution of the score estimates, which may lead to generalization of results. Also, LIME struggles with nonlinear dependencies, while SHAP has problems with feature dependencies, as permutation values assume independence. A quantitative comparison between AI models and traditional methodologies in terms of diagnostic and therapeutic efficacy is an active research area, as highlighted in recent studies [53].

Finally, neither method is capable of inferring causality, which limits their applications. All of these limitations need to be overcome in the near future to improve the interpretability of AI models [54].

5.3. Ethical Considerations

The use of AI in drug discovery raises ethical considerations regarding data privacy, algorithmic biases, and equitable access to AI-driven therapies. Biases in training datasets can lead to inequitable treatment predictions, disproportionately affecting under-represented populations. For instance, in drug repurposing, sex-specific differences in genetics, hormones, and metabolism often result in male- or female-biased drug responses and adverse events. Computational approaches are being developed to identify sex-inclusive drug candidates, improving therapeutic options and reducing adverse effects for all patients [55]. Addressing these concerns, an emerging number of members of the scientific community are applying the FAIR (Findable, Accessible, Interoperable, and Reusable) principles. This approach reduces bias in biomedicine by promoting ethical, transparent, and interoperable data management. Biomedical researchers can access diverse, FAIRified datasets, minimizing bias from localized data [56,57]. In 2021, the FDA received 132 regulatory submissions for drugs that included the use of AI in their discovery and development, resulting in a 10-fold increase over the previous year. This federal agency recognizes the transformative potential of AI in drug development and emphasizes rigorous evaluation through well-controlled clinical studies to ensure that the benefits outweigh the risks. Regulatory oversight requires deep technical expertise to review AI-assisted applications, such as target selection or intervention strategies. Maintaining a workforce with technical expertise is vital to provide timely and effective guidance and support innovation in this evolving field [58]. The European Medicines Agency (EMA) has developed an artificial intelligence (AI) workplan for the European medicines regulatory network to guide the use of AI in medicine regulation in Europe up to 2028 [59]. Regulatory frameworks must evolve to include guidelines for AI in healthcare, ensuring that ethical standards are maintained [60]. By prioritizing ethical considerations, the industry can foster trust and ensure that AI technologies benefit all patients equitably.

5.4. Collaboration Across Sectors, Future Trends, and Emerging Opportunities

In addition to the aforementioned challenges, collaboration between academia, industry, and regulatory bodies represents a significant opportunity for advancing AI-driven drug discovery. Cross-sector partnerships can facilitate the sharing of data, tools, and expertise, fostering innovation and reducing barriers to implementation. By working together, stakeholders can address challenges collaboratively and leverage AI’s capabilities to achieve unprecedented levels of efficiency, accuracy, and impact in drug discovery. This collaborative approach not only accelerates the pace of discovery but also ensures that the resulting therapies are safe, effective, and accessible to patients worldwide.

Cross-sector collaboration is a key driver for advancing AI-driven drug discovery. Numerous initiatives exemplify the power of partnerships between academia, industry, and regulatory agencies. For example, major pharmaceutical companies such as Pfizer, Novartis, and AstraZeneca have formed strategic alliances with AI technology companies, leading to the development of innovative tools and platforms for drug discovery. These collaborations have significantly accelerated target identification, drug design, and clinical trial optimization. Additionally, renowned research centers such as the Broad Institute and the National Cancer Institute have established AI research centers dedicated to drug discovery. These centers foster interdisciplinary collaboration, bringing together experts in biology, chemistry, and computer science to develop new AI methodologies and applications. Additionally, public–private initiatives such as the Alliance for Drug Discovery Innovation (ADDI) facilitate collaboration between academia, industry, and regulatory agencies, providing funding, resources, and expertise to drive the development of AI technologies.

Despite these challenges, the opportunities presented by artificial intelligence (AI) in drug discovery are vast and transformative. Emerging technologies, such as multimodal AI, are at the forefront of this revolution, integrating diverse data types—from clinical records to genomic data—into cohesive analytical frameworks. This integration is crucial as it allows for a more comprehensive understanding of the biological systems involved in drug interactions and disease mechanisms. These advancements are driving significant progress in precision medicine, enabling the development of highly targeted therapies that are tailored to the unique genetic and molecular profiles of individual patients. This personalized approach not only enhances treatment efficacy but also minimizes adverse effects, thereby improving overall patient outcomes.

As AI-driven drug discovery advances, ensuring the reliability and clinical applicability of AI models remains a key priority. Experimental validation plays a crucial role in this process, incorporating preclinical testing, real patient data comparisons, and ongoing monitoring to assess model performance. Preclinical studies allow researchers to evaluate AI-generated predictions in controlled settings before transitioning to real-world clinical applications [61]. Furthermore, comparisons with patient data help confirm that AI models provide meaningful and actionable insights [62]. To maintain long-term effectiveness, continuous monitoring systems are being explored to oversee AI performance post-deployment, ensuring adaptability to evolving clinical practices and patient demographics [61,62,63]. These validation strategies are essential for bridging the gap between theoretical AI models and practical, clinically relevant applications in drug discovery.

Furthermore, quantum computing, although still in its infancy, holds immense potential in addressing complex molecular problems that traditional computational methods struggle to solve [64]. By accelerating simulations and optimizing chemical structures, quantum computing could revolutionize de novo drug design, allowing researchers to explore vast chemical spaces and identify promising drug candidates more efficiently. This capability could significantly shorten the drug development timeline and reduce costs, making it a game-changer in the pharmaceutical industry.

Collaboration between academia, industry, and regulatory bodies represents another significant opportunity for advancing AI-driven drug discovery. High-performance computing and cloud computing play a critical role in AI frameworks for drug discovery. These technologies facilitate the processing of large genomic, proteomic, and chemical datasets, as well as the execution of complex computational simulations. For example, cloud computing enables researchers to access scalable and affordable computational resources, which accelerates the development and deployment of AI models. Cross-sector partnerships can facilitate the sharing of data, tools, and expertise, fostering innovation and reducing the barriers to implementation. Such collaborations can lead to the establishment of standardized protocols and best practices that enhance the reliability and reproducibility of AI applications in drug discovery. By addressing challenges collaboratively and leveraging AI’s capabilities, the field of drug discovery can achieve unprecedented levels of efficiency, accuracy, and impact. This collaborative approach not only accelerates the pace of discovery but also ensures that the resulting therapies are safe, effective, and accessible to patients worldwide.

In summary, the integration of AI into drug discovery processes, coupled with advancements in quantum computing and collaborative efforts across sectors, allows the pharmaceutical industry to overcome existing challenges and unlock new therapeutic possibilities. As these technologies continue to evolve, they promise to reshape the landscape of drug development, ultimately leading to more effective treatments and improved health outcomes for patients globally.

Funding

This study was supported in part by grants from the Spanish Ministry of Science and Innovation and the European Regional Development fund (PID2020-119792RB-I00), the Institute of Health Carlos III (RD21/0001/0022, Spanish Network of Advanced Therapies, TERAV-ISCIII), and the Fundación Mutua Madrileña (project: FMM-AP16030-2024). SHG is supported by a UC3M–PhD research training scholarship (PIPF).

Acknowledgments

This paper is dedicated to the mentors who teach us that constant curiosity is the road to happiness. Specifically, we dedicate this paper to the memory of Claudio J. Conti, whose groundbreaking research in the field of cancer and unforgettable personality have inspired generations of scientists and/or disciples.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hughes, J.P.; Rees, S.; Kalindjian, S.B.; Philpott, K.L. Principles of early drug discovery. Br. J. Pharmacol. 2011, 162, 1239–1249. [Google Scholar] [CrossRef] [PubMed]
Waring, M.J.; Arrowsmith, J.; Leach, A.R.; Leeson, P.D.; Mandrell, S.; Owen, R.M.; Pairaudeau, G.; Pennie, W.D.; Pickett, S.D.; Wang, J.; et al. An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat. Rev. Drug Discov. 2015, 14, 475–486. [Google Scholar] [CrossRef] [PubMed]
Vasan, N.; Baselga, J.; Hyman, D.M. A view on drug resistance in cancer. Nature 2019, 575, 299–309. [Google Scholar] [CrossRef]
Garg, P.; Singhal, G.; Kulkarni, P.; Horne, D.; Salgia, R.; Singhal, S.S. Artificial Intelligence-Driven Computational Approaches in the Development of Anticancer Drugs. Cancers 2024, 16, 3884. [Google Scholar] [CrossRef] [PubMed]
Wang, L.; Song, Y.; Wang, H.; Zhang, X.; Wang, M.; He, J.; Li, S.; Zhang, L.; Li, K.; Cao, L. Advances of Artificial Intelligence in Anti-Cancer Drug Design: A Review of the Past Decade. Pharmaceuticals 2023, 16, 253. [Google Scholar] [CrossRef]
Tanoli, Z.; Vaha-Koskela, M.; Aittokallio, T. Artificial intelligence, machine learning, and drug repurposing in cancer. Expert. Opin. Drug Discov. 2021, 16, 977–989. [Google Scholar] [CrossRef]
Cai, Z.; Poulos, R.C.; Liu, J.; Zhong, Q. Machine learning for multi-omics data integration in cancer. iScience 2022, 25, 103798. [Google Scholar] [CrossRef]
Chen, C.; Wang, J.; Pan, D.; Wang, X.; Xu, Y.; Yan, J.; Wang, L.; Yang, X.; Yang, M.; Liu, G.P. Applications of multi-omics analysis in human diseases. MedComm (2020) 2023, 4, e315. [Google Scholar] [CrossRef]
Xu, X.; Yue, L.; Li, B.; Liu, Y.; Wang, Y.; Zhang, W.; Wang, L. DSGAT: Predicting frequencies of drug side effects by graph attention networks. Brief. Bioinform. 2022, 23, bbab586. [Google Scholar] [CrossRef]
Tatonetti, N.P.; Liu, T.; Altman, R.B. Predicting drug side-effects by chemical systems biology. Genome Biol. 2009, 10, 238. [Google Scholar] [CrossRef]
Cichonska, A.; Ravikumar, B.; Rahman, R. AI for targeted polypharmacology: The next frontier in drug discovery. Curr. Opin. Struct. Biol. 2024, 84, 102771. [Google Scholar] [CrossRef] [PubMed]
Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Zidek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
Baião, A.R.; Cai, Z.; Poulos, R.C.; Robinson, P.J.; Reddel, R.R.; Zhong, Q.; Vinga, S.; Gonçalves, E. A technical review of multi-omics data integration methods: From classical statistical to deep generative approaches. arXiv 2025, arXiv:2501.17729. [Google Scholar]
He, X.; Liu, X.; Zuo, F.; Shi, H.; Jing, J. Artificial intelligence-based multi-omics analysis fuels cancer precision medicine. Semin. Cancer Biol. 2023, 88, 187–200. [Google Scholar] [CrossRef]
Ozturk, H.; Ozgur, A.; Ozkirimli, E. DeepDTA: Deep drug-target binding affinity prediction. Bioinformatics 2018, 34, i821–i829. [Google Scholar] [CrossRef]
Ekundayo, N.F. Reinforcement learning in treatment pathway optimization: A case study in oncology. Int. J. Sci. Res. Arch. 2024, 13, 2187–2205. [Google Scholar] [CrossRef]
Duan, F.; Duan, C.; Xu, H.; Zhao, X.; Sukhbaatar, O.; Gao, J.; Zhang, M.; Zhang, W.; Gu, Y. AI-driven drug discovery from natural products. Adv. Agrochem 2024, 3, 185–187. [Google Scholar] [CrossRef]
Ahmed, F.; Soomro, A.M.; Chethikkattuveli Salih, A.R.; Samantasinghar, A.; Asif, A.; Kang, I.S.; Choi, K.H. A comprehensive review of artificial intelligence and network based approaches to drug repurposing in Covid-19. Biomed. Pharmacother. 2022, 153, 113350. [Google Scholar] [CrossRef]
Bhatia, A.S.; Saggi, M.K.; Kais, S. Quantum Machine Learning Predicting ADME-Tox Properties in Drug Discovery. J. Chem. Inf. Model. 2023, 63, 6476–6486. [Google Scholar] [CrossRef]
Qiu, Y.; Cheng, F. Artificial intelligence for drug discovery and development in Alzheimer’s disease. Curr. Opin. Struct. Biol. 2024, 85, 102776. [Google Scholar] [CrossRef]
Kumar, V.; Krishna, S.; Siddiqi, M.I. Virtual screening strategies: Recent advances in the identification and design of anti-cancer agents. Methods 2015, 71, 64–70. [Google Scholar] [CrossRef] [PubMed]
Alam, S.; Khan, F. 3D-QSAR studies on Maslinic acid analogs for Anticancer activity against Breast Cancer cell line MCF-7. Sci. Rep. 2017, 7, 6019. [Google Scholar] [CrossRef]
Carpenter, A.E.; Jones, T.R.; Lamprecht, M.R.; Clarke, C.; Kang, I.H.; Friman, O.; Guertin, D.A.; Chang, J.H.; Lindquist, R.A.; Moffat, J.; et al. CellProfiler: Image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006, 7, R100. [Google Scholar] [CrossRef]
Subramanian, A.; Narayan, R.; Corsello, S.M.; Peck, D.D.; Natoli, T.E.; Lu, X.; Gould, J.; Davis, J.F.; Tubelli, A.A.; Asiedu, J.K.; et al. A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Cell 2017, 171, 1437–1452.e17. [Google Scholar] [CrossRef] [PubMed]
Issa, N.T.; Stathias, V.; Schurer, S.; Dakshanamurthy, S. Machine and deep learning approaches for cancer drug repurposing. Semin. Cancer Biol. 2021, 68, 132–142. [Google Scholar] [CrossRef]
Somolinos, F.J.; León, C.; Guerrero-Aspizua, S. Drug Repurposing Using Biological Networks. Processes 2021, 9, 1057. [Google Scholar] [CrossRef]
Zeng, X.; Zhu, S.; Liu, X.; Zhou, Y.; Nussinov, R.; Cheng, F. deepDR: A network-based deep learning approach to in silico drug repositioning. Bioinformatics 2019, 35, 5191–5198. [Google Scholar] [CrossRef] [PubMed]
Xu, J.; Mao, C.; Hou, Y.; Luo, Y.; Binder, J.L.; Zhou, Y.; Bekris, L.M.; Shin, J.; Hu, M.; Wang, F.; et al. Interpretable deep learning translation of GWAS and multi-omics findings to identify pathobiology and drug repurposing in Alzheimer’s disease. Cell Rep. 2022, 41, 111717. [Google Scholar] [CrossRef]
Chakraborty, C.; Bhattacharya, M.; Pal, S.; Islam, M.A. Generative AI in drug discovery and development: The next revolution of drug discovery and development would be directed by generative AI. Ann. Med. Surg. 2024, 86, 6340–6343. [Google Scholar] [CrossRef]
Mesko, B. The ChatGPT (Generative Artificial Intelligence) Revolution Has Made Artificial Intelligence Approachable for Medical Professionals. J. Med. Internet Res. 2023, 25, e48392. [Google Scholar] [CrossRef]
Kyro, G.W.; Morgunov, A.; Brent, R.I.; Batista, V.S. ChemSpaceAL: An Efficient Active Learning Methodology Applied to Protein-Specific Molecular Generation. arXiv 2023, arXiv:2309.05853. [Google Scholar]
Lu, H.; Wei, Z.; Wang, X.; Zhang, K.; Liu, H. GraphGPT: A Graph Enhanced Generative Pretrained Transformer for Conditioned Molecular Generation. Int. J. Mol. Sci. 2023, 24, 16761. [Google Scholar] [CrossRef] [PubMed]
Wang, X.; Gao, C.; Han, P.; Li, X.; Chen, W.; Rodriguez Paton, A.; Wang, S.; Zheng, P. PETrans: De Novo Drug Design with Protein-Specific Encoding Based on Transfer Learning. Int. J. Mol. Sci. 2023, 24, 1146. [Google Scholar] [CrossRef]
Liang, Y.; Zhang, R.; Zhang, L.; Xie, P. DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs. arXiv 2023, arXiv:2309.03907. [Google Scholar]
Popova, M.; Isayev, O.; Tropsha, A. Deep reinforcement learning for de novo drug design. Sci. Adv. 2018, 4, eaap7885. [Google Scholar] [CrossRef] [PubMed]
Mouchlis, V.D.; Afantitis, A.; Serra, A.; Fratello, M.; Papadiamantis, A.G.; Aidinis, V.; Lynch, I.; Greco, D.; Melagraki, G. Advances in de Novo Drug Design: From Conventional to Machine Learning Methods. Int. J. Mol. Sci. 2021, 22, 1676. [Google Scholar] [CrossRef] [PubMed]
Olivecrona, M.; Blaschke, T.; Engkvist, O.; Chen, H. Molecular de-novo design through deep reinforcement learning. J. Cheminform 2017, 9, 48. [Google Scholar] [CrossRef]
Li, J.; Shi, M.; Chen, Z.; Pan, Y. DeepPurpose-based drug discovery in chondrosarcoma. Chin. J. Plast. Reconstr. Surg. 2022, 4, 158–165. [Google Scholar] [CrossRef]
Esteban-Medina, M.; de la Oliva Roque, V.M.; Herraiz-Gil, S.; Pena-Chilet, M.; Dopazo, J.; Loucera, C. drexml: A command line tool and Python package for drug repurposing. Comput. Struct. Biotechnol. J. 2024, 23, 1129–1143. [Google Scholar] [CrossRef]
Huang, Y.; Dong, D.; Zhang, W.; Wang, R.; Lin, Y.C.; Zuo, H.; Huang, H.Y.; Huang, H.D. DrugRepoBank: A comprehensive database and discovery platform for accelerating drug repositioning. Database 2024, 2024, baae051. [Google Scholar] [CrossRef]
Wu, J.; Li, X.; Wang, Q.; Han, J. DRviaSPCN: A software package for drug repurposing in cancer via a subpathway crosstalk network. Bioinformatics 2022, 38, 4975–4977. [Google Scholar] [CrossRef] [PubMed]
Wu, J.; Li, J.; He, Y.; Huang, J.; Zhao, X.; Pan, B.; Wang, Y.; Cheng, L.; Han, J. DrugSim2DR: Systematic prediction of drug functional similarities in the context of specific disease for drug repurposing. Gigascience 2022, 12, giad104. [Google Scholar] [CrossRef]
Ren, F.; Ding, X.; Zheng, M.; Korzinkin, M.; Cai, X.; Zhu, W.; Mantsyzov, A.; Aliper, A.; Aladinskiy, V.; Cao, Z.; et al. AlphaFold accelerates artificial intelligence powered drug discovery: Efficient discovery of a novel CDK20 small molecule inhibitor. Chem. Sci. 2023, 14, 1443–1452. [Google Scholar] [CrossRef] [PubMed]
Born, J.; Manica, M.; Oskooei, A.; Cadow, J.; Markert, G.; Rodriguez Martinez, M. PaccMann(RL): De novo generation of hit-like anticancer molecules from transcriptomic data via reinforcement learning. iScience 2021, 24, 102269. [Google Scholar] [CrossRef] [PubMed]
Grisoni, F.; Neuhaus, C.S.; Hishinuma, M.; Gabernet, G.; Hiss, J.A.; Kotera, M.; Schneider, G. De novo design of anticancer peptides by ensemble artificial neural networks. J. Mol. Model. 2019, 25, 112. [Google Scholar] [CrossRef]
Wu, Z.; Ramsundar, B.; Feinberg, E.N.; Gomes, J.; Geniesse, C.; Pappu, A.S.; Leswing, K.; Pande, V. MoleculeNet: A benchmark for molecular machine learning. Chem. Sci. 2018, 9, 513–530. [Google Scholar] [CrossRef]
Krenn, M.; Pollice, R.; Guo, S.Y.; Aldeghi, M.; Cervera-Lierta, A.; Friederich, P.; Dos Passos Gomes, G.; Hase, F.; Jinich, A.; Nigam, A.; et al. On scientific understanding with artificial intelligence. Nat. Rev. Phys. 2022, 4, 761–769. [Google Scholar] [CrossRef]
Kadurin, A.; Nikolenko, S.; Khrabrov, K.; Aliper, A.; Zhavoronkov, A. druGAN: An Advanced Generative Adversarial Autoencoder Model for de Novo Generation of New Molecules with Desired Molecular Properties in Silico. Mol. Pharm. 2017, 14, 3098–3104. [Google Scholar] [CrossRef]
Ching, T.; Himmelstein, D.S.; Beaulieu-Jones, B.K.; Kalinin, A.A.; Do, B.T.; Way, G.P.; Ferrero, E.; Agapow, P.M.; Zietz, M.; Hoffman, M.M.; et al. Opportunities and obstacles for deep learning in biology and medicine. J. R. Soc. Interface 2018, 15, 20170387. [Google Scholar] [CrossRef]
Bonner, S.; Barrett, I.P.; Ye, C.; Swiers, R.; Engkvist, O.; Bender, A.; Hoyt, C.T.; Hamilton, W.L. A review of biomedical datasets relating to drug discovery: A knowledge graph perspective. Brief. Bioinform. 2022, 23, bbac404. [Google Scholar] [CrossRef]
Kirboga, K.K.; Abbasi, S.; Kucuksille, E.U. Explainability and white box in drug discovery. Chem. Biol. Drug Des. 2023, 102, 217–233. [Google Scholar] [CrossRef]
Ladbury, C.; Zarinshenas, R.; Semwal, H.; Tam, A.; Vaidehi, N.; Rodin, A.S.; Liu, A.; Glaser, S.; Salgia, R.; Amini, A. Utilization of model-agnostic explainable artificial intelligence frameworks in oncology: A narrative review. Transl. Cancer Res. 2022, 11, 3853–3868. [Google Scholar] [CrossRef] [PubMed]
Laganà, F.; Prattico, D.; De Carlo, D.; Oliva, G.; Pullano, S.A.; Calcagno, S. Engineering Biomedical Problems to Detect Carcinomas: A Tomographic Impedance Approach. Eng 2024, 5, 1594–1614. [Google Scholar] [CrossRef]
Salih, A.M.; Raisi-Estabragh, Z.; Galazzo, I.B.; Radeva, P.; Petersen, S.E.; Lekadir, K.; Menegaz, G. A perspective on explainable artificial intelligence methods: SHAP and LIME. Adv. Intell. Syst. 2025, 7, 2400304. [Google Scholar] [CrossRef]
Fisher, J.L.; Jones, E.F.; Flanary, V.L.; Williams, A.S.; Ramsey, E.J.; Lasseigne, B.N. Considerations and challenges for sex-aware drug repurposing. Biol. Sex. Differ. 2022, 13, 13. [Google Scholar] [CrossRef]
Parra-Calderon, C.L.; Sanz, F.; McIntosh, L.D. The Challenge of the Effective Implementation of FAIR Principles in Biomedical Research. Methods Inf. Med. 2020, 59, 117–118. [Google Scholar] [CrossRef]
FAIR4Health Project. Available online: https://www.fair4health.eu/en/project (accessed on 27 February 2025).
Warraich, H.J.; Tazbaz, T.; Califf, R.M. FDA Perspective on the Regulation of Artificial Intelligence in Health Care and Biomedicine. JAMA 2025, 333, 241–247. [Google Scholar] [CrossRef]
Artificial intelligence, EMA. Available online: https://www.ema.europa.eu/en/about-us/how-we-work/big-data/artificial-intelligence (accessed on 27 February 2025).
Nene, L.; Flepisi, B.T.; Brand, S.J.; Basson, C.; Balmith, M. Evolution of Drug Development and Regulatory Affairs: The Demonstrated Power of Artificial Intelligence. Clin. Ther. 2024, 46, e6–e14. [Google Scholar] [CrossRef]
Uema, R.; Hayashi, Y.; Kizu, T.; Igura, T.; Ogiyama, H.; Yamada, T.; Takeda, R.; Nagai, K.; Inoue, T.; Yamamoto, M.; et al. A novel artificial intelligence-based endoscopic ultrasonography diagnostic system for diagnosing the invasion depth of early gastric cancer. J. Gastroenterol. 2024, 59, 543–555. [Google Scholar] [CrossRef]
Chang, Y.J.; Hung, K.C.; Wang, L.K.; Yu, C.H.; Chen, C.K.; Tay, H.T.; Wang, J.J.; Liu, C.F. A Real-Time Artificial Intelligence-Assisted System to Predict Weaning from Ventilator Immediately after Lung Resection Surgery. Int. J. Environ. Res. Public Health 2021, 18, 2713. [Google Scholar] [CrossRef]
Feng, J.; Phillips, R.V.; Malenica, I.; Bishara, A.; Hubbard, A.E.; Celi, L.A.; Pirracchio, R. Clinical artificial intelligence quality improvement: Towards continual monitoring and updating of AI algorithms in healthcare. npj Digit. Med. 2022, 5, 66. [Google Scholar] [CrossRef] [PubMed]
Biamonte, J.; Wittek, P.; Pancotti, N.; Rebentrost, P.; Wiebe, N.; Lloyd, S. Quantum machine learning. Nature 2017, 549, 195–202. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The evolving landscape of AI-assisted drug discovery. The integration of AI-driven approaches can potentially revolutionize each stage of the process, leading to accelerated timelines, reduced costs, and improved success rates. NIAID Visual & Medical Arts. 14/1/2025. Some icons are from NIAID BIOART Source (https://bioart.niaid.nih.gov/).

Figure 2. The role of AI in the drug discovery workflow for repurposing and de novo designing. Some icons are from NIAID Visual & Medical Arts. 14/1/2025. DNA (124); Pill (#407); Three Quarters Human Anatomy (519). Some icons are from NIAID BIOART Source (https://bioart.niaid.nih.gov/). #. # = number code of icon.

Figure 3. Challenges, solutions, and future trends of AI-driven drug repurposing. Some icons are from Bioicons (https://bioicons.com/); some icons are from SERVIER MEDICAL ART (https://smart.servier.com/).

Table 1. AI-guided studies in cancer drug discovery.

	Case Study	Computational Approach	Model/ Algorithm	Relevant Results
AI drug repurposing	Chondrosarcoma (CS) [38]	Knowledge + network-based methods 1. Genetic data of disease: pubmed2ensembl. 2. Drug–gene interaction: Drug Gene Interaction Database (DGIdb). 3. Drug–target information: DeepPurpose.	Deep learning-based algorithm	A total of 25 candidate drugs were identified. Among the listed drugs, there are drugs that have been approved for various solid tumors and have been applied to patients with CS: everolimus, paclitaxel, sirolimus, 2-methoxyestradiol, and sunitinib.
	Familiar Melanoma [39]	Knowledge + network-based methods 1. Genetic data of disease: databases + disease knowledge. 2. Disease Mechanistic Map: HiPathia + Genotype-Tissue Expression Project. 3. Drug–target information: Drexml.	Explainable machine learning model	A total of 78 candidate drugs correspond to currently approved chemotherapeutic agents used to treat various types of cancer. Paclitaxel, docetaxel, moxetumomab, and ruxolitinib are drugs that target specific melanogenesis circuits.
	Liver and lung cancers [40]	Similarity-based, artificial intelligence-based, signature-based, and network-based methods 1. Integrating heterogeneous data (drugs, targets, diseases, side effects and pathways) from databases and the literature. 2. Drug–target information: DrugRepoBank.	Artificial intelligence model	AI-predicted a CYP3A4 target for sildenafil repositioning in the treatment of liver cancer. The drug candidate verteporfin may influence lung cancer by modulating the Hippo signaling pathway and insulin secretion.
	Breast cancer [41]	Network-based method 1. Genetic data of disease: breast cancer gene expression profiles from GEO database. 2. Drug–disease interaction: DRviaSPCN.	Random walk with restart algorithm	Four of ten candidate drugs have been demonstrated to be associated with breast cancer: azacitidine, valproic acid, doxorubicin, and exemestane.
	Breast and lung cancers [42]	Network-based method 1. Genetic data of disease: breast cancer and lung cancer gene expression profiles from GEO database. 2. Drug–disease interaction: DrugSim2DR.	Random walk with restart algorithm	Five potential anti-breast cancer drugs were identified: fluoxymesterone, gestrinone, pyrazole, fomepizole, and medroxyprogesterone acetate. Fluoxymesterone has received approval for breast cancer treatment. Of nine candidate drugs, methotrexate and pemetrexed have been approved for the treatment of lung cancer.
AI de novo drug design	Hepatocellular carcinoma [43]	Structure-based drug design of novel targets 1. Target selection: PandaOmics. 2. Determination of putative binding sites: Chemistry42. 3. Generation of novel hits targeting CDK20 inhibitor: AlphaFold.	Deep learning-based algorithm	A novel therapeutic target was identified from a pool of dark targets (without experimental structure) that were predicted using AlphaFold (v.2.3.0). ISM042-2-048 generated a compound that showed good CDK20 inhibitory activity.
	Carcinoma and neuroblastoma [44]	Reinforcement learning approach 1.Genetic data of disease: carcinoma and neuroblastoma gene expression profiles. 2. Generation of anticancer hit molecules: PaccMann^RL.	Deep learning-based algorithm	The generated compounds exhibited similar physicochemical properties to real cancer drugs.
	Breast and lung cancers [45]	Counter-propagation artificial neural networks (CPANNs) 1. Two peptide datasets targeting breast and lung cancer cells were assembled and curated manually from CancerPPD. 2. Training CPANN model to classify peptides according to their activity. 3. Library class generation with 1000 presumed alpha-helical peptides sequences with the amino acid distribution of alpha-helical anticancer peptides (ACPs): modlAMP. 4. Evaluation and ranking of the activity of de novo designed peptides from the library: CPANNs. 5. Selection of candidate peptides with anticancer activity to in vitro assays.	Deep learning-based algorithm	From a total of 1000 de novo designs, 6 peptides showed anticancer activity in vitro, including 5 against both MCF7 and A549 cell lines.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Herráiz-Gil, S.; Nygren-Jiménez, E.; Acosta-Alonso, D.N.; León, C.; Guerrero-Aspizua, S. Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer. Appl. Sci. 2025, 15, 2798. https://doi.org/10.3390/app15052798

AMA Style

Herráiz-Gil S, Nygren-Jiménez E, Acosta-Alonso DN, León C, Guerrero-Aspizua S. Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer. Applied Sciences. 2025; 15(5):2798. https://doi.org/10.3390/app15052798

Chicago/Turabian Style

Herráiz-Gil, Sara, Elisa Nygren-Jiménez, Diana N. Acosta-Alonso, Carlos León, and Sara Guerrero-Aspizua. 2025. "Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer" Applied Sciences 15, no. 5: 2798. https://doi.org/10.3390/app15052798

APA Style

Herráiz-Gil, S., Nygren-Jiménez, E., Acosta-Alonso, D. N., León, C., & Guerrero-Aspizua, S. (2025). Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer. Applied Sciences, 15(5), 2798. https://doi.org/10.3390/app15052798

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Artificial Intelligence-Based Methods for Drug Repurposing and Development in Cancer

Abstract

1. Introduction

2. AI Techniques for Drug Repurposing and De Novo Drug Design

2.1. Machine and Deep Learning Techniques

2.2. Knowledge Graph-Based AI Techniques

2.3. Generative AI Models

2.4. Reinforcement Methods

3. AI-Guided Applications in Cancer Drug Discovery

4. Integration of AI with Experimental Techniques

4.1. AI-Guided High-Throughput Screening (HTS)

4.2. AI-Assisted Drug Synthesis and Optimization

4.3. AI-Driven In Vitro and In Vivo Testing

4.4. AI for Data Integration in Preclinical Studies

5. Challenges and Opportunities

5.1. Data Quality and Quantity

5.2. Interpretability of AI Models

5.3. Ethical Considerations

5.4. Collaboration Across Sectors, Future Trends, and Emerging Opportunities

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI