Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Communications Chemistry
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. communications chemistry
  3. articles
  4. article
Benchmarking deep learning models for predicting anticancer drug potency (IC50) with insights for medicinal chemists
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 29 January 2026

Benchmarking deep learning models for predicting anticancer drug potency (IC50) with insights for medicinal chemists

  • Udbhas Garai  ORCID: orcid.org/0000-0001-7207-50181 na1,
  • Aditya S. Pal2 na1,
  • Koyel Ghosh3,
  • Deepak B. Salunke4 &
  • …
  • Utpal Garain2,5 

Communications Chemistry , Article number:  (2026) Cite this article

  • 705 Accesses

  • 1 Altmetric

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Cheminformatics
  • Medicinal chemistry

Abstract

Potency (IC50) prediction of small molecules is pivotal for anticancer drug development. This study benchmarked five deep learning (DL) models for IC50 prediction—DeepCDR, DrugCell, PaccMann, Precily, and tCNN—against a simple mean-based Baseline using standardized GDSC datasets and recently published anticancer compounds. To ensure practicality, conventional error metrics were supplemented with percentage error, log error, three-sigma limit, and a newly proposed Experimental Variability-Aware Prediction Accuracy statistic. The models performed well on randomly split data and unseen cell lines but showed sharply reduced accuracy for unseen compounds. Though all DL models exhibited similar performance trends, DeepCDR, DrugCell, and tCNN held a slight edge in most testing scenarios. Interestingly, several DL algorithms could not significantly outperform the Baseline model in many tests. Assessing prediction error against physicochemical and biological properties of compounds and cell lines revealed weak correlation, highlighting an underexplored aspect of model performance. A user-friendly web server (https://nlplab1.isical.ac.in/ic50.php) was also developed for IC50 prediction of new compounds against cancer cell lines.

Similar content being viewed by others

Deriving general structure–activity/selectivity relationship patterns for different subfamilies of cyclin-dependent kinase inhibitors using machine learning methods

Article Open access 03 July 2024

An interpretable deep learning framework for genome-informed precision oncology

Article 11 July 2024

Interpretable deep recommender system model for prediction of kinase inhibitor efficacy across cancer cell lines

Article Open access 06 August 2021

Data availability

The data supporting the findings of this study are available within the article and its Supplementary Information file. Supplementary Data 1 contains data related to EVAPA derivation, unseen anticancer compounds from literature, and five-fold cross-validation results (only prediction data from the fold with the best EVAPA for each model and split included). The numerical source data of main figures are available in Supplementary Data 2. All other data are available from the corresponding author upon reasonable request.

Code availability

This work mainly uses author-published code for the DL models.

References

  1. Bray, F. et al. Global cancer statistics 2022: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 74, 229–263 (2024).

    Google Scholar 

  2. Fatima, I. et al. Breakthroughs in ai and multi-omics for cancer drug discovery: a review. Eur. J. Med. Chem. 280, 116925 (2024).

    Google Scholar 

  3. Zhang, L., Shi, L., Soars, S. M., Kamps, J. & Yin, H. Discovery of novel small-molecule inhibitors of nf-κb signaling with antiinflammatory and anticancer properties. J. Med. Chem. 61, 5881–5899 (2018).

    Google Scholar 

  4. Michelini, E., Cevenini, L., Mezzanotte, L., Coppa, A. & Roda, A. Cell-based assays: fuelling drug discovery. Anal. Bioanal. Chem. 398, 227–238 (2010).

    Google Scholar 

  5. Lee, J. A. & Berg, E. L. Neoclassic drug discovery: the case for lead generation using phenotypic and functional approaches. J. Biomol. Screen. 18, 1143–1155 (2013).

    Google Scholar 

  6. Cava, C. & Castiglioni, I. Integration of molecular docking and in vitro studies: a powerful approach for drug discovery in breast cancer. Appl. Sci. 10, 6981 (2020).

    Google Scholar 

  7. Stevens, J. A. et al. Molecular dynamics simulation of an entire cell. Front. Chem. 11, 1106495 (2023).

    Google Scholar 

  8. Smith, J. S., Roitberg, A. E. & Isayev, O. Transforming computational drug discovery with machine learning and AI. ACS Med. Chem. Lett. 9, 1065–1069 (2018).

    Google Scholar 

  9. Tropsha, A. Best practices for qsar model development, validation, and exploitation. Mol. Inform. 29, 476–488 (2010).

    Google Scholar 

  10. https://www.cancerrxgene.org/compound/Foretinib/2040/overview/ic50 (2025).

  11. Zhu, H. Big data and artificial intelligence modeling for drug discovery. Annu. Rev. of Pharmacol. Toxicol. 60, 573–589 (2020).

    Google Scholar 

  12. Cai, C. et al. Transfer learning for drug discovery. J. Med. Chem. 63, 8683–8694 (2020).

    Google Scholar 

  13. Badwan, B. A. et al. Machine learning approaches to predict drug efficacy and toxicity in oncology. Cell Rep. Methods 3, 100413 (2023).

    Google Scholar 

  14. Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).

    Google Scholar 

  15. LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).

    Google Scholar 

  16. Yang, W. et al. Genomics of drug sensitivity in cancer (gdsc): a resource for therapeutic biomarker discovery in cancer cells. Nucleic. Acids Res. 41, D955–D961 (2012).

    Google Scholar 

  17. Barretina, J. et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012).

    Google Scholar 

  18. https://www.cancer.gov/tcga. The Cancer Genome Atlas (TCGA).

  19. Chen, J. & Zhang, L. A survey and systematic assessment of computational methods for drug response prediction. Brief. Bioinformatics 22, 232–246 (2021).

    Google Scholar 

  20. Park, A. et al. A comprehensive evaluation of regression-based drug responsiveness prediction models, using cell viability inhibitory concentrations (IC50 values). Bioinformatics 38, 2810–2817 (2022).

    Google Scholar 

  21. Costello, J. C. et al. A community effort to assess and improve drug sensitivity prediction algorithms. Nat. Biotechnol. 32, 1202–1212 (2014).

    Google Scholar 

  22. Liu, Q., Hu, Z., Jiang, R. & Zhou, M. Deepcdr: a hybrid graph convolutional network for predicting cancer drug response. Bioinformatics 36, i911–i918 (2020).

    Google Scholar 

  23. Kuenzi, B. M. et al. Predicting drug response and synergy using a deep learning model of human cancer cells. Cancer Cell 38, 672–684 (2020).

    Google Scholar 

  24. Oskooei, A. et al. PaccMann: prediction of anticancer compound sensitivity with multi-modal attention-based neural networks. In Proc. Workshop on Machine Learning for Molecules and Materials, 32nd Conference on Neural Information Processing Systems (NIPS, 2018).

  25. Chawla, S. et al. Gene expression based inference of cancer drug sensitivity. Nat. Commun. 13, 5680 (2022).

    Google Scholar 

  26. Liu, P., Li, H., Li, S. & Leung, K.-S. Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network. BMC Bioinformatics 20, 1–14 (2019).

    Google Scholar 

  27. Damiani, E., Solorio, J. A., Doyle, A. P. & Wallace, H. M. How reliable are in vitro IC50 values? values vary with cytotoxicity assays in human glioblastoma cells. Toxicol. Lett. 302, 28–34 (2019).

    Google Scholar 

  28. He, Y. et al. The changing 50% inhibitory concentration (IC50) of cisplatin: a pilot study on the artifacts of the mtt assay and the precise measurement of density-dependent chemoresistance in ovarian cancer. Oncotarget 7, 70803 (2016).

    Google Scholar 

  29. Brooks, E. A. et al. Applicability of drug response metrics for cancer studies using biomaterials. Philos. Trans. R. Soc. Lond. B Biol. Sci. 374, 20180226 (2019).

    Google Scholar 

  30. Li, Y. et al. Discovery, synthesis, and evaluation of novel dual inhibitors of a vascular endothelial growth factor receptor and poly (adp-ribose) polymerase for BRCA wild-type breast cancer therapy. J. Med. Chem. 66, 12069–12100 (2023).

    Google Scholar 

  31. Zhang, L. et al. Design, synthesis, and anti-cancer evaluation of novel cyclic phosphate prodrug of gemcitabine. J. Med. Chem. 66, 4150–4166 (2023).

    Google Scholar 

  32. Wang, Y., Lv, Z., Chen, F., Wang, X. & Gou, S. Conjugates derived from lapatinib derivatives with cancer cell stemness inhibitors effectively reversed drug resistance in triple-negative breast cancer. J. Med. Chem. 64, 12877–12892 (2021).

    Google Scholar 

  33. Tang, Z. et al. Novel covalent probe selectively targeting glutathione peroxidase 4 in vivo: Potential applications in pancreatic cancer therapy. J. Med. Chem. 67, 1872–1887 (2024).

    Google Scholar 

  34. Gu, H. et al. Discovery of a highly potent and selective myof inhibitor with improved water solubility for the treatment of gastric cancer. J. Med. Chem. 66, 16917–16938 (2023).

    Google Scholar 

  35. Fang, W. et al. Discovery of the first-in-class rorγ covalent inhibitors for treatment of castration-resistant prostate cancer. J. Med. Chem. 67, 1481–1499 (2024).

    Google Scholar 

  36. Li, Y. et al. Synthesis and anti-tumor activity of nitrogen-containing derivatives of the natural product diphyllin. Eur. J. Med. Chem. 243, 114708 (2022).

    Google Scholar 

  37. Litvinova, V. A. et al. Naphthoindole-2-carboxamides as a lipophilic chemotype of hetarene-anthraquinones potent against p-gp resistant tumor cells. Eur. J. Med. Chem. 281, 117013 (2025).

    Google Scholar 

  38. Li, S.-S. et al. Design, synthesis, and biological evaluation of novel benzimidazole derivatives as anti-cervical cancer agents through PI3K/Akt/mTOR pathway and tubulin inhibition. Eur. J. Med. Chem. 271, 116425 (2024).

    Google Scholar 

  39. Borude, A. S., Deshmukh, S. R., Tiwari, S. V., Kumar, S. H. & Thopate, S. R. Design and synthesis of novel thiazolo[5,4-b]pyridine derivatives as potent and selective egfr-tk inhibitors targeting resistance mutations in non-small cell lung cancer. Eur. J. Med. Chem. 276, 116727 (2024).

    Google Scholar 

  40. Benjamin, D. J. et al. Redefine statistical significance. Nat. Hum. Behav. 2, 6–10 (2018).

    Google Scholar 

  41. Shi, Y., Xu, W. & Hu, P. Out of distribution learning in bioinformatics: advancements and challenges. Brief. Bioinformatics 26, bbaf294 (2025).

    Google Scholar 

  42. Pirie, R. et al. An analysis of the physicochemical properties of oral drugs from 2000 to 2022. RSC. Med. Chem. 15, 3125–3132 (2024).

    Google Scholar 

  43. Sterling, T. & Irwin, J. J. Zinc 15–ligand discovery for everyone. J. Chem. Inf. Model. 55, 2324–2337 (2015).

    Google Scholar 

  44. Scarpino, A., Ferenczy, G. G. & Keseru, G. M. Comparative evaluation of covalent docking tools. J. Chem. Inf. Model. 58, 1441–1458 (2018).

    Google Scholar 

  45. Pukelsheim, F. The three sigma rule. Am. Stat. 48, 88–91 (1994).

    Google Scholar 

  46. Johnson, S. R. et al. A simple strategy for mitigating the effect of data variability on the identification of active chemotypes from high-throughput screening data. SLAS Discov. 12, 276–284 (2007).

    Google Scholar 

  47. Niepel, M. et al. A multi-center study on the reproducibility of drug-response assays in mammalian cell lines. Cell Syst. 9, 35–48.e5 (2019).

    Google Scholar 

  48. Haibe-Kains, B. et al. Inconsistency in large pharmacogenomic studies. Nature 504, 389–393 (2013).

    Google Scholar 

  49. Kalliokoski, T., Kramer, C., Vulpetti, A. & Gedeck, P. Comparability of mixed IC50 data–a statistical analysis. PLoS ONE 8, e61007 (2013).

    Google Scholar 

  50. Das, K. R. & Imon, A. A brief review of tests for normality. Am. J. Theor. Appl. Stat. 5, 5–12 (2016).

    Google Scholar 

  51. Gordon, S. The Normal Distribution 1 edn (Sydney: Mathematics Learning Centre, University of Sydney, 2006)

  52. Mikolov, T., Chen, K., Corrado, G. & Dean, J. Distributed representations of words and phrases and their compositionality. In Proc. 26th Annual Conference on Neural Information Processing Systems (NIPS) (Curran Associates Inc., 2013).

  53. Nguyen, E. et al. Hyenadna: long-range genomic sequence modeling at single nucleotide resolution. In Proc. Advances in Neural Information Processing Systems, Oh, A. et al. (eds) 43177–43201 https://proceedings.neurips.cc/paper_files/paper/2023/file/86ab6927ee4ae9bde4247793c46797c7-Paper-Conference.pdf (Curran Associates, Inc., 2023).

Download references

Acknowledgements

The authors sincerely thank the Master’s students at the Indian Statistical Institute—Siva Kumar Lakkoju, Sneha Tiwari, and Abhishek Bale—for their coding and testing contributions to this research. We are grateful to Debasrija Mondal for assistance with the illustration of Fig. 1. We appreciate and express gratitude for the comments and suggestions of the anonymous reviewers, which have greatly helped improve the work.

Author information

Author notes
  1. These authors contributed equally: Udbhas Garai, Aditya S. Pal.

Authors and Affiliations

  1. Department of Chemical Sciences, Indian Institute of Science Education and Research Mohali, SAS Nagar (Mohali), Punjab, India

    Udbhas Garai

  2. Computer Vision & Pattern Recognition (CVPR) Unit, Indian Statistical Institute, Kolkata, West Bengal, India

    Aditya S. Pal & Utpal Garain

  3. School of Computing, College of Engineering and Technology, SRM Institute of Science & Technology, Kattankulathur, Tamil Nadu, India

    Koyel Ghosh

  4. Department of Medicinal Chemistry, National Institute of Pharmaceutical Education and Research (NIPER), SAS Nagar, Mohali, Punjab, India

    Deepak B. Salunke

  5. Centre for Artificial Intelligence and Machine Learning (CAIML), Indian Statistical Institute, Kolkata, West Bengal, India

    Utpal Garain

Authors
  1. Udbhas Garai
    View author publications

    Search author on:PubMed Google Scholar

  2. Aditya S. Pal
    View author publications

    Search author on:PubMed Google Scholar

  3. Koyel Ghosh
    View author publications

    Search author on:PubMed Google Scholar

  4. Deepak B. Salunke
    View author publications

    Search author on:PubMed Google Scholar

  5. Utpal Garain
    View author publications

    Search author on:PubMed Google Scholar

Contributions

The first author U.Garai conceived the concept, designed the experiments, collated data, provided domain knowledge, analysed the results, critically reviewed the research, and took the lead in writing the manuscript. A.S.P. managed all computations, implemented the deep learning models, and developed the web server. K.G. assisted with computations, managed run-wise results, and contributed to the analysis. D.B.S. provided domain-specific support and critically reviewed the research from the perspective of medicinal chemistry. The corresponding author, U.Garain supervised and managed the entire research, including the experiments and manuscript preparation. The first two authors, U.Garai and A.S.P., contributed equally to this work.

Corresponding author

Correspondence to Utpal Garain.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Chemistry thanks Martin Vogt and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Material

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Garai, U., Pal, A.S., Ghosh, K. et al. Benchmarking deep learning models for predicting anticancer drug potency (IC50) with insights for medicinal chemists. Commun Chem (2026). https://doi.org/10.1038/s42004-026-01916-9

Download citation

  • Received: 01 July 2025

  • Accepted: 20 January 2026

  • Published: 29 January 2026

  • DOI: https://doi.org/10.1038/s42004-026-01916-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Journal Information
  • Open Access Fees and Funding
  • Journal Metrics
  • Editors
  • Editorial Board
  • Calls for Papers
  • Referees
  • Editorial Values Statement
  • Editorial policies
  • Contact

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Communications Chemistry (Commun Chem)

ISSN 2399-3669 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research