DeepEGFR a graph neural network for bioactivity classification ofEGFR inhibitors

Aijaz Ahmad Malik (Corresponding / Lead Author), Costerwell Khyriem, Sven Hauns, Imran Khan, Frederico Garcia Pinto, Azzat Al-Sadi, Rasheed Mohammad (Corresponding / Lead Author), Van Dinh Tran, Rolf Backofen, Nelson Soares, Mohammed Uddin, Omer S. Alkhnbashi (Corresponding / Lead Author)

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Epidermal Growth Factor Receptor (EGFR) plays a critical role in the development of several cancers.Thus, modulation/inhibition of EGFR activity is an appealing target of developing novel cancertherapeutics. With the advent of modern machine learning technologies, it is now possible to simulateinteractions with high precision between EGFR and small molecules to predict inhibitory/ modulatoryactivity at an unprecedented scale. In this work, we propose a novel machine-learning method to fastand precise classification of small compounds that are active, intermediate or inactive in inhibiting/modulating EGFR activity. We developed DeepEGFR, a novel multi-class graph neural network(GNN) model, to classify compounds into Active, Inactive, and Intermediate functional categories.DeepEGFR leverages complementary molecular representations, combining SMILES strings andmolecular fingerprint matrices (Klekota-Roth and PubChem) to capture both structural and property-based features of compounds. The model constructs an advanced molecular graph representing atomtype, formal charge, bond type, and bond order, through nodes and edges. DeepEGFR achievedsuperior performance compared to baseline machine learning algorithms (e.g., SVM, Random Forest,ANN), with approximately 94% F1-scores across training and test datasets for all activity classes. Toensure interpretability, the top 20 features identified by DeepEGFR were validated against the fivekey characteristics of FDA-approved EGFR inhibitors (Afatinib, Gefitinib, Osimertinib, Dacomitinib,Erlotinib), confirming the biological relevance of the features. Moreover, DeepEGFR successfullyidentified 300 underexplored EGFR-targeting compounds, demonstrating its potential to acceleratethe discovery of therapeutic agents. These results highlight the effectiveness of graph neural networksin advancing molecular activity classification, setting a potential new benchmark for EGFR inhibitorprediction. These findings demonstrate the DeepEGFR’s ability to highlight the promising EGFRinhibitors, that have received limited prior investigation, thereby supporting its role in facilitating therational development of targeted therapies for precision oncology.
    Original languageEnglish
    Article number38236
    JournalScientific Reports
    Volume15
    DOIs
    Publication statusPublished (VoR) - 31 Oct 2025

    Fingerprint

    Dive into the research topics of 'DeepEGFR a graph neural network for bioactivity classification ofEGFR inhibitors'. Together they form a unique fingerprint.

    Cite this