Abstract
Epidermal Growth Factor Receptor (EGFR) plays a critical role in the development of several cancers.Thus, modulation/inhibition of EGFR activity is an appealing target of developing novel cancertherapeutics. With the advent of modern machine learning technologies, it is now possible to simulateinteractions with high precision between EGFR and small molecules to predict inhibitory/ modulatoryactivity at an unprecedented scale. In this work, we propose a novel machine-learning method to fastand precise classification of small compounds that are active, intermediate or inactive in inhibiting/modulating EGFR activity. We developed DeepEGFR, a novel multi-class graph neural network(GNN) model, to classify compounds into Active, Inactive, and Intermediate functional categories.DeepEGFR leverages complementary molecular representations, combining SMILES strings andmolecular fingerprint matrices (Klekota-Roth and PubChem) to capture both structural and property-based features of compounds. The model constructs an advanced molecular graph representing atomtype, formal charge, bond type, and bond order, through nodes and edges. DeepEGFR achievedsuperior performance compared to baseline machine learning algorithms (e.g., SVM, Random Forest,ANN), with approximately 94% F1-scores across training and test datasets for all activity classes. Toensure interpretability, the top 20 features identified by DeepEGFR were validated against the fivekey characteristics of FDA-approved EGFR inhibitors (Afatinib, Gefitinib, Osimertinib, Dacomitinib,Erlotinib), confirming the biological relevance of the features. Moreover, DeepEGFR successfullyidentified 300 underexplored EGFR-targeting compounds, demonstrating its potential to acceleratethe discovery of therapeutic agents. These results highlight the effectiveness of graph neural networksin advancing molecular activity classification, setting a potential new benchmark for EGFR inhibitorprediction. These findings demonstrate the DeepEGFR’s ability to highlight the promising EGFRinhibitors, that have received limited prior investigation, thereby supporting its role in facilitating therational development of targeted therapies for precision oncology.
| Original language | English |
|---|---|
| Article number | 38236 |
| Journal | Scientific Reports |
| Volume | 15 |
| DOIs | |
| Publication status | Published (VoR) - 31 Oct 2025 |
Fingerprint
Dive into the research topics of 'DeepEGFR a graph neural network for bioactivity classification ofEGFR inhibitors'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver