Abstract
Resilient microgrid control only delivers value in remote settings if it can be trained quickly and run on resource-constrained hardware. This paper applies Simple Policy Optimisation (SPO) to energy management and benchmarks it against Proximal Policy Optimisation (PPO) and a Model Predictive Control (MPC) baseline under historical Cyclone Laila conditions. A short two-stage curriculum first secures resilience, then trades within the feasible set to extend battery life using either a balanced summation reward or a Lagrangian resilience constraint; goal-based early stopping breaks training once targets are met. Under the same actor–critic scheme, SPO produced steadier updates and stronger battery preservation than PPO, achieving up to 19.7 expected years of battery life (32% longer than MPC and 7% longer than PPO) while maintaining near-perfect resilience (RI = 0.998, compared with 0.993 for PPO Lagrangian). Compared with PPO, training compute is reduced by nearly twelve times with similar inference cost, whereas MPC is around a thousand times heavier at inference. SPO therefore offers an edge-deployable, resilience-first controller that protects batteries without sacrificing continuity of supply.
| Original language | English |
|---|---|
| Title of host publication | 2025 9th International Conference on Environment Friendly Energies and Applications (EFEA) |
| Subtitle of host publication | proceedings |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331550950 |
| Publication status | Published (VoR) - 17 Feb 2026 |
| Event | International Symposium on Environment Friendly Energies and Applications - Newcastle upon Tyne, Newcastle upon Tyne, United Kingdom Duration: 4 Dec 2025 → 5 Dec 2025 Conference number: 9 https://efeaconf.com/ |
Conference
| Conference | International Symposium on Environment Friendly Energies and Applications |
|---|---|
| Abbreviated title | EFEA |
| Country/Territory | United Kingdom |
| City | Newcastle upon Tyne |
| Period | 4/12/25 → 5/12/25 |
| Internet address |
Keywords
- Microgrid Resilient Operation
- Deep Reinforcement Learning
- Energy Management
- Smart Grid
- Proximal Policy Optimisation
- Simple Policy Optimisation
Fingerprint
Dive into the research topics of 'Quicker to Train and Cheaper to Run: Edge-Ready DRL for Microgrid Resilient Energy Management'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver