TY - JOUR
T1 - MEDISURE
T2 - Towards Assuring Machine Learning-based Medical Image Classifiers using Mixup Boundary Analysis
AU - Byfield, Adam
AU - Poulett, William
AU - Jose, Anusha
AU - Tyagi, Shatakshi
AU - Shembekar, Smita
AU - Qayyum, Adnan
AU - Qadir, Adnan
AU - Bilal, Muhammad
PY - 2024/8/22
Y1 - 2024/8/22
N2 - Machine learning (ML) models are becoming integral in healthcare technologies, necessitating formal assurance methods to ensure their safety, fairness, robustness, and trustworthiness. However, these models are inherently error-prone, posing risks to patient health and potentially causing irreparable harm when deployed in clinics. Traditional software assurance techniques, designed for fixed code, are not directly applicable to ML models, which adapt and learn from curated datasets during training. Thus, there is an urgent need to adapt established software assurance principles such as boundary testing with synthetic data. To bridge this gap and enable objective assessment of ML models in real-world clinical settings, we propose Mix-Up Boundary Analysis (MUBA), a novel technique facilitating the evaluation of image classifiers in terms of prediction fairness. We evaluated MUBA using brain tumour and breast cancer classification tasks and achieved promising results. This research underscores the importance of adapting traditional assurance principles to assess ML models, ultimately enhancing the safety and reliability of healthcare technologies. Our code is available at \url{https://github.com/willpoulett/MUBA_pipeline}.
AB - Machine learning (ML) models are becoming integral in healthcare technologies, necessitating formal assurance methods to ensure their safety, fairness, robustness, and trustworthiness. However, these models are inherently error-prone, posing risks to patient health and potentially causing irreparable harm when deployed in clinics. Traditional software assurance techniques, designed for fixed code, are not directly applicable to ML models, which adapt and learn from curated datasets during training. Thus, there is an urgent need to adapt established software assurance principles such as boundary testing with synthetic data. To bridge this gap and enable objective assessment of ML models in real-world clinical settings, we propose Mix-Up Boundary Analysis (MUBA), a novel technique facilitating the evaluation of image classifiers in terms of prediction fairness. We evaluated MUBA using brain tumour and breast cancer classification tasks and achieved promising results. This research underscores the importance of adapting traditional assurance principles to assess ML models, ultimately enhancing the safety and reliability of healthcare technologies. Our code is available at \url{https://github.com/willpoulett/MUBA_pipeline}.
UR - https://www.open-access.bcu.ac.uk/15560/
U2 - 10.1109/ISBI56570.2024.10635870
DO - 10.1109/ISBI56570.2024.10635870
M3 - Article
JO - ISBI 2024
JF - ISBI 2024
ER -