DOI: https://www.doi.org/10.53289/LACC1078
Dr Claudia Lindner – Senior Research Fellow and Sir Henry Dale Fellow in Translational Medical Imaging at The University of Manchester. Dr Helen Coulshed — Associate Vice Dean (Assessment & Quality Assurance), Faculty of Natural, Mathematical and Engineering Sciences and Senior Lecturer in Chemistry Education, King’s College London. Dr Jessica Pollitt – Senior Strategy and Planning manager, Biotechnology and Biological Sciences Research Council, UK Research and Innovation.
AI systems are transforming healthcare. They can analyse chest X-rays, read heart scans and flag potential health issues faster than human doctors – in some cases, in seconds rather than minutes. Hospitals are adopting these tools to improve efficiency, reduce costs and standardise care. As the Science Technology, Engineering Mathematics and Medicine (STEMM) community and the public collectively look forward to advances in the field of medical AI and the resulting benefit on patient care, it is vital that AI models are trained responsibly so that the unique needs of all patients can be addressed to ensure societal equity and patient safety.
Fairness and bias impact AI for healthcare
In her invited talk, Dr Tiarna Lee (Kings College London) emphasised that fairness encompasses both equality (treating everyone the same) and equity (allocating resources based on need), and that protected characteristics (e.g. age, sex, and race) must be carefully monitored when developing and deploying AI tools. Dr Lee explained that bias can arise at any stage of the AI pipeline, from defining the clinical problem to data collection, preprocessing, model training, and downstream clinical decision-making, with early-stage biases propagating through the pipeline.
The datasets that are used to train many AI models are often imbalanced in terms of subject demographics and so the resulting AI models have a better performance for the overrepresented group(s) and worse for the underrepresented group(s). In her research on cardiac MRI segmentation, Dr Lee identified that models trained on imbalanced datasets demonstrate significant racial performance differences. This can lead to delayed or poorer-quality treatment for heart disease with worse outcomes overall, which then reinforces or worsens existing disparities. The talk concluded by calling for intentional and transparent development practices supported by frameworks such as FUTUREAI, which is an international consensus guideline for trustworthy and deployable artificial intelligence in healthcare.
Groups of conference attendees discussed what incentives could make sectors share responsibility for creating and maintaining diverse, representative datasets, and what practical steps could ensure how health technologies are regulated, commissioned or deployed.
Key themes
Recommendations for stakeholders:
AI developers
Market end-users