Primary supervisor
Sanoop MallisseryResearch area
Software Systems and CybersecurityCore PhD Question
How can we design future medical AI systems that remain secure, privacy-preserving, explainable, and clinically reliable when exposed to adversarial attacks, prompt injection, poisoned data, privacy leakage, and unsafe autonomous AI-agent behaviour?
This PhD investigates how next-generation healthcare AI systems can be protected before they are trusted in real clinical environments. The focus is not only on making AI models accurate, but on making them resilient, safe, and trustworthy when operating under realistic security threats.
What We Are Not Planning to Do
This project is not about building a standard disease-diagnosis model or simply improving classification accuracy on medical datasets.
It is also not focused on replacing doctors, giving direct medical advice, or developing a clinical product for hospital deployment during the PhD.
The project is not limited to ordinary machine-learning evaluation where models are tested only on clean datasets.
It is also not just a general cybersecurity project disconnected from healthcare. The security problems studied here must be directly linked to medical AI systems, clinical workflows, patient data, or healthcare decision-support environments.
What We Are Planning to Do
This project will study the security and privacy risks of AI systems used in healthcare, especially systems based on large language models, vision-language models, transformer models, and autonomous AI agents.
The project may investigate attacks such as prompt injection, adversarial medical-image manipulation, poisoned clinical records, unsafe AI-agent actions, membership inference, privacy leakage, and manipulation of AI-generated clinical recommendations.
The PhD candidate will design and evaluate a security-aware medical AI framework that can detect risky inputs, unsafe outputs, suspicious model behaviour, privacy-sensitive responses, and clinically dangerous recommendations.
The project may include the development of a runtime safety layer that works like a “cyber-immune system” for medical AI. This layer can monitor AI behaviour, assign risk scores, explain why an output may be unsafe, and prevent high-risk responses from reaching clinicians or patients without further review.
The project can use medical text, medical images, multimodal datasets, electronic health record-style data, or simulated clinical AI-agent workflows. The exact scope can be refined based on the candidate’s background and available datasets.
Possible PhD Contribution Expectations
- Medical AI Threat Model: A detailed threat model for future healthcare AI systems, covering medical LLMs, medical vision-language models, clinical chatbots, AI agents, and AI-enabled decision-support systems.
- Healthcare AI Attack Benchmark: A benchmark or simulation environment for testing how medical AI systems behave under adversarial, privacy, and prompt-injection attacks.
- Security-Aware Medical AI Framework: A novel defense framework that identifies malicious inputs, unsafe outputs, suspicious reasoning paths, and privacy-sensitive responses in medical AI systems.
- Cyber-Immune Runtime Safety Layer: A monitoring layer that continuously checks AI behavior during use and assigns risk scores before the output is trusted in a healthcare setting.
- Privacy-Preserving Medical Intelligence: Integration of methods such as federated learning, differential privacy, secure representation learning, or privacy-aware synthetic data generation to reduce patient-data exposure.
- Explainable Clinical Risk Assessment: An explainable AI component that helps clinicians understand why an AI output is considered safe, unsafe, uncertain, privacy-risky, or potentially manipulated.
Required knowledge
Essential
- Strong programming skills in Python
- Good understanding of machine learning and deep learning
- Basic knowledge of cybersecurity concepts
- Familiarity with transformer models or large language models
- Ability to read and implement research papers
- Understanding of model evaluation, experiments, and result analysis
- Interest in healthcare, patient safety, and trustworthy AI
Useful
- Experience with PyTorch or TensorFlow
- Knowledge of adversarial machine learning
- Knowledge of privacy-preserving machine learning
- Familiarity with differential privacy or federated learning
- Understanding of prompt injection, LLM safety, or AI-agent security
- Experience with medical datasets, medical images, or clinical text
- Knowledge of explainable AI methods
- Experience with NLP, computer vision, or multimodal learning
Nice to Have
- Background in digital health, biomedical informatics, or medical imaging
- Experience with vision-language models
- Experience with secure software systems or runtime monitoring
- Knowledge of electronic health records or healthcare workflows
- Interest in regulatory and ethical aspects of AI in medicine
- Experience with building prototypes, dashboards, or AI safety tools
- Ability to work across AI, security, and healthcare domains
The topic is for candidates who wants to work at the frontier of artificial intelligence, cybersecurity, and healthcare. The candidate should be interested in building future AI systems that are not only intelligent, but also secure, privacy-preserving, explainable, and safe for real-world human-centred environments.
The project is especially suitable for students interested in medical AI, trustworthy AI, AI security, LLM safety, privacy-preserving machine learning, and the future of autonomous AI systems in healthcare.