My Blog

dag_2 orange

What are GANs and how can they generate synthetic data?

This blog explores Generative Adversarial Networks (GANs) and how they can be used to generate synthetic healthcare data.

Read More
hospital blue

Clinic to Code to Care

This blog came out of a talk Steph Jones and I gave at Women in Data and AI in October 2025. It explores the journey of information from a patient in clinic to how that information is coded for research and ultimately ends up informing statistical and machine learning models that can help improve patient care.

Read More
padlock green

What is Synthetic Data and Why Does it Matter?

This blog is the first in a series exploring synthetic data, its benefits, and its applications in various fields.

Read More
dag_1 orange

Why the Adapter Pattern is King in Health Data

This blog discusses in detail how the Adapter Pattern can bridge the critical gaps between healthcare systems to improve interoperability.

Read More
crab blue

Finding Similarity with Vector Search: A Beginner's Guide

This blog comes out of an interactive workshop I gave using SurrealDB. It's a beginner's guide to vector search, a modern way to find matches based on multiple preferences at once.

Read More
star green

Code Review for Research Code

An overview of how to conduct a code review for research code

Read More
stethoscope orange

SNOMED and friends

This blog provides an introduction to SNOMED codes and how they are used in routine care in the UK. It also covers some of the quirks of SNOMED and the challenges of using it in research.

Read More
histogram blue

An Introduction to Electronic Health Records

A quick primer on what is an electronic health record and how it is used in clinical practice and research. This post is UK focussed but the principles are the same in many other countries.

Read More