Patient data use case studies

Photo of man with lots of files

Contact us

Questions about these case studies or how patient data is handled?

Contact the team

During our Review of Informed Choice for Cancer Registration, patients clearly told us they would like to know what their data are being used for. We agreed that as a charity, we would highlight where we are using patient-data for research and analysis. Relevant examples of work by the Cancer Intelligence team at Cancer Research UK will be available here, and we will add to this as we embark on new projects.

Patient data – a vital tool that will help beat cancer

Cancer is the most complex health challenge that we face. There are 200 different types of cancer but even that is an oversimplification.  

At a genetic level each individual cancer is as unique as the person with the disease. Despite this complexity our research has helped double cancer survival over the last 40 years. This extraordinary progress has been powered by relentlessly increasing our understanding of cancer’s intricacies. 

But big questions about cancer remain, and our scientists are seeking the answers. Patient data has the potential to reveal some of these. Here we meet some outstanding scientists who are using patient data to grow our understanding even further, work that could underpin tomorrow’s cures. 

Predicting the future – how patient data will help anticipate how a tumour might behave 

At the heart of the CRUK Scotland Institute is a team of brilliant scientists, led by Professor Crispin Miller, who specialise in using super computers to understand how cancer behaves. 

This diverse and talented team - which includes mathematicians, physicists, biologists and software engineers - are using a range of techniques to dig deep into the huge amounts of data we have on the genetics and anatomy of tumours.   

“Biology is exciting” says Crispin. “The questions of modern cancer research, like how the DNA in one cell can define an entire living person – to me that’s as exciting as understanding the first few microseconds of the Big Bang.”  

Crispin’s goal is to identify patterns in a tumour’s DNA or its structure that might help us better understand why and how cancers develop. And ultimately this could inform how best to treat them. 

But spotting those crucial patterns is difficult because tumours are so diverse. And so Crispin’s team turn to supercomputers. These powerful tools can sift through vast amounts of information to find patterns that would be impossible for a human to spot. Then the scientists can do what they do best – work out how the patterns are driving cancer progression or making a tumour more or less sensitive to an anticancer drug.   

Professor Miller explains: “Data gives us insight into things you can’t see down a microscope.  It has the potential to start developing truly targeted therapies. To find the right therapy for the right person at the right time  that is the goal.”  

His team are using a data approach on many different cancer-related questions.  In one project, patients have kindly allowed their doctors to share anonymised scan images of tumours and the tumour genetic data. Crispin’s team are seeing whether they can build a sophisticated AI approach that has the power to use this data to predict how tumours are going to respond to treatment. This could help doctors pick the best treatment for each person.   

What nearly all Professor Miller’s work has in common is that it is only possible due to the data that people with cancer have shared.  

“Data is central to the research we are doing.  The more tumours we study, the more we understand how they differ from each other and how we can use this information to improve outcomes for people with cancer.  I am incredibly grateful to the patients who donate their data and tissue, I couldn’t do my work without it.” 

“Data is everything” – how patient data is helping us understand how cancer spreads and resists treatment 

For Dr Irene Lobon, everything is data and data is everything. From calculating nutrients in new recipes to studying tumour samples, analysing data is central to her life and work. While in her spare time Dr Lobon’s analytical skills help her to optimise her favourite recipes, in the laboratory, these skills could lead to lifesaving discoveries. 

Dr Lobon is a biomedical researcher in the Cancer Dynamics Laboratory, led by Professor Samra Turajlic at The Francis Crick Institute. She works on a multidisciplinary team featuring scientists and clinicians, all working together to better understand cancer.  

The team are studying advanced, spreading melanoma, called metastatic melanoma. This is a type of skin cancer that is particularly hard to treat and often becomes resistant to drugs. Professor Samra’s group want to know how melanoma changes over time, what happens inside the tumour as it spreads and what features lead to drug resistance. To answer these questions, the team relies on patient data. Specifically, samples from patients who have passed away from cancer. 

To access these samples, the team makes use of the PEACE (Posthumous Evaluation of Advanced Cancer Environment) study, a huge collaborative project, set up to help researchers collect samples from patients who have died from cancer. In this study, researchers take samples from participants during their treatment journey and after they have died. This huge assembly of data has allowed clinicians and scientists to study cancer in a way that was not possible before. 

Professor Turajlic’s team employed sophisticated genetic sequencing techniques to study nearly 600 separate samples from 14 different patients with metastatic melanoma from the PEACE study. “What we’ve observed is that there are many variables that contribute tiny bits to the cancer progression” said Dr Lobon.  

Uncovering these small contributions allowed the team to piece together a complex timeline of how the cancer develops.  “It’s like having a fossil. We can study cancers at different points in time.”  

The team shared the results of this extensive study with the scientific community earlier this year. The incredible information uncovered has helped scientists to understand the complex ways that melanomas use to evade drugs. The researchers hope that it could also lead to the development of completely new drugs that could provide hope for those diagnosed with metastatic melanoma.  

While Professor Samra’s team are using the PEACE study to study metastatic melanoma, there are many other scientists using the valuable data collected to investigate other cancer types. There are researchers investigating lung cancer, renal cancer and many more. What they have in common, is their reliance on patient data. 

Dr Lobon emphasised that her work would be impossible without the generous people who agree to donate their samples after they pass. “It’s extremely important. There is no other way we could get such specific information without these samples.”  

“Data is everything.”  

The origins of oesophagael cancer – how patient data is revealing how this cancer gets started 

Oesophageal cancer is one of the biggest challenges in cancer research today. The underlying biology of the disease is poorly understood and survival remains very low, with only 1 in 10 people surviving their disease for 10 years or more. It’s a big challenge – and one that researchers and clinicians can’t tackle alone. 

For over 10 years, Professor Rebecca Fitzgerald at the University of Cambridge has been spearheading a huge project that brings together scientists, doctors and nurses from across the UK with the goal of better understanding oesophageal cancer.  

Known in the research community as OCCAMS (Oesophageal Cancer Clinical and Molecular Stratification), this impressive programme of work has seen researchers collecting tumour and blood samples from people with oesophageal cancer and decoding the cancer’s genetic sequence so that they have a complete map of the cancer’s DNA.  

These samples provide a treasure trove of information, allowing scientists to see how changes in DNA sequence affects progression of oesophageal cancer. And every person’s data provides a new piece of the puzzle. 

“This really is a team effort,” says Rebecca. “By sharing information, we can see the patterns and the trends and that’s what allows the breakthroughs to happen. Right now we can’t predict how well someone will do just by how they look when we first see them in the clinic. Some people do well, and sadly some people do badly, and we need to know why.” 

And there are a lot of questions that need answering. How do genetic changes in the cancer change the way it develops over time? Are there specific risk factors that could predict if oesophageal cancer is going to recur? Why do some people respond well to treatment and some people don’t? All these questions rely on matching up clinical data (data collected by doctors about how each person’s cancer progresses) with laboratory data (data that comes from samples that scientists take away to analyse in the lab).    

“And we need a lot of data,” explains Rebecca. “Oesophageal cancer is really complicated. We can’t find the needle in the haystack by just using data from a few people. The current problem my team is working on – working out whether all oesophageal cancers start from the precursor condition Barrett’s oesophagus – is using data from 4,000 patients. And that is why we are so grateful to all the patients who allow us to do this work.” 

In the end, this work is all driven by the desire to make the situation better for people with oesophageal cancer. Already the team have been able to describe for the first time some of the DNA mutations that they think cause cancer, and there are likely to be many more findings to come. 

 “We have seen some improvements over the 20 years I have been working on this,” reflects Rebecca, “but we’ve got a long way to go. Solving this problem is what gets me out of bed in the morning. And I want to get to the end of my career and see that it is really different. I can’t do what I do without patient data – and I am not done yet.”   

Personalising radiotherapy – how patient data is helping reduce the side effects of treatment 

Radiotherapy is the gold standard of treatment for many types of cancers. In fact, more than 130,000 patients benefit from radiotherapy every year in the UK. Today, most people receive image-guided radiotherapy – that is, using imaging such as x-rays and MRIs to target the beam of radiation to the tumour site, to make it as accurate as possible.  

“Many people think about imaging at the point of diagnosis to find a tumour, or to follow up on treatment to see if its working,” says Dr Rajesh Jena, a clinician scientist based in Cambridge who is finding ways to improve radiotherapy for people with tumours of the brain and spine. “But imaging can be so much more: it can be used to personalise and even predict response to treatment.”  

With radiotherapy there can be side effects because the beam of radiation can also affect healthy tissue surrounding the tumour. Combining imaging while delivering radiotherapy can help minimise these side effects, and getting better at combining these approaches is where patient data can be so important.  

By studying images taken during radiotherapy, researchers can determine which healthy tissues were affected by the radiation, and then use that information to help predict how a new patient might respond to treatment, potentially altering the treatment plan to make it as side-effect free as possible. 

“The wealth of information an image holds comes with an incredible potential,” says Dr Jena. Such potential to inform patient care led Dr Jena and his team to develop the first continuously learning artificial intelligence (AI) medical device in his hospital. Using patient data to train the AI, they were able to develop a tool that speeds up analysis of images. The technology helps doctors by cutting the amount of time spent ‘sketching’ around healthy organs, as they create radiotherapy plans, a painstaking yet vital step in radiotherapy treatment to ensure healthy tissue is protected.  

And this data offers value far beyond direct medical treatment. “It’s not only doctors that need access to imaging data like this,” Dr Jena adds. “Patients are so generous allowing us to use this data, so we try to maximise its value by working with mathematicians, physicists, biologists and many other experts.” Bringing together different specialisms like this gives the scientists the best chance of understanding a disease as complex as cancer. 

“And we also look back at historical patient images – this is important to provide context, improve our understanding and build up a valuable database of information.” 

Thanks to patient data, Dr Jena and his team have already been able to create a technology that frees up valuable time and ultimately enable patients to receive treatment sooner. Over the next year, Dr Jena and his team are working on improving this technology by providing the AI with even more examples of patient images to learn from, allowing it to become better than ever at identifying healthy tissue.  

“We’re so grateful to have access to patient imaging data because without it we wouldn’t be able to develop technologies like this, which can make a real difference to people with cancer. We’re very careful with how we use patient data, we take care to ensure that all data is anonymised and is treated with respect.” 

 

Case studies in detail

The earlier diagnosis of lung cancer will save lives but also puts additional pressure on diagnostic services. It is therefore important that lung cancer pathways are organised to be as effective and efficient as possible and to ensure patients are given their diagnosis as soon as possible.

We hoped to improve understanding of pre-diagnostic events, intervals and patterns for lung cancer patients on a national scale, and to benchmark to the timings in the newly adopted National Optimal Lung Cancer Pathway (NOLCP).

The data used included: lung cancer registrations (2013-2015) from National Cancer Registration Analysis Service (NCRAS), diagnostic imaging data (DID) and cancer waiting times (CWT) data from NHS England; and involved data from over 100,000 patients.

Following data linkage, time intervals between events were calculated, different scenarios of events were investigated and timings of events were compared with those from the NOLCP.

Many different diagnostic scenarios exist, from simple to complex, which varies by CCG. Time intervals from imaging to diagnosis differed by source of image referral, with those ordered by GP direct access imaging having longer pathways. Benchmarking to the NOLCP timings showed small proportions (less than 6%) of patients meeting timings, this also varied widely by CCG. 

Partners: ACE Programme (funded by CRUK, Macmillan, NHS England), CRUK-NHS England Partnership, NCRAS

See also: ACE Programme website, More in-depth findings

Survival for ovarian cancer in the UK is lower than other comparable high-income countries and there is substantial regional variation within the UK. Access to and/or quality of treatment may be contributing factors.

This analysis will establish a detailed picture of ovarian cancer treatments and outcomes and is intended to provide initial insights into a planned national ovarian cancer audit benchmarking pilot, with the ultimate aim to act as a catalyst to reduce variation and drive improvements to clinical practice.

Data used will be: patients diagnosed with ovarian cancer (1st July 2014 to 31st March 2015) who were eligible for chemotherapy; information about their tumour, chemotherapy treatment received, and demographic information.

The linked data will be used to compare treatment access rates between Cancer Alliances and/ or Trust of Multi-Disciplinary Team/diagnosis, to determine the extent to which this can be explained by regional differences in patient demographics, such as age and socioeconomic status, and we will investigate the influence of healthcare system levels factors on access to treatment, such as provider and type, and consultant volume and specialisation.

Findings are due Winter 2018, but we expect to find how much geographical variation in access to treatment for ovarian cancer there is in England, and to what extent is this affected by patient demographics and healthcare system level factors.

Partners: NHS England

The purpose of this service evaluation is to establish; factors that affect access to lung cancer treatments for non-small cell lung cancer (NSCLC) patients, whether there is variation between Cancer Alliances/ hospital trusts in access to a range of treatment pathways, and the impact of patient, tumour and provider characteristics on access to treatments.

Data used will be: patients diagnosed with lung cancer (1st April 2014 to 31st March 2015) who were eligible for chemotherapy; information about their tumour, chemotherapy treatment received, and demographic information.

Understanding variation in accessing treatments will inform policy makers and commissioners regarding where efforts should be focused, to ensure equitable access to effective treatments for NSCLC, and improve patient outcomes.

The linked data will be used to calculate access rates for different treatments, and then statistical tests will measure the overall geographic variation and significance. Finally, logistic regression will identify which factors are predictive of whether people receive various treatments (e.g. age, deprivation, ethnicity).

Findings are due early 2019: we expect to establish the presence of variation at a geographic level, and the reaso

Partners: NHS England

In England, less than 60% of eligible bowel cancer screening participants take part in the programme. Cancer Research UK, in partnership with Public Health England, carried out a regional Be Clear on Cancer pilot campaign in the North West of England to encourage participation.

The campaign ran in 33 CCGs and consisted of advertising (including TV) and direct mail. Advertising ran alone for 6 weeks and was later combined for 6 weeks with direct mail in 22 CCGs to first-timers and previous non-responders. The remaining 11 CCGs continued to receive advertising only. The campaign was aimed at people aged 55-74 in the lower socioeconomic groups (C2DE) and skewed towards men; this was based on pilots previously run by Cancer Research UK.

We evaluated whether the campaign advertising, or advertising paired with sending a CRUK-endorsement letter with the kit increased uptake. Differences have also been compared across different socioeconomic groups.

The data we used to do this were extracted from the national bowel cancer screening database. We used age, location, date of test kit received, and date kit sent back. For the direct mail element of the campaign, we obtained a set of data for bowel screening test kit recipients who received a CRUK-endorsement letter, and a separate set of data for bowel screening test kit recipients who did not. We did not receive any names, addresses, or NHS numbers, so no individuals could be identified.

We split participants into 3 groups for the evaluation:

  • First timers – people who are invited for bowel screening for the first time
  • Previously screened – those who were invited for screening previously and returned completed kits
  • Previous non-responders – those who were invited previously but didn’t return completed kits

Results are promising. They showed an increase in uptake of the screening test kit during the live campaign for all groups, as well as a smaller sustained increase in the 3 months following the campaign. 

There are upcoming changes to the bowel cancer screening programme which may impact on screening uptake. We recommend waiting for an appropriate time, once changes have been implemented, to consider another campaign.

Partners: NHS England

See also: Definitions of socioeconomic groups, Previous pilots.

The aims of COVID RT are to understand why changes in radiotherapy treatment schedules were implemented during the pandemic and to then explore the impact of these changes on patient outcomes and the UK radiotherapy services. This project aims purely to understand the changes in patient's radiotherapy treatment from COVID-19.

The COVID-19 pandemic has had a significant impact on cancer patients and cancer services across the UK. During the peak of the pandemic, radiotherapy services across the UK continued to treat cancer patients in often challenging circumstances and implemented significant changes to standard practice to minimise the risks to patients of contracting COVID-19 and focus radiotherapy resources where they were most needed. The scale of these changes in radiotherapy practice, the clinical decision-making underpinning them and their impact on cancer patient outcomes is unknown. This study will give a national picture of the decision making by patients and clinical staff and the impact on patient's treatment. It also provides knowledge of how RT was used as a bridge to surgery when surgical services weren't viable due to the pandemic and therefore what are the further requirements for cohorts of patients.

Data have been collected by radiotherapy centres across England specifically for this study as part of patient's routine care. These data were submitted to NHS England (previously Public Health England) under regulation 2 of the Health Service Regulations 2002. NHS England (previously Public Health England) collected and stored these data. A pseudonymised extract of these data was created and transferred to our Cancer Research UK Secure Data Environment (SDE).  

These data are being analysed within the CRUK SDE, by named analytical staff at Cancer Research UK and University of Leeds, alongside similar data for other UK nations. Summary results will be calculated and used as the first research project to understand the affect Covid had on radiotherapy treatment. These data will not be linked to other datasets.

Partners: NHS England