Skip to main content

Together we are beating cancer

Donate now
  • For Researchers

Rewiring the life sciences: why foundations come first for AI in science

The Cancer Research UK logo
by Cancer Research UK | In depth

22 June 2026

0 comments 0 comments

Ai

AI could transform cancer research, and the UK has the ambition to lead. But the science will only deliver in full if we get the foundations right first – here Shivani Sinha tells us how we can ensure just that…

Data science and AI are already part of how cancer research is done. And yet the science may not be moving as fast as it could.

When we invest in the technology, it is easy to forget the foundations beneath it: the data we train models on, the trust and inclusivity around how it is used, the people with the skills to do the work, and the computing power to run it. These foundations will decide whether the UK’s strengths reach the people they are meant to help, or stall before they do.

Across the UK, AI is moving from promise to practice. It is already helping researchers read scans, design cancer vaccines and detect patterns across datasets too vast to analyse by hand.

The question is not whether we have the ambition. We clearly do, CRUK’s new Data Science in Cancer Research strategy is a good example of that. It is whether we can match that ambition with the harder, quieter work of building the foundations beneath it.

An exciting moment

Across the UK, AI is moving from promise to practice. It is already helping researchers read scans, design cancer vaccines and detect patterns across datasets too vast to analyse by hand.

The UK is well placed to build on this momentum: it is home to leading AI labs such as DeepMind, and Anthropic’s new London office will sit close to the Francis Crick Institute, strengthening the connection between cutting-edge AI and a world-class life sciences research base. Government investment is also increasing, from a planned twenty-fold expansion in public compute by 2030 to a new national supercomputer in Edinburgh and a Sovereign AI Fund that names life sciences as a priority. CRUK is backing the field directly too, investing £60 million through its Data Science in Research Strategy.

It is an exciting moment. But investment alone will not be enough. The quieter groundwork beneath these AI tools must keep pace.

An old problem with higher stakes

The barriers we set out are not new. Bringing the right data together, in a form researchers can readily use, has never been straightforward. People who can work fluently across data science and biology have always been in short supply, and useable computing power has often struggled to reach the researchers who need it.

What AI changes is the cost of leaving these things unfixed. A researcher could once work around a thin dataset or a clunky system through judgement and patience. A model cannot. It inherits the gaps in the data and infrastructure around it – and, when reused across multiple projects, can reproduce those flaws at scale.

Taken together, these foundational gaps mean that AI-driven research happens more slowly, works less well, or turns out less fair to the very patients whose trust the research depends on. That last point matters most, and it runs underneath everything else: trust is far easier to lose than to rebuild, especially in a climate where the public conversation about AI tends towards caution.

AI

A researcher could once work around a thin dataset or a clunky system through judgement and patience. A model cannot. It inherits the gaps in the data and infrastructure around it.

Four foundations to get right

None of this is cause for pessimism. With the right investment and support from government, these are solvable problems. The changes we are calling for fall into four broad themes.

The first is sorting out the data itself. The UK’s health data assets are world-class, but holding data and being able to use it are not the same thing. Too often a researcher with a good idea spends months not doing science, but chasing permissions across institutions, only to find the dataset they finally reach is missing the fields they needed. And even when data can be reached, it has to be in a form AI can work with: linked up rather than scattered, and held to shared standards rather than reinvented lab by lab. We see the forthcoming Health Data Research Service as a chance to build that in from the start, and cancer, with some of the most complex and sensitive data in the system, is the right test case. Get it right there, and it will work almost anywhere.

The second is making AI both inclusive and trustworthy. Who is in the data decides who benefits from the science. A model is only as good, and only as fair, as the data behind it, so the datasets we rely on need to reflect the whole population, not just the groups easiest to study. AI raises the stakes here, because it doesn’t only inherit the gaps in our data, it amplifies them. The answer isn’t to discard every imperfect dataset, but to be honest about what’s in them, so their limits are understood rather than discovered too late. Closely tied to this is trust: patients will only share sensitive data when they’re confident it will be handled with care, so we have to bring people into research decision making from the start, as partners rather than names informed after the fact.

A model is only as good, and only as fair, as the data behind it, so the datasets we rely on need to reflect the whole population.

The third is people and skills. Most of the research workforce doesn’t need to become AI specialists, but they do need enough grounding to use these tools well and to know where they fall short. Beyond that, we need more of the rare researchers who are genuinely fluent in both data science and biology, the people who can make the two worlds talk to each other. They are hard to find and easy to lose, so we also need a system that lets people move between universities, the NHS and industry without it being a one-way door.

The fourth is computing power, where the trap is to assume the answer is simply more of it. How it is shared out matters just as much. Life sciences has needs that general-purpose compute planning could potentially overlook, and an impressive national headline figure means little if researchers can’t get time on the machines.

The harder problem is that data and compute often sit in different places, when the sensible thing, for security as much as speed, is to bring the analysis to the data rather than shuttling sensitive data around. Get that right, and capacity that exists only on paper becomes something researchers can really use.

Foundations first

Get these four things right, and the rest follows. The technology and ambition are here. The task now is to step back and strengthen the foundations beneath them. CRUK is playing its part: investing in AI-ready data, building skills across the research community, and pressing government to move faster because we are moving too.

Do make sure to take a look at the Rewiring the Life Sciences report in full. If you are a researcher working at this intersection, the foundations get built faster when the people who run into them every day help shape the fix, and we would value your perspective. The point, in the end, is people living longer, better lives, free from the fear of cancer, reached sooner because the foundations were finally there to carry the research that serves them.

Shivani Profile Picture

Author

Shivani Sinha

Shiva is a Policy Advisor in Science and Research Policy at Cancer Research UK

Tell us what you think

Leave a Reply

Your email address will not be published. Required fields are marked *

Read our comment policy.

Tell us what you think

Leave a Reply

Your email address will not be published. Required fields are marked *

Read our comment policy.