1 Recommendations

Can be used with evidence generation

1.1

Five artificial intelligence (AI) technologies can be used in the NHS during the evidence generation period as options to aid the opportunistic detection of vertebral fragility fractures (VFFs). The technologies are:

  • BriefCase-Triage

  • CINA-VCF Quantix

  • HealthVCF

  • HealthOST

  • IB Lab FLAMINGO.

    These technologies can only be used:

  • within their indicated populations as outlined in their instructions for use and with consideration of the risk groups as recommended in NICE's guideline on assessing the risk of fragility fracture in osteoporosis

  • if the evidence outlined in the evidence generation plan is being generated

  • as long as they have appropriate regulatory approval, including NHS England's Digital Technology Assessment Criteria (DTAC) approval.

    Commissioners should take into account whether a technology is likely to remain available on the UK market and supported by its company when entering into a contract.

1.2

The companies must confirm that agreements are in place to generate the evidence. They should contact NICE annually to confirm that evidence is being generated and analysed as planned. NICE may revise or withdraw the guidance if these conditions are not met.

1.3

At the end of the evidence generation period (3 years), the companies should submit the evidence to NICE in a format that can be used for decision making. NICE will review the evidence and assess if the technology can be routinely adopted in the NHS.

More research is needed

1.4

More research is needed on the following AI technologies that aid the opportunistic detection of VFFs before they can be funded by the NHS:

  • Annalise Enterprise CXR/Annalise Container CXR

  • BoneView

  • TechCare Spine.

What this means in practice

Can be used with evidence generation

The 5 technologies listed in recommendation 1.1 can be used as an option in the NHS during the evidence generation period (3 years) and paid for using core NHS funding. During this time, more evidence will be collected to address any uncertainties. Companies are responsible for organising funding for evidence generation activities.

Take into account whether a technology is likely to remain available on the UK market and supported by its company before generating evidence to address the evidence gaps. Evidence generation should preferably be on technologies that will still be available in the NHS after the evidence generation period.

After the evidence generation period, NICE will review this guidance and the recommendations may change. Take this into account when negotiating the length of contracts and licence costs.

Potential benefits of use in the NHS during the evidence generation period

  • Clinical benefit: Clinical evidence suggests that AI technologies can help opportunistically detect VFFs that would otherwise have been missed. This could help identify more people with a VFF who need treatment to improve their quality of life and reduce the risk of future fractures.

  • Resources: By reducing the risk of further fractures, early detection and treatment of VFFs could reduce the demand on other costly services, such as those needed to manage hip fractures.

  • System benefit: Using AI technologies can help reduce variation in clinical practice and help healthcare professionals to implement the Royal College of Radiologists' guidance for the recognition and reporting of osteoporotic vertebral fragility fractures.

Managing the risk of use in the NHS during the evidence generation period

  • Clinical subgroups: There is no evidence to show whether the AI technologies are equally clinically effective across all age groups. Older age is a risk factor, but there are other risk factors independent of age. It is uncertain whether the opportunistic detection of VFFs in all subgroups represents value for money in the NHS.

  • Resources: Implementing the AI technologies could have a big impact on radiology services, such as increasing the number of diagnostic images that need to be reviewed by a radiologist and the number of referrals for dual-energy X-ray absorptiometry (DEXA) scans that need to be done.

  • Costs: Early results from the economic modelling show that the technology could be cost effective. But, there is uncertainty around the cost of some of the technologies and the true cost of implementing them in the NHS. Trusts should take into account the costs of the AI technologies used in this assessment when implementing the technologies. When negotiating with companies, trusts should also consider the upfront costs for implementing a technology and should monitor costs associated with its use in populations at a lower risk of osteoporosis.

  • Clinical risk: Using AI technologies to help detect VFFs on diagnostic images is considered to have a low clinical risk. This is because the technologies are used in addition to standard care in which healthcare professionals make treatment decisions. AI technologies do not replace the definitive radiology review.

  • Implementation guidance: Clear local protocols will need to be in place when using AI technologies. This is to ensure that healthcare professionals refer people with a newly identified VFF to the appropriate services.

  • Equality: There is a risk that the AI technologies may have reduced diagnostic accuracy in different populations. These include younger people who may have risk factors for VFF, people from ethnic minorities and other groups that may have been underrepresented in the AI training set.

NICE has produced tools and resources to support the implementation of this guidance.

More research is needed

There is not enough evidence to support funding the 3 technologies listed in recommendation 1.4 for the purpose of opportunistic detection of VFFs in the NHS.

Access to the technologies should be through company, research or non-core NHS funding, and clinical or financial risks should be managed appropriately.

What evidence generation and research are needed

Evidence generation and more research are needed on:

  • the diagnostic accuracy of the technologies compared with current NHS standard care, including in key subgroups such as people under 50 and people at a higher risk of a VFF

  • the failure rates of the technologies and the reasons for failure

  • the impact of identifying additional VFFs on referral rates for other services, including DEXA

  • the impact of identifying additional VFFs on treatment

  • the impact of introducing the technologies on the workload of healthcare professionals

  • the short-term impact on quality of life of identifying and managing a VFF.

The evidence generation plan gives further information on the prioritised evidence gaps and outcomes, ongoing studies and potential real-world data sources. It includes how the evidence gaps could be resolved through real-world evidence studies.

Why the committee made these recommendations

AI technologies can help healthcare professionals spot VFFs on X-ray images and CT scans involving the spine that are done for unrelated conditions (opportunistic detection). Treatment can reduce symptoms and the risk of future fractures, so detecting VFFs early has clear benefits. Preventing future fractures can also reduce the demand on radiology services and save money elsewhere in the NHS.

BriefCase-Triage, CINA-VCF Quantix, HealthVCF, HealthOST and IB Lab FLAMINGO are designed to help detect VFFs on CT scans. Diagnostic accuracy evidence comparing them with a reference standard suggests that they can help detect moderate to severe VFFs. Early economic evidence shows that they could be cost effective. So, these 5 technologies are recommended for use with evidence generation.

Annalise Enterprise CXR/Annalise Container CXR, BoneView and TechCare Spine are designed to help detect VFFs on X-ray images. Clinical evidence for Annalise Enterprise CXR/Annalise Container CXR is uncertain because it is based on studies that included mostly or only lateral chest X-ray images. In the NHS these are not commonly done and are usually only performed in specific groups. So, the evidence may not be generalisable to the NHS and the diagnostic accuracy of the technology in this context is uncertain. The usefulness of BoneView and TechCare Spine is uncertain because they only analyse spine X-ray images, which are usually done for indications relating to back pain and include a thorough review of the spine. This means VFFs are less likely to be missed on these X-ray images, so the technologies may not offer additional benefit. In addition, there is no clinical evidence for TechCare Spine. So, these 3 technologies can only be used in research.

Evidence on the diagnostic accuracy of the technologies compared with standard care in the UK is limited. So, more data should be collected to show how much better they are at detecting additional VFFs in clinical practice. More evidence is also needed on the downstream effects of the technologies for both the people having diagnostic imaging and the healthcare professionals. This should include the effect of the technologies on the rates of referral and on treatment, quality of life and healthcare professional workload. Companies should also address gaps in the evidence around how often their technologies are unable to process an image (the failure rate) and the reasons why this happens.