Interventional procedure overview of Balloon cryoablation for treating Barrett's oesophagus
Closed for comments This consultation ended on at Request commenting lead permission
Evidence summary
Population and studies description
This interventional procedures overview includes 4 prospective cohort studies, 4 retrospective analyses, and 1 systematic review and meta-analysis. The overview is based on 743 people from 8 observational studies. This Of the 743 people included, around 444 had the procedure. This figure accounts for a known overlap of 22 people between studies. However, the figure of 743 does not account for the systematic review, as CBA was only included as a subgroup, with combined population estimates not provided. There is also a notable overlap of studies included in the systematic review and the studies included in this overview as key evidence.
This is a rapid review of the literature, and a flow chart of the complete selection process is shown in figure 1. This overview presents 9 studies as the key evidence in table 2 and table 3, and lists 19 other relevant studies in appendix B, table 5.
The key evidence included 3 single-centre studies (Canto, 2018; Alshelleh 2021; Dbouk, 2022) and 5 multi-centre studies (Schölvinck, 2015; van Munster, 2018; Canto, 2020; Agarwal, 2022; Frederiks, 2022). One did not include location details (Dbouk, 2022). All others included centres in the US (N=5) (Schölvinck, 2015; Canto, 2018; Canto, 2020; Alshelleh, 2021; Agarwal, 2022), or the Netherlands (N=3) (Schölvinck, 2015; van Munster, 2018; Frederiks, 2022). None included UK centres.
Follow-up ranged from 8 weeks (van Munster, 2018) to 4 years (Dbouk, 2022). Most had at least 1 year follow-up (Canto, 2018; Canto, 2020; Alshelleh, 2021; Agarwal, 2022; Dbouk, 2022).
Study populations varied. All required a confirmation of BE. All included LGD and HGD. People with ImCA were included in 5 studies (Schölvinck, 2015; Canto, 2018; Canto, 2020; Agarwal, 2022; Dbouk, 2022). Previous ablation was allowed in 2 studies (van Munster, 2018; Canto; 2018). The remaining 6 studies only included those who were treatment naïve (Schölvinck, 2015; Canto, 2020; Alshelleh, 2021; Agarwal, 2022; Dbouk, 2022; Frederiks, 2022). Mean age ranged from 65 years (Canto, 2020) to 68 years (Frederiks, 2022). Males were more commonly included across all studies. The proportion of males ranged from 82.6% (Alshelleh, 2021) to 93% (Frederiks, 2022). All studies included both LGD and HGD, but HGD was more common. Table 2 presents further study details.
Study no. | First author, date country | Characteristics of people in the study (as reported by the study) | Study design | Inclusion criteria | Intervention | Follow up |
|---|---|---|---|---|---|---|
1 | Dbouk (2022) United States | N=59 (all CBA) Mean age: 66.8 (SD 9.6) Male gender: 54 (91.5%) Mean BMI: 29.5 (SD 5.2) LGD: 22 (37.3%) HGD: 33 (55.9%) ImCa: 4 (6.8%) Mean BE length: 5 cm (SD 4.7) <8 cm: 45 (76.3%) >8 cm: 14 (23.7%) | Prospective cohort | Treatment-naïve people with LGD, HGD, or intramucosal cancer (ImCA) | Cryoballoon ablation - cryoballoon focal ablation system, Pentax Medical, Montvale, New Jersey, United States, with touch up argon plasma coagulation (APC) for small residual columnar islands (< 5 mm) | Median 54.3 months (IQR 32.9 – 65) |
2 | Agarwal (2022) United States | N=311 (85 CBA versus 226 RFA) CBA: Mean age: 67.1 (SD 10.1) Male gender: 71 (83.5%) Mean BMI: 28.9 (SD 4.9) LGD: 32 (37.6%) HGD: 53 (62.4%) Prior resection: 51 (60.0%) RFA: Mean age: 65.6 (SD 10.0) Male gender: 177 (78.3%) Mean BMI: 30.8 (SD 5.9) LGD: 108 (47.8%) HGD: 118 (52.2%) Prior resection: 112 (49.6%) | Retrospective cohort | People with HGD or LGD (segments ≤6 cm), or ImCA using CBA or RFA as their primary ablation modality | Cryoballoon ablation - C2 focal cryoballoon, Pentax Medical Corporation, Montvale, NJ, USA versus radiofrequency ablation - Medtronic, Minneapolis, Minn, USA | Median 2 years (IQR 1.3-2.5) CBA; 1.5 years (IQR 0.8-2.5 RFA) |
3 | Frederiks (2022) Netherlands | N=56 (all CBA: 28 10-second duration versus 28 8-second duration) 10-second cohort (N=28): Mean age: 68 (SD 58-73) Male sex: 26 (93%) Mean BMI: 27 (SD 25-30) LGD: 8 (29%) HGD: 8 (29%) Adenocarcinoma: 12 (43%) Prior resection:17 (61%) 8-second cohort (N=28): Mean age: 67 (SD 59-72) Male gender: 23 (82%) Mean BMI: 28 (SD 24-30) LGD: 8 (29%) HGD: 10 (36%) Adenocarcinoma: 10 (36%) Prior resection: 19 (68%) | Retrospective analysis of prospective data | People aged ≥18 years with short BE segments (C≤2 cm and M≤5 cm) and either HGD, LGD or residual BE following prior resection, ablation therapy naïve | Cryoballoon ablation - C2 Cryoballoon Ablation System, Pentax Medical, Redwood City, Calif, USA. Twice daily proton pump inhibitors and once daily (prescribed at physicians discretion) histamine receptor antagonist used alongside. | 12 weeks |
4 | Canto (2020) United States | N=120 (all CBA) Mean age: 65 (45-83) Male gender: 102 (85%) Mean BMI: 32 (18.7-59) Mean Prague C: 1.2 (0–5) Mean Prague M: 3.2 (1–6) White ethnicity: 112 (93.3%) LGD: 29 (24%) HGD: 67 (56%) ImCa: 24 (20%) | Multi-centre prospective cohort | People aged 18 years or older with treatment naïve BE of 6 cm or less, with either HGD, LGD or ImCA | Cryoballoon ablation - C2 Cryoballoon/ Pentax Medical Corporation), with touch up APC for skipped areas if islands <5mm and fewer than 3 in number. Proton pump inhibitor and daily histamine receptor antagonists used alongside. | 1 year |
5 | Canto (2018) United States | N=41 (all CBA; 22 treatment naïve versus 19 previously ablated) Mean age: 65.7 (34-79) Male gender: 34 (85%) Mean Prague C: 1.7 (0-9) Mean Prague M: 3.9 (1-14) LGD: 13 (31.7%) HGD: 23 (56.1%) ImCa: 5 (12.2%) Prior resection: 14 (34%) Prior RFA: 19 (46%) Pre-existing stricture due to prior ablation: 9 (22%) Mean maximum BE length: 3.9 cm (1-14) | Prospective cohort | Adult people with >1 cm BE with LGD, HGD, or ImCA. Including treatment naïve or previously ablated people | Cryoballoon ablation - C2 Therapeutics, Inc, Redwood City, Calif. Proton pump inhibitor (dosed once or twice daily) and histamine receptor antagonists (at the discretion of enrolling site) used alongside. | Median 20.9 months (IQR 17.5-24.6) |
6 | van Munster (2018) Netherlands | N=46 (20 CBA versus 26 RFA) CBA (N=20): Median age: 66 (62-71) Male sex: 17 (85%) LGD: 9 (45%) HGD: 11 (55%) Prior endoscopic resection: 6 (30%) Prior ablation: 2 (paper states 20%) Prior resection and ablation: 4 (20%) RFA (N=26): Median age: 68 (63-74) Male sex: 21 (81%) LGD: 14 (54%) HGD: 12 (46%) Prior endoscopic resection: 5 (19%) Prior ablation: 9 (35%) Prior resection and ablation: 7 (27%) | Retrospective analysis of prospective data | People with flat BE and either LGD, HGD, residual BE post-resection for non-flat lesions with dysplasia or mucosal EAC or residual BE after circumferential or focal ablation | Cryoballoon ablation - C2 Therapeutics, Inc, Redwood City, Calif, versus RFA - Medtronic, Inc, Minneapolis, Minn. Proton pump inhibitor dosed twice daily alongside. | 3 months |
7 | Alshelleh (2021) United States | N=71 (46 CBA versus 25 cryospray) CBA (N=46): Mean age: 65.5 (45-83) Male gender: 38/46 (82.6%) LGD: 25 HGD/ImCa: 21 Mean BE maximum length: 3.2 cm (1-9) Prior resection: 21 (45.7%) Cryo spray (N=26): Mean age: 65 (49-84) Male gender: 21/26 (84%) LGD: 9 HGD/ImCa: 16 Mean BE maximum length: 3.6 cm (1-12) Prior resection: 15/25 (16%) | Retrospective cohort | People (≥18 years), with histologically confirmed, treatment naïve BE (LGD, HGD or ImCA) | Cryoballoon ablation - C2 Cryoballoon, Pentax Medical, Montvale, NJ versus cryospray - truFreeze, Steris Endoscopy, Mentro, OH | Mean 13 months (range 6-15) CBA; 15 months (range 9-18) cryospray |
8 | Schölvinck (2015) United States, Netherlands | N=39 (all CBA) Mean age: 66 (57 – 69) Men: 35 (90%) Median Prague C: 2 (2-4) Median Prague M: 5 (3-7) No dysplasia: 9 (23%) Indefinite dysplasia: 1 (3%) LGD: 9 (23%) HGD: 9 (23%) Early adenocarcinoma: 11 (28%) | Multi-centre prospective cohort | People aged 18-80 years who are ablation treatment naïve, with BE (LGD, HGD or ImCA), a flat treatment area, and either a Prague classification score of C≥2 and/or M ≥3, or a BE island (≥1 cm) | Cryoballoon ablation - C2 Therapeutics, Redwood City, California, USA. various CBA durations explored, including: 6 seconds (n=10), 8 seconds (n=28) and 10 seconds (n=18).Proton pump inhibitors dosed twice daily used alongside. | 8 weeks |
9 | Papaefthymiou (2024) | SLR, including 9 studies reporting on CBA (included as a subgroup analysis). There is likely significant overlap with other studies included in this overview. The following studies reported in this systematic review have also been included separately in this overview:
Further details of the combined population are not reported for the subgroup specifically. | Systematic literature review and meta-analysis | Included studies reporting on adult patients (≥18 years old) with BE and dysplasia, cryoablation balloon intervention, reporting on CED and CEIM | Cryoballoon ablation (C2 Cryoballoon Focal Ablation system, Pentax Medical, Redwood City, Calif, USA) | NR |
First author, date | Efficacy | Safety |
|---|---|---|
Dbouk (2022) | CED (per protocol; sensitivity analysis)
CE-IM
Treatment failure:
Disease progression:
| Stricture
Post-procedural bleed:
|
Agarwal (2022) | CRD
CRIM
| Strictures
Perforation
Bleeding
|
Frederiks (2022) | BE surface regression
Technical success
| Strictures
Oesophageal scarring
Bleeding
Pain
Tolerability
Medication use
|
Canto (2020) | CED (per protocol, intention-to-treat)
CE-IM (PP, ITT)
Technical success (ITT)
Treatment failure (ITT)
Disease progression (ITT)
| Adverse events
Strictures
Dysphagia (among those with stricture)
Bleeding
Perforation
Post-procedural pain:
Medication use:
|
Canto (2018) | CED
CE-IM
Technical success
Treatment failure
Disease progression
| Adverse events
Strictures
Dysphagia
Pain
Medication use
|
van Munster (2018) | BE surface regression
| Pain
Dysphagia
Medication use
|
Allselleh (2021) | CRD (CBA, Cryospray - CS)
CR-IM (CBA, Cryospray - CS)
| Strictures
|
Schölvinck (2015) | Conversion to neo-squamous epithelium
| Adverse events
Strictures
Treatment failure
Pain
Medication use
|
Papaefthymiou (2024) | CED
CEIM
Recurrence of BO:
| Adverse events
Stricture development sub-category
|
Procedure technique
Device details were given by all 9 studies. All used the C2 Cryoballoon Focal Ablation System (Pentax Medical, Redwood City, Calif, USA). The pear shaped cryoballoon was noted as an available alternative to the standard balloon in 3 studies (Canto, 2020; Agarwal, 2022; Dbouk, 2022).
Procedure details were given by 7 studies (Schölvinck, 2015; van Munster, 2018; Canto, 2018; Canto, 2020; Agarwal, 2022; Dbouk, 2022; Frederiks, 2022). Procedure details were not included in Alshelleh (2021) or the systematic review by Papaefthymiou (2024).
Setting details were given by 4, which all included the outpatient setting (Canto, 2018; van Munster, 2018; Canto, 2020; Fredericks, 2022). The inpatient setting was also included in 1 multi-centre study (Frederiks, 2022). This was to help meet site specific anaesthesia and sedation policies.
The systematic review by Papaefthymiou (2024) did not specify the CBA duration. A standard 10-second CBA duration was used across the remaining 8 studies. Additional durations were also included in 2 studies. This included 8-seconds (Schölvinck, 2015; Frederiks, 2022), and 6-seconds (Schölvinck, 2015). Frederiks (2022) initially selected a 10-second dose. However, due to comparably high stricture rates compared with comparators for the first 28 procedures, the dose was lowered to 8-seconds to improve safety while preserving efficacy.
Sedation details were given by 6 studies (Schölvinck, 2015; van Munster, 2018; Canto, 2018; Canto, 2020; Agarwal, 2022; Frederiks, 2022). Conscious sedation was specified in 2 (Schölvinck, 2015; van Munster, 2018) and intravenous propofol in 1 (Canto, 2018). Site-specific standard of care was specified in 2 (Canto, 2020; Frederiks, 2022), with either general anaesthesia, propofol or conscious sedation as clinically indicated specified in 1 (Agarwal, 2022).
Details on the maximum number of ablations or treatment sessions was variably reported. The systematic review by Papaefthymiou (2024) did not report this detail for the CBA subgroup. Three studies did not specify a maximum number of ablation sessions (Alshelleh, 2021; Dbouk, 2022; van Munster 2018). Schölvinck (2015) reported a maximum of 2 ablations per person. The remaining studies all specified a maximum of 5 ablative treatment sessions within 12 months (Agarwal, 2022; Canto 2018; Canto 2020; Frederiks 2022). Only Canto (2018) specified the number of ablations allowed per treatment session, with a maximum of 24.
Dbouk (2022) allowed "touch-up" focal ablations using either CBA or APC for small residual columnar islands < 5 mm. Canto (2020) also allowed APC treatment of small flat residual BE islands if fewer than 3 in number and all were < 5 mm in maximum diameter.
Four studies included comparators, including RFA (van Munster, 2018; Agarwal 2022) and cryosrpay (Alshelleh, 2021; Papaefthymiou, 2024). Both van Munster (2018) and Agarwal (2022) provided details of the comparator procedure. Both used focal RFA (Medtronic, Inc, Minneapolis, Minn). van Munster (2018) note that the RFA regimen consisted of either 3 applications with 12 J/cm2, or a regimen consisting of 2 applications with 12 J/cm2 followed by a cleaning step and another 2 applications with 12 J/cm2. Agarwal (2022) note that RFA was applied using standard techniques, using either a balloon-based device or focal device, depending on BE segment length. Alshelleh (2021) and Papaefthymiou (2024) did not include details of the cryospray or spray catheter procedure.
Five studies reported the use of adjunct medical therapies (Schölvinck, 2015; van Munster, 2018; Canto, 2018; Canto 2020; Frederiks 2022). All five reported use of proton-pump inhibitors. Four specified twice-daily dosage (Schölvinck, 2015; Canto, 2018; van Munster, 2018; Frederiks, 2022), while Canto (2020) specified that dosing could be once or twice daily. Three reported use of histamine receptor antagonists (Canto, 2018; Canto 2020; Frederiks, 2022). Canto (2018) reported once daily use, while Canto (2020) and Frederiks (2022) reported use at physicians' discretion.
Efficacy
Complete eradication or remission of dysplasia
CED or CRD was reported in 6 studies. This included 3 as a primary outcome (Canto, 2020; Dbouk, 2022; Alshelleh 2021), 2 as a secondary (Canto, 2018; Agarwal 2022). The systematic review by Papaefthymiou (2024) included CED from CBA as a subgroup analysis.
Canto (2020) reported CED among 76% (91/120) of people at 1-year using ITT analysis. Adjusting for key baseline differences (age, sex, BE length, and dysplasia grade), the probability of achieving CED increased to 96% (95% CI 90%–100%). No statistically significant difference in CED was observed according to baseline dysplasia (P=0.42).
Dbouk (2022) reported CED among 94.6% (54/56) of people at 1-year using PP analysis. Interim retreatment was allowed. CED reached 100% at 2-year and was maintained across 3-year (45/45) and 4-year (37/37) follow-up. A SA included those lost-to-follow-up as treatment failures. Under this, CED decreased to 96% (45/47) at 3-year, and 97% (37/38) at 4-year. One person had recurrent dysplasia at 21.7 months. This was retreated and CED was achieved by next follow-up. This resulted in a recurrence rate of 0.59 per 100 person-years. No statistically significant relationship was found between BE length and dysplasia recurrence (P=0.6).
Alshelleh (2021) reported CED among 95.6% (44/46) of people who had CBA at 18-months (retrospective analysis). This was compared to 96% (24/25) of people who had cryospray. No statistically significant difference in CED was found between CBA or cryospray (P=0.94). The differences were also not statistically significant when only considering LGD (P=0.55) or HGD (P=0.44).
Canto (2018) reported CED among 95% (39/41) of people at 1-year using ITT analysis. Among people with ultra-long BE (8 cm or more), 67% (4/6) achieved CED. This was compared to 100% (35/35) among people with shorter BE lengths. Those with longer BE were statistically significantly less likely to achieve CED (P=0.02). No statistically significant difference was found based on previous EMR status (P=0.11).
Agarwal (2022) reported that 41 of 85 people (% not reported) who had CBA achieved CRD at 1-year follow-up. This increased to 73 of 85 at 2-year follow-up. This was compared to 106 of 226 and 177 of 226 among those who had RFA. The difference in CRD among those who had CBA and RFA was not statistically significant (P=0.46). Across all people, those with longer BE were statistically significantly less likely to achieve CRD (P=0.01). Propensity score–matched analysis using 1:1 matching (85 CBA and 85 RFA cases) was performed, revealing comparable results for achieving CRD (CBA vs RFA: HR, 1.19; 95% CI, .82-1.73; P=0.36) with both ablation modalities.
Papaefthymiou reported that among 9 studies, CBA was achieved in 94% of people (CI: 89.4–98.6). A non-significant (p = 0.076), although moderate (I2 = 56.5%), heterogeneity was reported among the CBA group compared to the spray catheter group. Among the 11 spray catheter studies, CED was achieved among 80.2% of people (95%CI: 73.3–87.1), with high heterogeneity noted across studies [80.2% (95%CI: 73.3–87.1; I 2 = 86.8%, p < 0.001).
Complete eradication/remission of internal metaplasia
CE-IM or CRIM was reported in 6 studies. CE-IM or CRIM was listed as a primary outcome in 4 studies (Canto 2018, Alshelleh 2021, Agarwal 2022, Dbouk 2022), and secondary in 1 (Canto, 2020). The systematic review by Papaefthymiou (2024) reported CE-IM from CBA as a subgroup analysis.
Canto (2018) reported that 88% (35/41) of people achieved CE-IM at 1-year. No statistically significant difference in achieving CE-IM was found for people with (86%) or without (89%) prior endoscopic ablation (P=1.0). No statistically significant difference in CE-IM was found for those with ultra-long (83%) or shorter BE (88%) (P=0.57).
Alshelleh (2021) reported that 84.8% (39/46) of people who had CBA achieved CRIM at 1-year. This was compared to 90% (20/25) of people who had cryospray. No statistically significant difference in CRIM was found between the CBA or cryospray group (P=0.61). The difference was also not statistically significant for either LGD (P=0.67) or HGD (P=0.72).
Agarwal (2022) reported that 25.2% of 85 people achieved CRIM following CBA at 1-year, increasing to 69.8% at 2-year follow-up (number not reported).This was compared to 20.1% from 221 with RFA at 1-year follow-up, and 57.3% at 2-year follow-up. No statistically significant difference in CRIM was found between the CBA or RFA groups (P=0.50). Longer BE length was statistically significantly associated with a decreased chance of CRIM (P=<0.01). Prior endoscopic resection was statistically significantly associated with an increased chance of CRIM (P=0.01). Propensity score–matched analysis using 1:1 matching (85 CBA and 85 RFA cases) was performed, revealing comparable results for achieving CRIM (CBA vs RFA: HR, 1.24; 95% CI, .79-1.96; P =0.35) with both ablation modalities.
Dbouk (2022) reported that 75% (42/56) of people achieved CE-IM at 1-year follow-up. CE-IM increased to 98% (47/48) at 2-year, 98% (40/41) at 3-year, and 97% (32/33) at 4-year follow-up. No statistically significant difference in CE-IM was found after stratifying by baseline dysplasia grade (P value not reported). Recurrent IM was found in 14.6% (7/48) of people after a median of 20.7 months. The IM recurrence rate was 5 in every 100 person-years. CE-IM was maintained in 89% of people at 2-year follow-up, and 86% at 3-year follow-up. No statistically significant association was found between IM recurrence and BE length (P=0.8). No statistically significant associations were found for age, gender, race, BMI, baseline dysplasia, or BE length (P value not reported).
Canto (2020) reported that 72% (86/120) achieved CE-IM at 1-year using ITT analysis. CE-IM was 91% (86/94) under PP analysis. No statistically significant difference in CE-IM was found according to baseline dysplasia grade (P=0.61).
Papaefthymiou reported that among 9 studies, CE-IM was achieved in 87.2% of people (CI: 80.3–94.2). CE-IM was achieved among 52.7% of people among the 11 studies on spray catheters. It was noted that heterogeneity remained high across both CBA and spray catheter subgroups (percentages not reported).
Disease progression
Disease progression was reported as a secondary outcome in 3 studies (Canto, 2018; Canto 2020; Dbouk, 2022).
Canto (2018) reported 0% (0/41) progression from baseline dysplasia to oesophageal cancer at 1 year.
Canto (2020) reported that 1 patient (1/120, 0.8%) with Prague C5M6 progressed from HGD to ImCA during treatment. The patient received 3 CBA treatments. No ImCA was presented at 1-year, but 1 of 3 modules resected at 15 months showed ImCA. No residual or buried BE was found at 2-year follow-up. No other people had progression over the study period.
Dbouk (2022) reported 0% (0/59) progression of baseline disease among any patient during treatment. No new ImCA or dysplasia were noted over the 4-year follow-up.
BE surface regression
BE surface regression was reported as a primary outcome in 2 studies (van Munster, 2018; Frederiks, 2022). Both measured median regression percentage after a single CBA treatment using images/videos of the treatment area. Van Munster (2018) used 2 experts to independently rank regression, and Frederiks (2022) used 3.
Van Munster (2018) reported a median regression of 88% (IQR 63-94%) at 3 months among 20 people who had CBA. This was compared with a median regression of 90% (IQR 77-94%) among 26 people who had RFA. No statistically significant difference in median regression was found between CBA and RFA (P=0.62). Regression scoring was similar between experts with a median difference of 10% (IQR 5-20%). Regression scores for 2 people were excluded from analysis due to low image quality.
Frederiks (2022) reported a median regression of 80% (95% CI 75-90%) at 12 weeks for the 10-second CBA group. This was compared with a median regression of 80% (95% CI, 66-90%) among the 8-second CBA group. In total, 8 people had regression below 50%. This included 5 (5/27) from the 8-second group, and 3 (3/27) from the 10-second group. No statistically significant difference in median surface regression was found between the 8-second and 10-second groups (P=0.65). Regression scoring was similar between experts, with less than 30% difference in 66% (35/53) of images.
Conversion to neo-squamous epithelium
Conversion to neo-squamous was reported as a secondary outcome in 1 study (Schölvinck, 2015).
Schölvinck (2015) reported that 100% (18/18) of people in the 10-second cohort had full conversion to neo-squamous epithelium at 8-weeks. This was compared to 82% (23/28) for the 8-second group, and 60% (6/10) for the 6-second group. Increasing ablation duration was statistically significantly associated with conversion to neo-squamous epithelium (P=0.04).
Technical success
Technical success was reported in 3 studies (Frederiks 2022, Canto 2018, Canto 2020). This refers to the treatment of all visible BE as intended. Schölvinck (2015) reported number of ablations successfully performed.
Frederiks (2022) reported a technical success rate of 96% (26/27) for the 10-second CBA group. This is compared to 100% (27/27) for the 8-second CBA group. No statistically significant difference in technical success was found between the 8-second and 10-second CBA groups (P=1.0).
Canto (2018) reported a technical success rate of 98% (115/117 procedures). Details were provided. Balloon migration from pre-existing strictures caused 100% (2/2) of failures.
Canto (2020) reported a technical success rate of 96% (290/303 procedures). Details were provided. Device failure caused 69.2% (9/13), mucosal injury caused 7.7% (1/13), and balloon positioning caused 23.1% (3/13) of failures.
Schölvinck (2015) reported number of ablations successfully performed, 56/62 procedures (90.3%). Of the 6 ablations that were not successfully performed, these were attributed to device malfunction (3/6, 50%), stenosis in treatment area (1/6, 16.7%), proximity to oesophageal junction (1/6, 16.7%), and accidental CBA in squamous mucosa (1/6, 16.7%).
Treatment failure
Treatment failure was reported in 4 studies (Schölvinck 2015, Canto 2018, Canto 2020, Dbouk 2022).
Canto (2018) defined treatment failure as any patient requiring intervening alternative ablative or surgical treatment for residual BE. At 1-year follow up, 5% (2/41) of people had treatment failure. Both people had ultra-long BE, measuring 8 cm or more.
Canto (2020) did not explicitly define treatment failure. During the first or subsequent procedure, 2.5% (3/120) of people had RFA and were considered treatment failures. This was due to technical difficulties (details provided in the safety section). At 1-year follow-up, 1.6% (2/120) of peoples BE did not respond to treatment. Both had HGD. One had persistent dysplasia, despite 3 CBA and 2 EMR treatments. One achieved CE-IM at 9-months but had buried HGD at 12-months.
Schölvinck (2015) did not explicitly define treatment failure. At 8-weeks, 0% (0/18) of people in the 10-second group had 'no conversion' (less than 20%). This compared to 30% (3/10) in the 6-second and 7% (2/28) in the 8-second groups.
Of the 5 treatment areas considered failures, 40% (2/5) were not biopsied (the study noted that the CBA treatment failed at first attempt for 2 people, so this could be why biopsies were not taken for 2 people), 20% (1/5) contained no squamous epithelium, and 40% (2/5) contained mixed squamous and BE.
Dbouk (2022) defined treatment failure as any recurrence needing retreatment across the 4-year follow-up. Of the 53 people who achieved CED, 1 had recurrent LGD after 21.7 months. The dysplasia recurrence rate was 0.59 per 100 person-years. Of the 48 people who achieved CE-IM, 7 had recurrent IM after a median of 20.7 months. The IM recurrence rate was 5 per 100 person years.
Safety
Pain
Post-procedural pain was reported in 4 studies. Pain was listed as a primary outcome in 1 (van Munster, 2018), and a secondary outcome in 3 (Schölvinck 2015, Canto 2018, Canto 2020). All used a 0-10 rating scale, ranging from no pain (0) to most severe (10).
Van Munster (2018) reported a median cumulative pain score of 4 (IQR 0-16) for 20 people in the CBA group, across 14-days post-procedure. This was compared to a median score of 16 (IQR 14-44) among 26 people in the RFA group. Several secondary pain outcomes were available. Cumulative pain was statistically significantly less among the CBA group compared with the RFA group (P=0.01). The median duration of pain was 5.7 days (SD 1.1 day) for the CBA group, compared to 11.1 days (SD 1 day) for the RFA group. The duration of pain was statistically significantly shorter for the CBA group compared with the RFA group (P=<0.01). The peak pain score was 2 (IQR 2-4) for the CBA group, compared with 4 (IQR 3-7) for the RFA group. Peak pain was statistically significant lower after CBA compared with RFA (P=<0.01).
Schölvinck (2015) reported that 27% (10/37) of people had pain immediately post-procedure. The median immediate post-procedure pain score was 0 (IQR 0-2). This increased to 2.5 (IQR 2-3) if only including those who reported some pain. Pain was reported by 14% (5/37) of people 2-days post-procedure. This included a median pain score of 4 (IQR 3-6) in the treatment area, and 4 (IQR 2-5) when swallowing.
Canto (2018) reported that 27% (11/41) of people had pain requiring analgesics immediately post-procedure, with a median pain score of 1 (IQR 0-3). Among people with ultra-long BE (8 cm or more), 67% (4/6) reported pain, with a median score of 3.5 (IQR 2-8). Among those with shorter BE lengths, 20% (7/35) reported pain with a median score of 1 (IQR 0-3). Immediate post-procedural pain was statistically significantly higher for those with ultra-long BE (P=0.04). Pain was reported by 4.9% (2/41) of all people 1-day post-procedure, with a median score of 0 (IQR 0-2). No statistically significant difference was found according to BE length 1 day post-procedure. No people reported any pain on days 7 or 30 post-procedure.
Canto (2020) reported a median immediate post-procedure pain score of 2 (IQR 0-5) among 120 people. This decreased to 1 (IQR 0-2) at 1 day post-procedure. No people reported any pain on day 7 post-procedure.
Adverse events
AE were measured and included across all studies. Schölvinck (2015) included AE as a primary outcome. All other studies included AE as a secondary outcome (Canto, 2018; van Munster, 2018; Canto 2020; Alshelleh, 2021; Agarwal 2022; Dbouk, 2022; Frederiks, 2022).
Procedural AE rates were reported in 2 studies. Schölvinck (2015) reported that 15% (6/39) of people had an AE during the procedure. Canto (2020) reported that 1 person had a device related AE during the procedure.
AE rates (excluding SAE) among people during follow-up were reported for 8 studies. Schölvinck (2015) reported 0% (0/42) during 8-weeks. Canto (2018) reported 24% (10/41) during 1-year. Van Munster (2018) reported 0% (0/26) during 3-months. Canto (2020) reported 12.5% (15/120) during 1-year. Aslhelleh (2021) reported 8.7% (4/46) during 18-months. Agarwall (2022) reported 10.6% (9/85) over 2-year follow-up. Dbouk (2022) reported 10.2% (6/59) over 4-year follow-up. Further details are provided in relevant sub-sections. Papaefthymiou (2024) reported AE rates for subgroups of 9 CBA studies and 11 spray catheter studies. AE rates were similar at 15.8% (95%CI: 11.6–19.9) for the CBA subgroup and 12.1% (95%CI: 5.9–18.3) for the spray catheter. However, only the CBA subgroup achieved low heterogeneity (I2 = 24.97%, p = 0.22).
SAE rates during follow-up were reported in 4 studies. Some reported the data as SAEs for individuals undergoing the procedure, or SAEs associated with the procedures overall. Canto (2018) reported 2.4% (1/41) SAEs for people, and 0.9% (1/117) for procedures during 1-year. Canto (2020) reported 1% (3/303) for procedures during 1-year. Dbouk (2022) reported 1.7% (1/59) for people over 4-year follow-up. Frederiks (2022) reported 2 SEA over 12-weeks (proportion not provided).
Oesophageal strictures or narrowing
Stricture formation was reported in 8 studies (Schölvinck, 2015; Canto 2018; Canto 2020; Alshelleh 2021; Agarwall 2022; Dbouk 2022; Frederiks 2022; Papaefthymiou, 2024). Oesophageal stenosis was reported in 1 (van Munster, 2018).
Schölvinck (2015) reported strictures among 0% (0/39) of people over 8-weeks follow up.
Canto (2018) reported strictures among 9.8% (4/41) of people over 1-year follow up. Half (50%, 2/4) happened in people who were treatment naïve prior to CBA. These reported relevant symptoms (dysphagia) 5 and 10 weeks after CBA. Half (50%, 2/4) happened in people with previous strictures. These reported symptoms 2 and 4 days after CBA. All strictures were successfully treated using a median of 1 dilation (range 1-3).
Canto (2020) reported strictures among 12.5% (15/120) of people over 1-year follow-up. These developed after a median of 39 days (IQR 31–45). All strictures were treated using a median of 1 dilation (IQR 1-2). Previous EMR was reported in 47% (7/15) of people with strictures. No statistically significant difference in stricture rate was found among those with or without previous EMR (P=1.0). BE length was the only statistically significant predictor of strictures (P=0.04).
Alshelleh (2021) reported strictures among 8.7% (4/46) of people who had CBA over 18 months. This was compared to 12% (3/25) in the cryospray group. No statistically significant difference in stricture rate was found between CBA or cryospray (P=0.65). Differences in stricture rates were also not statistically significant when comparing those with LGD (P=0.39) or HGD (P=0.42).
Agarwall (2021) reported strictures among 10.6% (9/85) of people who had CBA over the median 2-year follow-up (IQR 1.3-2.5). This was compared to 4.4% (10/226) in the RFA group over the median 1.5-year follow-up (IQR 0.8-2.5). Statistically significantly more people who had CBA developed strictures compared to those who had RFA (P=.04). All strictures were successfully treated in both groups.
Dbouk (2022) reported strictures among 8.5% (5/59, 95% CI 2.8%-18.7%) of people within 4 months post treatment. These developed after a median of 2 months. All developed within 4 months of CBA. People with BE of 8 cm or more developed statistically significant more strictures than those with shorter lengths (28.6% versus 2.2%, P=0.01). Prior EMR (P=0.15) and baseline dysplasia (P=1) were not statistically significantly associated with stricture development.
Frederiks (2022) reported strictures among 19% (5/27) of people who had 10-second CBA over 12 weeks. This was compared to 15% (4/26) of people who had 8-second CBA. No people with 8-second CBA had severe strictures, compared with 7% (2/27) in among the 10-second group. No statistically significant difference in strictures (P=1) or severe strictures (P=0.44) was found between groups. Strictures were treated with a median of 1 dilation (range 1-8) in the 10-second group. This compared to 2 dilations (range 1-3) in the 8-second group. No statistically significant difference in the number of dilations was found between groups (P=0.78).
The systematic review and meta-analysis by Papaefthymiou (2024) reported similar rates of stricture development among the CBA and spray catheter subgroups. Of people in the CBA group, 6% (95%CI: 2.9–9.2) developed strictures, compared with 7.5% (95%CI: 3.0–11.9) for the spray catheter subgroup. Only the CBA group yielded non-significant heterogeneity (I2 = 53.7%) across studies.
Van Munster (2018) reported stenosis among 0% (0/20) of people during 3 months follow up post procedure. This was compared to 8% (2/26) in the RFA group. No statistically significant difference in stenosis was found between CBA and RFA (P=0.21).
Oesophageal perforation
Oesophageal perforation was reported in 4 studies (Schölvinck, 2015; Canto, 2018; Canto, 2020; Agarwal, 2022).
Schölvinck (2015) reported within their discussion that there was an 'absence of major bleeding or perforations' within their study population.
Canto (2018) reported no perforations (0%, 0/42) over 1-year follow-up.
Canto (2020) reported no perforations related to balloon inflation over 1-year follow-up. Perforation related to stricture dilation was reported among 1 (0.8%, 1/120) person. This was treated with oesophageal stent and the person made a full recovery.
Agarwal (2022) reported perforation among 0% (0/85) of people who had CBA and 0% (0/226) who had RFA over 2 years.
Bleeding
Bleeding was reported in 5 studies (Canto, 2018; Canto, 2020; Agarwall, 2022; Dbouk 2022; Frederiks 2022).
Canto (2018) reported an upper GI bleed among 1 person (2.4%, 1/41) over 1 year. This happened 7 days post CBA and related to a gastroesophageal junction ulcer associated with aspirin use that did not require therapy. Dbouk (2022) also reported moderate-grade bleeding in 1 person (1.7%, 1/59) over 1 year follow-up. Both reported bleeds refer to the same AE due to an overlap of 22 people between studies.
Canto (2020) reported upper GI bleed among 1 person (1/120, 0.8%) over 1 year. This happened 1 week post CBA and related to ongoing clopidogrel use. Treatment was not required.
Agarwall (2022) reported no clinically significant bleeding among people who had CBA (0%, 0/85) or RFA (0%, 0/226) over 2 years. Frederiks (2022) also note that no cases of bleeding occurred during the Euro-Coldplay study, confirmed by correspondence with the key authors.
Dysphagia
Dysphagia was reported in 4 studies (Canto, 2018; van Munster, 2018; Canto 2020; Frederiks, 2022).
Canto (2018) reported mild dysphagia from stenosis requiring dilation among 9.7% (4/41) of people over 1-year follow-up. This included 2 treatment naïve people who reported dysphagia 5- and 10-weeks post CBA. Dilation occurred at 3months for treatment.
Van Munster (2018) reported that dysphagia scores post treatment were statistically significantly lower among people who had CBA compared with RFA (P=<0.01).
Canto (2020) reported dysphagia among 12.5% (15/120) of people over 1-year. All had symptomatic oesophageal strictures requiring dilation. Dysphagia developed within 30 days for 60% (9/15), and after 30 days in 40% (6/15).
Device malfunction
Device malfunction/failure was reported in 3 studies (Schölvinck 2015, Canto 2020, Frederiks 2022).
Schölvinck (2015) reported 3 records of device malfunction. An error signal appeared on balloon inflation in 2 instances. The balloon did not make proper contact with the oesophageal wall in 1 instance. The reported device malfunctions caused 50% (3/6) of procedure failures.
Canto (2020) reported 9 instances of device-related failure. Furthers details were not provided. The device-related failure caused 69.2% (9/13) of procedure failures.
Frederiks (2022) reported 2 instances of device malfunction which required a switch from CBA to RFA. Further details were not provided, and these were excluded from the per-protocol analysis. An additional 8 device malfunctions were reported. This included 26% (7/27) of people in the 8-second group, compared with 4% (1/27) in the 10-second group. The difference between groups was statistically significant (P=0.05). Malfunctions happened either during the procedure (5/8) or set-up (3/7). Further details were not provided. All procedures were completed successfully following replacement of a CBA component.
Medication use
Analgesic medication use was reported in 5 studies (Schölvinck, 2015; Canto, 2018; van Munster, 2018; Canto, 2020; Frederiks, 2022).
Schölvinck (2015) reported no pain medication use (0%, 0/37) immediately post-procedure. After a median of 2-days, 8% (3/37) of people used additional pain medication. This included non-steroidal anti-inflammatory drugs (2/3) and acetaminophen (1/3).
Canto (2018) reported that 27% (11/41) of people required pain medication immediately post-procedure. This decreased to 4.9% (2/41) 1 day post-procedure, with none (0%, 0/41) required on either day 7 or 30. Immediately post-procedure, 67% (4.6) of people with long BE (8 cm or more) required medication compared to 20% (7/35) of those with shorter lengths. The difference in immediate post-procedure medication use was statistically significant (P=0.04).
Van Munster (2018) reported that people who had CBA used statistically significantly less pain medication than those who had RFA (P value not reported). Average use lasted 2.6 days (SD 0.7) for those who had CBA. This was compared to 6.3 days (SD 1.0) for those who had RFA. The difference in length of use was statistically significant (P=0.01). Medications used included paracetamol (10%, 2/20) and non-steroidal anti-inflammatory drugs (15%, 3/20). Types of medication used were not statistically significant between groups (P=0.09).
Canto (2020) reported that 8% of 120 people required pain medication immediately post-procedure. This reduced to 1.7% at day 1 post post-procedure, and 0.3% by day 7. More people required pain medication after their initial treatment, where 13% used analgesics.
Frederiks (2022) compared medication use for 14 days post-procedure among people receiving 10-second and 8-second CBA. The exact figures for analgesics use are not provided. Differences are presented graphically. Differences in medication use between people receiving 8-second and 10-second CBA were not found to be statistically significant (P=0.36).
Anecdotal and theoretical adverse events
Expert advice was sought from consultants who have been nominated or ratified by their professional society or royal college. They were asked if they knew of any other adverse events for this procedure that they had heard about (anecdotal), which were not reported in the literature. They were also asked if they thought there were other adverse events that might possibly occur, even if they had never happened (theoretical).
When this procedure was assessed previously, 1 specialist adviser noted device failure as an anecdotal adverse event, and considered nitrous oxide leakage from ruptured balloon was a theoretical adverse event.
Four professional expert questionnaires were submitted for this assessment. Three noted perforations as a theoretical adverse event, and pain and strictures as anecdotal events. Find full details of what the professional experts said about the procedure in the https://www.nice.org.uk/guidance/indevelopment/gid-ipg10413/documents.
Validity and generalisability
Total sample size ranged from 39 (Schölvinck, 2015) to 311 (Agarwal, 2022). The number of people who had CBA ranged from 20 (van Munster, 2018) to 120 (Canto, 2020).
Follow-up ranged from 8 weeks (Schölvinck, 2015) to 4 years (Dbouk, 2022) across studies. Most follow-up covered at least 12-months (Canto, 2018; Canto, 2020; Alshelleh, 2021; Agarwal, 2022; Dbouk, 2022).
One study did not disclose the location (Dbouk, 2022). Of those providing location details, none included the UK. Included study centres were either in the US (Schölvinck, 2015; Canto, 2018; Canto, 2020; Alshelleh, 2021; Agarwal, 2022), or the Netherlands (N=3) (Schölvinck, 2015; van Munster, 2018; Frederiks, 2022).
All studies were observational and did not include random assignment. Adjustments for potential confounders (i.e., age, gender, BE length, etc.) at least in part of the analyses were noted in 4 studies (van Munster, 2018; Canto, 2020; Agarwal, 2022; Frederiks 2022). Agarwal (2022) used propensity and score matching to minimise bias which may result from non-randomisation.
All studies used the cryoballoon focal ablation system (Pentax Medical, Montvale, New Jersey, United States). The pear shaped cryoballoon was explicitly noted as an available alternative to the standard focal balloon in 3 studies (Canto, 2020; Agarwal, 2022; Dbouk, 2022). Canto (2020) noted no technical difficulties after the pear-shaped balloon was made available.
All studies used a 10-second CBA duration as standard. This enables some comparability across studies. Two studies also measured effects using an 8-second duration (Schölvinck, 2015; Frederiks, 2022), with 1 study including an additional 6-second duration (Schölvinck, 2015). Frederiks (2022) had initially planned to only include 10-second duration, but due to an unexpected high stricture rate compared with the literature, the initial 10-second dose was lowered to 8 seconds to improve safety while preserving efficacy.
Alshelleh (2021) and Papaefthymiou (2024) did not include procedure details. All other studies provided at least some detail on procedure technique. Where reported, techniques were similar. Only slight variation was noted for sedation, due to site-specific policies.
Papaefthymiou (2024) only included CBA as a subgroup within the systematic review and meta-analysis. Therefore, details on the combined sample are very limited, which may limit the ability to generalise the findings to wider populations.
Three studies with shorter follow-up did not offer retreatment (Schölvinck, 2015; van Munster, 2018; Frederiks, 2022), and Alshelleh (2021) did not specify whether retreatment was offered. Retreatment was offered every 10-12 weeks, if required, in 4 studies (Canto, 2018; Canto, 2020; Agarwal, 2021; Dbouk, 2022). Of these, 2 allowed retreatment (Canto, 2018, Canto, 2020), and 2 classed people who had retreatment as treatment failures (Agarwal, 2021; Dbouk, 2022).
All studies reported at least some industry funding or involvement from the device manufacturer (Pentax Medical). All included authors who had financial ties with the manufacturer. Canto (2020) received a research grant from the manufacturer.
Inclusion criteria varied. All studies included people with LGD or HGD, with 5 also including people with ImCA (Schölvinck, 2015; Canto, 2018; Canto, 2020; Agarwal, 2022; Dbouk, 2022). Two studies included both previously ablated and treatment naïve people (van Munster, 2018; Canto; 2018). Six studies included only treatment naïve people (Schölvinck, 2015; Canto, 2020; Alshelleh, 2021; Agarwal, 2022; Dbouk, 2022; Frederiks, 2022).Schölvinck (2015) only included BE islands.
A range of BE lengths were represented within the inclusion criteria. No BE length was specified in 3 studies (van Munster, 2018; Aslhelleh, 2021; Dbouk, 2022). BE lengths of 6 cm or less were specified in 2 studies (Canto, 2020; Agarwal, 2022). BE lengths of 1 cm or more were specified in 2 studies (Schölvinck, 2015; Canto, 2018).
Participant overlap was explicitly reported among 22 people, included in both Canto (2018) and Dbouk (2022). There is potential for overlap among other studies, particularly the retrospective studies.
Pain outcomes were measured across variable time frames. For instance, Canto (2020) and van Munster (2018) measured pain up to 7-days post procedure, Frederiks (2022) measured pain up to 14 days post-procedure, Canto (2018) measured pain up to 30-days, and Schölvinck (2015) measured pain over the entire 3-month follow-up period. Variability in time frames presents a difficulty when trying to draw comparison between studies.
Definitions of treatment failure varied across studies. Canto (2018) and Schölvinck (2015) did not explicitly define treatment failure. Canto (2018) defined treatment failure as any requirement for alternative ablative therapies. Whereas Dbouk (2022) defined treatment failure as any requirement for re-treatment. This variability presents difficulty with trying to compare outcomes between studies.
BE length was most commonly found to be statistically significant across the various safety and efficacy outcomes. BE length has 5 reports of significance across 4 outcomes. However, conflict was identified across studies, with 3 instances of non-significance across 2 outcomes for BE length.
All included studies reported that CBA was safe and effective. The 3 studies which compared CBA with alternative ablation modalities all reported equal effect (van Muster, 2018; Alshelleh 2021; Agarwal, 2022). The 5 studies which evaluated CBA exclusively all reported that the procedure is safe and effective, with no contention across studies (Schölvinck, 2015; Canto, 2018; Canto 2020; Alshelleh, 2021; Dbouk, 2022; Frederiks, 2022).
Ongoing trials
C2 CryoBalloon™ 180 Ablation System Dose De-escalation Study. NCT03311451. N=30. Netherlands. Expected completion: December 2025.
Nitrous Oxide For Endoscopic Ablation of Refractory Barrett's Esophagus (NO FEAR-BE) (NO FEAR-BE). NCT03554356. N=70. United States. Expected completion: December 2026.
How are you taking part in this consultation?
You will not be able to change how you comment later.
You must be signed in to answer questions