The company's submission presented 2 anchored matching-adjusted indirect comparisons (MAICs): one to compare serplulimab with atezolizumab and another to compare serplulimab with durvalumab. Baseline characteristics of the ASTRUM-005 intention-to-treat population were adjusted to IMpower133 or CASPIAN data, before applying Cox proportional hazards regressions to estimate the relative efficacy between serplulimab and either atezolizumab or durvalumab. Overall, the ITCs suggested that serplulimab improves PFS and OS compared with either atezolizumab or durvalumab, with or without adjustment of baseline variables. The EAG highlighted that a limited number of characteristics was included in each ITC. The EAG noted that some characteristics, such as race and previous cancer treatment, were notably different between the trials but were not adjusted for in the base-case MAICs. The company explained that adjusting for these characteristics would result in excessively low effective sample sizes, leading to unreliable outcomes. Also, previous cancer treatment was not reported in CASPIAN. The EAG acknowledged this but noted that the uncertainty remains. At the clarification stage, the EAG requested that the company provide a multilevel network meta-regression to address the uncertainties around between-study variations because it would allow more flexibility to generate population-adjusted outcomes. But the company asserted that the MAICs were more suitable and addressed between-trial differences through the matching and reweighting of baseline data. The EAG acknowledged that the multilevel network meta-regression would also be uncertain because of the limited population overlap identified by the MAICs, but it maintained that this approach would have been useful to explore. The committee was concerned that the MAICs did not offer a robust approach for decision making in this evaluation. It said it was not appropriate to compare hazard ratios across 2 different matched populations. It also recalled the uncertainty in the generalisability of the trial populations to the NHS (see section 3.4) and that the MAICs reflected the trial populations of IMpower133 and CASPIAN. The committee noted that the results of unmatched Bucher ITCs and the MAICs were similar, implying that the treatment effect modification of the adjusted variables was not very strong. The company agreed and said it would also expect to see similar results if another adjustment method was used, such as the multilevel network meta-regression. The committee noted a multilevel network meta-regression would not address all the uncertainty around the between-study differences but would allow for comparisons to be made against the comparators in 1 population. The committee agreed that the Bucher ITCs were the best available evidence in the submission for comparing serplulimab with atezolizumab or durvalumab. But because these were highly uncertain, the committee requested to see a network meta-analysis (with time-varying hazard ratios; see section 3.9) that would allow for the relative effectiveness of serplulimab to both atezolizumab and durvalumab to be considered.
In response to the first consultation, the company produced fixed-effects fractional polynomial network meta-analyses (FP NMAs). It noted that FP NMAs offer a flexible, time-varying method that allow hazard rates to change over time, so could address potential violations of the proportional hazards assumption. The company fitted first- and second-order FP NMA models. It noted that the first-order models showed limited fit to the observed survival data and the second-order models had convergence issues and wide credible intervals which limited the reliability of the relative effect estimates. So, the company thought that the MAICs remained the most appropriate ITC approach. The EAG highlighted that if the populations across trials are assumed to be comparable when considering effect modifiers (see section 3.4), then all the ITCs should produce similar results. It also explained that when there are multiple comparators being evaluated, other ITC methods, for example, Bucher or multi-level network meta-analyses, are preferred to MAICs. This is because they either assume a similar population, or they can adjust to a common population. Considering this, the EAG stated that the Bucher ITC is a suitable method to use. The EAG agreed with the company that there were limitations with the FP NMAs and uncertainties in the assessment of proportional hazards, noting that the assumption did not hold in the CASPIAN PFS data. Accounting for these factors, the EAG selected the Bucher ITCs as the most appropriate approach for the PFS and OS hazard ratio estimates for atezolizumab, and OS hazard ratio estimate for durvalumab. The EAG chose the best-fitting second-order FP NMA model for the relative PFS estimate for durvalumab, because this model did not rely on the proportional hazards assumptions and had a better statistical fit than the first order FP NMA models.
The committee reiterated its concerns with the MAICs and considered the EAG's preferred ITCs. The committee questioned the EAG's use of 2 different models to estimate the relative PFS treatment effect for atezolizumab and durvalumab. It shared concerns about the internal validity of this, noting that each method has different underlying assumptions and that using results from different analyses means that correlations estimated in the FP NMA are not taken into account. The committee also thought that the FP NMA, although a more flexible approach, did not fit the data better than the ITCs provided at the first meeting. It noted the company's and EAG's concerns around the proportional hazards assumption and that it did not hold in the CASPIAN PFS data. So, for the durvalumab arm in particular, the committee noted that none of the ITCs fit the data well. But, the committee recalled that durvalumab is used by less than 4% of people with untreated ES‑SCLC in the NHS and that atezolizumab is the most relevant comparator in this evaluation (see section 3.2). So, the consequences of the decision risk associated with the comparison with durvalumab were less than for the comparison with atezolizumab. The committee stated that there were limitations with all of the ITCs presented and that more complex approaches such as MAICs and FP NMAs did not reduce this uncertainty. It concluded that the Bucher ITCs were highly uncertain, but remained the best available evidence presented for comparing serplulimab with atezolizumab.