본문으로 건너뛰기
← 뒤로

Relative Clinical Efficacy and Safety of Second- or Later-Line Treatments for Advanced and Metastatic Gastric Cancer: A Rapid Review and Network Meta-Analysis.

메타분석 1/5 보강
Journal of gastrointestinal cancer 📖 저널 OA 26% 2024: 1/16 OA 2025: 25/91 OA 2026: 21/74 OA 2024~2026 2026 Vol.57(1) p. 36
Retraction 확인
출처

Sharma S, McConnell D, Carey N, O'Sullivan J, Kearns P, Lowery M, McCullagh L

📝 환자 설명용 한 줄

[OBJECTIVE] To identify randomised control trials (RCTs) of treatments (recommended by the National Comprehensive Cancer Network (NCCN), the European Society for Medical Oncology (ESMO) Clinical Pract

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)
  • 연구 설계 systematic review

이 논문을 인용하기

↓ .bib ↓ .ris
APA Sharma S, McConnell D, et al. (2026). Relative Clinical Efficacy and Safety of Second- or Later-Line Treatments for Advanced and Metastatic Gastric Cancer: A Rapid Review and Network Meta-Analysis.. Journal of gastrointestinal cancer, 57(1), 36. https://doi.org/10.1007/s12029-026-01407-z
MLA Sharma S, et al.. "Relative Clinical Efficacy and Safety of Second- or Later-Line Treatments for Advanced and Metastatic Gastric Cancer: A Rapid Review and Network Meta-Analysis.." Journal of gastrointestinal cancer, vol. 57, no. 1, 2026, pp. 36.
PMID 41649630 ↗

Abstract

[OBJECTIVE] To identify randomised control trials (RCTs) of treatments (recommended by the National Comprehensive Cancer Network (NCCN), the European Society for Medical Oncology (ESMO) Clinical Practice Guidelines and clinical expertise) for the second- or later-line treatment of advanced/metastatic gastric cancer. To determine the relative efficacy and safety of the treatments.

[METHODS] RCTs were identified from a Rapid Literature Review and a published systematic review. Identified RCTs were subject to data-extraction and narrative review. Eligible RCTs were included in evidence networks to determine relative efficacy and safety of the treatments.

[RESULTS] In total, 44 RCTs (pertaining to eleven treatments), were identified for data-extraction and narrative review; 37 in the second-line setting, five in the second- and later-line setting and two in the third- and later-line setting. Evidence networks were feasible for the second-line treatments only. No statistically significant differences, across treatments, for key efficacy outcomes (overall-survival, progression-free survival), and additional outcome (objective-response rate) were identified. Pembrolizumab was associated with a statistically significant decreased risk of Grade ≥ 3 treatment-related adverse effects versus paclitaxel; no other significant differences, across treatments, were identified for this outcome.

[CONCLUSION] The appreciable number of RCTs identified indicates that the treatment landscape here is rapidly evolving. The introduction of novel treatments, in the second-line setting, has not had a statistically significant impact on key efficacy outcomes, and has had little impact on safety outcomes, versus more established treatments. There remains a need for novel treatments that will have a significant benefit on efficacy and safety outcomes.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (5)

📖 전문 본문 읽기 PMC JATS · ~159 KB · 영문

Introduction

Introduction
According to GLOBOCAN estimates, in 2022, gastric cancer ranked fifth, globally, in incidence rate and was the fifth most frequent cause of cancer-related mortality. At that time, over one million individuals, across 185 countries, had advanced or metastatic gastric cancer [1]. The estimated 5-year relative survival rates for localised, regional, and distant or metastatic gastric cancers in the US, for 2014 to 2020, were 75%, 36% and 7%, respectively [2]. While curative surgical or endoscopic resection are the cornerstone treatments for early stages of gastric cancer, systemic treatments are the standard of care in the advanced setting [3].
Recent advances in molecular profiling of gastric adenocarcinomas have identified key subtypes. These include Epstein Barr virus (EBV) positive, microsatellite instability-high (MSI-H), genomically stable, and tumours with chromosomal instability (CIN) [4]. CIN subtypes show copy number variations in key oncogenes such as epidermal growth factor receptor (EGFR), human epidermal growth factor receptor 2 (HER-2), fibroblast growth factor receptor 2 (FGFR2) and mesenchymal-epithelial transition factor (MET) [4, 5].
There have been recent advances in the use of biomarker-directed therapies [6]. For example, the European Society for Medical Oncology (ESMO) Clinical Practice Guidelines 2022 recommend programmed cell death protein 1 (PD-1) inhibitors for programmed death-ligand 1 (PD-L1) positive tumours where combined positive scores (CPS) determine the eligibility for immune checkpoint inhibitors (ICIs) [6, 7]. In metastatic gastric cancer, Microsatellite Instability-high/Deficient (MSI-H)/Mismatch Repair deficiency (dMMR) is associated with increased sensitivity to immunotherapy [8]. Ongoing trials are examining the efficacy and safety of other biomarker-targeted therapies, such as those for use in the settings of FGFR2b overexpression [9, 10], MET amplification, claudin-18.2 overexpression and EBV-positive cancers [7], and novel HER-2 directed therapies alone and in combination with immune checkpoint blockade [11].
Most individuals diagnosed with gastric cancer present with advanced-stage disease, often resulting in poor outcomes [12]. Common first-line chemotherapies include fluoropyrimidine in combination with platinum compounds (fluoropyrimidine + platinum compounds) [7]. Oxaliplatin and cisplatin are amongst the commonly utilized platinum compounds [7]; oxaliplatin is preferred for older adults [13]. Fluoropyrimidines are given either intravenously (5-FU) or orally (capecitabine or S-1 (which is more commonly used in Asia)) [7, 14]. 5-FU + platinum compounds + taxane is not recommended as a standard approach due to high toxicity [7] but may be considered in the setting of oligometastatic disease [15]. In cases of platinum-intolerance, individuals may be given 5-FU monotherapy or 5-FU + irinotecan. HER-2 overexpression may be treated with trastuzumab (a monoclonal antibody, specifically targeted against the HER-2 extracellular domain) + chemotherapy [7] with pembrolizumab added in the setting of PD-L1 (CPS) ≥ 1 [16, 17]. Where the HER-2 status is negative with PD-L1 (CPS) ≥ 5, nivolumab + chemotherapy is recommended [7]. Pembrolizumab + chemotherapy is approved regardless of PD-L1 (CPS) status [18]. Zolbetuximab (a claudin-18.2 directed cytolytic antibody), is approved, in combination with fluoropyrimidine + platinum compounds, for first-line treatment of individuals with claudin-18.2-positive and HER-2 negative tumours [19, 20]. Treatment options, in the first-line treatment setting, are limited. Survival outcomes are poor following disease progression on first-line treatments [21].
Thus, over recent years, there have been rapid changes in the treatment landscapes in the second- and later-line settings [6]. Notably, in the developments of immunotherapies and targeted therapies.
The ESMO and NCCN guidelines generally recommend similar second-line chemotherapy options, but the guidelines differ in some recommendations regarding novel treatments and the sequencing of treatments. In summary, the ESMO guidelines recommend that individuals without contraindications to chemotherapy or anti-angiogenic treatment should receive ramucirumab + paclitaxel. ESMO recommends that individuals who are unable to tolerate chemotherapy should receive ramucirumab monotherapy and those who are unable to tolerate anti-angiogenic treatments should receive a taxane or irinotecan monotherapy [7]. FOLFIRI (5-FU + leucovorin + irinotecan) is recognised by ESMO as an alternative option, although the supporting evidence is limited [7]. FOLFIRI, however, is listed as a preferred option in the NCCN guidelines [22]. Other NCCN recommended preferred options are ramucirumab + paclitaxel, docetaxel monotherapy, paclitaxel monotherapy or irinotecan monotherapy [22]. With regards to immunotherapy, both ESMO and NCCN recommend pembrolizumab in the setting of MSI-high (MSI-H) or mismatch repair deficient (dMMR) tumors [7, 22]. In addition, NCCN also recommends pembrolizumab if tumor mutational burden is ≥ 10/megabase, dostarlimab as another option for MSI-H/dMMR tumors and entrectinib in individuals with NTRK gene fusion positive tumors [22]. Both guidelines recommend trastuzumab deruxtecan for HER2-positive tumors in the second- and later-line settings [7, 22]. The ESMO guidelines recommend trifluridine–tipiracil as a third-line treatment [7]; the NCCN recommends trifluridine–tipiracil in the third or later-line setting ’ [22].
Of interest here, Catenacci et al. [23], conducted a systematic literature review (SLR) of randomised clinical trials (RCTs) (published 2009 to 19 November 2019 inclusive), that had investigated the efficacy and safety of systemic treatment, for adults with advanced, unresectable or metastatic gastric cancer, or gastroesophageal junction adenocarcinoma. Amongst their findings, the authors concluded that sequencing with taxanes or irinotecan, in combination with biologics, were effective second-line regimens. It was also noted that escalating to a triplet regimen might improve treatment efficacy at the expense of added toxicity [23].
It is recognised that, despite the survival benefits associated with novel second-line treatments, the prognosis here continues to be poor [24]. Also, there remains much uncertainty regarding the relative efficacy and safety of these novel treatments. This presents challenges for health-decision makers, including policy-decision makers, health-payers, clinicians, patients, carers and families.
The objective of this research is to examine the changing landscape of treatments of interest for advanced and metastatic gastric cancer in the second- or later-line setting. Also, to investigate the relative efficacy and safety of these treatments. Treatments of interest were defined as those recommended by both the ESMO Guidelines [7] and the NCCN Guidelines [22] and also that align with Clinical Expert opinion.
To capture the relevant literature, the search conducted as part of this research aimed to identify the published RCTs of treatments of interest. Those studies that had been identified by Catenacci et al. were reviewed for eligibility. In addition, a Rapid Literature Review (RLR) was undertaken to identify RCTs that had been published after the Catenacci et al. dissemination. An RLR is a recognised, pragmatic approach used to identify literature of relevance to a research question. In order to estimate the relative efficacy and safety of the treatments of interest, network evidence syntheses were conducted, in the form of network meta-analyses (NMAs), of eligible RCTs.

Methods

Methods

Identification of Treatments of Interest
The PICOS Framework for the literature search is presented in Appendix (Table 1). The population of interest was individuals with advanced or metastatic gastric cancer in the second- or later-line setting.
The ESMO Guidelines (Lordick et al., 2022) and NCCN Guidelines (Ajani et al., 2022) were systematically reviewed to identify treatments of interest [7, 22]. In compliance with both of these guidelines, RCTs that evaluated at least one of the following treatments: paclitaxel, docetaxel, irinotecan, FOLFIRI, ramucirumab, ramucirumab + paclitaxel, pembrolizumab (in the setting of MSI-H/dMMR tumours), dostarlimab, entrectinib, trastuzumab deruxtecan (in the setting of HER-2 positive tumours) and best supportive care (BSC) were considered for inclusion. Further, the NCCN Guidelines, also recommends pembrolizumab in the setting of TMB-high tumours.
Clinical Expertise advised that all treatments, on this list, should be included along with trifluridine-tipiracil for third- and fourth-line treatment.

Search Strategy for Eligible Randomised Controlled Trials
The two-step search strategy aimed to identify eligible RCTs published from 2009 to 29 May 2024 inclusive.

Step One
A comprehensive RLR for eligible RCTs (published 20 November 2019 to 29 May 2024 inclusive), was conducted. A systematic search of Cochrane Central, Embase, and MEDLINE via Ovid was performed. The search string is presented in Appendix 1. The RLR was restricted to English-language, full-text publications. Conference papers, commentaries, abstracts, and letters were excluded. Subgroup, exploratory and post-hoc analyses of original trials were also excluded. Further information regarding our inclusion criteria is provided in Table 1.
An RLR Protocol was developed and registered with PROSPERO (PROSPERO 2024 CRD42024568042). The RLR was conducted in line with RLR Approach 1 as described by Tricco et al. [25] and Smela et al. [26]. The modified PRISMA-NMA Reporting Checklist, as outlined by National Institute for Health and Care Excellence (NICE) [27], was followed. This modified checklist contains the subset of items, from the PRISMA Reporting Checklist [28], that are specifically applicable to the reporting of an NMA. ‘The modified PRISMA-NMA Checklist is presented in Appendix (Table 3)’. The protocol was developed to capture data published subsequent to the dissemination by Catenacci et al.

Step Two
All citations housed within the SLR dissemination by Catenacci et al. were reviewed to identify eligible RCTs (published 2009 to 19 November 2019 inclusive) that had investigated the efficacy and/or safety of the treatments of interest here. Those same inclusion and exclusion criteria and restrictions, described for the RLR, were applied when identifying the relevant RCTs from Catenacci et al.

Data Extraction
The data extraction and quality assessment processes were applied to all RCTs identified via the RLR and also via the Catenacci et al. dissemination [23]. In line with RLR Approach 1 methodology, one reviewer (SS) independently performed all title and abstract screening, full-text screening, and data extraction. Verifications were conducted by a second reviewer (LM). Two reviewers (DM and NC) also verified the accuracy and comprehensiveness of all data extracted with any disagreements resolved through discussion. After the initial screening, the references of primary studies were scanned to ascertain additional relevant RCTs. Forward citation searching was performed on the identified RCTs. In cases where RCT results were reported in multiple publications, data was extracted from the most comprehensive and/or recent dissemination only.
Templates for data extraction, developed in Microsoft Excel®, included the study title, author, year of publication, jurisdiction, key population characteristics, number of trial participants, number of participants in each arm, prior treatments received, and outcomes of interest. The primary outcomes of interest were overall survival (OS), progression-free survival (PFS), while additional outcomes were, objective response rate (ORR) and Grade ≥ 3 treatment-related adverse events (TRAEs). For the time-to-event outcomes (i.e. OS and PFS), hazard ratios (HRs) with 95% confidence intervals (95% CI) were extracted. For dichotomous outcomes (i.e. ORR and Grade ≥ 3 TRAEs), absolute number of events, event rates (with 95% CIs) and relative risk (95% CI) were extracted, where available. The RCTs were categorised according to the line(s) of treatment under investigation (i.e. second-line or second- and later-line, or third- and later-line). For any study where the inclusion or data quality was unclear, discussions ensued, and clarifications were reached with one or two additional reviewers as necessary (DM, NC).
Extracted data was presented in tabular form (see Table 1) and also in a narrative summary.

Assessment of Risk of Bias
The risk of bias (RoB), associated with each RCT, was evaluated using the Cochrane Collaboration RoB 1 Tool for RCTs (Higgins, Altman et al., 2011). We have followed the lead of several recent Cochrane reviews and other authors who have opted for use of the Cochrane RoB 1 Tool in preference to the Cochrane RoB 2 Tool. These authors have cited implementation and pragmatic constraints associated with the RoB 1 Tool [29–31]. Also, the Cochrane Handbook recognises that use of the RoB 1 Tool remains acceptable given ongoing implementation challenges associated with the RoB 2 Tool [32].This evaluation was performed by one reviewer (SS) with clarifications sought from other reviewers when required. The RoB 1 Tool assesses the RoB, in RCTs, across seven domains: Random Sequence Generation, Allocation Concealment, Blinding of Participants and Personnel, Blinding of Outcome Assessment, Attrition Bias, Reporting Bias and Other Bias. Each domain is evaluated for RoB and determined to be low, unclear, or high.

Data Synthesis and Analysis

Construction of Evidence Networks
The NMA approach taken here was aligned with the European Methodological Guidelines for Quantitative Evidence Synthesis for Direct and Indirect Comparisons [33]. Specifically, evidence synthesis was carried out using Bayesian NMA in the generalised linear modelling (GLM) framework described by Dias et al. [34].
Evidence networks for the outcomes of interest (OS, PFS, ORR and Grade ≥ 3 TRAEs) were constructed.
All RCTs identified via the RLR and via Catenacci et al. [23] were assessed for inclusion in the evidence networks. Where two or more citations reported different data cuts of the same study, the most recent data cut was included. Studies reporting post-hoc subgroup analyses from already-included RCTs were excluded.
Different doses, dosing schedules, or modes of administration of the same treatment of interest were classified as a single node in the evidence networks. Therefore, any RCTs that exclusively compared two such interventions were excluded. Where the intervention in a trial arm consisted of different treatments or combinations of different treatments, these were considered as distinct nodes in the networks. Each outcome-specific evidence network was constructed to include all relevant evidence which informed comparative efficacy or safety of the treatments listed in the PICO. Thus, each evidence network included all RCTs that contributed to a path connecting two or more treatments of interest, while any RCTs that did not lie on such a path were excluded.
Feasibility of evidence synthesis, within each evidence network, relies on the underlying assumption of exchangeability, namely, that if individuals in one trial were substituted into another, the treatment effect observed would be expected to be the same [33, 34]. Exchangeability requires that studies included in the networks be sufficiently similar in terms of patient and study characteristics, to ensure that no effect-modification is present. Thus, characteristics of RCTs were assessed and compared, with particular reference to inclusion and exclusion criteria, the role of biomarker-directed therapy, and prior treatments received.
Any RCTs which differed substantially from the main study pool in terms of these characteristics were excluded from the final evidence networks. In particular, RCTs conducted in different lines of treatment were not to be pooled within the same evidence network. Therefore, the feasibility of developing separate evidence networks in the second-line setting, the second- or later-line setting, and the third- or later-line setting was investigated.

Statistical Methods
Relative treatment effects for all pairwise comparisons in the final NMAs were estimated, measured as HRs for OS and PFS, and as relative risks (RRs) for ORR, and Grade ≥ 3 TRAEs. For the outcomes of OS and PFS, treatment effects were synthesised on the log-hazard ratio scale using a GLM with a normal likelihood and an identity link function. For the dichotomous outcomes of ORR and Grade ≥ 3 TRAEs, a GLM with a binomial likelihood and a log link function [35] was used. Non-informative prior distributions (specifically, normal distributions with mean 0 and variance 10^6) were specified for all treatment-effect parameters.
Random-effects models were used in the base case to account for the presence of between-study heterogeneity. As the estimation of random-effects models can be problematic in networks with few studies, weakly informative prior distributions were used for the between-study heterogeneity parameters, specifically, a half-normal distribution with standard deviation 0.5. This choice of distribution gives a 95% prior probability that the effect observed in any given study will differ from the mean effect by a factor of less than 3 on the RR or HR scale, and was chosen to reflect the typical extent of heterogeneity encountered in meta-analyses of drugs [36, 37]. Treatment effects were summarised using posterior medians and 95% credible intervals (95%CrI) for the estimated HRs (OS and PFS) and RRs (ORR and Grade ≥ 3 TRAEs).
The relative effectiveness and associated uncertainty for all possible pairs of treatments of interest were summarized in grid tables. The grid tables were developed where the treatment and comparator (for each pair-wise comparison) were named on the X-axis and Y-axis respectively. Relative-treatment effect point estimates, for each pair-wise comparison are described in coloured grid cells. Green indicates a point estimate that favours the treatment; red indicates a point estimate that favours the comparator. The colour intensity indicates the magnitude of the point estimate. The colour intensity however does not indicate statistical significance or the width of the 95%CrI).
Due to the small number of RCTs in the final evidence networks, quantitative assessment of heterogeneity was not possible. Similarly, quantitative analysis of inconsistency could not be carried out due to the absence of closed loops. Therefore, assessment of the underlying assumption of exchangeability was carried out via a qualitative assessment of study and population characteristics.
Analysis was carried out in R (version 4.4.2) and JAGS (version 4.3.1) using the package BUGSnet [38]. The Markov chain Monte Carlo sampling was conducted in JAGS using 3 chains with a total 15,000 iterations per chain, of which 4000 were burn-in iterations. An adaptive phase consisting of 1,000 iterations was also conducted to optimise the sampler. No thinning of samples was carried out. Model convergence was assessed via examination of trace plots, posterior density plots of model parameters, and Gelman-Rubin and Geweke convergence diagnostics as described by Du et al. [39].

Results

Results

Search Strategy for Eligible Randomised Controlled Trials
Screening was managed using Covidence® and EndNote® 20 software. In total, 5,529 studies and 42 studies were identified via the RLR and Catenacci et al., respectively (n = 5,571). A total of 1,577 duplicates were removed. Title and abstract screening resulted in the elimination of 3,752 studies. Thus, 242 studies were eligible for full-text assessment. The full-text assessment identified a total of 44 RCTs eligible for inclusion in the data-extraction and narrative review. The PRISMA flow, which details the reasons for inclusion and exclusion, is illustrated in Appendix (Fig. 1). The included RCTs are summarized in Table 1.

Data Extraction
The 44 identified RCTs, underwent data extraction in line with the prespecified data collection template. The data extracted is presented in Table 1. The 44 RCTs are from a broad range of jurisdictions; from Asia [40–47] (specifically Japan [48–57], Korea [58–63], China), from Europe (specifically France and Germany [64–67]), UK [68], in addition to multinational trials across multiple continents [64–67, 69–79]. Table 1 illustrates that, for each RCT, the study title, author and year of publication along with the number of participants in each arm (range from 15 to 357 participants) was recorded. Across the 44 RCTs, 37 investigated treatments in the second- line setting, five were in the second- and later-line setting and two were in the third- and later-line setting. As advised by clinical expert opinion, prior first-line treatments, across the eligible RCTs, were categorized as: fluoropyrimidine alone; fluoropyrimidine + platinum compounds; fluoropyrimidine + platinum compounds + trastuzumab (in the HER2-positive setting); fluoropyrimidine + platinum compounds or taxane-based chemotherapy; fluoropyrimidine + platinum compounds or anthracycline or docetaxel. These prior treatment categories are described in the data extraction table.

Narrative Review of Eligible Randomised Controlled Trials (n = 44)

Second-line Setting (37 RCTs)
The 37 eligible RCTS, in this setting, were a mixture of Phase II RCTs (n = 16), Phase III RCTs (n = 18), RCTs encompassing both Phases II and III (n = 2) and one trial that did not report the phase. Of the 37 RCTS, eight implemented a double-blind design [46, 47, 58, 70, 72–75]; the remaining were open-label trials. Of the 37 RCTs, 35 compared the intervention of interest to an active comparator; two were placebo-controlled [68, 72]. Populations across the trials were, in the main, heterogeneous. Certain studies were country specific, a number compared Asian and western populations, and several studies included a global population with Asian subgroups.

Paclitaxel Versus Paclitaxel in Combination with Various Treatments
A number of RCTs assessed the efficacy of paclitaxel alone versus paclitaxel in combination with various treatments. No statistically significant differences were observed for napabucasin + paclitaxel versus paclitaxel for OS, PFS and ORR [70]. Likewise, raltitrexed + paclitaxel was not associated with statistically significant improvements in OS, PFS and ORR versus paclitaxel [80]. Similarly, trastuzumab + paclitaxel did not improve OS, PFS and ORR versus paclitaxel alone [48]. Adding olaparib to paclitaxel did not lead to a statistically significant improvement in PFS, but did lead to a statistically significant OS benefit [58]. Olaparib + paclitaxel was not associated with a statistically significant OS benefit versus paclitaxel alone [47]. In the RAINBOW trial, OS was statistically significantly longer with ramucirumab + paclitaxel than with paclitaxel alone [73]. Muro et al. reported on a subgroup analysis of the RAINBOW trial which comprised participants from East Asia. Ramucirumab + paclitaxel was associated with a statistically significant PFS benefit, but not with a statistically significant OS benefit, versus paclitaxel alone [46]. Also, Shitara et al. concluded that OS and PFS outcomes associated with ramucirumab + paclitaxel were similar between Japanese and Western subgroups of RAINBOW [74]. Satoh et al. concluded that lapatinib + paclitaxel did not statistically significantly improve OS or PFS, versus paclitaxel, in an Asian population; however, a significant ORR benefit was seen [44]. Paclitaxel + S-1 was not associated with a statistically significant benefit in OS, PFS or ORR, versus paclitaxel, in an Asian population [53].

Paclitaxel Versus Other Treatments
Paclitaxel has also been compared to other treatments. Paclitaxel was associated with a significant PFS benefit, but not with a significant OS benefit, versus 5-FU [54]. AZD4547 (a selective FGFR-1, 2, 3 tyrosine kinase inhibitor) did not significantly improve PFS, relative to paclitaxel alone, in patients with gastric cancer displaying FGFR2 polysomy or gene amplification [76]. Two studies evaluated paclitaxel versus irinotecan; in both no statistically significant differences were observed between arms for PFS and OS [49, 61]. In their four-arm study, Kawase et al., compared irinotecan-alone, paclitaxel-alone, irinotecan + S-1 and paclitaxel + S-1. A significant benefit was seen for PFS, but not for OS, when comparing paclitaxel-alone versus irinotecan alone. No PFS benefit or OS benefit was seen in the S-1 combination groups versus the monotherapy groups [41]. No PFS benefit or OS benefit was seen in the S-1 combination groups versus the monotherapy groups [41]. In three RCTs, pembrolizumab was evaluated versus paclitaxel; pembrolizumab was not associated with a statistically significant OS benefit in any of these RCTs [40, 69, 77].
Kang et al. concluded that DHP107 (a novel oral formulation of paclitaxel) was noninferior to paclitaxel for OS, PFS and ORR [62]. Likewise, Shitara et al. concluded that nab-paclitaxel (nanoparticle albumin-bound paclitaxel) was non-inferior to paclitaxel, for OS [55].

Docetaxel Versus Other Treatments
In the COUGAR-02 trial, the addition of docetaxel to active-symptom-control was associated with a statistically significant improvement in OS [68]. Yi et al. concluded that the adding sunitinib to docetaxel did not result in a statistically significant improvement in the time to progression [42]. However, elsewhere, the addition of oxaliplatin to docetaxel was associated with a statistically significant improvement in PFS [59]. Lee et al. concluded that the addition of S-1 to docetaxel resulted in a statistically significant PFS benefit but that the addition of cisplatin to docetaxel did not improve PFS [60].
In the GATSBY trial, trastuzumab emtansine was not associated with a statistically significant OS benefit when compared to physician-choice taxane (either paclitaxel or docetaxel) in individuals with HER-2 positive advanced gastric [75].
Two studies compared cisplatin + irinotecan to irinotecan; with no OS benefit was demonstrated in either [50, 52]. A statistically significantly prolonged PFS was demonstrated in one of these RCTs [50], but not in the other [52].
Five RCTs evaluated irinotecan versus different comparators. In an RCT that compared irinotecan alone to BSC, irinotecan alone was associated with a statistically significant OS benefit [65]. There were no statistical significant differences observed in OS, PFS and ORR between mFOLFIRI (irinotecan + 5-FU + leucovorin) and irinotecan alone [43]. In their three-arm trial, Roy et al. compared PEP02 (a liposomal nanocarrier formulation of irinotecan) to irinotecan alone and to docetaxel alone. No statistically significant differences in OS, PFS and ORR were noted between the three arms [71]. In another RCT there was no statistically significant difference observed in OS between S-1 + irinotecan and irinotecan alone [51]. The addition of nimotuzumab to irinotecan was not associated with a statistically significant improvement in OS and PFS versus irinotecan alone [45].

Targeted Treatments & Immunotherapy Treatments
Fuchs et al. concluded that ramucirumab + BSC was associated with a statistically significant OS benefit versus placebo + BSC [72]. Tougeron et al. randomised participants to receive FOLFIRI + durvalumab or FOLFIRI + durvalumab + tremelimumab. No statistically significant differences in OS, PFS or ORR were observed between the two arms [81].
Lorenzen et al. concluded that treatment with FOLFIRI + ramucirumab resulted in similar OS, but improved ORRs and PFS outcomes relative to those of paclitaxel + ramucirumab; however, no p-values were reported [64].
Yan et al. concluded that intermittent-apatinib + docetaxel was associated with similar OS, PFS and ORR outcomes to those of continuous-apatinib + docetaxel, with no statistically significant differences identified between the two treatment regimens [82]. In another RCT, it was concluded that toripalimab + apatinib did not improve OS, PFS and ORR compared with physician’s-choice chemotherapy (defined as either irinotecan or paclitaxel or docetaxel) [83].
Of the 37 RCTs in this setting, 19 reported event rates for Grade ≥ 3 TRAEs for the intervention versus the comparator. None of these 19 RCTs reported any statistically significant differences in Grade ≥ 3 TRAEs in the intervention arms relative to the comparator.

Second- and Later-Line Setting (n = 5 RCTs)
In this setting, three RCTs were Phase II [56, 57, 67] and two were Phase III [63, 66]. Two implemented a double-blind design [66, 67]; three were open-label [56, 57, 63]. Populations across the trials were considered to be heterogeneous. Various studies were conducted in Germany, Korea and Japan.
In their RCT, Lorenzen et al. concluded that paclitaxel + everolimus did not improve OS, PFS and ORR outcomes compared with paclitaxel [66]. Fushida et al. reported no statistically significant differences between paclitaxel + valproic acid and paclitaxel for both OS and PFS [57]. Shitara et al. reported that dose-escalated paclitaxel did not result in a statistically significant improvement in OS or ORR compared to standard dose of paclitaxel. However, PFS was statistically significantly prolonged with dose-escalated paclitaxel [56]. Kang et al. concluded that chemotherapy (either irinotecan or docetaxel) + BSC statistically significantly improved OS compared to BSC alone [63]. Moehler et al. found that the addition of sunitinib (an anti-angiogenic receptor tyrosine kinase inhibitor) to FOLFIRI did not result in statistical significant improvements in PFS, OS, or ORR outcomes compared to FOLFIRI alone [67].
Only one of the RCTs in this setting reported event rates for Grade ≥ 3 TRAEs; no statistically significant difference between arms was detected [66].

Third- and Later-line Setting (n = 2 RCTs)
Shitara et al., 2018 evaluated the efficacy of trifluridine-tipiracil compared to BSC in individuals who had previously received two or more standard-of care-regimens. Trifluridine-tipiracil was associated with a statistically significant benefit in OS, and PFS but not in ORR. Grade ≥ 3 TRAEs were reported but statistical analyses were not presented.
Shitara et al., 2020 evaluated trastuzumab deruxtecan [79] as compared with physician-choice chemotherapy (irinotecan (89%) and paclitaxel (11%)) in individuals with HER-2 positive gastric cancer who had received a median of two previous systemic treatments. Trastuzumab deruxtecan was associated with a statistically significant benefit in both OS and PFS. No statistically significant differences, between arms, were detected in ORR and Grade ≥ 3 TRAEs.

Assessment of Risk of Bias
All 44 identified RCTS were subjected to a RoB assessment using the Cochrane Collaboration RoB 1 Tool for RCTs. Most RCTs were open-label in design, utilized an intention-to-treat analysis approach, and all were generally well-balanced across treatment arms for baseline characteristics.
The Y-axis describes the seven domains under consideration. The X-axis describes the number of RCTs that comply (green) or do not comply (red) with each domain or where reporting is unclear (amber).
The RoB 1 Tool assessment outcomes are described in Fig. 1. The Y-axis describes the seven domains considered within the Tool. The X-axis describes the number of studies that comply (green) or do not comply (red) with each domain, or where reporting is unclear (amber). As illustrated, no RCTs were compliant with all seven domains. Amongst all RCTs, compliance with certain domains was high. For example, methods used to generate the allocation sequence and reporting bias were adequately described in 39 RCTs. Attrition bias was adequately described in 34. Amongst all RCTs, compliance with certain domains was low. For example, across 33 RCTs, performance bias or measures used to blind trial participants and researchers from knowledge of which intervention a participant received were, in the main, poorly described. In 24, the allocation-concealment approaches used were not described in sufficient detail. A detailed, domain-level risk of bias table for all individual RCTs (n = 44) is presented in Appendix Table 2.

Data Synthesis and Analysis

Construction of Evidence Networks
All 44 identified RCTs, across all treatment-line settings, were assessed for potential inclusion in the four outcome-specific evidence networks. The assessment-process flow and conclusions, for each outcome-specific network, are described in Appendix (Figs. 6, 7, 8 and 9).
Briefly, only those RCTs that contributed to a path connecting two or more treatments of interest were included. RCTs were necessarily excluded from the OS and PFS networks when HRs were not reported. Likewise, RCTs were excluded from the ORR and Grade ≥ 3 TRAEs networks when event rates or absolute number of events were not reported.
The exchangeability of remaining RCTs was then assessed. Any RCT that differed substantially from the main study pool in terms of participant inclusion and exclusion criteria, biomarker status and prior treatments were excluded from the final evidence networks.
This assessment process concluded that it was feasible to construct evidence networks for each outcome of interest (OS, PFS, ORR and Grade ≥ 3 TRAEs) in the second-line setting. Also, that it was not feasible to construct evidence networks, for any of the outcomes, in the second- and later-line setting or in the third- and later-line setting.

Network Meta-Analysis in the Second-Line Setting

Overall Survival
Nine RCTs, comprising eight treatments of interest, were eligible for inclusion in the evidence network for OS. The treatments were paclitaxel alone, docetaxel alone, irinotecan alone, FOLFIRI, ramucirumab, ramucirumab + paclitaxel, pembrolizumab and BSC (see Appendix (Fig. 2)).
The relative efficacy estimates (and associated uncertainty) obtained from the NMA for all pairs of included treatments are presented in Fig. 2. As shown, there are no statistically significant differences in the risk of an OS event (death) across any of the pair-wise comparisons. However, certain trends are observed. For example, BSC exhibits a numerically (but not statistically significant) higher risk of death relative to all other comparators. Likewise, FOLFIRI exhibits a numerically lower risk of death relative to all other comparators included in this evidence network.
The HR (95%CrI) for each pair-wise comparison for the treatment (X-axis) vs. the comparator (Y-axis) is described, with values of less than one indicating a lower risk of death with the treatment relative to the comparator. For example, the HR (paclitaxel vs. pembrolizumab) = 1.17 (95%CrI 0.71, 1.78), indicating a numerically (but not statistically significant) higher risk of death with paclitaxel.
Green indicates a point-estimate that favours the treatment; red indicates a point estimate that favours the comparator. The colour intensity indicates the magnitude of the point estimate (but does not indicate statistical significance or width of the 95%CrI).
Where BSC = Best supportive care, HR = Hazard ratio, CrI = Credible Interval, FOLFIRI = Folinic acid Fluorouracil Irinotecan.

Progression-Free Survival
Six RCTs, comprising five treatments, were eligible for inclusion, in the evidence network for PFS. The treatments were: paclitaxel alone, irinotecan alone, FOLFIRI, ramucirumab + paclitaxel and pembrolizumab (see Appendix (Fig. 3)).
The results of the NMA for PFS are shown in Fig. 3. No statistically significant differences in efficacy between treatments were observed for this outcome. Also, it can be seen, for example, that Pembrolizumab is associated with a numerically (but not statistically significant) higher risk of progression or death versus each of the treatments in the network. Likewise, ramucirumab + paclitaxel is associated with a numerically lower risk of progression or death versus each treatment in the network.
The HR (95%CrI) for each pair-wise comparison for the treatment (X-axis) vs. the comparator (Y-axis) is described, with values of less than one indicating a lower risk of death with the treatment relative to the comparator. For example, the HR (irinotecan vs. FOLFIRI) = 1.20 (95%Ci 0.55, 2.55), indicating a numerically (but not statistically significant) higher risk of PFS with irinotecan.
Green indicates a point-estimate that favours the treatment; red indicates a point estimate that favours the comparator. The colour intensity indicates the magnitude of the point estimate (but does not indicate statistical significance or width of the 95%CrI).
The upper 95%CrI (ramucirumab + paclitaxel vs. pembrolizumab) = 1.0007; the lower 95%CrI (pembrolizumab versus ramucirumab + paclitaxel) = 0.9993. Both values are rounded to 1.0 in the table.

Objective Response Rate
Eight RCTs, pertaining to six treatments, (see Appendix (Fig. 4)) were eligible for inclusion in the ORR evidence network. The results of the NMA for this outcome (Fig. 4) show that, across all pair-wise comparisons, there are no instances of statistically significant differences in ORR. Estimated ORR is numerically higher with docetaxel versus all other treatments in the network, and lower with irinotecan.
The RR (95%CrI) for each pair-wise comparison for the treatment (X-axis) vs. the comparator (Y-axis) is described, with values greater than 1 indicating a higher rate of objective response with the treatment relative to the comparator. For example, the RR (docetaxel vs. FOLFIRI) = 2.21 (95%CrI 0.28, 19.45), indicating a numerically (but not statistically significant) higher rate of objective response (tumour shrinkage) with docetaxel compared to FOLFIRI.
Green indicates a point-estimate that favours the treatment; red indicates a point estimate that favours the comparator. The colour intensity indicates the magnitude of the point estimate (but does not indicate statistical significance or width of the 95%CrI).

Grade ≥ 3 Treatment Related Adverse Events
A total of five studies, pertaining to five treatments, were eligible for inclusion in the Grade ≥ 3 TRAEs evidence network (see Appendix (Fig. 5)).
The results of the NMA show that the only statistically significant finding is the decreased risk of Grade ≥ 3 TRAEs for pembrolizumab versus paclitaxel. It highlights also that FOLFIRI is associated with a numerically higher (but not statistically significant) risk of Grade ≥ 3 TRAEs versus all other treatments in the network. Likewise, pembrolizumab is associated with a lower risk of Grade ≥ 3 TRAEs versus all comparators; significance was only identified in the comparison versus paclitaxel. In general, confidence intervals for this outcome were wide due to low event numbers.
The RR (95%CrI) for each pair-wise comparison for the treatment (X-axis) vs. the comparator (Y-axis) is described, with values greater than 1 indicating a higher rate of adverse events with the treatment relative to the comparator. For example, the RR (paclitaxel vs. docetaxel) = 1.59 (95%CrI 0.24, 11.04), indicating a numerically (but not statistically significant) higher rate of adverse events with paclitaxel compared to docetaxel.
Green indicates a point-estimate that favours the treatment; red indicates a point estimate that favours the comparator. The colour intensity indicates the magnitude of the point estimate (but does not indicate statistical significance or width of the 95%CrI).

Discussion

Discussion
This work involved the identification of relevant RCTs that investigated the clinical efficacy and safety of eleven treatments for advanced and metastatic gastric cancer, in the second- and later-line setting, as recommended by both ESMO and NCCN guidelines. The subsequent implementation of evidence synthesis networks allowed investigation of the relative efficacy and safety of these treatments. Primary outcomes of interest were OS and PFS; additional outcomes of interest were ORR and Grade ≥ 3 TRAEs.
In total 44 RCTs were deemed to be eligible for inclusion in the data-extraction and narrative review. This review provided useful insights into the research undertaken, to date, in this setting. It is noted, for example, that the 44 RCTs are from a broad range of jurisdictions. The majority of RCTs (n = 37) investigated treatments in the second-line setting. Between-trial heterogeneity in several key criteria (including number of participants, trial phase, previous treatments, blinding methods and patient demographics) is evident. The RCTs were associated with varying risks of bias. Differences were noted, across the RCTs, in the outcomes investigated. Not all RCTs, for example, considered the efficacy (OS, PFS and ORR) and safety (Grade ≥ 3 TRAEs) outcomes of interest here.
An understanding of the relative efficacy and safety, of all the treatments of interest, required the implementation of NMAs. Given the data availability, NMAs were feasible in the second-line setting only and only for certain treatments. Eight treatments could be included in the OS NMA, five treatments in the PFS NMA and six in the ORR NMA. No statistically significant differences in the efficacy of the treatments across each of these networks were identified; however, certain trends were noted. For example, the estimated HRs suggest that mortality may be higher with BSC compared to all other treatments in the OS network, and lower with FOLFIRI. In terms of PFS, the risk of progression or death may be higher with pembrolizumab relative to the other comparators, and lower with ramucirumab + paclitaxel. Estimated rates of objective response were highest with docetaxel and lowest with irinotecan. Five treatments could be included in the Grade ≥ 3 TRAEs NMA; this indicated that pembrolizumab is associated with a statistically significant decreased risk of Grade ≥ 3 TRAEs versus paclitaxel. No other statistically significant differences identified in the risk of Grade ≥ 3 TRAEs across the treatments in this network, and confidence intervals were wide due to low event numbers overall.
This work has investigated the relative efficacy (in terms of a number of key outcomes) along with safety of a wide range of treatments for advanced and metastatic gastric cancer, in the second-line setting. To our knowledge, this is the first NMA which has specifically considered treatments as recommended by both ESMO and NCCN. This NMA adds to previously published work, in the second-line setting. In their NMA, Badiani et al. [84] included eligible literature, up to February 2015, for six treatments. OS was the outcome of interest. Badiani et al. concluded, from their NMA, that ramucirumab + paclitaxel is associated with a statistically significant decreased OS-event risk versus each of irinotecan, paclitaxel, docetaxel, ramucirumab, and BSC [84]. Although our OS NMA outputs are directionally consistent with that of Badiani et al., none of our outputs were statistically significant. The differences in findings can, at least partly, be explained by key methodological and evidence-base differences in the approaches taken by Badiani et al. and our group. Badiani et al. used median OS from seven RCTs (published to Feb 2015) [84]. Median survival does not account for censoring or the shape of the time-to-event curve and can therefore distort relative effects in indirect comparisons (Guyot et al., 2012; Michiels et al., 2005). Our NMA instead used the HR which is the recommended measure for time-to-event outcomes. Further, we incorporated RCTs published to May 2024.
In their 2019 NMA, Cheng et al. ranked treatments for efficacy and safety effects. Amongst the findings, it was concluded that paclitaxel + olaparib and ramucirumab + paclitaxel dominated the OS ranking, and ramucirumab + paclitaxel the PFS ranking, however, these rankings and the underlying effect estimates were uncertain [85].
An evidence network was not feasible, in the present study, for the third- and later-line setting, Zhang et al. 2023 [86], however, could construct an evidence network for third-line treatments given that their PICOS differed to ours. For example, their PICO included treatments that are not recommended by ESMO and NCCN. Their work indicated that trastuzumab deruxtecan was associated with a statistically significant improvement in OS compared to chemotherapy [86].
The methodology employed for this research is robust, being aligned with best-practice guidance [33, 34]. The outputs of this work are of interest to all healthcare decision makers including policy makers, health payers, clinicians, patients, carers and families.
The limitations of this work should be noted. Given data availability, it was not possible to investigate the relative efficacy and safety of treatments for use in the third- and later-line setting. Also, it was not possible to include all treatments of interest within each of the efficacy and safety NMAs in the second-line setting. The inclusion of pembrolizumab in our networks is noted. ESMO and NCCN Guidelines recommend pembrolizumab in the setting of MSI-H/dMMR tumours [7, 22]. NCCN also recommends use in the setting of TMB-high tumours [22]. The RCTs in the second-line setting (KEYNOTE-061 and KEYNOTE-063), which informed the pembrolizumab node in our evidence networks, mainly recruited patients with PD-L1-positive disease and this was regardless of MSI-H/dMMR and TMB status. In KEYNOTE-061, a very small subgroup (5.3%) had positive MSI-H status [8]. MSI-H status was not a trial stratification factor, and thus analyses in this subgroup are at a risk of bias [77]. Also, a comparison of outcomes for this KEYNOTE-061 subgroup with those of the full populations from the other RCTs, would require the strong assumption that MSI-H status is an effect modifier for pembrolizumab versus paclitaxel, but not for other comparisons in the network. We note the MSI-H/dMMR and TMB-high status of patients in KEYNOTE-063 is not provided [40]. Thus, it is not possible to consider pembrolizumab in the settings of MSI-H/dMMR tumours and TMB-high tumours only within our NMAs. Further work is warranted here.
Further, an important consideration in interpreting the results of this NMA is the fact that many comparisons yielded wide credible intervals for relative treatment effects and were not statistically significant. This can be seen, for example, in Fig. 2 which describes the outputs of the NMA for OS for each of the comparisons versus BSC. Here all RR point estimates, for each intervention versus BSC, show that each intervention is associated with a numerical benefit in OS versus BSC. However, in each instance the credible intervals are particularly wide. This is likely to be partially a consequence of the low event rates and small sample sizes across many included RCTs, as well as the between-study heterogeneity and reliance on indirect evidence for some comparisons (which can lead to lower statistical precision). Therefore, it is possible that true differences in efficacy and/or safety between certain treatments exist but were not detected due to low statistical power (i.e., a type II error). Despite the lack of statistical significance, the results of the NMA indicate trends in the data, and further research is suggested to identify optimal treatments.
We acknowledge that the generalizability of our results may be somewhat limited given that several trials were conducted entirely in East Asian populations or included subpopulations from East Asia. Our group considered that undertaking scenario analysis to exclude such studies from the networks would have been of interest. However, unfortunately, it was not possible to undertake robust scenario analysis here, given the relatively small number of trials in our networks. Also, we have restricted our SLR to English-language full texts. This may have introduced selection bias and an under-representation of certain regions or regimens.
Future NMAs should incorporate additional on-going and planned RCTs to enhance the precision of effect estimates and provide more definitive evidence regarding the comparative efficacy and safety of all treatments of interest here. Of note, the treatments of interest, in this work, are defined as those recommended by both the ESMO and NCCN Guidelines. As such, it is beyond the scope, here, to include novel treatments, in this setting, of which there are a number. It is planned that, in time, our evidence networks, will be updated to include emerging data from RCTs of novel treatments. This will allow investigations of relative efficacy and safety of these novel treatments versus the treatments in our networks. In line with robust practice, this will be feasible, only, if such novel treatments are supported by RCTs that are interchangeable with those in our networks. Also, the certainty of the resultant NMA outputs will be improved where the RCT sample sizes are sufficient [33].

Conclusion

Conclusion
The appreciable number of RCTs captured in our search highlights that the treatment landscape for advanced and metastatic gastric cancer, in the second- and later-line setting, continues to evolve. Implementation of NMAs were feasible in the second-line setting only and only for certain treatments. NMA outputs suggest that the introduction of novel treatments, in this setting, have not had a statistically significant impact on key efficacy outcomes, and have had little impact on safety outcomes, versus more established treatments. There remains a need for novel treatments that will have a significant beneficial impact on both efficacy and safety outcomes in this setting.

Supplementary Information

Supplementary Information
Below is the link to the electronic supplementary material.

출처: PubMed Central (JATS). 라이선스는 원 publisher 정책을 따릅니다 — 인용 시 원문을 표기해 주세요.

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

🟢 PMC 전문 열기