Patient-Perceived Overall Side Effect Bother at and After Cancer Treatment Discontinuation: An Analysis Using Commercial Cancer Trial Data.
1/5 보강
PICO 자동 추출 (휴리스틱, conf 3/4)
유사 논문P · Population 대상 환자/모집단
환자: solid tumours (renal cell carcinoma and breast cancer)
I · Intervention 중재 / 시술
treatment, completion rates at discontinuation for most items were at least 70%, and 52% to 78% at follow up
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
GP5 and most items were at least somewhat correlated (≥ 0.2 in nearly all evaluations). [DISCUSSION AND CONCLUSIONS] Persistent side effect bother and symptomatic and functional detriments at and after discontinuation suggest capturing this information post-treatment can inform understanding of tolerability, particularly with improved PRO completion.
[INTRODUCTION] There is widespread interest among patients, clinicians, regulators and other constituents in post-treatment patient-reported cancer data.
APA
Roydhouse J, Breslin M, et al. (2026). Patient-Perceived Overall Side Effect Bother at and After Cancer Treatment Discontinuation: An Analysis Using Commercial Cancer Trial Data.. Therapeutic innovation & regulatory science, 60(2), 575-590. https://doi.org/10.1007/s43441-025-00906-8
MLA
Roydhouse J, et al.. "Patient-Perceived Overall Side Effect Bother at and After Cancer Treatment Discontinuation: An Analysis Using Commercial Cancer Trial Data.." Therapeutic innovation & regulatory science, vol. 60, no. 2, 2026, pp. 575-590.
PMID
41513883 ↗
Abstract 한글 요약
[INTRODUCTION] There is widespread interest among patients, clinicians, regulators and other constituents in post-treatment patient-reported cancer data. Side effect bother is a patient-reported outcome (PRO) that can capture an important aspect of tolerability. In this study, we examined side effect bother at cancer treatment discontinuation and post-discontinuation in commercial cancer trials. We sought to understand completion rates, the extent of bother and its association with other PROs.
[MATERIALS AND METHODS] Data were evaluated from three trials in patients with solid tumours (renal cell carcinoma and breast cancer). Side effect bother was measured with the Functional Assessment of Chronic Illness Therapy (FACIT) GP5 item. Symptom items were drawn from FACIT and function items were drawn from the EQ-5D-3L. FACIT items, including the GP5, are on a 0-4 scale (higher = worse symptoms/bother), and were dichotomised as 0-1 ("low") vs 2-4 ("moderate"). EQ-5D-3L items were characterised as no problems (1) and some problems (2-3). Descriptive and correlation analyses were conducted separately for each trial.
[RESULTS] Among patients who received treatment, completion rates at discontinuation for most items were at least 70%, and 52% to 78% at follow up. More than 20% of patients had high side effect bother at and after discontinuation, and similar percentages were seen for symptom items and functioning problems. GP5 and most items were at least somewhat correlated (≥ 0.2 in nearly all evaluations).
[DISCUSSION AND CONCLUSIONS] Persistent side effect bother and symptomatic and functional detriments at and after discontinuation suggest capturing this information post-treatment can inform understanding of tolerability, particularly with improved PRO completion.
[MATERIALS AND METHODS] Data were evaluated from three trials in patients with solid tumours (renal cell carcinoma and breast cancer). Side effect bother was measured with the Functional Assessment of Chronic Illness Therapy (FACIT) GP5 item. Symptom items were drawn from FACIT and function items were drawn from the EQ-5D-3L. FACIT items, including the GP5, are on a 0-4 scale (higher = worse symptoms/bother), and were dichotomised as 0-1 ("low") vs 2-4 ("moderate"). EQ-5D-3L items were characterised as no problems (1) and some problems (2-3). Descriptive and correlation analyses were conducted separately for each trial.
[RESULTS] Among patients who received treatment, completion rates at discontinuation for most items were at least 70%, and 52% to 78% at follow up. More than 20% of patients had high side effect bother at and after discontinuation, and similar percentages were seen for symptom items and functioning problems. GP5 and most items were at least somewhat correlated (≥ 0.2 in nearly all evaluations).
[DISCUSSION AND CONCLUSIONS] Persistent side effect bother and symptomatic and functional detriments at and after discontinuation suggest capturing this information post-treatment can inform understanding of tolerability, particularly with improved PRO completion.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
📖 전문 본문 읽기 PMC JATS · ~95 KB · 영문
Introduction
Introduction
The acute effects of anti-cancer therapies such as chemotherapy are well-described in the literature [1], and treatment-related toxicity can persist well after treatment [2]. The side effects of newer therapies, such as targeted therapies or immunotherapies, can also seriously impact health-related quality of life [3]. Long-term follow-up of these therapies is also an important consideration when evaluating both adverse effects and health-related quality of life impacts [4]. For example, international regulatory guidance suggests a follow-up period of 15 years for chimeric antigen receptor T cell products (CAR-T) [5]. At the same time, there is a growing push towards the inclusion of patient-reported information about the impact of therapy in cancer trials, as evidenced by statements from regulatory bodies [6, 7]. Additionally, health-related quality of life plays a critical role in the decisions of health technology assessment bodies such as the National Institute for Health and Care Excellence (NICE) in the UK [8] and the Institute for Quality and Efficiency in Health Care (IQWiG) in Germany [9]. Payors value patient-reported outcome (PRO) data in oncology; in some cases, PROs have played an important role in decisions supporting products [10]. Payors also value post-progression data in oncology [10]. For example, a recent publication authored by individuals from IQWiG described post-progression data as “of paramount importance to patients and clinicians” [11]. Similarly, regulatory, academic, payor, and industry constituents at a 2020 US Food and Drug Administration (FDA) roundtable agreed that post-treatment PRO data in cancer clinical trials can provide important information about the persistence of symptomatic and functional impacts in patients treated for cancer [12].
Nonetheless, there are well-recognised challenges to the collection of post-treatment PRO data [12, 13]. Missing PRO data is a longstanding issue [14], and as pointed out in the literature, regulatory review [15, 16] has highlighted missing PRO data in regulatory submissions. Post-treatment PRO assessments were infrequent in a review of commercial clinical cancer trials, and when they occurred completion rates were on average 66% in either arm [17]. In contrast, median pre-treatment completion rates were over 90% and on-treatment completion rates close to 90% in commercial cancer clinical trials [18]. Similar findings of suboptimal post-discontinuation completion have been reported outside of the commercial clinical trial setting. Completion rates of 54–66% at treatment discontinuation were seen for the NRG’s GOG240 study [19], and response rates fell from 100% at randomization to 61% at 4-year follow-up and 46% at 7-year follow up in the PRO study component of SWOG S9133 [20].
Addressing and potentially improving the completion of post-treatment PRO data may be valuable [12, 13, 17]. Recommendations for post-treatment PRO data assessment include planning PRO assessments [12, 13, 17], using electronic PRO collection [13], and clearly defining specific PRO objectives [12, 13, 17]. Given concerns about long-term therapeutic impact, PRO objectives related to tolerability may be especially salient for post-treatment data collection. The low-grade but chronic adverse effect profiles of newer therapies [4] has highlighted a need for greater understanding of the longitudinal impact of treatment toxicity [21] and the inclusion of PROs [22] to aid in illuminating this. PROs related to tolerability include measures of symptomatic adverse events and overall side effect impact [6, 22]. In this study, we sought to characterise (1) the extent of overall side effect bother at the end of treatment and in post-treatment, (2) the association of overall side effect bother and symptom and function measures at the end of treatment and post-treatment, and (3) the completion rates for a single item for overall side effect bother at the end of treatment and post-treatment, using data from commercial cancer clinical trials.
The acute effects of anti-cancer therapies such as chemotherapy are well-described in the literature [1], and treatment-related toxicity can persist well after treatment [2]. The side effects of newer therapies, such as targeted therapies or immunotherapies, can also seriously impact health-related quality of life [3]. Long-term follow-up of these therapies is also an important consideration when evaluating both adverse effects and health-related quality of life impacts [4]. For example, international regulatory guidance suggests a follow-up period of 15 years for chimeric antigen receptor T cell products (CAR-T) [5]. At the same time, there is a growing push towards the inclusion of patient-reported information about the impact of therapy in cancer trials, as evidenced by statements from regulatory bodies [6, 7]. Additionally, health-related quality of life plays a critical role in the decisions of health technology assessment bodies such as the National Institute for Health and Care Excellence (NICE) in the UK [8] and the Institute for Quality and Efficiency in Health Care (IQWiG) in Germany [9]. Payors value patient-reported outcome (PRO) data in oncology; in some cases, PROs have played an important role in decisions supporting products [10]. Payors also value post-progression data in oncology [10]. For example, a recent publication authored by individuals from IQWiG described post-progression data as “of paramount importance to patients and clinicians” [11]. Similarly, regulatory, academic, payor, and industry constituents at a 2020 US Food and Drug Administration (FDA) roundtable agreed that post-treatment PRO data in cancer clinical trials can provide important information about the persistence of symptomatic and functional impacts in patients treated for cancer [12].
Nonetheless, there are well-recognised challenges to the collection of post-treatment PRO data [12, 13]. Missing PRO data is a longstanding issue [14], and as pointed out in the literature, regulatory review [15, 16] has highlighted missing PRO data in regulatory submissions. Post-treatment PRO assessments were infrequent in a review of commercial clinical cancer trials, and when they occurred completion rates were on average 66% in either arm [17]. In contrast, median pre-treatment completion rates were over 90% and on-treatment completion rates close to 90% in commercial cancer clinical trials [18]. Similar findings of suboptimal post-discontinuation completion have been reported outside of the commercial clinical trial setting. Completion rates of 54–66% at treatment discontinuation were seen for the NRG’s GOG240 study [19], and response rates fell from 100% at randomization to 61% at 4-year follow-up and 46% at 7-year follow up in the PRO study component of SWOG S9133 [20].
Addressing and potentially improving the completion of post-treatment PRO data may be valuable [12, 13, 17]. Recommendations for post-treatment PRO data assessment include planning PRO assessments [12, 13, 17], using electronic PRO collection [13], and clearly defining specific PRO objectives [12, 13, 17]. Given concerns about long-term therapeutic impact, PRO objectives related to tolerability may be especially salient for post-treatment data collection. The low-grade but chronic adverse effect profiles of newer therapies [4] has highlighted a need for greater understanding of the longitudinal impact of treatment toxicity [21] and the inclusion of PROs [22] to aid in illuminating this. PROs related to tolerability include measures of symptomatic adverse events and overall side effect impact [6, 22]. In this study, we sought to characterise (1) the extent of overall side effect bother at the end of treatment and in post-treatment, (2) the association of overall side effect bother and symptom and function measures at the end of treatment and post-treatment, and (3) the completion rates for a single item for overall side effect bother at the end of treatment and post-treatment, using data from commercial cancer clinical trials.
Methods
Methods
Population and Data Sources
This study is part of a larger study evaluating the measurement properties of a single item for overall side effect bother, the GP5 item. GP5 is described in more detail below (see Measures). For this study, we accessed commercial clinical cancer trial data from the Vivli data sharing platform. All trials in our study had received HREC/IRB approval.
For this analysis, we worked with data from three solid tumour phase III (two renal cell, one breast) trials with an end of treatment and post-treatment follow up visit listed in the questionnaire file and where there was a variable indicating treatment discontinuation at a specific treatment cycle. The two renal cell trials evaluated identical agents, but one was a first-line trial and one was not. The breast cancer trial compared a combination of agents compared to one agent.
We used de-identified secondary data for this study and obtained confirmation that HREC/IRB approval was not required from our institutional committees/boards (Northwestern University and the University of Tasmania).
Within each trial, the study population was the safety population (i.e., patients who had received treatment) and who had discontinued for either disease progression or toxicity. We did not include patients who discontinued due to death as they could not be included in the survival follow-up, and we anticipated potentially high levels of missing data for the end of treatment visit. Patients who discontinued for other reasons, such as withdrawal of consent, would also be unlikely to have visits at the time points of interest, and were not included in our study population.
Measures
We focused on five types of PRO items: (1) a side effect bother item (the primary PRO item of interest); (2) symptom items; (3) a bother item (not pertaining to side effect bother); (4) functional items; and (5) a global health item. Table 1 summarises the items and the data sources in which they are found for this analysis. Of note, the functional and global health items were only available for one data source (PALOMA) at the discontinuation visit only, and the bother item (not pertaining to side effect bother) was not available in two data sources (AXIS1, AXIS2). For the symptom items, two items (GP1, GP4) were available across all data sources and were the focus of the analysis for that broad PRO concept.
Analyses
Trials were described in terms of their clinical context (tumour type, agents being compared) and the scheduled end-of-treatment and first follow-up assessment as reported in their protocol. Population characteristics included age, baseline performance status (ECOG), and sex/gender (renal cell trials only, as the breast cancer trial was women-only).
For our analysis, we focused on three time points: (1) the end-of-treatment or discontinuation visit, at any point during the trial; (2) the discontinuation visit where it occurred earlier in the trial, defined here as within the first three cycles of treatment; and (3) the first follow-up visit after discontinuation. As there is relatively little known about the topic, we included an early discontinuation visit (as noted above, defined as within the first three cycles) to see if there were any differences in completion or severity.
To evaluate the extent of overall side effect impact at each time point, we used descriptive statistics (frequency and percentage [N, %]) to characterise GP5’s distribution, both overall and by reason for treatment discontinuation. Descriptive statistics (frequency and percentage) were used to characterise GP5 and all PRO items except the VAS, for which we used mean and standard deviation. The association between GP5 and the other measures described above was calculated in two ways. First, we evaluated the correlation at each time point using Spearman’s rho. Second, we evaluated dichotomised GP5 (less than moderate-severe: 0–1 vs moderate-severe: 2–4) against groups defined by symptom items and functional items through cross-tabulation. We focused on the items that were seen in all FACT measures (GP1, GP4); these were dichotomised as 0–1 (less than moderate-severe) vs 2–4 (moderate-severe). Where available, we also included the HI7 item (AXIS1, AXIS2 trials) and GP2 item (PALOMA trial). For the functional items in the EQ-5D, these were dichotomised as 1 (no problems) vs 2–3 (any problems). We expected that individuals with higher levels of symptoms, bother, or functional problems would be more likely to be in the moderate-severe (2–4) bother group. The descriptive and comparative analyses were repeated with GP5 dichotomised as 0–2 (less than severe) vs 3–4 (severe) for sensitivity analyses. Due to the exploratory nature of the study, available case analysis was used. We also examined within-person side effect bother between discontinuation and follow-up by estimating the number and percentage of people in each trial in change categories. This analysis was limited to people with complete responses for both the discontinuation and follow-up visits. We categorised within-person change as follows: (1) improved to less than moderate (i.e., went from 2–4 to < 2); (2) stayed less than moderate (i.e., did not increase from < 2); (3) stayed moderate (i.e., did not decrease from > 2); and (4) worsened to moderate (i.e., went from < 2 to ≥ 2).
Completion rates were calculated in several ways. First, we calculated the percentage of patients with a visit at the time points of interest. The denominator for this was the patients ‘expected’ to have the visit. For discontinuation visits, the denominator was patients who had received treatment (e.g., safety population) and had discontinued treatment for progression or toxicity. For the follow-up visit after treatment discontinuation, the denominator was restricted to patients who were listed as not having died per the survival file (e.g., would be eligible for a survival follow-up visit). Furthermore, for the two renal cell carcinoma trials, we were able to calculate the time between the survival/last contact date and discontinuation date and only considered patients with a time greater than or equal to the follow-up assessment period (e.g., patients for whom a follow-up visit could have been expected to have occur by the time of data cut). The numerator for all calculations was the occurrence of a discontinuation or follow-up visit, which was ascertained by whether the value for that visit appeared in the questionnaire dataset.
Second, we calculated the percentage of patients with a non-missing GP5 item at the time points of interest, with the ‘expected’ population as the denominator. Third, to facilitate comparison across items, we calculated the percentage of patients with non-missing GP5 and other items of interest at the time points of interest, with the number of patients with that visit as the denominator. We were interested in contextualising the GP5 item completion rate at these time points given earlier findings that GP5 completion at baseline may be lower than other items [23]. Finally, we also examined item completion by reasons for discontinuation, classified as progression/toxicity.
Each trial was analysed separately. All analyses were performed in R Studio.
Population and Data Sources
This study is part of a larger study evaluating the measurement properties of a single item for overall side effect bother, the GP5 item. GP5 is described in more detail below (see Measures). For this study, we accessed commercial clinical cancer trial data from the Vivli data sharing platform. All trials in our study had received HREC/IRB approval.
For this analysis, we worked with data from three solid tumour phase III (two renal cell, one breast) trials with an end of treatment and post-treatment follow up visit listed in the questionnaire file and where there was a variable indicating treatment discontinuation at a specific treatment cycle. The two renal cell trials evaluated identical agents, but one was a first-line trial and one was not. The breast cancer trial compared a combination of agents compared to one agent.
We used de-identified secondary data for this study and obtained confirmation that HREC/IRB approval was not required from our institutional committees/boards (Northwestern University and the University of Tasmania).
Within each trial, the study population was the safety population (i.e., patients who had received treatment) and who had discontinued for either disease progression or toxicity. We did not include patients who discontinued due to death as they could not be included in the survival follow-up, and we anticipated potentially high levels of missing data for the end of treatment visit. Patients who discontinued for other reasons, such as withdrawal of consent, would also be unlikely to have visits at the time points of interest, and were not included in our study population.
Measures
We focused on five types of PRO items: (1) a side effect bother item (the primary PRO item of interest); (2) symptom items; (3) a bother item (not pertaining to side effect bother); (4) functional items; and (5) a global health item. Table 1 summarises the items and the data sources in which they are found for this analysis. Of note, the functional and global health items were only available for one data source (PALOMA) at the discontinuation visit only, and the bother item (not pertaining to side effect bother) was not available in two data sources (AXIS1, AXIS2). For the symptom items, two items (GP1, GP4) were available across all data sources and were the focus of the analysis for that broad PRO concept.
Analyses
Trials were described in terms of their clinical context (tumour type, agents being compared) and the scheduled end-of-treatment and first follow-up assessment as reported in their protocol. Population characteristics included age, baseline performance status (ECOG), and sex/gender (renal cell trials only, as the breast cancer trial was women-only).
For our analysis, we focused on three time points: (1) the end-of-treatment or discontinuation visit, at any point during the trial; (2) the discontinuation visit where it occurred earlier in the trial, defined here as within the first three cycles of treatment; and (3) the first follow-up visit after discontinuation. As there is relatively little known about the topic, we included an early discontinuation visit (as noted above, defined as within the first three cycles) to see if there were any differences in completion or severity.
To evaluate the extent of overall side effect impact at each time point, we used descriptive statistics (frequency and percentage [N, %]) to characterise GP5’s distribution, both overall and by reason for treatment discontinuation. Descriptive statistics (frequency and percentage) were used to characterise GP5 and all PRO items except the VAS, for which we used mean and standard deviation. The association between GP5 and the other measures described above was calculated in two ways. First, we evaluated the correlation at each time point using Spearman’s rho. Second, we evaluated dichotomised GP5 (less than moderate-severe: 0–1 vs moderate-severe: 2–4) against groups defined by symptom items and functional items through cross-tabulation. We focused on the items that were seen in all FACT measures (GP1, GP4); these were dichotomised as 0–1 (less than moderate-severe) vs 2–4 (moderate-severe). Where available, we also included the HI7 item (AXIS1, AXIS2 trials) and GP2 item (PALOMA trial). For the functional items in the EQ-5D, these were dichotomised as 1 (no problems) vs 2–3 (any problems). We expected that individuals with higher levels of symptoms, bother, or functional problems would be more likely to be in the moderate-severe (2–4) bother group. The descriptive and comparative analyses were repeated with GP5 dichotomised as 0–2 (less than severe) vs 3–4 (severe) for sensitivity analyses. Due to the exploratory nature of the study, available case analysis was used. We also examined within-person side effect bother between discontinuation and follow-up by estimating the number and percentage of people in each trial in change categories. This analysis was limited to people with complete responses for both the discontinuation and follow-up visits. We categorised within-person change as follows: (1) improved to less than moderate (i.e., went from 2–4 to < 2); (2) stayed less than moderate (i.e., did not increase from < 2); (3) stayed moderate (i.e., did not decrease from > 2); and (4) worsened to moderate (i.e., went from < 2 to ≥ 2).
Completion rates were calculated in several ways. First, we calculated the percentage of patients with a visit at the time points of interest. The denominator for this was the patients ‘expected’ to have the visit. For discontinuation visits, the denominator was patients who had received treatment (e.g., safety population) and had discontinued treatment for progression or toxicity. For the follow-up visit after treatment discontinuation, the denominator was restricted to patients who were listed as not having died per the survival file (e.g., would be eligible for a survival follow-up visit). Furthermore, for the two renal cell carcinoma trials, we were able to calculate the time between the survival/last contact date and discontinuation date and only considered patients with a time greater than or equal to the follow-up assessment period (e.g., patients for whom a follow-up visit could have been expected to have occur by the time of data cut). The numerator for all calculations was the occurrence of a discontinuation or follow-up visit, which was ascertained by whether the value for that visit appeared in the questionnaire dataset.
Second, we calculated the percentage of patients with a non-missing GP5 item at the time points of interest, with the ‘expected’ population as the denominator. Third, to facilitate comparison across items, we calculated the percentage of patients with non-missing GP5 and other items of interest at the time points of interest, with the number of patients with that visit as the denominator. We were interested in contextualising the GP5 item completion rate at these time points given earlier findings that GP5 completion at baseline may be lower than other items [23]. Finally, we also examined item completion by reasons for discontinuation, classified as progression/toxicity.
Each trial was analysed separately. All analyses were performed in R Studio.
Results
Results
Included Trials and Study Population
The time frame for end of treatment assessments were not always specified; when specified, it was usually within one month of the last dose of treatment (Table 1). Follow-up assessments were highly variable, ranging from 28 days after the last dose to 6 months. When specified, windows were 7 days (Table 2). In the renal cell carcinoma studies, which recruited across both sexes, nearly all (> 70%) participants were male and had baseline ECOG scores of 0 or 1. In all trials, progression was the primary reason for discontinuation, and discontinuation due to toxicity/adverse events (AE) was infrequent (9–14% of patients across trials).
GP5 and Other PROs at End of Treatment and Follow-Up
At the treatment discontinuation visit, the percentage of patients with GP5 scores of at least 2 (“moderate to severe bother”) ranged from 30 to 57% (Table 3). At discontinuation, at least 20% of patients reported problems of some kind as reported on EQ-5D-3L; lack of energy was reported by at least 50% of patients in all trials, and pain by at least 40% (Table 3). However, the average VAS score was at least 60/100 (Table 3). When early discontinuation was considered, the percentage of patients with GP5 scores of at least 2 was at least 50%; at least 50% reported scores of at least 2 for lack of energy and at least 50% reported had scores of 2 or higher for pain (Table 3). In the breast cancer trial, < 30% of patients had scores of at least 2 on hair loss bother at all time points, and < 20% had scores of at least 2 on nausea at all time points (Table 3). At follow-up, at least 20% of patients in both AXIS trials had scores of 3 or higher for GP1 (Lack of Energy), compared to 13% in the PALOMA trial. However, at least 30% in each trial had scores of 3 or higher for GP4 (Pain). At least 10% in each trial had scores of 3 or higher for GP5, though numbers were very small in the AXIS1 trial (n = 4) (Table 3).
Analyses considering scores of at least 3 on all FACIT/FACT items, including the GP5, showed that a much smaller proportion of patients had scores of 3 or 4 on these items. For example, at the discontinuation visit, 47.1% patients had scores of 2–4 on the GP5 item, but 15.3% had scores of 3–4 on the item (Table 3). This pattern was seen at other time points (Table 3). A similar pattern was seen for the other FACIT/FACT items (Table 3).
When within-person score change was examined, at least 20% patients in each trial improved to less than moderate (scores < 2) from scores of moderate-severe (scores 2–4). In all three trials, the category with the largest percentage of patients was ‘Stayed Less than Moderate.’ Relatively few patients worsened between discontinuation and follow-up, and > 15% in each trial had persistent moderate-severe bother (“stayed moderate”) (Fig. 1).
GP5: Correlation with Other Items
Correlations of at least 0.3 were seen between GP5 and most items at the discontinuation visit (Table 4). The exceptions were self-care (0.14 and 0.19 in two trials), pain (0.29 in one trial), usual activities (0.29 in one trial) and hair loss bother (0.29 in one trial). At the early discontinuation and follow-up visits, correlations were generally 0.2 or higher for most items (Table 4).
Presence of Problems, Global Health Scores and Symptom Severity Across GP5 Categories
Regardless of the time point, there were higher percentages of patients reporting ‘No Problems’ for usual activities or mobility when GP5 was categorised as 0–1 (less than moderate-severe) or 0–2 (less than severe). The results were similar but less pronounced for self-care, where there were still high percentages of patients with ‘No Problems’ for this item yet moderate-severe (2–4) or severe (3–4) bother on GP5 (Fig. 2; Table 5). The mean VAS score decreased steadily with each GP5 category. For example, for AXIS2 the mean VAS score at discontinuation was 68.50 (SD 18.45) for the GP5 category of 0–1, compared to 64.63 (SD = 19.32) for 0–2, 56.31 (SD = 20.56) for 2–4 and 51.88 (SD = 21.45) for 3–4 (Table 5). Similar results were seen for other time points and in the other trials.
A similar pattern was seen for less than moderate-severe (0–1) symptoms (GP1 Lack of Energy, GP4 Pain) (Fig. 3; Table 6). For example, the percentage of patients who had scores of 0–1 (less than moderate-severe) on GP1 Lack of Energy in PALOMA at discontinuation was 62.5% for GP5 0–1, compared to 54.8% for GP5 0–2, 18.5% for GP5 2–4 and 9.4% for GP5 3–4 (Table 6). Similar findings were seen for other symptom items such as HI7 Fatigue (Appendix Table 8), GP2 Nausea (Appendix Table 9), and the bother item B5 Hair Loss Bother (Appendix Table 9).
Visit Occurrence and GP5 Completion as a Percentage of ‘Expected’
In the three included trials, nearly all (> 80%) patients had a discontinuation visit when expected (Appendix Table 10). GP5 completion, when considering the ‘expected’ patients as the denominator, ranged from 70 to 80% for discontinuation visits at any time, but 56–80% for early discontinuation visits. Follow-up visit occurrence ranged from 42 to 93% of expected and GP5 completion, when considering the ‘expected’ patients as the denominator, was 22–69%.
GP5 and PRO Completion Rates
Completion rates were higher at discontinuation than follow-up for all studies (Table 7). For discontinuation visits at any time, completion rates for most items were at least 75% and 76–85% for GP5 (Table 7). Completion rates at early discontinuation visits ranged from at least 57% to 83% for most items, and 64–82% for GP5 (Table 7). At follow-up, completion rates for most items were 54–78%, and 52–78% for GP5 (Table 7).
At nearly all time points and for all items, completion rates were higher when patients discontinued for progression as opposed to toxicity (Appendix Table 11). The exception was the AXIS1 trial, however at most 10 patients discontinued for toxicity at these time points. Within reasons for discontinuation, completion rates were similar between items at all time points, with no evidence of lower GP5 completion (Appendix Table 11). The exception was AXIS2 at progression, where, among patients discontinuing for toxicity (n = 25), completion for GP5 was 44% (n = 11), compared to 52% (n = 13) (Appendix Table 11).
Included Trials and Study Population
The time frame for end of treatment assessments were not always specified; when specified, it was usually within one month of the last dose of treatment (Table 1). Follow-up assessments were highly variable, ranging from 28 days after the last dose to 6 months. When specified, windows were 7 days (Table 2). In the renal cell carcinoma studies, which recruited across both sexes, nearly all (> 70%) participants were male and had baseline ECOG scores of 0 or 1. In all trials, progression was the primary reason for discontinuation, and discontinuation due to toxicity/adverse events (AE) was infrequent (9–14% of patients across trials).
GP5 and Other PROs at End of Treatment and Follow-Up
At the treatment discontinuation visit, the percentage of patients with GP5 scores of at least 2 (“moderate to severe bother”) ranged from 30 to 57% (Table 3). At discontinuation, at least 20% of patients reported problems of some kind as reported on EQ-5D-3L; lack of energy was reported by at least 50% of patients in all trials, and pain by at least 40% (Table 3). However, the average VAS score was at least 60/100 (Table 3). When early discontinuation was considered, the percentage of patients with GP5 scores of at least 2 was at least 50%; at least 50% reported scores of at least 2 for lack of energy and at least 50% reported had scores of 2 or higher for pain (Table 3). In the breast cancer trial, < 30% of patients had scores of at least 2 on hair loss bother at all time points, and < 20% had scores of at least 2 on nausea at all time points (Table 3). At follow-up, at least 20% of patients in both AXIS trials had scores of 3 or higher for GP1 (Lack of Energy), compared to 13% in the PALOMA trial. However, at least 30% in each trial had scores of 3 or higher for GP4 (Pain). At least 10% in each trial had scores of 3 or higher for GP5, though numbers were very small in the AXIS1 trial (n = 4) (Table 3).
Analyses considering scores of at least 3 on all FACIT/FACT items, including the GP5, showed that a much smaller proportion of patients had scores of 3 or 4 on these items. For example, at the discontinuation visit, 47.1% patients had scores of 2–4 on the GP5 item, but 15.3% had scores of 3–4 on the item (Table 3). This pattern was seen at other time points (Table 3). A similar pattern was seen for the other FACIT/FACT items (Table 3).
When within-person score change was examined, at least 20% patients in each trial improved to less than moderate (scores < 2) from scores of moderate-severe (scores 2–4). In all three trials, the category with the largest percentage of patients was ‘Stayed Less than Moderate.’ Relatively few patients worsened between discontinuation and follow-up, and > 15% in each trial had persistent moderate-severe bother (“stayed moderate”) (Fig. 1).
GP5: Correlation with Other Items
Correlations of at least 0.3 were seen between GP5 and most items at the discontinuation visit (Table 4). The exceptions were self-care (0.14 and 0.19 in two trials), pain (0.29 in one trial), usual activities (0.29 in one trial) and hair loss bother (0.29 in one trial). At the early discontinuation and follow-up visits, correlations were generally 0.2 or higher for most items (Table 4).
Presence of Problems, Global Health Scores and Symptom Severity Across GP5 Categories
Regardless of the time point, there were higher percentages of patients reporting ‘No Problems’ for usual activities or mobility when GP5 was categorised as 0–1 (less than moderate-severe) or 0–2 (less than severe). The results were similar but less pronounced for self-care, where there were still high percentages of patients with ‘No Problems’ for this item yet moderate-severe (2–4) or severe (3–4) bother on GP5 (Fig. 2; Table 5). The mean VAS score decreased steadily with each GP5 category. For example, for AXIS2 the mean VAS score at discontinuation was 68.50 (SD 18.45) for the GP5 category of 0–1, compared to 64.63 (SD = 19.32) for 0–2, 56.31 (SD = 20.56) for 2–4 and 51.88 (SD = 21.45) for 3–4 (Table 5). Similar results were seen for other time points and in the other trials.
A similar pattern was seen for less than moderate-severe (0–1) symptoms (GP1 Lack of Energy, GP4 Pain) (Fig. 3; Table 6). For example, the percentage of patients who had scores of 0–1 (less than moderate-severe) on GP1 Lack of Energy in PALOMA at discontinuation was 62.5% for GP5 0–1, compared to 54.8% for GP5 0–2, 18.5% for GP5 2–4 and 9.4% for GP5 3–4 (Table 6). Similar findings were seen for other symptom items such as HI7 Fatigue (Appendix Table 8), GP2 Nausea (Appendix Table 9), and the bother item B5 Hair Loss Bother (Appendix Table 9).
Visit Occurrence and GP5 Completion as a Percentage of ‘Expected’
In the three included trials, nearly all (> 80%) patients had a discontinuation visit when expected (Appendix Table 10). GP5 completion, when considering the ‘expected’ patients as the denominator, ranged from 70 to 80% for discontinuation visits at any time, but 56–80% for early discontinuation visits. Follow-up visit occurrence ranged from 42 to 93% of expected and GP5 completion, when considering the ‘expected’ patients as the denominator, was 22–69%.
GP5 and PRO Completion Rates
Completion rates were higher at discontinuation than follow-up for all studies (Table 7). For discontinuation visits at any time, completion rates for most items were at least 75% and 76–85% for GP5 (Table 7). Completion rates at early discontinuation visits ranged from at least 57% to 83% for most items, and 64–82% for GP5 (Table 7). At follow-up, completion rates for most items were 54–78%, and 52–78% for GP5 (Table 7).
At nearly all time points and for all items, completion rates were higher when patients discontinued for progression as opposed to toxicity (Appendix Table 11). The exception was the AXIS1 trial, however at most 10 patients discontinued for toxicity at these time points. Within reasons for discontinuation, completion rates were similar between items at all time points, with no evidence of lower GP5 completion (Appendix Table 11). The exception was AXIS2 at progression, where, among patients discontinuing for toxicity (n = 25), completion for GP5 was 44% (n = 11), compared to 52% (n = 13) (Appendix Table 11).
Discussion
Discussion
Moderate-severe levels of side effect bother and symptoms (scores of 2 or higher out of 4) were seen by at least 20% of patients in follow-up, underscoring the potential usefulness of post-treatment data collection. Similar results were seen at the discontinuation visit, whether it was examined at any point in the trial or when discontinuation visits only in the first three months were considered. Although fewer patients had severe bother or symptoms (scores of 3 or higher out of 4), patients with moderate side effect bother may carry residual symptoms long after treatment discontinuation. Our study also demonstrated associations between the GP5 item and functioning, symptom, and global health PROs, providing further evidence in support of the item’s psychometric validity. PRO completion, including completion of the GP5 item, was variable across studies and lower at follow-up than end-of-treatment. There was no evidence of lower completion of the GP5 item, compared to other PRO items, at any of these time points.
Like previous studies, we found low rates of post-treatment patient-reported outcome measure (PROM) completion, particularly at follow-up visits [17, 20]. To the best of our knowledge, there are no standard completion rate calculation metrics for these visits. Not surprisingly, we found lower completion if we considered the denominator to be patients for whom the visit was ‘expected,’ rather than those who had the visit. There may be different reasons why patients do not attend discontinuation or follow-up visits, and it is not clear if the number of ‘expected’ is a reasonable denominator for calculating completion rates at these visits. Development of guidelines and standards would be beneficial and aid comparison across trials. In any case, our findings of comparatively lower completion at follow-up compared to discontinuation were robust regardless of the denominator used.
It is somewhat concerning, though perhaps not surprising, that completion rates were comparatively low for discontinuation due to toxicity compared to discontinuation due to progression. If patients who discontinued due to toxicity had higher levels of side effect bother, our findings may be underestimating the level of side effect impact at discontinuation and post-treatment. However, discontinuation due to toxicity does not necessarily mean that symptomatic adverse events were a key factor; for example, trial protocols may indicate discontinuation due to lab-based toxicity. The number of patients discontinuing due to toxicity was generally small across the trials, posing challenges for definitive conclusions.
As there is interest in capturing patient perspectives at the end of treatment, for example through exit interviews [24, 25] or evaluation of how valuable patients perceived treatment to be [26], working to improve PROM completion at discontinuation is worthwhile. Additionally, collection of the reasons for missing PRO data is particularly important after treatment [12]. Earlier work has identified that GP5 completion at baseline may be lower than that for other PROs [23]; however, we found no evidence of this at either the discontinuation or post-discontinuation visits.
Another consideration for discontinuation and post-discontinuation PROM collection is the timing of follow-up PRO assessments, as schedules may affect outcomes [27–29]. Consistent with the literature [13, 17], we found PRO assessment timing at and after discontinuation to be heterogeneous. Focusing on assessment at specific post-discontinuation time points [12] or following the pre-discontinuation assessment schedule until a new treatment is taken up [13] have been suggested. In any case, if PRO collection will be undertaken post-discontinuation, working with trialists to optimise completion can help in minimising missing data and maximising data value and interpretability. Furthermore, well-designed objectives can help determine the suitability of post-discontinuation completion as well as the appropriate assessment schedule.
In our study, we found persistent symptoms and side effect bother after treatment discontinuation. Attributing PROs to the treatment investigated may be challenging as the time since discontinuation increases [13]. Exacerbating this challenge may be the possible receipt of subsequent therapy by patients who have discontinued. Interpreting PRO data and particularly PRO tolerability data when subsequent treatments have commenced may be difficult. Furthermore, patients who may have discontinued trial therapy and identified another trial for their indication may be completing PROs for that trial, which could affect completion rates for post-discontinuation evaluation for the original trial. These complexities should be considered by constituents when planning measurement strategies, and a tailored approach can be used when it is known that therapy will likely have long term, irreversible side effects (such as neuropathy).
This study had several limitations. We used data from solid tumour commercial clinical trials, and thus our findings may not necessarily generalise to trials in other settings or patients in cancer clinical care. Although we excluded patients who left treatment due to death from our analysis, it is possible that patients may have died prior to the follow-up time point and thus the completion rates could be underestimated. The analyses presented in this paper use descriptive statistics, without attempts to mitigate missing data. As such, the analyses in the paper may be subject to bias due to missing data. Lastly, the symptom items selected (lack of energy, pain, fatigue, nausea) were not specified as disease-related or symptomatic side effects. In practice, such classification may be difficult for some such items; pain, for example, may be a disease-related symptom and a symptomatic side effect, depending on the indication.
Moderate-severe levels of side effect bother and symptoms (scores of 2 or higher out of 4) were seen by at least 20% of patients in follow-up, underscoring the potential usefulness of post-treatment data collection. Similar results were seen at the discontinuation visit, whether it was examined at any point in the trial or when discontinuation visits only in the first three months were considered. Although fewer patients had severe bother or symptoms (scores of 3 or higher out of 4), patients with moderate side effect bother may carry residual symptoms long after treatment discontinuation. Our study also demonstrated associations between the GP5 item and functioning, symptom, and global health PROs, providing further evidence in support of the item’s psychometric validity. PRO completion, including completion of the GP5 item, was variable across studies and lower at follow-up than end-of-treatment. There was no evidence of lower completion of the GP5 item, compared to other PRO items, at any of these time points.
Like previous studies, we found low rates of post-treatment patient-reported outcome measure (PROM) completion, particularly at follow-up visits [17, 20]. To the best of our knowledge, there are no standard completion rate calculation metrics for these visits. Not surprisingly, we found lower completion if we considered the denominator to be patients for whom the visit was ‘expected,’ rather than those who had the visit. There may be different reasons why patients do not attend discontinuation or follow-up visits, and it is not clear if the number of ‘expected’ is a reasonable denominator for calculating completion rates at these visits. Development of guidelines and standards would be beneficial and aid comparison across trials. In any case, our findings of comparatively lower completion at follow-up compared to discontinuation were robust regardless of the denominator used.
It is somewhat concerning, though perhaps not surprising, that completion rates were comparatively low for discontinuation due to toxicity compared to discontinuation due to progression. If patients who discontinued due to toxicity had higher levels of side effect bother, our findings may be underestimating the level of side effect impact at discontinuation and post-treatment. However, discontinuation due to toxicity does not necessarily mean that symptomatic adverse events were a key factor; for example, trial protocols may indicate discontinuation due to lab-based toxicity. The number of patients discontinuing due to toxicity was generally small across the trials, posing challenges for definitive conclusions.
As there is interest in capturing patient perspectives at the end of treatment, for example through exit interviews [24, 25] or evaluation of how valuable patients perceived treatment to be [26], working to improve PROM completion at discontinuation is worthwhile. Additionally, collection of the reasons for missing PRO data is particularly important after treatment [12]. Earlier work has identified that GP5 completion at baseline may be lower than that for other PROs [23]; however, we found no evidence of this at either the discontinuation or post-discontinuation visits.
Another consideration for discontinuation and post-discontinuation PROM collection is the timing of follow-up PRO assessments, as schedules may affect outcomes [27–29]. Consistent with the literature [13, 17], we found PRO assessment timing at and after discontinuation to be heterogeneous. Focusing on assessment at specific post-discontinuation time points [12] or following the pre-discontinuation assessment schedule until a new treatment is taken up [13] have been suggested. In any case, if PRO collection will be undertaken post-discontinuation, working with trialists to optimise completion can help in minimising missing data and maximising data value and interpretability. Furthermore, well-designed objectives can help determine the suitability of post-discontinuation completion as well as the appropriate assessment schedule.
In our study, we found persistent symptoms and side effect bother after treatment discontinuation. Attributing PROs to the treatment investigated may be challenging as the time since discontinuation increases [13]. Exacerbating this challenge may be the possible receipt of subsequent therapy by patients who have discontinued. Interpreting PRO data and particularly PRO tolerability data when subsequent treatments have commenced may be difficult. Furthermore, patients who may have discontinued trial therapy and identified another trial for their indication may be completing PROs for that trial, which could affect completion rates for post-discontinuation evaluation for the original trial. These complexities should be considered by constituents when planning measurement strategies, and a tailored approach can be used when it is known that therapy will likely have long term, irreversible side effects (such as neuropathy).
This study had several limitations. We used data from solid tumour commercial clinical trials, and thus our findings may not necessarily generalise to trials in other settings or patients in cancer clinical care. Although we excluded patients who left treatment due to death from our analysis, it is possible that patients may have died prior to the follow-up time point and thus the completion rates could be underestimated. The analyses presented in this paper use descriptive statistics, without attempts to mitigate missing data. As such, the analyses in the paper may be subject to bias due to missing data. Lastly, the symptom items selected (lack of energy, pain, fatigue, nausea) were not specified as disease-related or symptomatic side effects. In practice, such classification may be difficult for some such items; pain, for example, may be a disease-related symptom and a symptomatic side effect, depending on the indication.
Conclusions
Conclusions
In conclusion, we found that PRO completion at discontinuation and follow-up was variable and tended to be suboptimal at follow-up; variable completion was seen for all evaluated PROs and not specific to the GP5. Additionally, a subset of patients had persistent side effect bother at discontinuation and follow-up. Our findings of the association between GP5 and other PROs suggests that GP5 may be a helpful measure in capturing post-treatment tolerability in some cases. Improving PRO completion at discontinuation may enhance the usefulness of post-treatment tolerability data particularly when the long-term side effects of anti-cancer therapy need further evaluation.
In conclusion, we found that PRO completion at discontinuation and follow-up was variable and tended to be suboptimal at follow-up; variable completion was seen for all evaluated PROs and not specific to the GP5. Additionally, a subset of patients had persistent side effect bother at discontinuation and follow-up. Our findings of the association between GP5 and other PROs suggests that GP5 may be a helpful measure in capturing post-treatment tolerability in some cases. Improving PRO completion at discontinuation may enhance the usefulness of post-treatment tolerability data particularly when the long-term side effects of anti-cancer therapy need further evaluation.
출처: PubMed Central (JATS). 라이선스는 원 publisher 정책을 따릅니다 — 인용 시 원문을 표기해 주세요.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- Prostate Cancer Care for Men with an Intellectual Disability: A Population-based Cohort Study of Symptoms, Diagnosis, Treatment, and Survival.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Association between polygenic risk scores and cardiovascular events in prostate cancer patients receiving androgen deprivation therapy in Han Chinese.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Comprehensive analysis of androgen receptor splice variant target gene expression in prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.