Evidence reviews for the clinical and cost-effectiveness of routine MRI or CT of the brain in the management of people with lung cancer prior to therapy with curative intent
Evidence review B
NICE Guideline, No. 122
Evidence reviews for the clinical and cost-effectiveness of routine MRI or CT of the brain in the management of people with lung cancer prior to radical therapy with curative intent
Review questions
RQ1.3: What is the clinical and cost-effectiveness of routine MRI or CT of the brain in the management of people with lung cancer prior to radical therapy with curative intent?
Introduction
This area was identified for review because observational data calculating the prevalence of brain metastases in people with various stages of NSCLC selected for treatment with curative intent has been published since the last guideline (O’Dowd 20141). This data enabled the effectiveness and cost-effectiveness of various imaging strategies to be calculated. The 2011 NICE lung cancer guideline recommended that MRI or CT scan should be considered before treatment with curative intent, especially for (patients otherwise thought to have) stage III NSCLC. MRI brain may be more accurate at detecting brain metastases compared to CT brain. However, there is reduced availability and increased cost for MRI compared to CT. The prevalence of brain metastases is likely to by population subgroup.
Table 1
PICO table.
Methods and process
This evidence review was developed using the methods and process described in Developing NICE guidelines: the manual (2014). Methods specific to this review question are described in the review protocol in appendix A, and the methods section in appendix B. In particular, the minimally important differences (MIDs) used in this review are summarised in appendix B.
There was a deviation from the protocol: for the diagnostic test accuracy outcomes (sensitivity and specificity), the population of interest was increased from NSCLC stages I-IIIA to include all stages (NSCLC stages IIIB and IV too). This was to obtain more accuracy data for imaging to detect brain metastases. Studies that only have participants with NSCLC stages I-IIIA are few and have relatively low numbers of participants. Where we used studies that included participants with stages IIIB and IV, we downgraded for indirectness.
A consultant neuroradiologist was co-opted onto the committee to provide advice and expertise for this research question. Declarations of interest were recorded according to NICE’s 2018 conflicts of interest policy.
Clinical evidence
Included studies
This review was conducted as part of a larger update of the NICE Lung cancer: diagnosis and management guideline (CG121). A systematic literature search for randomised controlled trials (RCTs), systematic reviews of RCTs and observational studies including cohort trials with a no date limit yielded 4,216 references.
Papers returned by the literature search were screened on title and abstract, with 24 full-text papers ordered as potentially relevant references.
Nine papers representing 9 unique studies were included after full text screening:
Table of included studies
For the search strategy, please see appendix C. For the clinical evidence study selection flowchart, see appendix D. For the full evidence tables and full GRADE profiles for included studies, please see appendix E and appendix F.
Excluded studies
Details of the studies excluded at full-text review are given in appendix H along with a reason for their exclusion.
Summary of clinical studies included in the evidence review
Outcomes and sample sizes
See full evidence tables and Grade profiles Appendix E and Appendix F.
Quality assessment of clinical studies included in the evidence review
See appendix F for full GRADE tables.
Economic evidence
Standard health economic filters were applied to the clinical search for this question, and a total of 401 citations was returned. Details of the literature search are provided in Appendix C. Following review of titles and abstracts, 2 full-text studies were retrieved for detailed consideration, of which none were included in our review.
Summary of original economic model
The de-novo cost-utility analysis developed for this guideline (see Appendix I for full details) included three strategies; no imaging (i.e. proceed straight to treatment with curative intent), imaging with CT brain, followed by MRI brain if positive and imaging with MRI brain. Patients in the model were divided into three categories; negative, positive with 1–3 brain metastases and positive with 4+ metastases. These were decided upon as the most clinically relevant patient groups. The model examined patients with NSCLC stage I, stage II and stage IIIA separately. Patients found to be negative exited the model because the tests were assumed (based on the evidence identified and the committee’s experience) to have a specificity of 100%. CT and MRI were also assumed to have a sensitivity of 100% for detecting 4+ metastases in the model’s base case. After imaging or no imaging, patients could therefore be true positive with 1–3 brain metastases, true positive with 4+, false negative with 1–3 or undetected with 4+. This final group only existed in the no imaging strategy in the base case. Following detection of brain metastases, radical treatments shifted from more to less invasive techniques and radical treatments were assumed to be used less frequently. Patients also received appropriate treatment for their brain metastases. After initial imaging and treatment, patients entered the long term part of the model where their overall and progression-free survival was modelled using data from relevant RCTs and cohort studies. Patients received indicated treatments upon progression and death.
The model found that imaging was not cost-effective in stage I NSCLC, that CT followed by MRI if positive could be cost-effective in stage II disease and MRI was the dominant strategy (the cheapest and most effective) in stage IIIA disease. These results were robust to plausible sensitivity and scenario analyses. The most important parameters in the model were the prevalence of brain metastases, the proportion of positives who had 4+ metastases and the extent to which the treatment plan was assumed to change following initial imaging.
Evidence statement
MRI brain
Diagnostic accuracy data: meta-analysis
Very low-quality evidence from 4 observational studies on 624 people with stage I to stage IV lung cancer considered for radical treatment found that for MRI brain the sensitivity was 94.1% (68.6 – 99.9) and the specificity was 99.9% (91.0 – 100.0).
Diagnostic accuracy data: Yokoi 1999
Very low-quality evidence from 1 observational study on 177 people with stage I to stage IV lung cancer considered for radical treatment found that for MRI brain the sensitivity was 50% (26.1 – 73.9) and the specificity was 99.7% (97.2 – 100). This data was excluded from the meta-analysis above due to clinical implausibility; the sensitivity was too low. This was a post hoc decision by the guideline committee.
Effectiveness data (change in treatment plan: initially operable people who had metastases detected by imaging)
Very low-quality evidence from 4 prospective cohort studies and 1 retrospective cohort study reporting data on 558 people with stage I to stage IIIA lung cancer considered for radical treatment found that the percentage who were found to have brain metastases using MRI brain ranged from 1.5% (CI 0.19% - 5.57%) to 21.4% (8.3% - 31%).
CT brain with contrast
Diagnostic accuracy data: meta-analysis
Very low-quality evidence from 3 observational studies on 418 people with stage I to stage IV lung cancer considered for radical treatment found that for CT brain the sensitivity was 74.6% (11.5 – 99.7) and the specificity was 99.7% (85.2 – 100.0).
Diagnostic accuracy data: Yokoi 1999
Very low-quality evidence from 1 observational study on 177 people with stage I to stage IV lung cancer considered for radical treatment found that for CT brain the sensitivity was 12.5% (2.9 – 40.2) and the specificity was 99.7% (96.8 – 100). This data was excluded from the meta-analysis above due to clinical implausibility; the sensitivity was too low. This was a post hoc decision by the guideline committee.
Effectiveness data (change in treatment plan: initially operable people who had metastases detected by imaging)
Very low-quality evidence from 1 prospective cohort study reporting data on 152 people with stage I to stage IIIA lung cancer considered for radical treatment found that the percentage who were found to have brain metastases using CT brain was 6.29% (2.92 – 11.6).
Health economics evidence statement
Evidence from one directly applicable health economic model with minor limitations developed for this guideline found that brain imaging was not cost-effective in patients with stage I NSCLC otherwise being considered for treatment with curative intent. The model found that a strategy of CT followed by MRI if positive was the most cost-effective for stage II disease at a threshold of £30,000/QALY and might have been cost-effective at a threshold of £20,000/QALY. MRI alone was the most cost-effective strategy in stage III disease.
The committee’s discussion of the evidence
Interpreting the evidence
The outcomes that matter most
The committee agreed that the outcome that matters most is not causing harm by offering treatment options with curative intent, particularly surgical options, in patients who have brain metastases. Radical treatment options for lung cancer are associated with risks, side effects, high healthcare resource use and are not expected to alter the prognosis of many people with brain metastases. Another important outcome is the potential benefit of being able to offer alternative treatments to patients who have brain metastases. Early identification and appropriate management may slow disease progression and increase overall survival.
The quality of the evidence
The quality of the evidence included in the clinical review was very low. The committee noted that there is no agreed gold standard for assessing the presence of brain metastases and therefore the data on sensitivity in the included studies is particularly unreliable. The original health economic model developed for this review question included a large amount of evidence of varying quality, including a large number of assumptions and extrapolations from indirect data but overall the committee considered it a robust analysis for decision making. This was because its conclusions for each disease stage were not sensitive to plausible variations in any of the input parameters. In the meta-analyses of sensitivity and specificity data for MRI and CT brain, we excluded Yokoi 1999 from the analysis because the sensitivity data in this study are implausible compared to the sensitivity of modern MRI and CT brain imaging.
Benefits and harms
Imaging of the brain for those being considered for surgery or radical radiotherapy should prevent the use of radical treatment options in some patients for whom it is not expected to be beneficial. In addition, patients found to have brain metastases could be considered for other treatments such as stereotactic radiosurgery or brain surgery, which would be expected to improve their prognosis although each treatment would carry its own risks and side effects. The committee agreed that some patients feel anxiety on undergoing MRI but agreed that the scan was a safe and highly accurate way to detect brain metastases.
Cost-effectiveness and resource use
The recommendations for this area were based on the health economic model developed for this update (see Appendix I). The economic model examined three strategies; no imaging, CT (followed by MRI if the CT was positive) and MRI alone in patients with stage I, stage II and stage IIIA NSCLC being considered for treatment with curative intent separately. Early identification of brain metastases within the model led to an increase in Quality Adjusted Life Years (QALYs) because earlier management of brain metastases led to slower rates of progression and higher overall survival. There were costs associated with the initial imaging and subsequent treatment of brain metastases but also some savings from patients receiving less radical treatment, particularly surgery. Broadly, there were two types of patients within the economic model, those with 1–3 brain metastases, many of whom would receive radical treatment for their primary tumour as well as their metastases, and those with 4+ metastases who were modelled to no longer receive radical treatment but to move to systemic therapy. The committee were aware there would be some exceptions to these groupings in practice but felt the split was clinically meaningful and that it was a distinction that had often been made in the evidence base. Because of the associated cost savings, the proportion of positive patients who have 4+ brain metastases was an important but uncertain parameter in the economic model. The committee noted this uncertainty in its interpretation of the evidence for different stages of NSCLC.
To calculate test outcomes in the model, a diagnostic test accuracy meta-analysis was undertaken. This found that the sensitivity of CT and MRI were 74% and 94% respectively and that both modalities had a specificity of ~100%. The committee thought this was reasonable, particularly in relation to MRI so there were no patients with a false diagnosis of brain metastases included in the economic model. While the prevalence would likely be affected by the mixed population in some of the studies, the committee did not think the sensitivity of the tests would be and understood that these values would be thoroughly tested in scenario analysis in the model.
The evidence on the prevalence of brain metastases within the model came from a retrospective cohort analysis that had extrapolated data on patients treated with curative intent who had subsequently developed brain metastases. The authors of this paper used tumour doubling times to calculate how many patients would have had detectable brain metastases at the time of their radical treatment. The committee understood the limitations of this kind of analysis but also considered it to be the best available source of evidence that was relevant to the decision problem. The paper reported the estimated prevalence for stages I, II and IIIA separately.
The base case ICERs for CT-MRI versus No Imaging and MRI versus CT-MRI in stage I patients were greater than £30,000/QALY gained. There were no sensitivity analyses that moved these values close to £20,000/QALY gained. This was primarily because of the low prevalence of brain metastases in Stage I patients. The committee also noted that for every 100 MRI scans performed, only 3 patients would be found positive for brain metastases. They therefore decided that it was highly unlikely that imaging in Stage I represented a cost-effective use of NHS resources.
The base case ICERs for CT-MRI versus No Imaging and MRI versus CT-MRI in stage II patients were £21,000/QALY and £48,000/QALY respectively. There were no plausible sensitivity analyses that made MRI cost-effective compared to CT-MRI. The primary reasons for these findings are that CT was assumed to have very good sensitivity for identifying patients who have 4+ brain metastases and these patients are the most important in the cost-effectiveness calculations within the model because they are no longer likely to receive radical treatment, leading to significant cost savings. The committee noted that only a small number of people with 1–3 brain metastases would be missed on initial CT that might have been detected had MRI been the first test. They therefore decided to recommend a strategy of CT, followed by MRI if positive in the Stage II NSCLC population.
For stage IIIA patients, MRI was the dominant strategy (it was both cheaper and more effective) and remained either dominant or the most cost-effective strategy in all plausible sensitivity analyses. This is because all stage IIIA patients found to be positive for brain metastases are highly unlikely to receive radical treatment, leading to significant cost savings in the model. These savings, coupled with the relatively high prevalence of brain metastases and the clinical benefits of early diagnosis mean that the most sensitive test, MRI, is the most cost-effective.
The committee noted a number of limitations in the economic model relating to its data inputs and assumptions but also noted the findings were robust to all plausible sensitivity analyses and were therefore confident that it was reliable as the basis for decision making for this review question.
Other factors the committee took into account
The committee was aware that there are pressures on imaging services, particularly MRI scanners and that some patients prefer not to receive MRI scans but agreed that these considerations should not affect the recommendations. Some of the evidence that underpinned the health economic model was of low quality or based on committee assumption. In particular, they considered that due to the non-contemporary nature of the studies, the sensitivity of CT and MRI are likely to be underestimated with the use of thin collimation and volumetric imaging having improved the accuracy of both modalities in recent years. The committee was satisfied that these concerns had been addressed by an extensive range of sensitivity analyses. The main evidence for the prevalence of brain metastases came from a paper where the population of interest had not received contrast enhanced PET-CT as part of their staging. The committee acknowledged that in centres where contrast enhanced PET-CT is routine, the prevalence of brain metastases in the population of interest might be lower. While the specificity of MRI was thought to be 100% as regards brain metastases from lung cancer, the committee noted that several differential diagnoses such as infection and primary brain tumour might be detected by the scan. They considered this an ancillary benefit of imaging.
For recommendations 1.3.23, 1.3.24, and 1.3.25 the committee agreed that ‘clinical stage’ should be written, rather than ‘stage’. This is to ensure that healthcare professionals understand that the brain imaging should be performed before surgery. After surgery, a pathologist is able to confirm the stage. Other recommendations do not require this clarification because it is normally obvious to clinicians whether a stage is clinical or not.
For recommendation 1.3.25, the committee agreed that the stage should be clinical stage III, which includes IIIA and IIIB. This is because some people with stage IIIB disease will receive radical radiotherapy and it is highly likely that MRI brain would be just as cost-effective in these patients as in patients with stage IIIA NSCLC.
Appendix A. Review protocols
Review protocol for the clinical and cost-effectiveness of routine MRI or CT of the brain in the management of people with lung cancer prior to radical therapy with curative intent?
Table
Diagnostic sensitivity and specificity Staging sensitivity and specificity
Appendix B. Methods
Priority screening
The reviews undertaken for this guideline all made use of the priority screening functionality with the EPPI-reviewer systematic reviewing software. This uses a machine learning algorithm (specifically, an SGD classifier) to take information on features (1, 2 and 3 word blocks) in the titles and abstract of papers marked as being ‘includes’ or ‘excludes’ during the title and abstract screening process, and re-orders the remaining records from most likely to least likely to be an include, based on that algorithm. This re-ordering of the remaining records occurs every time 25 additional records have been screened.
- Research is currently ongoing as to what are the appropriate thresholds where reviewing of abstract can be stopped, assuming a defined threshold for the proportion of relevant papers it is acceptable to miss on primary screening. As a conservative approach until that research has been completed, the following rules were adopted during the production of this guideline:
- In every review, at least 50% of the identified abstract (or 1,000 records, if that is a greater number) were always screened.
- After this point, screening was only terminated when the threshold was reached for a number of abstracts being screened without a single new include being identified. This threshold was set according to the expected proportion of includes in the review (with reviews with a lower proportion of includes needing a higher number of papers without an identified study to justify termination), and was always a minimum of 250.
- A random 10% sample of the studies remaining in the database when the threshold were additionally screened, to check if a substantial number of relevant studies were not being correctly classified by the algorithm, with the full database being screened if concerns were identified.
- As an additional check to ensure this approach did not miss relevant studies, the included studies lists of included systematic reviews were searched to identify any papers not identified through the primary search.
Evidence synthesis and meta-analyses
Where possible, meta-analyses were conducted to combine the results of studies for each outcome. For mean differences, where change from baseline data were reported in the studies and were accompanied by a measure of spread (for example standard deviation), these were extracted and used in the meta-analysis. Where measures of spread for change from baseline values were not reported, the corresponding values at study end were used and were combined with change from baseline values to produce summary estimates of effect. All studies were assessed to ensure that baseline values were balanced across the treatment/comparison groups; if there were significant differences in important confounding variables at baseline these studies were not included in any meta-analysis and were reported separately.
When averages were given as medians, we presented them as they were.
Evidence of effectiveness of interventions
Quality assessment
Individual RCTs and quasi-randomised controlled trials were quality assessed using the Cochrane Risk of Bias Tool. Cohort studies were quality assessed using the CASP cohort study checklist. Each individual study was classified into one of the following three groups:
- Low risk of bias – The true effect size for the study is likely to be close to the estimated effect size.
- Moderate risk of bias – There is a possibility the true effect size for the study is substantially different to the estimated effect size.
- High risk of bias – It is likely the true effect size for the study is substantially different to the estimated effect size.
Each individual study was also classified into one of three groups for directness, based on if there were concerns about the population, intervention, comparator and/or outcomes in the study and how directly these variables could address the specified review question. Studies were rated as follows:
- Direct – No important deviations from the protocol in population, intervention, comparator and/or outcomes.
- Partially indirect – Important deviations from the protocol in one of the population, intervention, comparator and/or outcomes.
- Indirect – Important deviations from the protocol in at least two of the following areas: population, intervention, comparator and/or outcomes.
Methods for combining intervention evidence
Meta-analyses of interventional data were conducted with reference to the Cochrane Handbook for Systematic Reviews of Interventions (Higgins et al. 2011).
Where different studies presented continuous data measuring the same outcome but using different numerical scales (e.g. a 0–10 and a 0–100 visual analogue scale), these outcomes were all converted to the same scale before meta-analysis was conducted on the mean differences. Where outcomes measured the same underlying construct but used different instruments/metrics, data were analysed using standardised mean differences (Hedges’ g).
A pooled relative risk was calculated for dichotomous outcomes (using the Mantel–Haenszel method). Both relative and absolute risks were presented, with absolute risks calculated by applying the relative risk to the pooled risk in the comparator arm of the meta-analysis.
Fixed- and random-effects models (der Simonian and Laird) were fitted for all syntheses, with the presented analysis dependent on the degree of heterogeneity in the assembled evidence. Fixed-effects models were the preferred choice to report, but in situations where the assumption of a shared mean for fixed-effects model were clearly not met, even after appropriate pre-specified subgroup analyses were conducted, random-effects results are presented. Fixed-effects models were deemed to be inappropriate if one or both of the following conditions was met:
- Significant between study heterogeneity in methodology, population, intervention or comparator was identified by the reviewer in advance of data analysis. This decision was made and recorded before any data analysis was undertaken.
- The presence of significant statistical heterogeneity in the meta-analysis, defined as I2≥50%.
In any meta-analyses where some (but not all) of the data came from studies at high risk of bias, a sensitivity analysis was conducted, excluding those studies from the analysis. Results from both the full and restricted meta-analyses are reported. Similarly, in any meta-analyses where some (but not all) of the data came from indirect studies, a sensitivity analysis was conducted, excluding those studies from the analysis.
Meta-analyses were performed in Cochrane Review Manager v 5.3.
Minimal clinically important differences (MIDs)
The Core Outcome Measures in Effectiveness Trials (COMET) database was searched to identify published minimal clinically important difference thresholds relevant to this guideline. However, no relevant MIDs were found. In addition, the Guideline Committee were asked to specify any outcomes where they felt a consensus MID could be defined from their experience. The committee agreed that they could not specify any MIDs. Because all studies were cohort studies without a comparator, none of the studies had a line of no effect with which to rate imprecision. Therefore, imprecision was rated according to number of participants. If the number of participants in one arm was 40 or less, the committee agreed that the imprecision would most likely be serious. If the number of participants in one arm was 25 or less, the committee agreed that the imprecision would most likely be very serious.
GRADE for pairwise meta-analyses of interventional evidence
GRADE was used to assess the quality of evidence for the selected outcomes as specified in ‘Developing NICE guidelines: the manual (2014)’. Data from RCTs was initially rated as high quality and the quality of the evidence for each outcome was downgraded or not from this initial point. If non-RCT evidence was included for intervention-type systematic reviews then these were initially rated as either moderate quality (quasi-randomised studies) or low quality (cohort studies) and the quality of the evidence for each outcome was further downgraded or not from this point, based on the criteria given in Table 4. The committee agreed that the outcomes of cohort studies with one arm (no comparator) would be described using a narrative synthesis.
Table 2. Rationale for downgrading quality of evidence for intervention studies
The quality of evidence for each outcome was upgraded if any of the following five conditions were met:
- Data from non-randomised studies showing an effect size sufficiently large that it cannot be explained by confounding alone.
- Data showing a dose-response gradient.
- Data where all plausible residual confounding is likely to increase our confidence in the effect estimate.
Publication bias
Publication bias was assessed in two ways. First, if evidence of conducted but unpublished studies was identified during the review (e.g. conference abstracts, trial protocols or trial records without accompanying published data), available information on these unpublished studies was reported as part of the review. Secondly, where 10 or more studies were included as part of a single meta-analysis, a funnel plot was produced to graphically assess the potential for publication bias.
Evidence statements
Evidence statements for pairwise intervention data are classified in to one of four categories:
- Situations where the data are only consistent, at a 95% confidence level, with an effect in one direction (i.e. one that is ‘statistically significant’), and the magnitude of that effect is most likely to meet or exceed the MID (i.e. the point estimate is not in the zone of equivalence). In such cases, we state that the evidence showed that there is an effect.
- Situations where the data are only consistent, at a 95% confidence level, with an effect in one direction (i.e. one that is ‘statistically significant’), but the magnitude of that effect is most likely to be less than the MID (i.e. the point estimate is in the zone of equivalence). In such cases, we state that the evidence could not demonstrate a meaningful difference.
- Situations where the data are consistent, at a 95% confidence level, with an effect in either direction (i.e. one that is not ‘statistically significant’) but the confidence limits are smaller than the MIDs in both directions. In such cases, we state that the evidence demonstrates that there is no difference.
- In all other cases, we state that the evidence could not differentiate between the comparators.
Diagnostic test accuracy evidence
In this guideline, diagnostic test accuracy (DTA) data are classified as any data in which a test result or the output of an algorithm – is observed in some people who have the condition of interest at the time of the test and some people who do not. Such data either explicitly provide, or can be manipulated to generate, a 2x2 classification of true positives and false negatives (in people who, according to the reference standard, truly have the condition) and false positives and true negatives (in people who, according to the reference standard, do not).
The ‘raw’ 2x2 data can be summarised in a variety of ways. Those that were used for decision making in this guideline are as follows:
- Sensitivity is the probability that the feature will be positive in a person with the condition.
- sensitivity = TP/(TP+FN)
- Specificity is the probability that the feature will be negative in a person without the condition.
- specificity = TN/(TN+FP)
Meta-analysis of diagnostic test accuracy was undertaken for this guideline using univariate random effects models, which were effectively four simple meta-analyses of a proportion. We were unable to fit a bivariate model due to having a small number of studies for both CT and MRI. Bayesian methods were chosen in order to handle zero-cells without the need for a continuity correction with vague prior distributions being assigned to sensitivity and specificity for the two tests. Random effects models were preferred based on DIC being more than 3–5 points lower for sensitivity and because of heterogeneity in study populations, methods and settings. While the DIC for the random effects model for specificity was not 3–5 points lower, it was still preferred due to heterogeneity in study populations, methods and settings. Further details can be found in Appendix F (GRADE tables), Appendix I (Cost-utility analysis) and Appendix K (WinBUGS code).
Quality assessment
Individual studies were quality assessed using the QUADAS-2 tool, which contains four domains: patient selection, index test, reference standard, and flow and timing. Each individual study was classified into one of the following two groups:
- Low risk of bias – Evidence of non-serious bias in zero or one domain.
- Moderate risk of bias – Evidence of non-serious bias in two domains only, or serious bias in one domain only.
- High risk of bias – Evidence of bias in at least three domains, or of serious bias in at least two domains.
Each individual study was also classified into one of three groups for directness, based on if there were concerns about the population, index features and/or reference standard in the study and how directly these variables could address the specified review question. Studies were rated as follows:
- Direct – No important deviations from the protocol in population, index feature and/or reference standard.
- Partially indirect – Important deviations from the protocol in one of the population, index feature and/or reference standard.
- Indirect – Important deviations from the protocol in at least two of the population, index feature and/or reference standard.
Modified GRADE for diagnostic test accuracy evidence
GRADE has not been developed for use with diagnostic studies; therefore a modified approach was applied using the GRADE framework. GRADE assessments were only undertaken for sensitivity and specificity. This is because the committee agreed that these two measurements are the ones that that matter most to clinicians and people with stage I to stage IIIA lung cancer being considered for radical treatment. GRADE quality ratings were calculated using the same criteria as for intervention studies, given in Table 4. Neither sensitivity nor specificity have a line of no effect with which to rate imprecision. Therefore, imprecision was rated according to number of participants. If the number of participants in one arm was 40 or less, the committee agreed that the imprecision would most likely be serious. If the number of participants in one arm was 25 or less, the committee agreed that the imprecision would most likely be very serious.
Appendix C. Literature search strategies
Scoping search strategies
Scoping searches Scoping searches were undertaken on the following websites and databases (listed in alphabetical order) in April 2017 to provide information for scope development and project planning. Browsing or simple search strategies were employed.
Clinical search literature search strategy
Main searches
Bibliographic databases searched for the guideline
- Cochrane Database of Systematic Reviews – CDSR (Wiley)
- Cochrane Central Register of Controlled Trials – CENTRAL (Wiley)
- Database of Abstracts of Reviews of Effects – DARE (Wiley)
- Health Technology Assessment Database – HTA (Wiley)
- EMBASE (Ovid)
- MEDLINE (Ovid)
- MEDLINE Epub Ahead of Print (Ovid)
- MEDLINE In-Process (Ovid)
Identification of evidence for review questions
The searches were conducted between October 2017 and April 2018 for 9 review questions (RQ).
Searches were re-run in May 2018.
Where appropriate, in-house study design filters were used to limit the retrieval to, for example, randomised controlled trials. Details of the study design filters used can be found in section 3.
Search strategy
Table
Medline Strategy, searched 13th February 2018 Database: Ovid MEDLINE(R) 1946 to Present with Daily Update
Study Design Filters
Health Economics literature search strategy
Sources searched to identify economic evaluations
- NHS Economic Evaluation Database – NHS EED (Wiley) last updated Apr 2015
- Health Technology Assessment Database – HTA (Wiley) last updated Oct 2016
- Embase (Ovid)
- MEDLINE (Ovid)
- MEDLINE In-Process (Ovid)
Search filters to retrieve economic evaluations and quality of life papers were appended to the review question search strategies. For some health economics strategies additional terms were added to the original review question search strategies (see sections 4.2, 4.3 and 4.4) The searches were conducted between October 2017 and April 2018 for 9 review questions (RQ).
Searches were re-run in May 2018.
Searches were limited to those in the English language. Animal studies were removed from results.
Economic evaluation and quality of life filters
Health economics search strategy
Table
Medline Strategy, searched 5th December 2017 Database: Ovid MEDLINE(R) 1946 to Present with Daily Update
Appendix D. Evidence study selection
Appendix E. Clinical evidence tables
Download PDF (583K)
Appendix F. GRADE tables
Brain MRI: intervention evidence: operable people who had metastases detected by MRI brain
Brain MRI: intervention evidence: change in staging for people who were operable
Brain CT: intervention evidence: operable people who had metastases detected by CT brain
Brain CT: intervention evidence: change in staging for people who were operable
Diagnostic accuracy evidence: meta-analysis
Diagnostic accuracy evidence: Yokoi 1999
Appendix G. Excluded Studies
Excluded clinical studies
Excluded economic studies
Appendix H. References
Clinical Studies - Included
- Earnest F, Ryu J H, Miller G M, Luetmer P H, Forstrom L A, Burnett O L, Rowland C M, Swensen S J, and Midthun D E (1999) Suspected non-small cell lung cancer: incidence of occult brain and skeletal metastases and effectiveness of imaging for detection--pilot study. Radiology 211(1), 137–45 [PubMed: 10189463]
- Hochstenbag M M, Twijnstra A, Hofman P, Wouters E F, ten Velde, and G P (2003) MR-imaging of the brain of neurologic asymptomatic patients with large cell or adenocarcinoma of the lung. Does it influence prognosis and treatment?. Lung Cancer 42(2), 189–93 [PubMed: 14568686]
- Kim S Y, Kim J S, Park H S, Cho M J, Kim J O, Kim J W, Song C J, Lim S P, and Jung S S (2005) Screening of brain metastasis with limited magnetic resonance imaging (MRI): clinical implications of using limited brain MRI during initial staging for non-small cell lung cancer patients. Journal of Korean Medical Science 20(1), 121–6 [PMC free article: PMC2808557] [PubMed: 15716616]
- Yohena T, Yoshino I, Kitajima M, Uehara T, Kanematsu T, Teruya T, Ikeda J, and Ichinose Y (2004) Necessity of preoperative screening for brain metastasis in non-small cell lung cancer patients without lymph node metastasis. Annals of Thoracic & Cardiovascular Surgery 10(6), 347–9 [PubMed: 15658906]
Clinical studies – Excluded
- Axelsson R, Bach-Gansmo T, Castell-Conesa J, McParland B J, and Study Group (2010) An open-label, multicenter, phase 2a study to assess the feasibility of imaging metastases in late-stage cancer patients with the alpha v beta 3-selective angiogenesis imaging agent 99mTc-NC100692. Acta Radiologica 51(1), 40–6 [PubMed: 20001475]
- de Cos Escuín, J S, Menna D M, González M A, Quirantes J Z, Vicente C D, and Calvo M C (2007) Silent brain metastasis in the initial staging of lung cancer: evaluation by computed tomography and magnetic resonance imaging. Arch Bronconeumol 43, 386–91 [PubMed: 17663891]
- Ferrigno D, and Buccheri G (1994) Cranial computed tomography as a part of the initial staging procedures for patients with non-small-cell lung cancer. Chest 106(4), 1025–9 [PubMed: 7924469]
- Hudson Z, Internullo E, Edey A, Laurence I, Bianchi D, and Addeo A (2017) Brain imaging before primary lung cancer resection: a controversial topic. Ecancermedicalscience 11, 749 [PMC free article: PMC5493439] [PubMed: 28717395]
- Kormas P, Bradshaw J R, and Jeyasingham K (1992) Preoperative computed tomography of the brain in non-small cell bronchogenic carcinoma. Thorax 47(2), 106–8 [PMC free article: PMC463584] [PubMed: 1549816]
- Lahde S, Paivansalo M, Rainio P, Merikanto J, and Karkola P (1990) Assessing resectability of lung cancer: the role of computed tomography of the mediastinum, upper abdomen and head. European Journal of Radiology 10(1), 48–55 [PubMed: 2311606]
- Lee H Y, Lee K S, Kim B T, Cho Y S, Lee E J, Yi C A, Chung M J, Kim T S, Kwon O J, and Kim H (2009) Diagnostic efficacy of PET/CT plus brain MR imaging for detection of extrathoracic metastases in patients with lung adenocarcinoma. Journal of Korean Medical Science 24(6), 1132–8 [PMC free article: PMC2775863] [PubMed: 19949671]
- Li Y, Jin G, and Su D (2017) Comparison of Gadolinium-enhanced MRI and 18FDG PET/PET-CT for the diagnosis of brain metastases in lung cancer patients: A meta-analysis of 5 prospective studies. Oncotarget 8(22), 35743–35749 [PMC free article: PMC5482613] [PubMed: 28415747]
- Mujoomdar A, Austin J H. M, Malhotra R, Powell C A, Pearson G D. N, Shiau M C, and Raftopoulos H (2007) Clinical predictors of metastatic disease to the brain from non-small cell lung carcinoma: Primary tumor size, cell type, and lymph node metastases. Radiology 242(3), 882–888 [PubMed: 17229875]
- Na II, Lee T H, Choe D H, Cheon G J, Kim C H, Koh J S, Baek H, Ryoo B Y, Yang S H, and Lee J C (2008) A diagnostic model to detect silent brain metastases in patients with non-small cell lung cancer. European Journal of Cancer 44(16), 2411–7 [PubMed: 18707868]
- O’Dowd E L, Kumaran M, Anwar S, Palomo B, and Baldwin D R (2014) Brain metastases following radical surgical treatment of non-small cell lung cancer: is preoperative brain imaging important?. Lung Cancer 86(2), 185–9 [PubMed: 25239395]
- Ohno Y (2007) Whole-body MR imaging vs. FDG-PET:comparison of accuracy of M-stage diagnosis for lung cancer patients. J Magn Reson Imaging 26(3), 498–509 [PubMed: 17729341]
- Ohno Y (2008) Non-small cell lung cancer: whole-body MR examination for M-stage assessment-utility for whole-body diffusion-weighted imaging compared with integrated FDG PET/CT. Radiology 248(2), 643–54 [PubMed: 18539889]
- Plathow C (2008) Positron emission tomography/computed tomography and whole-body magnetic resonance imaging in staging of advanced nonsmall cell lung cancer-initial results. Invest Radiol 43(5), 290–7 [PubMed: 18424949]
- Salbeck R, Grau H C, and Artmann H (1990) Cerebral tumor staging in patients with bronchial carcinoma by computed tomography. Cancer 66(9), 2007–11 [PubMed: 2224799]
- Salvatierra A, Baamonde C, Llamas J M, Cruz F, and Lopez-Pujol J (1990) Extrathoracic staging of bronchogenic carcinoma. Chest 97(5), 1052–8 [PubMed: 2184992]
- Seute T, Leffers P, ten Velde, G P, and Twijnstra A (2008) Detection of brain metastases from small cell lung cancer: consequences of changing imaging techniques (CT versus MRI). Cancer 112(8), 1827–34 [PubMed: 18311784]
- Suzuki K, Yamamoto M, Hasegawa Y, Ando M, Shima K, Sako C, Ito G, and Shimokata K (2004) Magnetic resonance imaging and computed tomography in the diagnoses of brain metastases of lung cancer. Lung Cancer 46(3), 357–60 [PubMed: 15541821]
- van de Pol, M, van Oosterhout, A G, Wilmink J T, ten Velde, G P, and Twijnstra A (1996) MRI in detection of brain metastases at initial staging of small-cell lung cancer. Neuroradiology 38(3), 207–10 [PubMed: 8741187]
- Yokoi K, Kamiya N, Matsuguma H, Machida S, Hirose T, Mori K, and Tominaga K (1999) Detection of brain metastasis in potentially operable non-small cell lung cancer: a comparison of CT and MRI. Chest 115(3), 714–9 [PubMed: 10084481]
Health Economic studies – Included
None
Health Economic studies – Excluded
- Colice, G.L., Birkmeyer, J.D., Black, W.C., Littenberg, B. and Silvestri, G., 1995. Cost-effectiveness of head CT in patients with lung cancer without clinical evidence of metastases. Chest, 108(5), pp.1264–1271. [PubMed: 7587427]
- Wernicke, A. Gabriella, Menachem Z. Yondorf, Bhupesh Parashar, Dattatreyudu Nori, KS Clifford Chao, John A. Boockvar, Susan Pannullo, Philip Stieg, and Theodore H. Schwartz. “The cost-effectiveness of surgical resection and cesium-131 intraoperative brachytherapy versus surgical resection and stereotactic radiosurgery in the treatment of metastatic brain tumors.” Journal of neuro-oncology 127, no. 1 (2016): 145–153. [PubMed: 26725100]
Appendix I. Cost-utility analysis
Background
Brain metastases (BM) are a frequent complication from non-small cell lung cancer (NSCLC) but routine imaging of the brain is not undertaken, especially in early stage disease. The 2011 guideline included a recommendation to “Consider MRI or CT of the head in patients selected for treatment with curative intent, especially in stage III disease” but it is not known how widely this guidance is implemented in UK practice or whether practice differs by cancer stage. Detecting BM prior to treatment with curative intent is valuable as it may alter the treatment plan. For example, patients initially indicated for surgery may be switched to less invasive treatment as the chance for cure is greatly reduced if they are found to have BM. Early detection of BM may also lead to better outcomes for patients in that they may be able to receive BM-specific treatment that will better their prognosis.
The prevalence of BM is thought to be relatively low in patients with early stage NSCLC and, given that CT and MRI have limited availability, the committee were interested in examining the cost-effectiveness of routine imaging separately in patients with stage I, II and III disease. An important motivator for the inclusion of this review question in the guideline update was the publication of the O’dowd 2014 paperb, which tried to estimate the prevalence of BM in the population of interest.
Methods
Population, interventions/comparators and outcomes
The populations in the model are patients with stage I, II and III NSCLC who are otherwise selected for treatment with curative intent; either surgery or radical radiotherapy to the lung. These patients have already received the standard lung cancer staging investigations of chest CT, whole body non-contrast-enhanced PET-CT and any necessary biopsy procedures. The cancer stage is expected to be correct in all respects except for the potential for occult BM. Patients in the model are either negative for BM, are positive with 1–3 BM or are positive with 4+ BM. The distinction is clinically important in that patients with 1–3 BM often receive radical brain treatment and then may go on to receive radical treatment to their lung whereas patients with 4+ BM receive treatment that is systemic and palliative in nature.
The strategies examined in the model were No Imaging (i.e. straight to radical treatment), CT of the brain followed by MRI if positive and MRI of the brain alone. Outcomes were measured in quality adjusted life years (QALYs).
Model Structure
This section is intended to give a structural overview of the model and its underpinning assumptions. Derivation of parameters is discussed in the Model Parameters section.
Short Term Model
The model begins with a series of decision trees which determine the results of the diagnostic tests undertaken on 1,000 theoretical patients. Following this, patients have the potential to be either True Negative (TN), True Positive (1–3), True Positive (4+), False Positive (1–3) or False Negative (4+). There are no TPs or FPs in the No Imaging strategy as no test has taken place. In Figure 1, p is the prevalence of BM, pr(1–3) is the proportion of patients with BM that have 1–3 BM, pr(4+) is the proportion of patients with BM that have 4+ BM, seMRI and spMRI are the sensitivity and specificity of MRI. Sensitivity is expected to be higher for patients with 4+ BM.
Figure 1. Diagnostic Decision Trees (No Imaging and MRI only Strategies)
In Figure 2, p is the prevalence of BM, pr(1–3) is the proportion of patients with BM that have 1–3 BM, pr(4+) is the proportion of patients with BM that have 4+ BM, sect, seMRI, spCT and spMRI are the sensitivity and specificity of CT and MRI. Sensitivity is expected to be higher for patients with 4+ BM. Patients who are identified as positive (4+) do not receive a confirmatory MRI in the base case analysis.
Figure 2. Diagnostic Decision Tree for CT-MRI Strategy
Following initial imaging, those patients who are found to be negative receive the treatment with curative intent that they had initially been indicated for (comprising various types of surgery and radical radiotherapy). Many of the patients who are found to be positive (1–3) receive radical treatment for both their brain metastases and on their lung. Patients who are found to be positive (4+) are assumed to receive treatments that are systemic or palliative in nature rather than radical. The exact breakdown of these treatments is discussed in the parameters section of this report. An important assumption of this analysis is that specific treatments do not affect patients’ prognoses. The reason for this assumption is that both the patient group under study and their treatment options are very heterogeneous so the model would have quickly become unmanageably complicated and would have required a large number of parameters for which data do not exist. We therefore chose to model broad groups of patients for whom robust data do exist based on the outcomes of the diagnostic tests.
We assume that the sensitivity and specificity of MRI, when used in the MRI alone strategy is the same when used on the patient population who have been confirmed as positive with CT scanning. The committee indicated this assumption was reasonable. Another important assumption of the model is that the testing strategies do not generate any genuine False Positives. This is because the specificity of MRI for detecting brain metastases was found to be ~100% in the clinical review. The committee stated that they believe this to be true; while evidence from the clinical review showed MRI scans identifying phenomena that mimicked lesions such as “flow related enhancements” and may detect differential diagnoses such as primary brain tumours or infections, the committee were of the view that MRI would not falsely detect brain metastases and that a patient would not be managed as if they had brain metastases when they did not. While there might be the odd highly unusual contradiction to this assumption, for the purposes of the model it was reasonable to assume there were no False Positives.
If there are no False Positives then there is no value in modelling True Negatives as the numbers will not differ by strategy. Therefore all True Negative patients exit the model after initial imaging. Included in the True Negative patients exiting the model are those few patients with differential diagnoses such as primary brain tumours and infections as it is assumed they will be managed in a cost-effective way elsewhere for those conditions, in addition to receiving appropriate treatment for their NSCLC. It was thought this cohort are small enough that the potential gain in net monetary benefit from incidentally identifying them via imaging was assumed not to affect the conclusions of the model.
The diagnostic decision trees and the initial treatments that patients receive are assumed, for the purposes of the model, to occur instantaneously. That is, there are no negative effects from delay due to imaging and all patients are assumed to receive some initial treatment before any deaths or progressions occur. Addressing this limitation would have required a number of evidence-free assumptions about the effects of delay that would have likely only had a minor effect on the results.
Long Term Model
At the end of the diagnostic decision tree there are four broad patient groups to model the outcomes for; True Positives (1–3), True Positives (4+), False Negatives (1–3) and False Negative (4+), all of whom have BM. A Partitioned Survival Analysisc (PartSA) model was chosen as it is the most common structure for modelling advanced cancers and due to the availability of relevant data to calculate the model’s parameters. A PartSA model makes use of overall (OS) and progression free survival (PFS) curves to partition patients into three mutually exclusive states at any given point in time; ‘dead’, ‘alive and progression free’, ‘alive and progressed’. At each time point the proportion of patients in the dead state is given by one minus the overall survival curve, the proportion of patients alive and progression free is equal to the progression free survival curve and therefore the number alive and progressed is equal to one minus the sum of the other two groups.
The model is a state membership rather than a state transition model so some assumptions are needed to model transition related events. It would not possible for any patient to transition from the progressed to the progression free state but it would be possible for a patient to transition from the progression free state to either the progressed state and for patients in either state to transition to the dead state. The number of transitions assumed to occur to the dead state from cycle to cycle is therefore equal to the difference in the dead state membership and the number of transitions from the progression free to the progressed state is equal to the difference in the progression free state minus the number of first events that are deaths (these data need to be obtained from trials). Both types of transition events incur important one off costs in NSCLC patients so it was necessary to characterise their occurrences explicitly in this way. Figure 3 shows how the OS and PFS curves dictate the proportions of patients in each state in a typical PartSA model.
Figure 3. Typical Partitioned Survival Analysis Model State Membership
Overall Survival
For this specific model True Positives (1–3) were assumed to proceed along an OS curve that was obtained from trials of relevant patients. The OS of patients who were True Positive (4+) was calculated by applying a hazard ratio (HR) relevant to the proportional hazard between these two groups. As is often the case in diagnostic models that try to capture the outcomes for patients with an incorrect diagnosis, some strong structural assumptions were needed for the False Negative cohorts. Patients who were False Negative (1–3) were assumed to begin with a HR of 1 versus the TP (1–3) group and were then assumed to gradually progress to having an equal HR to the TP (4+) group over the average time to intracranial progression in a trial of patients with BM multiplied by 2 (it was assumed that the vast majority of patients would have developed 4+ BM by this time). Evidence on the natural history of BM from the O’Dowd paper as well as the trials used to inform parameters in this model lent credence to the assumption that BM grow and proliferate over a relatively short time period. The committee confirmed that this assumption was reasonable, given their clinical experience of managing these patients. The overall survival curve for patients who were FN4+ was calculated by applying an initial hazard ratio representative of Whole Brain Radio Therapy (WBRT) treatment to the overall survival curve for patients who were TP4+. This parameter was taken from an RCT for use of WBRT versus best supportive care in patients with a good performance status and brain metastases from NSCLC (Mulvenna et al. 2016). Because the TP4+ patients were treated with WBRT and the FN4+ patients were not, we considered this a reasonable approximation. The hazard ratio declined uniformly, cycle by cycle (to represent patients gradually presenting symptomatically) and became equal to one at two times the average time to presentation, by which time all patients who were likely to progress intracranially were assumed to have progressed.
Progression-Free Survival
Progressions were defined as either intracranial or extracranial or both together and could occur at initial or distant sites or both together. Data on the PFS curve was obtained from a trial of relevant patients with 1–3 BM (Kocher et al. 2011). This PFS curve was used directly for the cohort who were TP (1–3) but a series of assumptions needed to be made to translate it to the other groups. The PFS curve for TP (1–3) was divided by the OS curve for TP (1–3) to give a proportion alive and progression free at each time point, this was multiplied by the OS curve for TP (4+) to give the PFS curve for TP (4+). It was assumed that the difference in survival between patients that were TP (1–3) and TP (4+) was directly attributable to BM. The committee thought this a reasonable assumption as the multivariable regression that had provided the relevant hazard ratio had controlled for other patient level factors. This assumption then extends to the difference in OS for the FN (1–3) group. To try to approximate this relationship, the model accelerated the PFS curve by an acceleration factor that would ensure the absolute difference in the area under the FN (1–3) and TP (1–3) PFS curves from time point 0 to 42 weeks (as discussed earlier, this was the point at which all intracranial progressions in the FN group were assumed to have occurred) equal to the absolute difference in the area under the corresponding OS curves. This assumption was tested in sensitivity analysis. We applied the same logic for the (4+) population as the (1–3) population for PFS, accelerating the PFS curve for FN (4+) to a value that ensured the absolute difference in the area under the curve between the FN (4+) and TP (4+) population was equal to the difference in their corresponding overall survival curves at 42 weeks. The combination of acceleration factors and the multiplicative approach to PFS curves has the advantage of preserving the relationship of PFS and OS in the different patient groups and in sensitivity analyses but the disadvantage that there will be a very small amount of “double-counting” progressions following 42 weeks. Because the PFS curve will still be multiplied by a lower OS curve but the internal logic of the model is that the FN patients who are going to progress are assumed to progress by this point and that all differences in OS are attributable to PFS, a lower PFS curve beyond 42 weeks is perhaps inconsistent. It can be seen in Figure 4 that the effect of this is a very minimal, however, and might reflect a clinically reasonable ‘tail’ of late progression.
Figure 4 shows a diagram of the structure of the partitioned survival analysis component of the economic model.
The average age at the start of the model was 60 (the average age in relevant BM trials), the model was run on a weekly cycle length for 10 years in the base case. While a few patients were left alive at the end of the time horizon, the committee were mindful that every patient within the model has NSCLC and BM and found it highly unlikely that anyone would survive beyond this time point. Due to small patient numbers, this issue was not expected to meaningfully affect the conclusions of the model, however.
Patients existing in the progression free and progressed states accrued QALYs as a multiple of relevant utility values and time in state. They also accrued routine NSCLC management costs for existing in both states. Progression and death events both accrued one-off event costs, which are discussed in more detail in the Model Parameters section.
Both costs and QALYs were discounted at 3.5% and a half cycle correction was applied.
Model Parameters
Prevalence of BM
As stated in the section detailing the model structure, the prevalence of BM in the three populations of interest was obtained from a paper by O’Dowd et al 2014. This was a retrospective study of 646 NSCLC patients undergoing treatment with curative intent at a UK hospital so was seen by the committee as the most relevant source of data for this parameter. The analysis included 41 patients who had been identified as having BM in a maximum follow up period of 2 years. The size of the BM and a tumour doubling time of 58.48 daysd were used to estimate the proportion of patients who had BM at the time of their radical treatment. The paper estimated that 71% of these metastases were above 5mm in diameter and 83% were above 2mm. The committee felt that the 2mm cut-off was the more relevant for modern MRI scanners but the 5mm cut-off was used in sensitivity analysis. The prevalence values were multiplied by the proportion detectable to calculate the proportion of detectable BM in the model.
Based on the natural history of NSCLC, one would expect the prevalence of BM to be higher in stage IIIA than in stage II. The equivalence observed in this data could be due to the patients having received a staging PET-CT, which could have detected the larger and more obvious BM and therefore ruled them out of receiving radical treatment. The patients in this study occupy the same point in the care pathway as the patients in this decision problem so the committee thought the data were directly relevant but recognised that in centres that use contrast enhanced PET-CT at initial staging, the prevalence of BM might be lower.
Diagnostic Test Accuracy
Sensitivity (Se) is the probability that a diagnostic test will correctly identify a positive patient as positive. Specificity (Sp) is the probability that a diagnostic test will correctly identify a negative patient as negative. In order to determine these parameters, we used studies reporting the relevant data that had been identified as part of the clinical sift for this question. The relevant data are in Table 4.
Table 4. Diagnostic Test Accuracy of CT and MRI
There are a number of limitations to these studies; several were old and therefore used out of date equipment, there was a relatively significant prevalence of patients with stage IIIB NSCLC and above in the studies (although the committee assessed this limitation as minor as regards the accuracy of the tests), there were a small number of positive patients on which to base the sensitivity calculations and the method for determining sensitivity was of varying quality. Nevertheless, these were the only empirical data available and the committee were content to use them in the base case analysis. For this base case, they decided to exclude the data from Yokoi 1999 as the sensitivity values looked implausibly low at 9% for CT and 50% for MRI.
We performed independent meta-analyses for Se and Sp for both MRI and CT using WinBUGS. We attempted to fit bivariate models (i.e. where Se and Sp were correlated) but did not have enough studies for the MCMC algorithm to be stable. The WinBUGS code can be found in Appendix L and the results are in Table 5.
Table 5. Results of DTA Meta-Analyses
The committee chose to prefer random effects models for Se and Sp for both CT and MRI, which reflected a combination of the heterogeneity of the studies and the DIC statistics. This gave Se values of 74.6% for CT and 94.1% for MRI and Sp values of 99.7% for CT and 99.9% for MRI.
The committee examined the data on False Positives in the underpinning studies and decided that they were not relevant to current practice, particularly for MRI. This was because the source of False Positives in the Lee 2009 study was listed as ‘flow related enhancements’, which are thought to no longer be a factor. The committee agreed that in their experience there would be no genuine False Positives (i.e. those that would lead to someone being treated for BM when they did not, in fact, have BM) following an MRI scan. As discussed earlier, differential diagnosis, while a consequence of imaging were not expected to affect the conclusions of the model due to small numbers. A specificity value of 100% (rather than 99.9%) was therefore used in the model and because there were no False Positives in any of the strategies, long term outcomes for False Positives and True Negatives were not modelled. This value was necessarily fixed at 100% in the probabilistic sensitivity analysis.
In the base case, the Se of both CT and MRI for detecting people with 4+ BM was fixed at 100% on the advice of the committee. While this assumption was relaxed in sensitivity analysis for CT, the committee thought it highly implausible that MRI would not detect someone with 4+ BM of above 2mm in diameter.
Number of BM
As discussed in the model structure section, the committee indicated that the number of brain metastases identified could significantly alter subsequent treatment decisions. They specified two broad patient groups of interest, those who had 1–3 BM and those who had 4+ BM. The committee’s a priori assumption was that 90% of positive patients would have 1–3 BM. We also identified data in a relevant populatione showing the proportion to be 74% (CI 55% - 89%). These data, while quite uncertain, are very important in the model as the initial treatments received by patients with 1–3 BM are far higher in cost than those received by the patients with 4+ BM. Therefore, the higher we believe the proportion of patients with 4+ BM to be, the more cost-effective imaging will be. In the base case, we used the 74% value for the number of positive patients would have 1–3 BM, examining the effects of the 90% value in sensitivity analysis.
Survival Curve Parameter Estimation Method
All survival curve parameters used in the model were obtained from studies using the algorithm from Guyot 2012f. The algorithm makes use of Kaplan-Meier (KM) curves that are digitised using graph digitisation software (Enguageg was used for this purpose) and the numbers at risk (often published beneath KM curves in studies) at various time points to estimate synthetic individual patient survival and censorship data. The synthetic individual patient data is then amenable to survival analysis and statistics such as hazard ratios and parametric survival curve parameters may be obtained in the normal way. STATAh was used for this purpose. This method has been extensively validated, with survival analysis statistics generated using synthetic data very closely mirroring those produced using the relevant real trial data in a large number of examples (see also Guyot 2012).
Overall Survival Curves
No fully direct data were identified that would have enabled us to estimate survival curves for the populations of interest within the model. Instead a number of partially applicable studies were discussed with the committee:-
- Kocher 2011i, an RCT in a European setting that investigated Whole Brain Radiotherapy (WBRT) + Radical Treatment versus Radical Treatment alone in patients with 1–3 brain metastases (only 53% of whom had NSCLC). N=359
- Brown 2016j, an RCT in a US setting that investigated WBRT + Stereotactic Radiosurgery (SRS) versus SRS alone for people with 1–3 brain metastases (only 69% of whom had ‘lung’ cancer). N=213
- Sperduto 2016k, a retrospective study in a US setting that estimated prognostic indicators for the survival of people with NSCLC and brain metastases. N=2,186
- The IASLC Lung Cancer Staging Project 2015, a retrospective study in a European setting that underpinned the TNM8 NSCLC staging criteria. N=1,059
The most relevant data from the Sperduto study were survival curves relating to the group with a GPA 2.5–3 (age under 70, good Karnofsky Performance Status, absent of extracranial metastases 1–4 BM and EGFR/ALK status unknown). The committee discussed all the relevant survival curves and the strengths and limitations of the studies. They concluded that the IASLC TNM8 data only included sparse data on people with BM so should be excluded from the analysis but were unable to decide which of the Kocher, Brown and Sperduto studies was the most relevant to the patient group who were True Positive (1–3). For Kocher and Brown, the study arms that did not receive WBRT were used as this is not standard treatment for people with 1–3 BM. The committee noted that the Kocher and Brown studies had been used in the economic model conducted for NICE’s Guideline on Brain tumours and brain metastasesl and that the median and interquartile range values for all three curves were similar and clinically plausible. They therefore requested that the OS curve in the model should be based on a meta-analysis of all three.
Figure 5. KM Estimates for OS in the TP (1–3) Group
For the purposes of economic modelling, we decided to fit parametric survival models to these KM data because we wanted the curves to be able to work flexibly with a cycle length and time horizon defined by ourselves within the economic model. The best fitting models were selected using Akaike’s Information Criterion (AIC). We also restricted our selection to models with a log relative-hazard form rather than an accelerated failure time form. This was because we wanted to use a variety of published hazard ratios to simulate other patient groups within the model. Table 6 shows the AIC statistic was smallest for the Gompertz model in all three datasets
Table 6. AIC Statistics for Log Relative-Hazard Models for OS Curves
As per the committee’s instructions, we then meta-analysed the shape and scale parameters of the Gompertz curves to obtain the final parameters the curve that represented the OS of patients who were TP (1–3) within the model. In theory it might have been preferable to fit a bivariate model and meta-analysed both the shape and scale parameters together, accounting for correlations, but we felt that independent meta-analyses were reasonable given the small number of studies and the lack of observed correlations between the shape and scale parameters within studies.
Random effects models were chosen due to heterogeneity between the study participants, settings and treatments. The results are in Table 7.
Table 7. Shape and Scale Parameters from the Gompertz Overall Survival Models
Overall survival curves then needed to be estimated for other groups within the model. It was agreed the best source of evidence for the survival difference between the TP (1–3) group and the TP (4+) group was the hazard ratio of people with 1–4 versus 5+ BM published in the Sperduto study. This hazard ratio came from a multivariable regression so was controlling for a number of other relevant factors and although the difference in the populations is slightly indirect, the committee agreed that it was a reasonable approximation. The Sperduto study publishes separate hazard ratios for people with and without adenocarcinoma histology. We obtained data on the number of patients in our model cohort who were expected to have adeno and non-adeno histology and weighted the hazard ratio accordingly. Separate scenario analyses for these two population groups were also conducted. The hazard ratio obtained from an earlier GPA paper by Sperduto that related to the difference in OS between two broad GPA groups that were representative of the difference between 1–4 and 4+ metastases was also obtained via digitising the relevant survival curves and used in sensitivity analysis.
Table 8. Hazard Ratios and acceleration factors for OS and PFS used within the Model
Due to the lack of directly relevant data, estimating the OS curves for False Negative patients required some further assumptions, which were discussed in the Model Structure section. OS for patients who were FN (4+) was modelled as being equal to patients who were TP (4+). This was because the committee were unaware of any evidence that earlier detection would significantly affect OS in this group. People in this group were assumed not to be indicated for any radical therapy to their brain and the effect of WBRT on OS is uncertain. The hazard ratio for patients who were FN (1–3) versus TP (1–3) was assumed to begin at 1 at the beginning of the model and progress uniformly, cycle by cycle, to 1.16 (see Table 8) over the 2* median time to intracranial progression observed in the Brown 2016 trial, which was 21 weeks. By week 42, the HR for this group was therefore equal to the group with 4+ BM as it was assumed that the vast majority of the patients would have intracranially progressed. These assumptions were tested in sensitivity analyses.
Progression Free Survival
The same Kocher 2011 and Brown 2016 BM trials that provided data on OS also provided data on PFS. For Kocher 2011 we obtained the PFS curves through a personal communication with the trialistsn. The committee were shown both survival curves and concluded that the Kocher 2011 PFS data (again, the no WBRT) arm should be used to model PFS in the base case for people who were TP (1–3) because it showed both intra and extracranial progression and was conducted in a European setting. We digitised the PFS curves from Kocher and Brown and fitted parametric survival models to them via the method described in the Overall Survival Section.
Table 9. AIC Statistics for Parametric Survival Curves fit to PFS Data
Based on the AIC statistics shown in Table 9, we selected a log logistic form for the Kocher data and a lognormal form for the Brown data. In order for the Kocher PFS curve to interact properly with the OS curves within the model we set up the model so that it calculated, cycle-by-cycle, the people alive and progression-free as a proportion of those alive as dictated by the Kocher OS curve. This gave us a ‘PFS multiplier’ curve that we could then use with other survival curves. The result of this is that, whichever OS curve is used (Kocher, Brown, meta-analysed curve, adeno only e.g.), the proportion of people alive and progression free will remain constant, even though the raw number will change.
The committee considered whether the PFS curves should be meaningfully altered for FN patients to reflect the lack of management that they receive and concluded that they should be. The method for doing this for the FN (1–3) population has already been described in the Model Structure section and details the process by which we arrived at an acceleration factor of 11% during the time that these patients remain undiagnosed. For the FN (4+) patients who would have been treated with WBRT, had they been identified at initial imaging, we calculated an acceleration factor by fitting a loglogistic regression to both arms of the Kocher 2011 PFS data with the study arm representing ‘no WBRT’ as an independent variable. The acceleration factor associated with this variable was 30.4% (s.e. 11.4%, p=0.001).
Progression and Death Events
Progression is an important concept to capture in NSCLC models because it often triggers challenge of the cancer with another or repeat of therapy. Such therapies are typically of defined and relatively short duration such as 10 sessions of WBRT or 4 cycles of SACT.
The implementation of progression cost within the model is somewhat complex. As partitioned survival analyses are state membership rather than state transition models, there are no transition probabilities between the progression free and progressed health states so these have to be estimated. In our model, these data are only important for cost accrual.
A one-off cost of death was applied by calculating the difference in the overall survival curve (people in the dead state) from cycle to cycle. It is not possible to use this same logic to calculate the number of progressions from the progression free to the progressed state because some of these progressions are deaths. Similarly, one cannot easily treat deaths from the progressed state any differently to deaths from the progression free state without making some assumptions. Our model assumes they had a homogenous cost although this might not be true in reality. This limitation was assessed as minor because the overall proportion of progressions that were deaths was very similar across strategies.
The committee indicated to us that they expected half of FN patients to present with mild to moderate symptoms to their cancer nurse. Upon presentation these patients would undergo imaging, at which point their BM would be discovered. The other half of FN patients were expected to present as an emergency with severe symptoms, resulting in an A&E visit, a non-elective inpatient stay and the requisite imaging.
It was not straightforward to determine what treatments the different populations in the model would receive when experiencing the various events in the progression decision trees (see paragraph below) and we had no evidence to inform these parameters other than committee assumption. Firstly, we needed to determine which False Negative (1–3) patients would still receive radical brain treatment upon intracranial progression. Since we assumed that 50% of people would progress as a routine presentation with mild symptoms, the committee agreed that it would be reasonable to assume that 50% of patients would receive radical brain treatment if intracranial progression was part of their first event (whether alone or along with extracranial progression). This assumption could be changed to apply to only patients whose first event was intracranial alone or who experienced any intracranial event. 80% of patients who were FN (4+) were assumed to receive SACT upon intracranial progression (the same proportion as if they had been identified early). Underlying intracranial progression event costs that applied to all patients were also applied; 80% received WBRT, 5% SRS and 5% SACT. For the patients who were TP (4+), the WBRT was removed as they had received this intervention on initial diagnosis.
To calculate the weighted average cost of a progression event we obtained the progression event decision trees (see Table 33 for those data) from the Kocher 2011 trial for patients with initial treatment with WBRT (TP4+) and without WBRT (all other patients). Deaths were assigned a cost of £0 because they are already accounted for via the method detailed above. Those who did not progress at all were removed from the decision tree because they are not relevant to the calculation.
60% of patients who progressed extracranially alone first were assumed to receive SACT. 20% of these patients as well as any patients who had intracranial and extracranial progression, whether together or consecutively in any order were assumed to receive a single dose of palliative radiotherapy.
All treatment assumptions were provided by the committee. The constituent and resulting cost data are provided from Table 28 onwards. Death costs are available in Table 22.
State Membership Costs
The longer term partitioned survival analysis model contains three possible membership states for simulated patients; progression free survival, progressed and dead. Patients in each of these states consume resources at differing amounts, and therefore incur differing total costs for each given unit of time they have membership of the states.
In order to arrive at state membership costs for the aforementioned states, we examined the literature to uncover the types of resource that commonly were used in each membership state, and the associated numbers of units consumed each month. We developed this information into a table and presented it to the committee alongside up to date prices for resource units from the English NHS. The committee used this table as a starting point to a discussion to validate these data for use within the economic model. The committee made changes to this table based on their experience of the NHS, excising some resource use, unit usage and costs, whilst adding others. The committee also agreed that the state membership costs were the same, despite the stage of cancer the patient experiences.
The committee agreed that patients stop incurring ongoing costs when they die.
Here we present tables to show the final membership costs of progression free survival (Table 10), progressed (Table 11), agreed by the committee to be valid for use in the economic model:
Table 10. Long term model - Progression free survival membership
In order to arrive at the costs for each patient for each month whilst they have membership of the progression free survival state, we multiplied the percentage of patients who are assumed to use the resource type each month, by the number of units used by those patients, by the unit cost to obtain the total weighted cost. For progression free survival patients this was £296.06. We then multiplied this value by the number of months in a year (12) and divided by the number of cycles the model uses each year (52) to obtain a progression free survival cycle cost of £68.32.
Table 11. Long term model - Progression membership
In order to arrive at the costs for each patient for each month whilst they have membership of the progressed state, used the same approach as the progression free survival state. The resulting figures are a weighted average progressed state membership cost of £923.24 each month, and a cycle cost of £213.06.
Initial treatments
The committee were consulted on the types of treatments that patients would be eligible to receive, and what percentage of patients eligible would receive them, given the number of brain metastases detected by the initial diagnostic strategy. The committee were also consulted with regards to the costs of such treatments. Here we present how we calculated the costs for each of the treatments used in the model, all of which were validated by the committee.
Surgical treatments for primary tumours
Table 12 shows the costs of surgical procedures for primary tumours in patients with lung cancer. There are no reference costs that apply to the specific treatments listed so the committee chose the most appropriate from the full range of available thoracic procedure reference costs. The cost of ‘Complex resections and other resections’ was calculated by averaging the cost of lobectomy and pneumonectomy.
Radiotherapy treatments for primary tumours
Stereotactic Ablative Radiotherapy (SABR)
Stereotactic Ablative Radiotherapy (SABR), is an emerging technology. It is a specialised radiotherapy treatment planning technique resulting in a high dose to the target with steep dose gradients resulting in rapid dose fall off outside the target area. This results in high biologically effective dose (BED) while minimising the dose received by the normal tissues, and could potentially minimise the radiotherapy treatment toxicity and side effects. SABR is currently provisioned by the NHS through the Commissioning through Evaluation (CtE) programme, whilst it awaits a full formal review for general use in the NHS. The CtE tariff (Table 13) reimburses three different treatment regimens, 3 fractions, 5 fractions and 8 fractions. These tariffs have been identified by Leeds Teaching Hospital as bundled tariffs, meaning that they include payments for all related planning and treatment.
In order to obtain the cost of SABR for an average patients, the tariff costs must be weighted by the proportion of patients receiving each regimen. This information was provided by Leeds Teaching Hospital, NHS Trust. When the costs of each regimen are weighted against the proportion of patients who receive each treatment regimen, the average cost of SABR for a patient is calculated to be £5,178.78. The costs of SABR are expected to decline with routine adoption.
From the NLCA data, we find that overall, for stage I and II NSCLC, 53.9% of patients receive SABR. Using this, an assumption made by the committee that a patient would be twice as likely to receive SABR in stage I as stage II disease, and the data found in Table 14, we calculate that 63.38% of patients in stage I and 31.69% of patients with stage II NSCLC receive SABR.
Table 14. Patients who presented with NSCLC from the NLCA Report 2017
Continuous hyperfractionated accelerated radiotherapy (CHART)
Continuous hyperfractionated accelerated radiotherapy (CHART) is a method of delivering standard external beam radiotherapy in a more intense regimen than conventional radiotherapy. The CHART regimen used in the model assumes 55Gy delivered over 36 sessions over 12 days, including weekends.
Table 15. CHART for primary tumour
To calculate the total cost of CHART, the number of resource units used is multiplied by the resource unit cost. The cost of hospital inpatient stay is calculated as the initial cost of first 5 days stay (£1,590) added to the remainder of hospital inpatient stay days (12–5) multiplied by the cost of excess bed days (£313). When these costs are added together, this results in the total cost of CHART for each patient as £8,037.25.
Hyper fractionated accelerated radiotherapy
Hyper fractionated accelerated radiotherapy in our model was defined as the delivery of 55Gy over 20 sessions over the course of four weeks. Table 15 shows the how the cost of hyper fractionated accelerated radiotherapy was calculated. This is the most common form of radical radiotherapy practiced in the UK NHS today.
Table 16. Hyper fractionated accelerated radiotherapy
To calculate the cost of hyper fractionated accelerated radiotherapy, we multiply the number of resource units by the cost of each unit, and add them together. This results in the cost of hyper fractionated accelerated radiotherapy for each patient at £2,536.81.
Standard fractionated radiotherapy
Standard fractionated radiotherapy in our model was defined as the delivery of 60–66 Gy over 30–33 sessions over the course of 6 – 6.5 weeks. Table 17 shows the how the cost of standard fractionated accelerated radiotherapy was calculated.
Table 17. Standard fractionated radiotherapy
To calculate the cost of hyper fractionated accelerated radiotherapy, we multiply the number of resource units by the cost of each unit, and add them together. This results in the cost of standard fractionated radiotherapy for each patient at £3,611.46.
Fractionated radiotherapy for local control – 36 Gy over 12 sessions
The costing for fractionated radiotherapy for local control – 36 Gy over 12 sessions, is shown in Table 18. The total cost for fractionated radiotherapy for local control – 36 Gy over 12 sessions was found to be £1,652.16.
Table 18. Fractionated radiotherapy for local control 36 Gy over 12 sessions
Fractionated radiotherapy for local control – 20 Gy over 5 sessions
The costing for fractionated radiotherapy for local control – 20 Gy over 5 sessions, is shown in Table 19. The total cost for fractionated radiotherapy for local control – 20 Gy over 5 sessions was found to be £899.91.
Table 19. Fractionated radiotherapy for local control 20 Gy over 5 sessions
Radiotherapy for local control is given to some stage IIIA patients who are positive for brain metastases within the model.
Treatments for brain tumours
Stereotactic radiosurgery
The cost of stereotactic radiosurgery, £3,555.65, was taken from the model which was created for NICE Guideline NG99 (Brain tumours (primary) and brain metastases in adults). As the NICE Brain Tumour model did not specify a standard deviation for the cost of stereotactic radiosurgery, we assumed this to be a quarter of the mean price (£888.91).
Surgical brain resection
The cost of surgical brain resection, £7,031.94, was taken from NICE Guideline NG99 (Brain tumours (primary) and brain metastases in adults). As the guideline did not specify a standard deviation for the cost of surgical brain resection, we assumed this to be a quarter of the mean price (£1,757.98).
Whole brain radiotherapy (WBRT)
Whole brain radiotherapy (WBRT) included in our model consisted of preparation 10 fractions.
Table 20. Whole brain radiotherapy
To calculate the cost of WBRT, we multiply the number of resource units by the cost of each unit, and add them together. This results in the cost of WBRT at £1,524.34.
Systemic Anti-Cancer Therapy (SACT)
There are a very large number of systemic therapy options available in NSCLC (see RQ 3.3 of this update for a full algorithm) so costing them all and factoring in their differential benefits in this patient population would have been impractical and subject to high uncertainty. These treatment options have typically been the subject of NICE Technology Appraisals and therefore represent cost-effective additions to the care pathway, but additions that the committee was aware were unlikely to add much in terms of net monetary benefit. This is because Technology Appraisal approved drugs in advanced cancer rarely have base case ICERs significantly lower than the upper limit of the ICER range normally considered cost-effective by NICE. The committee also noted that much of the evidence in this model came from survival data collected before many of these drugs were widely available. They therefore thought that the net monetary benefit associated with systemic therapy could reasonably be approximated using the costs of a representative platinum doublet chemotherapy. Systemic anti-cancer therapy (SACT) treatment in our model therefore consisted of Vinorelbine (oral), Carboplatin (IV), and Dexamethasone (oral). In the base case, patients received 4 cycles for each course of SACT. Each course of SACT required a quarter of an hour of an Agenda for Change band 4 member of staff to book an outpatient appointment.
The dose of oral Vinorelbine required for patients is 60mg/mg2, which equates to 120mg on days 1 and days 8 of each cycle. We assumed that the Carboplatin dose required equated to a target AUC 5mg/ml/min, based on a surface area of 1.73m2 and an eGFR of 90. This translated to a requirement of 575mg of Carboplatin required for infusion each cycle. The dosage regimen of dexamethasone was calculated based on the advice of the guideline committee as 8mg twice a day over the first week, tapering down over the remaining 3 weeks.
Table 21. Systemic Anti-Cancer Therapy
The sum of resource use in Table 21 summates to the cost of each SACT cycle as £750.84. Therefore, the cost of all 4 cycles is £3,003.36.
Death event
To calculate the cost of a death event in the mode, we used resource costs from Georghiou and Bardsley (2014), given over to the patient in the final three months of their lives. From this, study, we sum the average hospital costs, local authority funded care, district nursing care, GP contacts costs and inflate them to 2018 levels using a four yearly inflation factor of ~6% (PSSRU HCHS). As patients accrue the death event costs during the final three months of their lives, we account for this by removing the state based costs incurred by these patients for being in the model for 3 months with health states weighted by the proportion of people who die directly from the progression free and progressed states.
This results in the death event total cost (less the weighted state membership costs) to be £5,152.88 (SE £1,288.22).
Patients groups considered by the model
The model considers treatment strategies for stage I, stage II and stage IIIA patients. The stage IIIA patient group consist of five broad treatment strategies; those treated with Chemotherapy and Surgery (CS), Chemotherapy and Radiotherapy (CR), Chemotherapy, Radiotherapy and Surgery (CRS), Radiotherapy only (R) and Surgery only (S). The committee agreed that if they were to deliberate a separate recommendation for each of these five identified treatment strategies for stage III, the resulting guidance would be impractical. Therefore, we have combined and weighed each of the treatment strategies for stage IIIA patients into a single treatment strategy within the model.
Table 23. Treatment strategy split for stage IIIA NSCLC patients
As it was not directly reported in the NLCA Annual report, the committee advised that only roughly one out of six patients who received chemotherapy and surgery would also receive radiotherapy. Using this information in combination with the data from the National Lung Cancer Audit (NLCA) Report 2017, we calculated the percentage of patients who receive each treatment strategy (shown in Table 23).
Initial Treatments for False Negatives
Whilst the ‘no imaging’ strategies and both imaging strategies result in false negative patients, with between one and three brain metastases, only the ‘no imaging’ strategy result in false negative patients with more than three brain metastases. Since there is no way to distinguish false negatives from true negatives, false negative patients continue to receive the planned initial radical treatment.
The committee agreed that the split between patients who received each treatment for their primary tumour was the same for both stage I and stage II lung cancer patients. As discussed above, patients receiving each type of treatment for stage IIIA lung cancer were weighted into a single model arm.
Here, in Table 24, we present the initial treatment strategies for false negative patients, as taken from the NLCA Annual report 2017 and confirmed by the committee for each aforementioned group.
Initial Treatments for True Positives (1–3)
Of the three strategies considered by the model, No Imaging, CT followed by MRI, and MRI alone, only the latter two diagnostic strategies are able to confirm the presence of any number of a brain metastases. Table 25 shows the committee consensus for what treatments would be given to those with 1–3 detected brain metastases and treatments would be given to those eligible to receive radical treatment therapy.
Initial Treatments for True Positives (4+)
Table 26 shows the committee consensus for what treatments would be given to those with more than 3 detected brain metastases. The committee assumed that 15% of patients with more four or more detected brain metastases receive radiotherapy for local control, 92.5% would receive WBRT and 80% of stage I and II patients would receive SACT, with 100% of stage IIIA patients receiving SACT. In our model, patients with more than 3 brain metastases do not receive any radical therapy.
Radiotherapy for local control
Patients with any number of brain metastases may receive radiotherapy for local control, as indicated in Table 25 or Table 26. Where this is the case, 25% of patients who receive radiotherapy for local control receive 36 Gy over 12 sessions, whilst the remaining 75% of patients receive 20 Gy over 5 sessions.
Initial Imaging Strategies
As described earlier, received either an MRI scan alone, or a CT scan, followed by a confirmatory MRI scan, or no imaging.
Table 27 the costs of imaging modalities used in the model.
Progression and presentation
As discussed earlier, half of patients who were FN are expected to present as an emergency with severe symptoms, whilst the other half are expected to present in a routine appointment with their cancer nurse after experiencing mild symptoms. In the model, both of these types of presentation are associated with significantly different resource use and associated cost.
Here in Table 28, we present the cost of emergency presentation and in Table 29 for non-emergency routine presentation for FN patients.
Table 28. FN Emergency presentation resource use and cost
Table 29. FN routine presentation resource use and cost
Summing the costs gives a total for emergency presentation of £2,038.55 and routine presentation as £491.65. Assuming 50% of intracranial progressions for FN patients are of each type, the average cost in the model is £1,265.10.
Intracranial and extracranial progression event
As noted in the sections on progression above, there are several different types of progression events, including intracranial, extracranial, and both intracranial and extracranial. Each one of these pathways is associated with different levels of resource use and therefore overall cost. Here we present the average cost associated with each type of progression event within the progression decision trees (see below).
Table 30. Intracranial Progression Event Treatment cost
Therefore, we calculate the cost of an intracranial progression event to be £1,547.42 (SE of £386.86).
The additional cost of an Intracranial Progression Event Cost for TP4+ patients is the same as shown in Table 30, except that instead of 80% of patients receiving WBRT, no patients receive WBRT. This results in the cost of an Intracranial Progression Event Cost for TP4+ patients as £327.95.
The additional cost of an Intracranial Progression Event for FN patients with 1–3 brain metastases was calculated to be £4,087.65, which assumes that 50% of patients presenting late will be treated with radical treatment, whilst the cost of an Intracranial Progression Event Cost for FN patients with more than 3 brain metastases was calculated to be £2,402.69, which is simply the cost of SACT multiplied by the assumed probability that those patients would receive it (80%).
Table 31. Extracranial Progression Event Treatment cost
The cost of an extracranial progression event is the sum of these values; £1,828.50 (SE of £457.12).
Table 32. Intracranial and Extracranial Progression Event Treatment cost
The cost of an intra and extracranial progression event (whether occurring together or separately) is the sum of these values; £26.48 (SE of £6.62).
Intracranial and extracranial progression event decision tree
The trialists for Kocher 2011 provided additional data of probabilities of progression events after intracranial progression (Table 33).
Table 33. Progression and death event probabilities for patients who are given or not given WBRT
These probabilities were used to calculate the number of patients who would experience each type of progression event and the weighted cost (Table 33).
Table 34. Weighted cost of a progression event for each type of patient in the model
Table 34 shows the final weighted cost of a progression event that is arrived at under the base case assumptions in the model.
Utilities
The three health states in the long-term model are associated with utility scores, which are shown in Table 35. Patients who spend time in one or more of these states in the long-term model accumulate QALYs. A final modifying factor for the total number of QALYs a patient may accumulate is the QALY loss associated with surgery. In the base case the data from Lester-Coll 2016 were used for the progression-free and post-progression survival states with the Nafees 2008 data being used in sensitivity analysis.
Results
Stage I
Table 36. Stage I – Base case fully incremental results
Table 37. Stage I – Base case results and scenario analyses
Table 38. Total strategy and strategy per patient cost for stage I patients
Figure 6. Stage I - CT then MRI vs No Imaging using INMB of £20,000/QALY (Base case ICER £42,962)
Figure 7. Stage I - CT then MRI vs No Imaging using INMB of £30,000/QALY (Base case ICER £42,962)
Figure 8. Stage I - MRI vs CT then MRI using INMB of £30,000/QALY (Base case ICER £88,070)
Figure 9. Stage I – Cost-effectiveness acceptability curve (CEAC) (5000 PSA iterations)
Figure 10. Stage I - CT followed by MRI compared to No Imaging (5000 PSA iterations)
Stage II
Table 39. Stage II – Base case fully incremental results
Table 40. Stage II – Base case results and scenario analyses
Table 41. Total strategy and strategy per patient cost for stage II patients
Figure 11. Stage II - CT then MRI vs No Imaging using INMB of £20,000/QALY (Base case ICER £21,095)
Figure 12. Stage II - CT then MRI vs No Imaging using INMB of £30,000/QALY (Base case ICER £21,095)
Figure 13. Stage II - MRI vs CT then MRI using INMB of £30,000/QALY (Base case ICER £47,532)
Figure 14. Stage II – CEAC (5000 PSA iterations)
Figure 15. Stage II - CT followed by MRI compared to No Imaging (5000 PSA iterations)
Stage IIIA
Table 42. Stage III – Base case fully incremental results
Table 43. Stage IIIA - Base case results and scenario analyses
Table 44. Total strategy and strategy per patient cost for stage IIIA patients
Figure 20. Stage IIIA – CEAC (5000 PSA iterations)
Figure 21. Stage IIIA - CT followed by MRI compared to No Imaging (5000 PSA iterations)
Figure 22. Stage IIIA – MRI compared to CT followed by MRI (5000 PSA iterations)
Discussion
This model calculated the number of cases of brain metastases (BM) that might be detected using MRI brain, CT brain followed by MRI brain, and no imaging strategies. The model combined the prevalence of brain metastases and the proportion detectable (as shown in Table 3) with the sensitivity of the test to calculate the number of true positive (1–3 or 4+), and false negative patients (1–3 and 4+) for each of the imaging strategies by NSCLC stage (Table 38, Table 41, Table 44). For stage I patients, MRI scanning alone produced 36.8 true positive and 1.7 false negatives per 1,000 patients imaged compared to 30.0 true positive and 8.49 false negatives for CT followed by MRI. For both strategies, 10 of the true positives have 4+ brain metastases and none of the false negatives do. For stage II patients, MRI scanning alone produced 75.6 true positive and 3.5 false negatives compared to 61.62 true positive and 17.43 false negatives for CT followed by MRI. For both strategies, 20 of the true positives have 4+ brain metastases and none of the false negatives do. For stage IIIA patients, MRI scanning alone produced 74.1 true positive and 3.4 false negatives compared to 60.47 true positive and 17.1 false negatives for CT followed by MRI. For both strategies, 20 of the true positives have 4+ brain metastases and none of the false negatives do.
If opportunity cost were not a concern, then it would be logical to give all patients who have received initial staging for their lung cancer and are being considered for radical treatment with curative intent an initially more expensive MRI scan (£180) because it is the most sensitive and jointly most specific strategy.. As the opportunity costs are important, the purpose of this economic analysis was to establish cost-effectiveness of these strategies at thresholds of £20,000 and £30,000 per QALY gained.
The key driving factors in this model was the overall prevalence of brain metastases, the proportion of positive patients with 4+ metastases and the costs of radical treatments. The prevalence of brain metastases used in this analysis (shown in Table 3) in stage II and III were similar to each other, both being around double that in stage I.
Base case, probabilistic sensitivity analysis, and sensitivity analyses showing the overall cost-effectiveness of the imaging strategies versus one another for all stages of NSCLC considered are presented in this report.
Stage I NSCLC
For stage I patients with NSCLC, the results table (Table 36) showed that all ICERS were above £30,000 per QALY, except for when an adenocarcinoma hazard ratio and prevalence were used. The one-way sensitivity analysis (OSA) of CT followed by MRI compared to No Imaging when QALYs are valued at £20,000 (Figure 6) showed that no plausible variations in any of the parameters could make CT followed by MRI cost-effective compared to No Imaging. However, for the same analysis, when QALYs are valued at £30,000 (Figure 7), the upper bound of the 95% confidence interval for the prevalence of brain metastases could make CT followed by MRI cost effective compared to no imaging. The OSA of MRI compared to CT followed by MRI when QALYs are valued at £30,000 (Figure 8) showed that the only situation where MRI could be cost-effective compared to CT followed by MRI was when the cost of MRI scanning of one area (with pre and post contrast) was at its lowest possible value of £127.33.
For stage I patients with NSCLC, the results of the probabilistic sensitivity analysis were very similar to the base case results. The cost-effectiveness acceptability curve (CEAC) (Figure 9) showed that we would have to be prepared to pay around £46k/QALY for the probability of cost-effectiveness of CT followed by MRI to be as high as no imaging. On the graph of the PSA of 5000 iterations of CT followed by MRI compared to ‘no imaging’ (Figure 10), we can see that the average iteration marker (yellow diamond with the red border) is firmly above the light blue line denoting a threshold of £30,000/QALY. The majority of the density of the 5,000 iterations are above the above the £30,000 per QALY threshold line.
Based on these results, we can conclude that no imaging strategy involving the use of either technology (CT or MRI) for detecting brain metastases in stage I NSCLC patients prior to radical treatment with curative intent is cost-effective at willingness-to-pay thresholds of £20,000 or £30,000 per QALY. This is primarily due to the low prevalence of detectable brain metastases in the stage I population (~3.8%). Varying this value to the highest extreme of its confidence interval yielded an ICER of £29,067 per QALY for CT followed by MRI compared to no imaging.
Stage II NSCLC
For patients with stage II NSCLC, we carried out the same analysis as we carried out for stage I NSCLC patients. The only difference was the prevalence of detectable BM (~8%). In the deterministic base case, we found that ICER for CT followed by MRI was £21,095 – just over the threshold of £20,000 per QALY gained, but well under £30,000 per QALY. The ICER for MRI alone compared to CT followed by MRI was well in excess of £30,000 per QALY.
The results of the PSA followed a very similar pattern to the deterministic base case. The scatterplot of 5,000 PSA iterations (Figure 15) shows the average iteration marker between the dark purple line denoting a threshold of £20,000 per QALY, and the light blue line denoting a threshold of £30,000 per QALY. Most of the iterations fall evenly on either side of both of these lines demarcating these thresholds, showing reasonable uncertainly in the average ICER in relation to the common decision thresholds.
Of the 22 scenario analyses we performed shown in Table 38, only two scenarios (where the proportion of patients with 1–3 brain metastases came from the committee, and where non-adenocarcinoma hazard ratios and prevalence were used) exceeded the threshold of £30,000 per QALY. Three of these scenario analyses (where treatment with curative intent for all brain events, ‘Acceleration factor for the Kocher progression free survival (FN 1–3 brain mets) (30%)’ and ‘Adenocarcinoma hazard ratio and adenocarcinoma prevalence’) ICERs were below the £20,000 per QALY threshold.
For stage II patients with NSCLC, neither for the base case, the PSA or any of the incremental analysis for MRI alone when compared to CT followed by MRI shown in Table 40 had an ICER below the £30,000 per QALY threshold.
The OSA for CT followed by MRI compared to ‘no imaging’ at a willingness-to-pay threshold of £20,000 per QALY (Figure 11) showed that model was sensitive to a large number of parameters, which when varied within their plausible ranged could cause the INMB to be above zero, therefore rendering CT followed by MRI a cost-effective strategy. A further OSA analysis of the same pairwise comparison, using a willingness-to-pay threshold of £30,000 per QALY (Figure 12), showed that only three parameters (prevalence of brain metastases in stage II, proportion of patients with 1–3 brain mets (Yokoi), and the hazard ratio), would be able to take the INMB into negative territory, thus rendering CT followed by MRI not cost-effective in comparison to the ‘no imaging strategy’. The final OSA conducted for stage II NSCLC (Figure 13) showed that just two parameters (sensitivity of CT and the cost of MRI of one area) when varied within their plausible range, could render MRI cost-effective as compared to CT followed by MRI, a willingness-to-pay threshold of £30,000 per QALY.
The CEAC for stage II patients with NSCLC (Figure 14) shows that ‘no imaging’ strategy has the highest probability of being cost-effective until around a willingness-to-pay of £23,000 QALY, at which point it is equally likely to be as cost-effective as CT followed by MRI at 48%. From here, as the willingness-to-pay increases, the probability of CT followed of MRI being the most cost-effective strategy increases until around a willingness-to-pay threshold of £34,000 per QALY where the probability is around 67%. At a willingness-to-pay threshold of £30,000 per QALY, CT followed by MRI has the highest probability of being the most cost-effective strategy at around 62%, whilst the ‘no imaging’ strategy has a probability of around 22%.
We can conclude with a fair amount of certainty that CT followed by the MRI is the most cost-effective strategy at a willingness-to-pay threshold of £30,000 per QALY for detecting brain metastases in stage II NSCLC patients prior to radical treatment with curative intent but it is uncertain whether the ‘true’ ICER for imaging lies above or below the £20,000 threshold.
Stage IIIA NSCLC
As discussed previously, the stage IIIA NSCLC patient group consist of five broad treatment strategies; those treated with Chemotherapy and Surgery (CS), Chemotherapy and Radiotherapy (CR), Chemotherapy, Radiotherapy and Surgery (CRS), Radiotherapy only (R) and Surgery only (S). We combined and weighted each of the treatment strategies for stage IIIA patients into a single treatment strategy within the model, with the split between each of the treatment shown in Table 23.
In the base case, the PSA with 5,000 iterations and every analysis shown in Table 43, CT followed by MRI is a dominant strategy as compared to the no imaging strategy (which means that CT followed by MRI produced more benefits and cost less as compared to the no imaging strategy). In the base case, PSA and 16 of the 21 sensitivity analyses presented in the same table, MRI compared to CT followed by MRI, was a dominant strategy (meaning that MRI produced more benefits and cost less than CT followed by MRI). Of the strategies where MRI only was not dominant compared to CT followed by MRI, three had ICERS below £20,000 per QALY, one had an ICER between £20,000 and £30,000 per QALY, and one had an ICER above £30,000 per QALY.
The OSA associated with the stage IIIA analysis of CT followed by MRI compared to no imaging when QALYs are worth £20,000 (Figure 16) and £30,000 (Figure 17) both show that no parameter varied within their plausible threshold was able to make CT followed by MRI cost-ineffective. A further two analyses of MRI only compared to CT followed by MRI where QALYs are worth £20,000 (Figure 18) and £30,000 (Figure 19) show that the parameters concerned with the sensitivity of CT scanning, when increased to 0.997, and the parameter concerned with the prevalence of brain metastases in stage IIIA patients, when lowered to 0.046, could render MRI cost-ineffective at both these thresholds.
The CEAC for stage IIIA (Figure 20) shows that the MRI only strategy has an equal chance of being cost-effective compared to CT followed by MRI at a willingness-to-pay threshold of £0 per QALY, whilst the probability of the ‘no imaging’ strategy is around 6%. As the willingness-to-pay increases, the probability of the MRI only strategy being the most cost-effective also increase whilst both CT followed by MRI and no imaging decrease. In Figure 21 showing 5,000 PSA iterations of CT followed by MRI compared to ‘no imaging’, we can see that the average iteration marker is firmly in the south east quadrant, showing that the average cost of the of CT followed by MRI as compared to the ‘no imaging’ strategy was lower, and produced more QALYS, and thus rendering CT followed by MRI a dominant strategy for this comparison. Furthermore, the vast majority of the iterations on this figure fall below the dark purple line demarcating a threshold of £20,000 per QALY, which in turn gives us considerable confidence that the ICER is below £20,000 per QALY.
A further similar comparison of MRI alone compared to the CT followed by MRI strategy (Figure 22) showed that the average iteration marker is still in the south-east quadrant, meaning that MRI alone is a dominant strategy as compared to CT followed by MRI, although not as pronounced as CT followed by MRI compared to ‘no imaging.
Based on these results, we can conclude that for people with stage IIIA NSCLC, CT followed by MRI is preferable to the ‘no imaging’ strategy as it is dominant. However, a further pairwise comparison of MRI alone as the sole imaging strategy as compared to CT followed by MRI shows MRI alone to be the dominant strategy, and therefore the overall most cost-effective strategy for detecting brain metastases in stage IIIA NSCLC patients prior to radical treatment with curative intent.
In summary:-
- No imaging strategy was cost-effective for stage I patients, mainly because of the low prevalence of BM.
- CT-MRI could be considered cost effective compared to no imaging for stage II patients. MRI is not cost-effective compared to CT-MRI, mainly because CT has a good sensitivity for identifying patients who are TP (4+), who contribute the most cost-benefit in the model.
- MRI is cost effective for stage IIIA patients, mainly because it is the most sensitive test and identifying a case contributes both QALY gains and cost savings
Strengths and Limitations
Our model has a number of important strengths; it is the only directly applicable health economic model to examine whether NSCLC patients selected for curative intent should receive brain imaging in a UK setting and includes a number of original pieces of evidence synthesis for survival and diagnostic accuracy data. We made use of a wide range of sensitivity and scenario analyses to explore the uncertainty in the model and can be confident that our conclusions, certainly for stages I and IIIA, are robust to plausible variations in parameters.
The model is also characterised by a number of important limitations; the diagnostic accuracy data was of low quality, the prevalence data came from a retrospective analysis, the proportion of people with 4+ brain mets was an important but highly uncertain parameter, the costs of systemic therapy were crudely captured, the effectiveness of treatment pathways was crudely captured, the survival curves and progression data were drawn from partly indirect populations and a large amount of parameters were underpinned by committee assumptions (the proportion of patients receiving different potential treatments upon diagnosis and progression, the health state occupancy costs and the consequences upon presentation). We also had to make a number of assumptions about the way that survival curves for the different groups were related to each other and the way that False Negative patients’ progression would be accelerated. While we think that all of these assumptions were justified and we tested them in sensitivity analysis, they are not based on directly observed data in the population of interest (although this limitation is common to at least some populations in all economic models examining diagnostic test accuracy).
Appendix J. Research recommendations
Table
Question What is the effectiveness and cost-effectiveness of performing contrast enhanced CT brain routinely at the time of initial diagnosis/staging CT in people with suspected lung cancer?
Table
Potential criterion Explanation
Appendix K. WinBUGS Code
This codeset was used to meta-analyse the diagnostic test accuracy data for use in the model. It includes data from all the studies included in the clinical review minus Yokoi 1999 because the committee wished to exclude it through lack of clinical plausibility. The example below uses data from the studies reporting sensitivity and specificity for MRI. The same codeset was used for the CT data.
Random Effects model{ for(i in 1:4){ N1[i] <- tp[i] + fn[i] # Number of patients with disease tp[i] ~ dbin(tpr[i], N1[i]) logit(tpr[i]) <- lt[i] lt[i] ~ dnorm(mean1, prec1) N0[i] <- tn[i] + fp[i] # Number of patients without disease fp[i] ~ dbin(fpr[i], N0[i]) logit(fpr[i]) <- lf[i] lf[i] ~ dnorm(mean0, prec0) } # Vague priors: mean1 ~ dnorm(0, 0.01) # Mean logit(tpr) sd1 ~ dunif(0,5) # Between-study SD in logit(tpr) mean0 ~ dnorm(0, 0.01) # Mean logit(fpr) sd0 ~ dunif(0,5) # Between-study SD in logit(fpr) prec1 <- pow(sd1, -2) # Precision prec0 <- pow(sd0, -2) # Precision logit(summtpr) <- mean1 # Define summary TPR logit(summfpr) <- mean0 # Define summary FPR summspec <- 1 - summfpr # Summary specificity } # Initial values: list(mean1 = 0, sd1 = 1, mean0 = -1, sd0 = 0.5) list(mean1 = 2, sd1 = 0.5, mean0 = -2, sd0 = 1) # Data: tp[] fn[] fp[] tn[] 6 0 0 23 5 0 0 51 10 1 0 86 37 6 7 392 END
Footnotes
- 1
O’Dowd et al (2014) Brain metastases following radical surgical treatment of non-small cell lung cancer: is preoperative brain imaging important? Lung Cancer. 2014 Nov;86(2):185–9 [PubMed: 25239395]
- b
O’Dowd et al (2014) Brain metastases following radical surgical treatment of non-small cell lung cancer: is preoperative brain imaging important? Lung Cancer. 2014 Nov;86(2):185–9 [PubMed: 25239395]
- c
NICE DSU TSD 19: Partitioned survival analysis for decision modelling in health care: a critical review (2007)
- d
Yoo H, Nam B-H, Yang H-S, Shin SH, Lee JS, Lee SH. Growth rates of metastatic brain tumors in non-small cell lung cancer. Cancer 2008;113(5):1043–7. [PubMed: 18618515]
- e
Yokoi et al Chest. 1999 Mar;115(3):714–9. [PubMed: 10084481]
- f
Guyot et a (2012) Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan-Meier survival curves. BMC Medical Research Methodology [PMC free article: PMC3313891] [PubMed: 22297116]
- g
- h
- i
Kocher et al (2011) Adjuvant Whole-Brain Radiotherapy Versus Observation After Radiosurgery or Surgical Resection of One to Three Cerebral Metastases: Results of the EORTC 22952–26001 Study. Journal of Clinical Oncology [PMC free article: PMC3058272] [PubMed: 21041710]
- j
Brown et al (2016) Effect of Radiosurgery Alone vs Radiosurgery With Whole Brain Radiation Therapy on Cognitive Function in Patients With 1 to 3 Brain Metastases. JAMA [PMC free article: PMC5313044] [PubMed: 27458945]
- k
Sperduto et al (2016) Estimating Survival in Patients With Lung Cancer and Brain Metastases. JAMA Oncology [PMC free article: PMC5824323] [PubMed: 27892978]
- l
The National Institute for Health and Care Excellence (2018). Brain tumours (primary) and brain metastases in adults
- m
National Lung Cancer Audit (2018). NLCA annual report 2017. [online] Available at: https://www
.rcplondon .ac.uk/projects/outputs /nlca-annual-report-2017 [Accessed 7 Aug. 2018]. - n
EORTC Data Centre (2018) Personal Communication with NICE Centre for Guidelines
- o
Improvement
.nhs.uk. (2018). Reference costs | NHS Improvement. [online] Available at: https://improvement .nhs .uk/resources/reference-costs/ [Accessed 6 Aug. 2018]. - p
Curtis, Lesley A. and Burns, Amanda (2017) Unit Costs of Health and Social Care 2017. Report number: https://doi
.org/10.22024/UniKent/01 .02/65559. Personal Social Services Research Unit, University of Kent, 260 pp. ISBN 978-1-911353-04-1. (doi:10.22024/UniKent/01.02/65559) (Full text available) [CrossRef] - q
Drugtariff
.nhsbsa.nhs.uk. (2018). NHS Electronic Drug Tariff. [online] Available at: http://www .drugtariff .nhsbsa.nhs.uk/#/00446515-DC_2 /DC00446511/Home [Accessed 14 May 2018]. - r
GOV
.UK. (2018). Drugs and pharmaceutical electronic market information tool (eMIT). [online] Available at: https://www .gov.uk/government /publications /drugs-andpharmaceutical-electronic-market-information-emit [Accessed 7 Aug. 2018]. - s
Lester-Coll, Nataniel H., Charles E. Rutter, Trevor J. Bledsoe, Sarah B. Goldberg, Roy H. Decker, and B. Yu James. “Cost-effectiveness of surgery, stereotactic body radiation therapy, and systemic therapy for pulmonary oligometastases.” International Journal of Radiation Oncology* Biology* Physics95, no. 2 (2016): 663–672. [PubMed: 27055395]
- t
Nafees, B., Stafford, M., Gavriel, S., Bhalla, S. and Watkins, J., 2008. Health state utilities for non small cell lung cancer. Health and quality of life outcomes, 6(1), p.84. [PMC free article: PMC2579282] [PubMed: 18939982]
- u
Bendixen, M., Jørgensen, O.D., Kronborg, C., Andersen, C. and Licht, P.B., 2016. Postoperative pain and quality of life after lobectomy via video-assisted thoracoscopic surgery or anterolateral thoracotomy for early stage lung cancer: a randomised controlled trial. The Lancet Oncology, 17(6), pp.836–844. [PubMed: 27160473]
- v
Dominant here refers to the intervention being less expensive and more effective than the comparator.
Final
Evidence reviews
These evidence reviews were developed by the NICE Guideline Updates Team
Disclaimer: The recommendations in this guideline represent the view of NICE, arrived at after careful consideration of the evidence available. When exercising their judgement, professionals are expected to take this guideline fully into account, alongside the individual needs, preferences and values of their patients or service users. The recommendations in this guideline are not mandatory and the guideline does not override the responsibility of healthcare professionals to make decisions appropriate to the circumstances of the individual patient, in consultation with the patient and/or their carer or guardian.
Local commissioners and/or providers have a responsibility to enable the guideline to be applied when individual health professionals and their patients or service users wish to use it. They should do so in the context of local and national priorities for funding and developing services, and in light of their duties to have due regard to the need to eliminate unlawful discrimination, to advance equality of opportunity and to reduce health inequalities. Nothing in this guideline should be interpreted in a way that would be inconsistent with compliance with those duties.
NICE guidelines cover health and care in England. Decisions on how they apply in other UK countries are made by ministers in the Welsh Government, Scottish Government, and Northern Ireland Executive. All NICE guidance is subject to regular review and may be updated or withdrawn.