Skip Navigation

JNCI Journal of the National Cancer Institute 2006 98(11):774-782; doi:10.1093/jnci/djj210
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplemental Table
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (10)
Right arrow Request Permissions
Google Scholar
Right arrow Articles by Stout, N. K.
Right arrow Articles by Fryback, D. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stout, N. K.
Right arrow Articles by Fryback, D. G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press.

ARTICLE

Retrospective Cost-effectiveness Analysis of Screening Mammography

Natasha K. Stout, Marjorie A. Rosenberg, Amy Trentham-Dietz, Maureen A. Smith, Stephen M. Robinson, Dennis G. Fryback

Affiliations of authors: Center for Risk Analysis, Harvard School of Public Health, Boston, MA (NKS); Department of Actuarial Science and Risk Management (MAR), Department of Biostatistics and Medical Informatics (MAR), Department of Population Health Sciences (ATD, MAS, DGF), Department of Industrial and Systems Engineering (SMR, DGF), University of Wisconsin, Madison, WI; University of Wisconsin Comprehensive Cancer Center, Madison, WI (MAR, ATD, MAS, DGF)

Correspondence to: Natasha K. Stout, PhD, Center for Risk Analysis, Harvard School of Public Health, 718 Huntington Ave., Boston, MA 02115 (e-mail: nstout{at}hsph.harvard.edu).


    ABSTRACT
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Background: Many guidelines recommend screening mammography every 1–2 years for women older than 40 years; more than 70% of women now participate in routine screening. No studies have examined the societal impact of screening practices over the past decade in the United States on costs and quality-adjusted life-years (QALYs). We performed a retrospective cost-effectiveness analysis comparing actual and alternative screening mammography scenarios. Methods: We used a discrete-event simulation model of breast cancer epidemiology to estimate the costs and the number of QALYs that were associated with observed screening mammography patterns in the United States from 1990 to 2000 for women aged 40 years or older. We also estimated costs and QALYS for no screening and for 64 alternative screening scenarios. Incremental cost-effectiveness ratios were computed. Sensitivity analyses were performed on key parameters. Results: Actual U.S. screening patterns from 1990 to 2000 accrued 947.5 million QALYs and cost $166 billion over the lifetimes of the screened women, resulting in a gain of 1.7 million QALYs for an additional cost of $62.5 billion compared with no screening. Among those polices that were not dominated—i.e., for which no alternative existed that produced more QALYs for lower costs—screening all women aged 40–80 years annually per some U.S. guidelines was the most expensive option, costing $58 000 per additional QALY gained compared with the next most costly alternative, screening all women aged 45–80 years annually. Many alternative screening scenarios generated more QALYs for less cost (with savings up to $6 billion) than actual screening patterns over the study period. Sensitivity analysis showed that conclusions about the cost-effectiveness of screening mammography policies were highly sensitive to small, short-term detrimental effects on quality of life from the screening test itself. Conclusions: Choosing among the efficient policies to guide current screening recommendations requires consideration of costs to promote participation in screening and measurement of acute quality-of-life effects of mammography.



    INTRODUCTION
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Since its introduction as a screening tool in the 1980s, screening mammography has been widely disseminated in the United States. Mammography guidelines have gradually broadened and now recommend that all women older than 40 years be screened annually or biennially (1,2). In 2001, nearly 70% of U.S. women reported having had a mammogram within the past 2 years (3). Breast cancer mortality rates in the United States have slowly declined over the past decade, probably in part because of the dissemination of screening mammography (4). However, this mortality benefit has been offset by smaller (i.e., nonlethal) but frequent risks associated with mammography, including false-positive test results (5); subsequent, often invasive, diagnostic follow-up tests; and the potential for overdiagnosis (6). With these tradeoffs in mind, an analysis of screening guidelines relative to actual screening performance may help to inform current policies regarding screening mammography.

Previous cost-effectiveness studies (712) have concluded that, in general, future implementation of screening mammography would be cost-effective compared with no screening. However, none of those studies evaluated the gap in screening performance between recommended guidelines for screening mammography and screening as it has been implemented in the United States historically because none included the observed screening dissemination as a basis for comparison. Furthermore, previous analyses tended to focus on narrow age ranges or on particular age cohorts, not on the entire U.S. population over time. Because the choice of study population can potentially affect the cost-effectiveness of medical interventions (13) and because breast cancer risk and the accuracy of screening mammography are functions of age, it is important to evaluate screening mammography practices across the entire population that would be affected.

Recently, positive new light has been shed on the debate over the effectiveness of screening mammography for reducing breast cancer mortality. A consortium of investigators who used seven independent statistical and simulation models of the natural history of breast cancer reported that routine screening use has contributed to the observed reduction in U.S. breast cancer mortality seen over the last 25 years (14,15). Given this new evidence, we have used one of those seven models to conduct a cost-effectiveness analysis of screening mammography. Our analysis posed a different question from that posed by many cost-effectiveness analyses. Specifically, we retrospectively examined the cost-effectiveness of screening programs for the U.S. female population for 1990–2000. We asked what happens to the total expenditures and quality-adjusted life-years (QALYs) if other screening practices, many of them congruent with recommended guidelines, had been substituted for the actual screening practices of the 1990s. We considered observed screening patterns as well as no screening and alternative screening scenarios that encompass U.S. guidelines to evaluate how effectively resources are actually being spent to improve women's health in the United States.


    METHODS
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Model Overview

To compute total costs and health effects associated with different screening mammography scenarios, we used a discrete-event simulation model of breast cancer epidemiology in a population over time that was developed at the University of Wisconsin as part of the National Cancer Institute's Cancer Intervention and Surveillance Modeling Network. One of the original purposes of this model was to investigate the individual and combined contributions of changes in mammography screening and adjuvant therapies for breast cancer from 1975 through 2000 to the observed reductions in breast cancer mortality from 1990 through 2000 (14). As such, the model incorporates secular trends in breast cancer risk, screening use, and treatment dissemination. By simulating the individual life histories of women aged 20 years or older who were born in 1891 through 1980 in proportion to their prevalence in the U.S. population and aggregating the outcomes, the model can replicate population-level U.S. cancer surveillance data corresponding to calendar years 1975 through 2000.

An overview of model structure relevant to this analysis is provided here. Unobservable model parameters that controlled the natural history and detection of breast cancer were estimated through the process of calibration to fit observed age-adjusted, stage-specific breast cancer incidence data from the Surveillance, Epidemiology, and End Results (SEER) program (16) and breast cancer mortality data for 1975 through 2000 reported by National Center for Health Statistics (National Center for Health Statistics, National Health Interview Survey Web site http://www.cdc.gov/nchs/deaths.htm links to survey description documentation and data). The model has been cross-validated against data from the Wisconsin Cancer Reporting System, a cancer registry independent of SEER (17). Separate analyses based on this model produced results that were congruent with those from analyses based on other models of the natural history of breast cancer (17). Model design, assumptions, and validation have also been described elsewhere (1719) and are available at http://www.cisnet.cancer.gov/profiles.

Model Specification

Individual life histories were generated through four model components: the natural history of breast cancer, breast cancer detection, breast cancer treatment, and breast cancer mortality. The natural history component modeled breast cancer occult inception and progression as a function of a woman's birth year and current age. Secular trends in breast cancer incidence that account for changes in risk factor prevalence were incorporated into the likelihood of cancer onset (20). After onset, breast cancers were assumed to progress in disease stage according to a stochastic Gompertz-type growth model that controls the size and the extent of the tumors (2124). Tumors were assigned a SEER historical stage (in situ, localized, regional, or distant) based on tumor size and spread to lymph nodes at time of diagnosis in the model for statistical tabulation. It is assumed that not all in situ or small localized invasive cancers progress to be a lethal threat to the woman (17).

Breast cancer can be detected by either of two methods: screening mammography or routine clinical detection (i.e., self and/or clinical breast examinations). In the model, the sensitivity of mammography and the likelihood of a routine clinical detection were functions of a woman's age and tumor size. Also, the sensitivity of mammography was assumed to improve over time, corresponding to improvements in mammographic technology. These parameters were estimated via model calibration and correspond to estimates of sensitivity derived from observational data, which ranged from 60% to 90% (2527). Specificity of mammography, a function of age, ranges from 88% to 90% (28). Routine clinical detection was also assumed to improve over time, coinciding with increasing dissemination of mammography and of the awareness by both women and physicians of breast cancer as a disease. In the model, routine clinical detection could occur at any time, whereas the timing of screening mammograms was based on the assigned screening scenario. Screening scenarios considered in this analysis are described in a following section.

The model assumed that all women received treatment at the time of breast cancer detection. Adjuvant therapy with chemotherapy and/or tamoxifen was assigned in accordance with observed U.S. dissemination patterns (29). The effectiveness of treatment was a function of age at diagnosis, tumor characteristics (stage at detection and estrogen receptor status), and receipt of adjuvant treatment (30,31), and was modeled independent of the method of breast cancer detection. An effective treatment was assumed to halt breast cancer progression.

In the model, death could be due to breast cancer or to other causes. We used U.S. birth cohort–based life tables, with breast cancer as a cause of death removed (32,33), to assign date of death due to causes other than breast cancer to each woman. A date of death from breast cancer based on mortality derived from women who were diagnosed with distant-stage disease in the SEER program (16) was assigned to a woman after her breast cancer progressed to the distant or metastatic stage. The timing and cause of death were determined by the earlier of the two dates of death (breast cancer or other causes).

Quality of Life

Age-specific quality-of-life utility weights were specified for four health states: healthy; breast cancer diagnosed and treated in the in situ or localized stage; breast cancer diagnosed and treated in the regional stage; and distant-stage breast cancer (Table 1). The age-specific quality-of-life utility weights for healthy women were derived from EuroQol EQ-5D quality-of-life utility scores. The EQ-5D is a widely used utility-based measure for general health (34). These data were collected in the 2000 wave of the Medical Expenditure Panel Survey, a nationally representative survey of health care usage, insurance, and costs in the United States (http://www.meps.ahrq.gov/). We used percentages of the age-specific EQ-5D scores for healthy women to estimate the potential negative effects of a breast cancer diagnosis and treatment on a woman's quality of life for a limited period. Our values were consistent with treatment-specific quality-of-life weights reported in other studies (35). We assigned a utility weight for every 6-month period in a woman's lifetime; the sum of those weights across the woman's lifetime equaled the QALYs accrued for that woman (13).


View this table:
[in this window]
[in a new window]
 
Table 1.  Yearly quality-of-life weights

 
Costs

The costs incorporated into the model (Table 2) included the average cost of a screening mammogram ($70), the average cost of diagnostic follow-up to a positive or abnormal screening result or the average cost of diagnosis from clinical detection ($533), and the treatment costs (range: $12 000–$27 000 depending on the stage at detection of breast cancer; treatment costs include the costs of initial treatment, of continuing treatment, and of terminal treatment for in situ, localized, regional, distant) (3638). Indirect costs associated with time and travel to and from the test and future medical care costs were not included in this analysis. All cost estimates were adjusted to calendar year 2000 dollars by using the medical care component of the Bureau of Labor Statistics Consumer Price Index (39).


View this table:
[in this window]
[in a new window]
 
Table 2.  Costs associated with breast cancer screening and treatment

 
Screening Scenarios

Actual screening patterns were replicated by stochastically assigning women to an age at first screening mammogram and to a subsequent screening frequency that was in accordance with observed U.S. dissemination and usage patterns (40). These mammography dissemination and usage patterns, which were composed of a mixture of frequent and infrequent screening patterns and included non-users, were informed by and are consistent with data from the National Health Interview Survey (National Center for Health Statistics, National Health Interview Survey Web site http://www.cdc.gov/nchs/nhis.htm) and the Breast Cancer Surveillance Consortium (http://breastscreening.cancer.gov/) (41). Under this screening scenario in the simulation model, the percentage of women who had never been screened by mammography decreased from 75% in 1985 to 25% in 2000 and the percentage of women who were screened every 1 or 2 years increased from 20% to 50% (15,40) and population participation in screening mammography increased from 0% in the early 1980s to nearly 70% in 2000, closely replicating age-specific screening participation patterns observed in the United States during this period (3).

The "no screening" scenario assumed no mammographic screening in the population after 1989. Sixty-four additional scenarios, each with a particular fixed screening schedule beginning in 1990, were modeled. The fixed-schedule scenarios varied by the age at the first screen (40, 45, 50, or 55 years) and at the last screen (65, 70, 75, or 80 years) and by the screening interval (1, 2, 3, or 5 years). To mimic the initiation of a new screening program, we assumed that all women in the age range indicated by the screening scenario participated and that in 1990, the starting year, all women in the age range specified by the screening scenario began screening. Because past screening use affects current screening outcomes (18) and thus may affect the cost-effectiveness of the screening programs for all scenarios, including the no-screening scenario, we assumed that the actual (historical) screening dissemination patterns had occurred before the start of the study in 1990. This assumption makes the "actual practice" scenario consistent with the full history of screening practices, so that each of the other scenarios represents a hypothetical change to historical practices beginning in 1990.

Cost-effectiveness Analysis

Our analysis examined the performance of screening mammography from 1990 to 2000. The population investigated included all women in the United States who were aged 40 years or older during the study period, comprising birth cohorts from 1891 to 1960 and representing approximately 95 million women. The women received screening mammograms during this 10-year period in accordance with the specified screening scenario. The costs and effects of screening, of a breast cancer diagnosis, and of treatment for breast cancer were calculated for each woman starting in calendar year 1990 and ending on the date of her death. For women who died after calendar year 2000, rates of breast cancer onset, detection by routine clinical finding, and treatment effectiveness were defined according to the prevailing rates in the year 2000. Evaluations of the costs and effects of the screening scenarios were meant to capture the lifetime effects of the 10-year period of screening in the population. A 3% annual discount rate was applied to both costs and QALYs, as recommended by the Panel on Cost-effectiveness in Health and Medicine (13).

We conducted a separate run of the simulation model for each screening scenario to compute the total costs and QALYs for that scenario; each run of the model used the same period; natural history assumptions, including those underlying the onset of breast cancer; and population. A unique feature of the model is that it uses common random numbers, a technique of variance reduction (42,43) in which the same random number sequences are used across runs. This technique allows for counterfactual analyses because the same population of women is compared across screening scenarios. Thus the population under investigation at the start of the analysis in 1990 was identical in the prevalence of underlying breast cancer across all screening scenarios. Therefore, differences in costs and QALYs across scenarios are the direct result of the screening pattern assigned to each woman and their consequent effects on health. For each screening scenario, this process was repeated 25 times by using different random numbers to generate a distribution of results. Outputs are reported as the mean and 95% confidence interval (CI) for the replications. These confidence intervals provide an indication of variation due to stochastic sources of variation within the model.

Sensitivity Analyses

In general, the purpose of a cost-effectiveness analysis is to construct a model that produces as meaningful an estimate as possible of the costs, QALYs, and incremental cost-effectiveness ratios of different interventions (13). Traditional statistical hypothesis testing is not appropriate. Instead, to understand the effect of key model parameters and assumptions on the stability of the incremental cost-effectiveness ratios, sensitivity analyses are performed (13). In a sensitivity analysis, cost-effectiveness ratios are recomputed by changing influential model variables. The extent of change in the ratios provides an indication of the robustness of model conclusions due to model uncertainty. For our sensitivity analysis, we varied the level of population participation in the alternative screening scenarios and the quality of life associated with a screening mammogram across plausible ranges. Costs, QALYs, and incremental cost-effectiveness ratios were recomputed with these changes in parameter values and compared with our initial result to isolate the effect of that parameter on the conclusions drawn about the cost-effectiveness of screening mammography.


    RESULTS
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Actual Screening Patterns

The actual screening patterns for women aged 40 years or older resulted in 947.5 million QALYs (1.596 billion QALYs undiscounted) at a total cost of $166 billion ($301 billion undiscounted) for breast cancer detection and treatment. These totals reflect the observed history of screening dissemination in the United States from 1990 to 2000 and the future effects of this decade of screening on the population. By comparison, in the absence of any screening during the 1990s (no-screening scenario), 945.8 million QALYs (1.592 billion QALYs undiscounted) would have accrued at a cost of $103 billion ($199 billion undiscounted) as a result of routine clinical detection and subsequent treatment. Therefore, as a result of the screening program in place during the 1990s, U.S. women gained 1.7 million QALYs at an additional cost of nearly $63 billion. The model also predicts that, with screening, approximately 25% more breast cancers were detected overall compared with no screening.

Alternative Screening Scenarios

The total costs of the alternative scenarios ranged from $117 billion to $252 billion ($219 billion to $426 billion undiscounted, respectively), and the total QALYs accrued ranged from 945.3 million to 949.6 million (1.593 billion to 1.602 billion QALYs undiscounted, respectively). The screening scenarios with the shorter screening intervals and the broader age ranges were the most costly but accrued the most QALYs. Figure 1 depicts the total costs and QALYs accrued for all simulated scenarios and for actual screening patterns and no screening. The no-screening scenario was the least costly option and accrued the fewest QALYs. Scenarios that had higher costs and that accrued fewer QALYs were dominated by scenarios that accrued more QALYs for the same or lower cost. In Fig. 1, the solid line that connects the 11 screening scenarios that were not dominated (i.e., the "efficient" policies in economic terminology, that is, policies for which no alternative policy exists that results in more QALYs for lower costs), represents the efficient frontier (i.e., the set of efficient policies) (13).


Figure 1
View larger version (11K):
[in this window]
[in a new window]
 
Fig. 1. Costs and quality-adjusted life-years associated with simulated screening policies. Each dot represents one of the 64 alternative policies analyzed. The solid line connects the efficient policies (i.e., policies for which no alternative policy exists that results in more quality-adjusted life-years for lower costs). The star indicates the cost and quality-adjusted life-years associated with actual U.S. screening patterns, which consisted of a mixture of frequent and infrequent screening patterns and included non-users. The diamond shows the cost and quality-adjusted life-years associated with no screening after 1990. Total quality-adjusted life-years and total cost were discounted at 3%.

 
Actual screening practice from 1990 to 2000, depicted by the star in Fig. 1, was not on the efficient frontier; many of the alternative scenarios achieved more QALYs for the same or lower costs. For example, compared with the actual screening patterns over the study period, implementing the efficient strategy of screening all women aged 45–70 years every 3 years could have saved approximately $6 billion while accruing an equivalent number of QALYs.

Cost-effectiveness Analysis

To understand the relative performance of the efficient screening scenarios, we performed an incremental cost-effectiveness analysis. Incremental cost-effectiveness ratios were computed by comparing the differences in costs and QALYs between the efficient policies ranked in increasing order of costs. Table 3 shows total costs, total QALYs, and incremental cost-effectiveness ratios for the 11 efficient scenarios.


View this table:
[in this window]
[in a new window]
 
Table 3.  Costs, QALYs, and incremental cost-effectiveness ratios for efficient screening scenarios*

 
Screening and treatment for the scenario of screening all women aged 55–70 years every 5 years would cost $27 000 per additional QALY compared with no screening. Screening the same age group but increasing the frequency of screening to every 3 years would cost $28 000 per additional QALY. The incremental increase in QALYs became increasingly expensive as the age ranges became wider and the screening frequency increased.

Policies that resemble current U.S. guidelines for mammography screening (e.g., screening all women aged 40–80 years annually) were on the efficient frontier. However, they were also the most costly screening scenarios as shown by their location in the upper right tail of the frontier in Fig. 1. Screening all women in the United States aged 40–80 years every year resulted in a gain of more than 2 million QALYs at an incremental cost of $87 billion more than the cost of actual screening patterns (i.e., 949.6 million at a cost of $252 billion for the policy resembling the guidelines, compared with 947.5 million QALYs at a cost of $166 billion for actual screening patterns), a cost of $40 000 per additional QALY. Most of these additional costs are from the screening mammograms themselves and their associated follow-up, because the model predicted that more than 2.4 times more mammograms would be performed under this scenario than under actual screening practice. As a result of these additional mammograms, many more women would experience a false-positive mammogram, but there would be 25% fewer breast cancer deaths over the lifetimes of this population compared with the number of breast cancer deaths observed under actual screening patterns. (Supplementary Table 1, containing additional simulation model outcomes, including the number of mammograms, the number of screen and clinical detected cases, and the number of breast cancer deaths for each screening scenario, can be found at http://jncicancerspectrum.oxfordjournals.org/jnci/content/vol98/issue11.)

Sensitivity Analysis

We found that the results were sensitive to the level of population participation in the alternative screening scenarios. Our initial assumption for these scenarios was 100% participation from the onset of the screening program in 1990 to the end of the study in 2000. Actual participation rates were considerably less than 100% over the study period, ranging from approximately 40% in 1990 to approximately 70% in 2000 (3). Figure 2 compares the screening scenarios under three population participation rates: 100% (initial assumption), 70%, and 50%. With 50% participation, both the costs and the QALYs achieved in the screening scenarios were reduced compared with those for higher participation; the efficient frontier shifted downward and approached the total cost and QALYs achieved by actual screening patterns. Thus, actual screening patterns appear efficient compared with the efficient screening scenarios under the assumption of 50% participation but not under the assumption of 100% participation.


Figure 2
View larger version (11K):
[in this window]
[in a new window]
 
Fig. 2. Sensitivity analysis comparing 100%, 70%, and 50% participation in the simulated screening policies. Costs and quality-adjusted life-years are shown for the efficient policies for each of the participation assumptions. The upper efficient frontier (solid line) shows the performance assuming 100% population participation and is the same as in Fig. 1. The two lower lines show the efficient frontiers under assumptions of 70% (short-dashed line) and 50% (long-dashed line) participation from the onset of the screening program in 1990. The star indicates the cost and quality-adjusted life-years associated with actual U.S. screening patterns, which consisted of a mixture of frequent and infrequent screening patterns and included nonusers, and remained unchanged under various assumptions about population participation in the alternative screening scenarios. The diamond shows the cost and quality-adjusted life-years associated with no screening after 1990.

 
Model conclusions about the efficiency of screening scenarios were also sensitive to assumptions about quality-of-life effects associated with screening mammography itself. In the first analysis, we assumed that the screening mammogram per se did not affect quality of life. In sensitivity analyses, we assumed that screening participation had small, short-term detrimental effects on quality of life, an approach used in other analyses (44). We also assumed that false-positive mammograms had a somewhat larger detrimental effect on quality of life than routine mammograms. The quality-of-life weights associated with these assumptions are listed in Table 1. As expected, inclusion of these small, short-term decrements to quality of life following a screening mammogram reduced the QALYs gained from the screening scenarios, an effect that was more pronounced in the scenarios with more frequent screening. Thus, because the costs were not affected by the quality-of-life assumption but the QALYs were reduced, the efficiencies of screening scenarios that recommend more frequent screening for more women were also reduced. For example, whereas the cost per additional QALY for screening all women aged 40–80 years every year compared with current practice was $40 000 in the analysis that assumed no quality-of-life effects, this ratio increased to $128 000 per additional QALY when detrimental quality-of-life effects for a screening mammogram were included.

Table 4 shows results of the incremental cost-effectiveness analysis of the efficient screening scenarios that included the assumption about the detrimental quality-of-life effects associated with a screening mammogram. The set of efficient policies under this assumption differed slightly from those under the original analysis. Screening women aged 40–80 years every year in accordance with current U.S. guidelines was no longer among the efficient policies. The broadest and most intense screening scenario among the efficient policies was one that screened women aged 45–80 years every year at a cost of more than $300 000 per additional QALY accrued beyond those accrued by screening women aged 45–80 years every 2 years.


View this table:
[in this window]
[in a new window]
 
Table 4.  Sensitivity analysis for inclusion of quality-of-life effects from a screening mammogram: costs, QALYs, and incremental cost-effectiveness ratios for efficient screening scenarios*

 

    DISCUSSION
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
We estimated that, compared with no screening, the gain in total QALYs from actual screening practice was 1.7 million QALYs for the study population of 95 million women. This gain of 0.02 QALYs per woman (or 0.05 QALYs per woman, undiscounted) in the population from 10 years of the current screening practices may appear to be small. However, many more women participate in screening mammography than actually benefit, as indicated by the low incidence of breast cancer. Our estimate of a gain of 0.02 QALYs per woman is consistent in magnitude with the health gains reported from breast cancer screening programs specifically and in general for screening and prevention programs for other diseases (11,45).

Compared with no screening during the study period (1990–2000), the prevailing screening mammography practices incurred an additional $63 billion in screening, diagnosis, and treatment expenses over the lifetimes of the women in the population to achieve the 0.02 QALY per woman gain. Although $63 billion is a large expenditure in total, it is an attractive investment if one considers the commonly accepted benchmark for cost-effectiveness of $50 000 per QALY (46); we found that the incremental cost per QALY accrued was approximately $37 000 for actual screening patterns compared with no screening. It is difficult to directly compare this result with those of previously published analyses because past results varied considerably, in part because of differences in the analytic assumptions (including the age structure of the study population, incorporation of past screening, and quality adjustment of life-years and screening scenarios evaluated) (7,47). However, our results reiterate the overall conclusion from these previous analyses that, in general, a screening mammography program can be cost-effective compared with no screening (8,11,46,48).

When we compared actual screening patterns with alternative screening scenarios, the results suggested that there is room for improvement in current screening mammography practice. In Fig. 1, screening scenarios located on the efficient frontier above and to the left of the actual practice scenario (starred) gained the same or more QALYs at the same or lower costs. However, these scenarios were generally less inclusive or intensive than the current screening recommendations and thus are unlikely to be acceptable to the general public. Screening scenarios located on the efficient frontier above and to the right of the actual practice scenario are polices that gained more QALYs compared with current practice, although they did so at a higher cost. Although substantial gains in QALYs relative to observed practice could have been achieved by these policies, all of which would be judged to be attractive investments (less than $50 000/QALY) if compared incrementally with observed practice, the total expenditures associated with these policies are large and this fact should be factored into decisions about future improvements to current screening mammography practice. There are caveats to this conclusion, however, and we turn to them next.

Two additional aspects of the screening program appear to be important for assessing its costs and effectiveness beyond those accounted for in a cost-effectiveness ratio: the costs required to increase participation, and the potentially detrimental quality-of-life effects associated with a screening mammogram. Compared with the actual screening rate in 2000, annual screening with a participation rate of 100% among women aged 40–80 years, as called for by some current U.S. recommendations (i.e., the most expensive policy in Table 3), would incur increased costs and also gain QALYs. The increased costs would result from additional mammograms, additional follow-up testing, and treatment. The gain in QALYs would result from additional reduced breast cancer mortality. Compared with current practice, the additional QALYs would cost $40 000 per QALY. However, this figure does not include the costs of attaining 100% screening participation. Although costs to increase participation have rarely been examined in previous cost-effectiveness analyses for screening mammography, Mandelblatt et al. (49) provided an example in which screening programs may not always remain cost-effective when these costs are included. Our population analysis suggested that we could spend as much as $20 billion (or $34 per woman per year for the population of 95 million women for the 10-year screening period) to increase participation and still keep the incremental cost-effectiveness ratio lower than the commonly accepted threshold of $50 000 per QALY (i.e., adding up to $20 billion for increasing participation to the total costs of current screening practice would raise the cost per QALY from $37 000 to approximately $50 000). However, even though such an expenditure would be cost-effective, it is large enough in total magnitude that total costs as well as cost-effectiveness may be an important consideration in policy-making decisions concerning an expenditure of this amount. Paying to promote adherence to less-intensive screening protocols (i.e., policies less far out on the efficient frontier curve in Fig. 1 but still to the right of the star) would incur less total costs and still gain many incremental QALYs.

The second factor to consider with respect to evaluating screening policies is the short-term pain and anxiety reported by some women after a screening mammogram—in particular, after a false-positive screening mammogram (5053). Modeling these transient health states is difficult. Nonetheless, based on our results it appears to be important to account for small yet common negative effects associated with mammography in the design and evaluation of a screening program, especially because of the high risk of a false-positive test result (5,54). We found that incorporating modest negative quality-of-life effects due to the screening mammogram itself attenuated the QALYs achievable from screening, thus dramatically increasing the incremental cost per QALY for alternative screening scenarios. For example, the cost per additional QALY accrued to achieve 100% participation for all women aged 40–80 years in annual screening from actual practice was $128 000 when a modest disutility was included. Any additional spending to increase participation in recommended screening programs would increase the cost per QALY even further beyond the $50 000/QALY threshold. Because women's perceptions of potential negative quality-of-life effects from the mammogram itself may contribute to lower participation rates, perhaps a substantial portion of the $34 per woman per year that would be required to increase participation could be spent to reduce women's apprehensions concerning physical discomfort from a mammogram; this approach would promote the double effect of increasing QALYs and participation. In any event, policies that appear societally attractive from the perspective of incremental costs and benefits appear considerably more complicated when the disutility of the mammogram itself is considered.

Our analysis has limitations. First, the natural history of breast cancer was unobservable. As with any model, unobservable parameters were constrained by structural assumptions and relevant parameters that were fit from the data. In developing our simulation model, we fit our submodel of the natural history and progression of breast cancer by using 25 years of SEER data. Our assumptions about the natural history of breast cancer were biologically plausible, and the model produced results that were consistent with observed U.S. trends in breast cancer incidence and mortality (17) and with other models of the natural history of breast cancer (15). Second, our analysis focused on the direct medical costs associated with breast cancer screening, follow-up, and treatment. A full societal approach would incorporate personal time costs for women being screened, followed up, and treated—data that we did not have. Incorporating these personal time costs would further reduce the amount of money that could be spent on increasing participation. Finally, we did not consider screening strategies that specified screening age ranges and frequency by individual risk of breast cancer. A future direction is to explore these types of stratified screening strategies that differentially target women who may be at higher risk for breast cancer with more frequent screening.

Current breast cancer screening practice appears to have contributed substantially to the health of women in the United States. However, because many of the alternative screening scenarios examined in our analysis were more effective and less costly than the actual screening patterns from 1990 to 2000, it is likely that current screening programs can be improved. Choosing among the efficient policies to guide current screening recommendations is a complex, high-level decision that is beyond the scope of one cost-effectiveness analysis. Policies directed to moving from current screening practice to full participation in current guidelines (i.e., annual screening for all women aged 40 years or older) are attractive from a cost-effectiveness standpoint. Yet, they are expensive investments in total magnitude. Our analysis identifies other important factors that should also be considered in guiding future screening policy, including the costs of increasing participation and of short-term disutility associated with a mammogram. An investment in studies that will provide empiric data on these two important quantities should be a high priority for policy makers who are hoping to improve population-based programs for breast cancer screening.


    NOTES
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Supported by an Agency for Healthcare Research and Quality Training Grant to the University of Wisconsin (HS00083, PI: D. G. Fryback), by the National Cancer Institute CISNET Consortium (CA88211, PI: D. G. Fryback), and by the Harvard Center for Risk Analysis.

The sponsors had no role in the study design, data collection and analysis, or the interpretation of the findings.

A previous version of this work was presented at the 2004 Annual Meeting of the Society for Medical Decision Making.

We thank Dr. Eric Feuer and the members of the National Cancer Institute's Cancer Intervention and Surveillance Modeling Network for their invaluable feedback on the development of the breast cancer simulation model. We gratefully acknowledge helpful discussions with Dr. Sue Goldie, Dr. Patrick Remington, and Dr. Vipat Kuruchittham. We also gratefully acknowledge the National Cancer Institute–funded Breast Cancer Surveillance Consortium for providing information on breast cancer screening practices (grant numbers U01CA63740, U01CA86076, U01CA86082, U01CA63736, U01CA70013, U01CA69976, U01CA63731, and U01CA70040).


    REFERENCES
 Top
 Notes
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 

(1) Smith RA, Cokkinides V, Eyre HJ. American Cancer Society guidelines for the early detection of cancer, 2004. CA Cancer J Clin 2004;54:41–52.[Abstract/Free Full Text]

(2) US Preventive Services Task Force. Screening for breast cancer: recommendations and rationale. Ann Intern Med 2002;137:344–6.[Abstract/Free Full Text]

(3) Breen N, Wagener DK, Brown ML, Davis WW, Ballard-Barbash R. Progress in cancer screening over a decade: results of cancer screening from the 1987, 1992, 1998 National Health Interview Surveys. J Natl Cancer Inst 2001;93:1704–13.[Abstract/Free Full Text]

(4) Ries LA, Eisner MP, Kosary CL, Hankey BF, Miller AB, Clegg L, et al., eds. SEER Cancer Statistics Review, 1975–2000. Bethesda (MD): National Cancer Institute; 2003.

(5) Elmore JG, Barton MB, Moceri VM, Polk S, Arena PJ, Fletcher SW. Ten-year risk of false positive screening mammograms and clinical breast examinations. N Engl J Med 1998;338:1089–96.[Abstract/Free Full Text]

(6) Zahl PH, Strand BH, Maehlen J. Incidence of breast cancer in Norway and Sweden during introduction of nationwide screening: prospective cohort study. BMJ 2004;328:921–4.[Abstract/Free Full Text]

(7) Brown ML, Fintor L. Cost-effectiveness of breast cancer screening: preliminary results of a systematic review of the literature. Breast Cancer Res Treat 1993;25:113–8.[CrossRef][ISI][Medline]

(8) de Koning HJ, van Ineveld BM, van Oortmarssen GJ, de Haes JC, Collette HJA, Hendriks JH, et al. Breast cancer screening and cost-effectiveness; policy alternatives, quality of life considerations and the possible impact of uncertain factors. Int J Cancer 1991;49:531–7.[ISI][Medline]

(9) van der Maas PJ, de Koning HJ, van Ineveld BM, van Oortmarssen GJ, Habbema JD, Lubbe KT, et al. The cost-effectiveness of breast cancer screening. Int J Cancer 1989;43:1055–60.[ISI][Medline]

(10) van Oortmarssen GJ, Habbema JD, van der Maas PJ, de Koning HJ, Collette HJ, Verbeek AL, et al. A model for breast cancer screening. Cancer 1990;66:1601–12.[CrossRef][ISI][Medline]

(11) Lindfors KK, Rosenquist J. The cost-effectiveness of mammography screening strategies. JAMA 1995;274:881–4.[Abstract]

(12) Salzmann P, Kerlikowske K, Phillips KA. Cost-effectiveness of extending screening mammography guidelines to include women 40 to 49 years of age. Ann Intern Med 1997;127:955–65.[Abstract/Free Full Text]

(13) Gold MR, Siegel JE, Russell LB, Weinstein MC, eds. Cost-effectiveness in health and medicine. New York (NY): Oxford University Press;1996.

(14) NCI Statistical Research and Applications Branch. Cancer Intervention and Surveillance Modeling Network. 1999. Available from the National Cancer Institute at http://cisnet.cancer.gov/. [Last accessed: April 2002.]

(15) Berry DA, Cronin KA, Plevritis SK, Fryback DG, Clarke L, Zelen M, et al. Effect of screening and adjuvant treatment on mortality from breast cancer. N Engl J Med 2005;353:1784–92.[Abstract/Free Full Text]

(16) SEER. SEER*Stat Database: Incidence and survival—SEER 9 Regs, Nov 2002 Sub (1973–2000); 2003.

(17) Fryback DG, Stout NK, Rosenberg MA, Trentham-Dietz A, Kuruchittham V, Remington PL. The Wisconsin breast cancer epidemiology simulation model. J Natl Cancer Inst Monogr. In press 2006.

(18) Stout NK. Quantifying the benefits and risks of screening mammography for women, researchers, and policy makers: an analysis using a simulation model [Dissertation]. University of Wisconsin; 2004.

(19) Stout NK, Fryback DG, Rosenberg MA, Remington PL, Trentham-Dietz A, Tappa LM. Construction, verification, and calibration of a discrete event simulation model of breast cancer epidemiology in Wisconsin. Med Decis Making 2002;22:559.

(20) Cancer Intervention and Surveillance Modeling Network. Breast base case parameters. Available from the National Cancer Institute at http://cisnet.cancer.gov/projects/parameters_br.html. [Last accessed: June 2004.]

(21) Shwartz M. A Mathematical model used to analyze breast cancer screening strategies. Oper Res 1978;26:937–55.[CrossRef]

(22) Shwartz M. Validation and use of a mathematical model to estimate the benefits of screening younger women for breast cancer. Cancer Detect Prev 1981;4:595–601.[Medline]

(23) Spratt JA, von Fournier D, Spratt JS, Weber EE. Decelerating growth and human breast cancer. Cancer 1993;71:2013–9.[CrossRef][ISI][Medline]

(24) Spratt JS, Spratt JA. Growth rates. In: Donegan WL, Spratt JS, eds. Cancer of the Breast. 5th ed. Philadelphia (PA): Saunders; 2002. pp. 443–76.

(25) Kerlikowske K, Barclay J. Outcomes of modern screening mammography. J Natl Cancer Inst Monogr 1997;22:105–11.

(26) Mushlin AL, Kouides RW, Shapiro DE. Estimating the accuracy of screening mammography: a meta-analysis. Am J Prev Med 1998;14:143–52.[CrossRef][ISI][Medline]

(27) Shen Y, Zelen M. Screening sensitivity and sojourn time from breast cancer early detection clinical trials: mammograms and physical examinations. J Clin Oncol 2001;19:3490–9.[Abstract/Free Full Text]

(28) Kerlikowske K, Carney PA, Geller BM, Mandelson MT, Taplin SH, Malvin K, et al. Performance of screening mammography among women with and without a first-degree relative with breast cancer. Ann Intern Med 2000;133:855–63.[Abstract/Free Full Text]

(29) Mariotto A, Feuer EJ, Harlan LC, Wun LM, Johnson KA, Abrams J. Trends in use of adjuvant multi-agent chemotherapy and tamoxifen for breast cancer in the United States: 1975–1999. J Natl Cancer Inst 2002;94:1626–34.[Abstract/Free Full Text]

(30) Early Breast Cancer Trialists' Collaborative Group. Polychemotherapy for early breast cancer: an overview of randomized clinical trials. Lancet 1998;352:930–42.[CrossRef][ISI][Medline]

(31) Early Breast Cancer Trialists' Collaborative Group. Tamoxifen for early breast cancer: an overview of the randomised trials. Lancet 1998;351:1451–67.[CrossRef][ISI][Medline]

(32) Human Mortality Database. U.S. Female Death Rates by Year of Birth. 2003. Available from University of California, Berkeley (USA), and Max Planck Institute for Demographic Research (Germany) at http://www.mortality.org. [Last accessed: March 2003.]

(33) Rosenberg MA. Competing Risks. J Natl Cancer Inst Monogr. In press 2006.

(34) EuroQol Group. EuroQol—a new facility for the measurement of health-related quality of life. Health Policy 1990;16:199–208.[CrossRef][ISI][Medline]

(35) Earle CC, Chapman RH, Baker CS, Bell CM, Stone PW, Sandberg EA, et al. Systematic overview of cost-utility assessments in oncology. J Clin Oncol 2000;18:3302–17.[Abstract/Free Full Text]

(36) Brown ML, Fintor L. U.S. screening mammography services with mobile units: results from the National Survey of Mammography Facilities. Radiology 1995;195:529–32.[Abstract/Free Full Text]

(37) Farria D, Feig SA. An introduction to economic issues in breast imaging. Radiol Clin North Am 2000;38:825–42.[CrossRef][Medline]

(38) Taplin SH, Barlow WE, Urban N, Mandelson MT, Timlin DJ, Ichikawa L, et al. Stage, age, comorbidity, and direct costs of colon, prostate, and breast cancer care. J Natl Cancer Inst 1995;87:417–26.[Abstract/Free Full Text]

(39) U.S. Bureau of Labor Statistics. Consumer price index, medical care component (U.S. city average, not seasonally adjusted) Series ID: CUUS0000SAM. 2004. Available from Bureau of Labor Statistics at http://www.bls.gov/data/home.htm. [Last accessed: April 2004.]

(40) Cancer Intervention and Surveillance Modeling Network. Input parameter generator interfaces. Available from the National Cancer Institute at http://cisnet.cancer.gov/interfaces/. [Last accessed: June 2004.]

(41) Cronin KA, Yu B, Krapchow M, Miglioretti DL, Fay MP, Izmirlan G, et al. Modeling the dissemination of mammography in the United States. Cancer Causes Control 2005;16:701–12.[CrossRef][ISI][Medline]

(42) Fishman GS. Common pseudorandom numbers. Discrete-event simulation: modeling, programming and analysis. New York (NY): Springer; 2001. pp. 142–9, 312–5.

(43) Law AM, Kelton WD. Simulation modeling and analysis. 3rd ed. Boston (MA): McGraw-Hill; 2000.

(44) Mandelblatt JS, Wheat ME, Monane M, Moshief R, Hollenberg JP, Tang J. Breast cancer screening for elderly women with and without comorbid conditions: a decision analysis model. Ann Intern Med 1992;116:722–30.[ISI][Medline]

(45) Wright JC, Weinstein MC. Gains in life expectancy from medical interventions—standardizing data on outcomes. N Engl J Med 1998;339:380–6.[Abstract/Free Full Text]

(46) Kerlikowske K, Salzmann P, Phillips KA, Cauley JA, Cummings SR. Continuing screening mammography in women aged 70 to 79 years: impact on life expectancy and cost-effectiveness. JAMA 1999;282:2156–63.[Abstract/Free Full Text]

(47) Mushlin AI, Fintor L. Is screening for breast cancer cost-effective? Cancer 1992;69:1957–62.[CrossRef][ISI][Medline]

(48) Mandelblatt J, Saha S, Teutsch S, Hoerger T, Sui AL, Atkins D, et al. The cost-effectiveness of screening mammography beyond age 65 years: a systematic review for the U.S. Preventive Services Task Force. Ann Intern Med 2003;139:835–42.[Abstract/Free Full Text]

(49) Mandelblatt JS, Schechter CB, Yabroff KR, Lawrence WF, Dignam J, Muenning P, et al. Benefits and costs of interventions to improve breast cancer outcomes in African American women. J Clin Oncol 2004;22:2554–66.[Abstract/Free Full Text]

(50) Aro AR, Pilvikki Absetz S, van Elderen TM, van der Ploeg E, van der Kamp LJ. False-positive findings in mammography screening induces short-term distress—breast cancer-specific concern prevails longer. Eur J Cancer 2000;36:1089–97.[CrossRef][ISI][Medline]

(51) Gram IT, Lund E, Slenker SE. Quality of life following a false positive mammogram. Br J Cancer 1990;62:1018–22.[ISI][Medline]

(52) Lampic C, Thurfjell E, Bergh J, Sjoden PO. Short- and long-term anxiety and depression in women recalled after breast cancer screening. Eur J Cancer 2001;37:463–9.[CrossRef][ISI][Medline]

(53) Lindfors KK, O'Connor J, Parker RA. False-positive screening mammograms: effect of immediate versus later work-up on patient stress. Radiology 2001;218:247–53.[Abstract/Free Full Text]

(54) Elmore JG, Barton MB, Moceri VM, Fletcher SW. Cumulative risk of a false-positive mammogram over a 10-year period. J Gen Intern Med 1997;12:197.

Manuscript received July 12, 2005; revised April 10, 2006; accepted April 13, 2006.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Clin. Endocrinol. Metab.Home page
K. Cheung, S. A. Roman, T. S. Wang, H. D. Walker, and J. A. Sosa
Calcitonin Measurement in the Evaluation of Thyroid Nodules in the United States: A Cost-Effectiveness and Decision Analysis
J. Clin. Endocrinol. Metab., June 1, 2008; 93(6): 2173 - 2180.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
J. S. Mandelblatt, J. Cullen, W. F. Lawrence, A. L. Stanton, B. Yi, L. Kwan, and P. A. Ganz
Economic Evaluation Alongside a Clinical Trial of Psycho-Educational Interventions to Improve Adjustment to Survivorship Among Patients With Breast Cancer
J. Clin. Oncol., April 1, 2008; 26(10): 1684 - 1690.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
A. N.A. Tosteson, N. K. Stout, D. G. Fryback, S. Acharyya, B. A. Herman, L. G. Hannah, E. D. Pisano, and for the DMIST Investigators
Cost-Effectiveness of Digital Mammography Breast Cancer Screening
Ann Intern Med, January 1, 2008; 148(1): 1 - 10.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
L. G. Kim, R. A. P. Scott, H. A. Ashton, S. G. Thompson, and for the Multicentre Aneurysm Screening Study Group
A Sustained Mortality Benefit from Screening for Abdominal Aortic Aneurysm
Ann Intern Med, May 15, 2007; 146(10): 699 - 706.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
A. B. Knudsen, P. M. McMahon, and G. S. Gazelle
Use of Modeling to Evaluate the Cost-Effectiveness of Cancer Screening Programs
J. Clin. Oncol., January 10, 2007; 25(2): 203 - 208.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplemental Table
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (10)
Right arrow Request Permissions
Google Scholar
Right arrow Articles by Stout, N. K.
Right arrow Articles by Fryback, D. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stout, N. K.
Right arrow Articles by Fryback, D. G.