© The Author 2006. Published by Oxford University Press.
ARTICLE |
Projecting Absolute Invasive Breast Cancer Risk in White Women With a Model That Includes Mammographic Density
Affiliations of authors: Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Philadelphia, PA (JC); Information Management Services, Rockville, MD (DP, RA); Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Bethesda, MD (BG, CS, MHG); Cancer Genetics and Epidemiology Program, Georgetown University Lombardi Comprehensive Cancer Center, Washington, DC (CB); Medical School, Rouen University Hospital, Inserm U 657, Rouen, France (JB)
Correspondence to: Mitchell H. Gail, MD, PhD, Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Bethesda, MD 20892 (e-mail: gailm{at}mail.nih.gov).
| ABSTRACT |
|---|
|
|
|---|
Background: To improve the discriminatory power of the Gail model for predicting absolute risk of invasive breast cancer, we previously developed a relative risk model that incorporated mammographic density (DENSITY) from data on white women in the Breast Cancer Detection Demonstration Project (BCDDP). That model also included the variables age at birth of first live child (AGEFLB), number of affected mother or sisters (NUMREL), number of previous benign breast biopsy examinations (NBIOPS), and weight (WEIGHT). In this study, we developed the corresponding model for absolute risk. Methods: We combined the relative risk model with data on the distribution of the variables AGEFLB, NUMREL, NBIOPS, and WEIGHT from the 2000 National Health Interview Survey, with data on the conditional distribution of DENSITY given other risk factors in BCDDP, with breast cancer incidence rates from the Surveillance, Epidemiology, and End Results program of the National Cancer Institute, and with national mortality rates. Confidence intervals (CIs) accounted for variability of estimates of relative risks and of risk factor distributions. We compared the absolute 5-year risk projections from the new model with those from the Gail model on 1744 white women. Results: Attributable risks of breast cancer associated with DENSITY, AGEFLB, NUMREL, NBIOPS, and WEIGHT were 0.779 (95% CI = 0.733 to 0.819) and 0.747 (95% CI = 0.702 to 0.788) for women younger than 50 years and 50 years or older, respectively. The model predicted higher risks than the Gail model for women with a high percentage of dense breast area. However, the average risk projections from the new model in various age groups were similar to those from the Gail model, suggesting that the new model is well calibrated. Conclusions: This new model for absolute invasive breast cancer risk in white women promises modest improvements in discriminatory power compared with the Gail model but needs to be validated with independent data.
| INTRODUCTION |
|---|
|
|
|---|
Gail et al. (1) used data from the Breast Cancer Detection Demonstration Project (BCDDP) to develop a model for the absolute risk of breast cancer for women in a given age interval. This model, known as the Gail model, included age at menarche (AGEMEN), age at birth of first live child (AGEFLB), number of previous benign breast biopsy examinations (NBIOPS), and number of first-degree relatives (mother or sisters) with breast cancer (NUMREL). We call these standard risk factors. This model was recalibrated to data from National Cancer Institute's (NCI's) Surveillance, Epidemiology, and End Results (SEER) program (2), and the resulting model, called Gail model 2, is available at http://www.cancer.gov/bcrisktool/. This model has been used to design prevention trials, such as the Breast Cancer Prevention Trial (3), and to counsel women about their individual risks (4). Although the model accurately predicts the numbers of cancers observed in subsets of women defined by various combinations of age and risk factors (i.e., it is well calibrated) (2,5), it has been criticized because the distribution of risks in women who develop breast cancer has considerable overlap with the distribution of risks in women who do not develop breast cancer (5). The corresponding low concordance statistic has been described as a lack of discriminatory power (5).
We investigated whether mammographic density could be used to improve the discriminatory power of the model because increased mammographic density has been associated with strongly increased breast cancer risk (68) and with high attributable risk (6). We defined the variable DENSITY in terms of the average percent dense area of the two breasts on craniocaudal mammograms. We previously obtained measurements of the variable DENSITY in 7251 women from the BCDDP and determined that the reader reliability of these measurements was sufficient to warrant their inclusion in risk models (9).
As a first step toward developing a model of absolute risk that includes DENSITY, we developed a relative risk model (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). The technical challenge was to estimate efficiently the effects of standard risk factors, which were measured on large numbers of women, and the effects of DENSITY, which was measured only on subsets of the women for which all of the other risk factors had been measured (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). Chen et al. (unpublished results) examined several possible relative risk models. A parsimonious model that captured the most important effects included AGEFLB, NBIOPS, NUMREL, DENSITY, and weight (WEIGHT). Unlike the original model of Gail et al. (1), this model did not include AGEMEN, an interaction between AGEFLB and NUMREL, or an interaction between NBIOPS and whether age equaled or exceeded 50 years because these terms were not statistically significantly associated with breast cancer risk in the multivariable model. Chen et al. (unpublished results) analyzed the concordance statistic from the new model and found increases in concordance ranging from 0.01 to 0.09 in the seven 5-year age groups studied. It seemed worthwhile, therefore, to develop the corresponding model for absolute risk.
In this study, we developed a model to estimate absolute invasive breast cancer risk that was based on age and on DENSITY, WEIGHT, AGEFLB, NBIOPS, and NUMREL. We also compared estimates from this model of absolute risk, which included DENSITY, with those from Gail model 2 (2). Calculations of the variance needed to put confidence intervals (CIs) on the absolute risk projections are described in the Appendix.
| METHODS AND DATA SOURCES |
|---|
|
|
|---|
Strategy to Estimate Absolute Risk
The components (1) needed to project absolute risk are the relative risk model, the baseline hazard rate for invasive breast cancer at age t [h1(t)], and the corresponding hazard of mortality from nonbreast cancer causes [h2(t)]. We used estimates of h2(t) from the National Center for Health Statistics for January 1, 1996, through December 31, 2000 (10). We obtained h1(t) by multiplying SEER hazard rates (11) from January 1, 1996, through December 31, 2000, for invasive breast cancer,
by [1 AR(t)], where AR(t) is the attributable risk for the entire relative risk model at age t. It is a technical challenge to estimate the attributable risk because the joint distribution of all the risk factors in the general population is not available. We, therefore, obtained an estimate of the joint distribution of age, WEIGHT, AGEFLB, NBIOPS, and NUMREL from the National Health Interview Survey (NHIS) for 2000 (12). We completed our model for the joint distribution of these factors with DENSITY by using BCDDP data to model the conditional distribution of DENSITY given age, WEIGHT, AGEFLB, NBIOPS, and NUMREL. From resulting estimates of the relative risk model (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results), h1(t), and h2(t), we computed absolute risk from formulas 5 and 6 in Gail et al. (1).
Estimating the Relative Risk Function That Incorporates Mammographic Density
The study design of the BCDDP, sampling to obtain information on factors in the Gail model, and sampling of data on mammographic density have been reported previously (9) (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). Chen et al. (unpublished results) described how to account for the patterns of missing data when estimating the relative risk function of these variables and how to calculate the variance of the relative risk function. In brief, the BCDDP recruited 284 780 women, including 243 221 white women, from January 1, 1973, through December 31, 1975, and followed the women with annual screening for up to 5 years (Fig. 1). More than 99% of the women were between the ages of 35 and 74 years at recruitment, with a median age of 50 years. The period of BCDDP from January 1, 1973, through December 31, 1979, was referred to as the screening phase of the study. At the end of the screening phase, follow-up was extended for all women who had developed breast cancer; all those who had a benign breast biopsy during the screening phase; all those who were recommended for biopsy examination during the screening phase but did not have it; and a subset of those remaining "normal" women who had not had a breast cancer diagnosis, a breast biopsy, or a recommendation for breast biopsy during the screening phase (Fig. 1). Normal subjects were selected by frequency matching on age, date of entry into the BCDDP screening phase, race, center, and length of participation in screening to women who developed breast cancer or who had a biopsy in the screening phase. Follow-up continued through December 31, 1998; we used follow-up data through December 31, 1995. We refer to the period from January 1, 1980, through December 31, 1995, as the follow-up phase of BCDDP and to the three groups of women (i.e., women who had biopsy examinations, women recommended for a biopsy examination but who did not receive it, and normal women) as subcohorts.
|
To study the risk factors for breast cancer, a nested casecontrol study was conducted within the screening phase in 1979, and data were collected on standard risk factors, including age at menarche, age at birth of first live child, number of breast biopsy examinations, and number of first-degree relatives (mother or sisters) with breast cancer (Fig. 1). Women who had either invasive or in situ breast cancer were defined as case patients. Control subjects for this study were selected from those who did not have a breast cancer diagnosis, a breast biopsy, or a recommendation for breast biopsy during the screening phase and were matched to case patients on age at entry in 5-year intervals, race, study center, 6-month calendar time of screening, and length of follow-up (at least as long as that of the case patient). Gail et al. (1) used data on the four standard risk factors and age from the white women in this study to estimate relative odds parameters with a logistic regression model. These standard risk factors were available for most women in the follow-up phase. Of the 21 243 women in the benign biopsy subcohort, 7773 women in the recommended-for-biopsy subcohort, and 21 629 women in the normal subcohort, we excluded 433, 101, and 780 women, respectively, who did not have data on WEIGHT and AGEFLB. There were no women with missing NBIOPS or NUMREL data, which were updated during the follow-up phase. We were able to obtain DENSITY measurements from mammograms at baseline in the period from January 1, 1973, through December 31, 1975 for approximately half the case patients and control subjects in the screening phase. A nested casecontrol sample was selected from each subcohort in the follow-up phase, and an effort was made to retrieve screening-phase baseline mammographic images. DENSITY measurements were obtained for approximately half of the case patients and control subjects in these nested casecontrol studies (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results) (Fig. 1).
Chen et al. (unpublished results) used a logistic regression model to describe the odds ratios (ORs) of breast cancer for the screening-phase data and a piecewise exponential model to estimate the corresponding hazard ratios for data from the follow-up phase. We use the term relative risk to refer to the relative hazard obtained from follow-up phase data and the odds ratios from screening-phase data, which closely approximate the relative hazard (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). Chen et al. developed efficient statistical methods to estimate the relative risk parameters by use of data from all subjects, including those without DENSITY measurements, and they analyzed the screening-phase data and the three follow-up subcohorts separately. Summary log relative risks were obtained from the four sets of log relative risk estimates by multivariate-weighted least squares, in which each component log relative risk vector was multiplied by the inverse of the corresponding covariance matrix and then multiplied by the inverse of the sum of the four covariance matrix inverses.
The final log relative risk model included main effects in five continuous variables: age at birth of first live child (AGEFLB), coded as 0, 1, 2, or 3 for ages younger than 20 years, 2024 years, 2529 years or nulliparous, or older than 29 years, respectively; number of biopsies (NBIOPS), coded as 0, 1, or 2 for zero, one, or more than one biopsy examination, respectively, at the time the case patients and control subjects were interviewed; number of affected relatives (NUMREL), coded as 0, 1, or 2 for zero, one, or more than one sister or mother with breast cancer at the time the case patients and control subjects were interviewed; body weight (WEIGHT), coded as 0, 1, 2, 3, 4, or 5 for the weight ranges 100 or less, 101125, 126150, 151175, 176200, and more than 200 pounds at entry in BCDDP for case patients and control subjects in the screening phase and at the beginning of follow-up otherwise; and DENSITY, coded as 0, 1, 2, 3, or 4 for mammographic density measurements in the ranges 0%, 1%24%, 25%49%, 50%74%, and 75%100% on the initial screening-phase mammograms. The baseline density 0% corresponds to women with no dense breast tissue. We measured mammographic density in percent by measuring the percentage of breast area outlined on a cranialcaudal image that was dense for each breast. The mammographic density in percent was defined as the average of the two percentages of dense area for the two breasts. The natural logarithm of the relative risk is given as described by Chen et al. (unpublished results) by
![]() | [1] |
Thus, for a woman with an AGEFLB of 2, NBIOPS of 1, NUMREL of 1, WEIGHT of 3, and DENSITY of 2, the natural logarithm of the relative risk is 10.3 = exp[(0.158 x 2) + (0.256 x 1) + (0.444 x 1) + (0.216 x 3) + (0.333 x 2)] times the hazard of a woman of the same age with all risk factors at the baseline level of zero. We denote coefficients in Eq. 1 by the vector
. Unreported analyses showed that the effect of WEIGHT was very similar when model 1 was fitted separately to women younger than 50 years old and to women 50 years old or older. We present factors (Table 1), corresponding to terms in Eq. 1, that can be used to calculate relative risks easily.
|
Estimating the Baseline Age-Specific Breast Cancer Hazard Rate [h1(t)] and the Hazard of Death From Causes Other Than Breast Cancer [h2(t)]
We used mortality data from the National Center of Health Statistics (10) from January 1, 1996, through December 31, 2000, to estimate h2(t), and we assumed that h2(t) was known without error and did not vary with the risk factors in our model. To estimate h1(t) as in Gail et al. (1), we used the formula
![]() | [2] |
where
is the composite invasive breast cancer incidence rate available from the SEER program (11) between January 1, 1996, through December 31, 2000; AR(t) is the population attributable risk for women of age t; and
is assumed to be known without error. The challenge was to estimate AR(t).
Because AR(t) changes very little with age, we assumed that AR(t) was a constant for women younger than 50 years old and possibly another constant for women 50 years old or older. Let the risk factors be described by vector X' = (X1,X'2), where X1 = (DENSITY) and X'2 = (AGEFLB, NBIOPS, NUMREL, WEIGHT). Let p(x|It) be the population mass function for the risk factors X given age is in interval It, where It corresponds to t < 50 years or t
50 years. In the analyses that follows, we use quantities such as p(x|It) and its estimate interchangeably. Letting r(x) = exp(
Tx) denote the relative risk at X = x compared with X = 0, we computed AR(It) from
![]() | [3] |
Ideally, we would have survey data from the general population on the distribution of X, but such data on DENSITY are not available. We obtained data on the probability mass distribution of X2 in the general population from the NHIS. The NHIS is based on a complex weighted cluster sampling design (12). Design-based consistent estimates and variance calculations must take this design into account. We used the program SUDAAN (13) for this purpose. We indexed the eight age intervals (<45, 4549, 5054, 5559, 6064, 6569, 7074,
75 years) by i = 0, 1, 2, ... 7. From NHIS, we estimated the mass distributions p(x2, t = i), from which conditional mass functions such as p(t = i|x2,It) and p(x2|It) were computed.
To obtain the required conditional joint probability mass function, p(x|It), we assumed that the conditional distribution of X1 for a given age interval t = i and X2 = x2 was the same in the BCDDP population as in the general population. From the data from control subjects with DENSITY measurements in each of the three subcohorts, we separately estimated p(x1|x2,t,It). We estimated
instead of directly estimating
because age is strongly associated with DENSITY (9); thus, inclusion of age may lead to more precise prediction of DENSITY. More importantly, the normal subcohort was sampled by frequency matching on age in 5-year intervals. Its age distribution is not necessarily equal to that of the general population; therefore, naive estimation without stratifying on age would not be expected to yield consistent estimates of p(x1|x2,It) for the general population. To accommodate the large number of cells defined by x2 and t, we further modeled p(x1|x2, t = i, t
50) and p(x1|x2, t = i, t < 50) by the same polytomous logistic regression model, which included trend effects on t and components of x2. This assumption of a common logistic model is justified by the good fit of the linear trend model in t. The model we fit was
![]() | [4] |
In Eq. 4,
0c is an intercept for DENSITY level c, where c = 1, 2, 3, or 4. For most of the women in the follow-up subcohorts, the values of X2 were not available when DENSITY was measured at the beginning of the screening phase. Instead, we used the values determined at the beginning of the follow-up phase in 1980 to predict X1 from Eq. 4.
The components of
c are the coefficients given separately in Table 2 for the three subcohorts. Such a coefficient estimates the change in the natural logarithm of a relative probability from a unit increase in the corresponding covariate in X2. The relative probability is the ratio of the probability that DENSITY = c divided by the probability that DENSITY = 0. For example, the coefficient 1.80 for WEIGHT in Table 2 implies that a unit increase in weight category is associated with a decrease in the ratio of the probability that DENSITY = 4 to the probability that DENSITY = 0 by a factor of exp(1.80) = 0.165. The differing intercepts and coefficients for the various subcohorts imply, for example, that the ratio of the probability that DENSITY = 4 to the probability that DENSITY = 0 is greater in the subcohort of women given a biopsy examination in the screening phase and in the subcohort of women recommended for biopsy examination but not given a biopsy examination than in the normal subcohort.
|
For each subcohort, we estimated p(x1|x2,It) from
![]() | [5] |
To account for the fact that the distribution of DENSITY may differ in the three follow-up subcohorts (Table 1) and that "normal" women were underrepresented in the follow-up study, we combined the subcohort-specific estimates by reweighting them to obtain
![]() | [6] |
where p(subcohort|It) refers to proportions of white women in age stratum It in each of the three follow-up subcohorts in the entire BCDDP population at the end of the screening phase. These proportions were 0.099, 0.868, and 0.032, respectively, for the benign breast disease, normal, and recommended-for-biopsy subcohorts among those younger than 50 years; corresponding proportions for those 50 years old or older were 0.082, 0.885, and 0.033. The joint mass distribution
from Eq. 6 was substituted into Eq. 3 to estimate AR(It), which was substituted into Eq. 2 to obtain h1(t).
Estimating Absolute Risk and Its Variance
We calculated absolute risk of invasive breast cancer from age a to age a +
, where
is the duration of the risk projection interval, in a woman with relative risk r(x) from formula 5 of Gail et al. (1):
![]() | [7] |
To compute the absolute risk from Eq. 7 and its variance, we assumed that the hazards
and h2(t) were constant on 5-year intervals as in Eq. 6 in Gail et al. (1). To compute the variance of the estimate of
(x), we took into account random variation and covariation of estimates of the following quantities: the log relative risk parameters
, the log odds ratio parameters
= (
0c,
c,
c), where c = 1, 2, 3, or 4, to model
in each subcohort,
and
Because values of
and
were estimated by use of data from the BCDDP but values of
and
were from the NHIS, their estimates were independent. However, estimates of
and
were correlated for two reasons. First, most of the control subjects from the screening phase were included in the normal subcohort, and a large fraction of them remained healthy and were, therefore, used to estimate
. Consequently, there was correlation between the contribution to the estimate of
from the screening phase and
estimated from the women in the normal subcohort who remained free of disease. Second, there was a correlation between the contribution to the estimate of
from each subcohort and the estimate of
from the same cohort. Estimates of
and
were correlated as well. Confidence limits on the estimated absolute risk were calculated from the variance of the estimate of
(x) by using the delta method to compute the variance of log[
(x)/(1
(x)], computing symmetric two-sided 95% confidence intervals on the logit-transformed risk, and back-transforming the result. The detailed variance calculations are presented in the Appendix. Numerical analysis that was based on formulas in the Appendix revealed that the largest contribution to the variance of (1 AR), and hence to the variance of absolute risk for a woman at the baseline level of risk factors, is from the variance of
Indeed this source contributes at least 20 times more than any other source to the variance of (1 AR).
To provide the reader with an approximate confidence interval on
(x), we regressed the upper confidence limits on
(x), which were calculated from the variance estimates in the Appendix, on
(x) and
2(x) (upper locus in Fig. 2). A similar regression analysis was obtained for the lower confidence limits (lower locus in Fig. 2). The points used in the regressions were chosen to cover a broad range of absolute risks as follows. For each of the 14 starting ages of 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, and 85 years, we considered projection intervals with a length of 5, 10, 20, or 30 years, which were subject to the constraint that the starting age plus the duration of the projection interval was no greater than 90 years. This yielded 47 possible age intervals over which projections were to be made. For each such age interval, we computed the absolute risk for each of the 1080 possible risk factor combinations, resulting in 47 x 1080 = 50 760 pairs (upper confidence limit, absolute risk) and 50 760 pairs (lower confidence limit, absolute risk). Thus, each of these quadratic regressions was based on 50 760 points. The squared multiple correlations for the upper and lower confidence intervals were 0.9991 and 0.9989, respectively. Thus, the loci in Fig. 2 provide good fits to the calculated confidence limits and explain 99.9% of the variation in the confidence limits in these 50 760 scenarios.
|
| RESULTS |
|---|
|
|
|---|
Relative Risk
We tabulated the relative risk factors from Eq. 1 (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results) to facilitate computation of the combined relative risk. As described previously (1,10), we included an adjustment factor F for information on atypical hyperplasia (Table 1). As an example, we consider a woman with AGEFLB = 2, NBIOPS = 1, NUMREL = 1, WEIGHT = 3, and DENSITY = 2, and we assume it is unknown whether atypical hyperplasia was present on the biopsy (factor F). From Table 1, her combined relative risk is calculated as 1.37 x 1.29 x 1.56 x 1.91 x 1.95 x 1.00 = 10.3. To estimate the absolute risk for such a woman, we need to combine this relative risk with information on the age interval over which risk is to be projected (see Table 5), as described below.
|
Distribution of Covariates in the General Population, Attributable Risks, and Hazard Rates
The conditional distribution of DENSITY for age interval i, NUMREL, NBIOPS, AGEFLB, and WEIGHT is given by Eq. 4 with estimated parameters in Table 2 for each of the three BCDDP subcohorts. Density decreases with increasing WEIGHT and age interval i, as indicated by the negative coefficients (Table 2). As an example, consider a woman in the normal subcohort with AGEFLB = 1, NBIOPS = 0, and NUMREL = 1. If she is aged 51 years (age group i = 3) and weighs 160 pounds (WEIGHT = 3), then from Eq. 4 and data in Table 2, we calculate her chance of having DENSITY = 4 (i.e.,
75% dense tissue) as exp[2.42 (0.6 x 3) (0.06 x 1) + (0.88 x 0) + (0.95 x 1) (1.87 x 3)]/(1 + 2.829 + 1.139 + 0.372 + 0.017) = 0.0031. If instead this woman were in age group i = 1 (i.e., age 4549 years) and had WEIGHT = 0, the probability of having DENSITY = 4 would be 0.1083, nearly 35 times as large.
Combining data on NUMREL, NBIOPS, AGEFLB, WEIGHT, and age with the conditional distributions in Table 2 and averaging as in Eq. 6, we estimated the joint distribution of all risk factors. The marginal distributions from this joint distribution are displayed for US women in Table 3. For comparison, we gave the marginal distribution among white control women with DENSITY data in the BCDDP screening phase. Examining the results for US women, we note that 9.8% of women younger than 50 years old have no dense tissue, compared with an estimated 21.7% of older women. As anticipated, the proportion of women with affected first-degree relatives and the proportion of women with biopsies increased with age. The BCDDP screening-phase control subjects tended to have higher levels of these risk factors than women in the general US population. In particular, BCDDP screening-phase control subjects had higher proportions with affected first-degree relatives, with biopsy examinations, and with delayed age at birth of first live child. BCDDP screening-phase control subjects 50 years old or older also had higher proportions with dense breast tissue.
|
From Eq. 3, we estimated the attributable risk (i.e., AR) from the joint distribution of risk factors and the relative risk model (Eq. 1). Attributable risks of breast cancer associated with breast density, AGEFLB, NUMREL, NBIOPS, and WEIGHT were 0.779 (95% CI = 0.733 to 0.819) and 0.747 (95% CI = 0.702 to 0.788), respectively, for women younger than 50 years and 50 years or older. The quantity (1 AR), which is needed in Eq. 2, was estimated to be 0.221 (95% CI = 0.181 to 0.267) among women younger than 50 years old and 0.253 (95% CI = 0.212 to 0.298) among those 50 years old or older. The corresponding values of (1 AR) in Gail model 2 (2) were 0.579 and 0.579. Thus, the attributable risk in the new model that included DENSITY is considerably larger than the attributable risk in the model with standard risk factors only. This difference accounts for the fact that the baseline hazard rates h1(t) are considerably lower in the new model (Table 4) than in the Gail model (2). Table 4 also displays the hazards of competing causes of mortality h2(t).
|
Absolute Risk of Invasive Breast Cancer
From the piecewise constant hazards version of Eq. 7 and the hazards in Table 4, we calculated the absolute risks of invasive breast cancer for various initial ages at counseling, follow-up intervals, and initial relative risks (Table 5). First, we assumed that the relative risks determined at the time of initial counseling remained constant over the risk projection interval. For example, if we consider a 50-year-old woman with a relative risk of 3.0, from Table 5, her absolute risks to ages 55, 80, and 90 years are, respectively, 0.01, 0.08, and 0.094 (or 1%, 8%, and 9.4%).
As a further illustration, consider a 50-year-old white woman with AGEFLB = 0, NBIOPS = 1, NUMREL = 1, WEIGHT = 2, and DENSITY = 2. If we assume that it is not known whether atypical hyperplasia was found in her breast biopsy examination, then from Table 1, we estimate her relative risk as 1.00 x 1.29 x 1.56 x 1.54 x 1.95 x 1.00 = 6.04. To estimate her absolute risk over 10 years from Table 5, we interpolated between the entries for relative risks of 5 and 10. The resulting estimate is 3.8 + [1/(10 5)](7.5 3.8)(6.04 5) = 3.8 0.77 = 4.57%.
The 95% confidence interval for such a projection can be approximated from Fig. 2. The actual loci are 0.0006 + 1.1649
(x) 0.0701
2(x) for the upper locus and 0.0004 + 0.8622
(x) + 0.0157
2(x) for the lower locus. By substituting
(x) = 0.0457, we obtain a 95% confidence interval of 0.040 to 0.052 for the absolute risk of 0.0457 in the previous example, in agreement with Fig. 2. The widths of the confidence intervals increase with increasing absolute risk (Fig. 2).
A Gauss program (14) has been developed that computes the estimates of absolute risk and corresponding confidence intervals exactly for any combination of risk factors and age interval over which risk is to be projected. Using this program, we found for the previous example that the relative risk was 6.03 and the exact absolute risk estimate was 0.046 (estimated 95% CI = 0.041 to 0.052). These exact calculations agreed well with the approximate values obtained by interpolation in Table 5 and from Fig. 2.
Comparison Between the New Model With Mammographic Density and Gail Model 2
We compared 5-year absolute risk estimates from the new model with those from Gail model 2, which is used in the NCI's Breast Cancer Risk Assessment Tool (http://www.cancer.gov/bcrisktool/), for the white control women with complete data on NUMREL, NBIOPS, AGEFLB, WEIGHT, and DENSITY in the screening phase of BCDDP. Data from 270 women who were selected from the 1744 control subjects with complete data are shown in Fig. 3. There were 270 unique relative risks among these 1744 control subjects, and one woman was randomly selected from each of the sets of women having these 270 unique relative risks. The equiangular line, on which the ordinate equals the abscissa, is presented in Fig. 3 to divide projections from the new model that exceed the projections from Gail model 2 (which are above the line) from those projections from the new model that are less than the projections from Gail model 2 (which are below the line). The new model tends to have higher projections than Gail model 2 when the projections from the new model are more than 0.01 and lower projections than Gail model 2 when projections are less than 0.01. Among the 444 women with projected 5-year risk of less than 0.01 according to the new model, the projections from the new model were lower than those from Gail model 2 in 285 (64.2%). Among the remaining 1300 women with projected risks of at least 0.01, the projections from the new model were lower than those from Gail model 2 in only 390 (30.0%). Similar patterns were observed in strata defined by age groups (unreported analyses).
|
To understand discrepancies between the new model and Gail model 2, one must consider the two relative risk functions and the fact that the baseline hazard
is lower in the new model than in Gail model 2 (Table 4). An inspection of the discrepancies between the new model and Gail model 2 indicated that the former tended to have higher risk projections for women in the higher categories of DENSITY and WEIGHT, as might be expected, because these factors are not present in Gail model 2. For example, the 5-year estimated risks for a 68-year-old woman with NBIOPS = 0, AGEMEN = 1, AGEFLB = 2, NULREL = 0, WEIGHT = 2, and DENSITY = 4 were 2.3% and 4.6%, respectively, for Gail model 2 and the new model; the corresponding combined relative risks were 1.70 and 7.98. The relative risk parameters for the standard risk factors in Gail model 2 differ from those in the new model, but direct comparisons are complicated because Gail model 2 includes interactions between NBIOBS and an indicator for an age of at least 50 years and between NUMREL and AGEFLB. Apart from the previous comments on DENSITY and WEIGHT and on high versus low estimated risks, we could not identify general classes of women for whom the new model gave predictably different absolute risk estimates from Gail model 2. | DISCUSSION |
|---|
|
|
|---|
To estimate absolute risk, we combined our previously developed relative risk model (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results), which includes mammographic density, with information on the distribution of risk factors from the NHIS and BCDDP studies, with SEER invasive breast cancer rates, and with national mortality hazard rates. We showed how to obtain an estimate of relative risk (Table 1) and use this relative risk to find the absolute risk estimate (Table 5). Variance calculations assume that SEER composite invasive breast cancer incidence rates and national mortality rates are known without error. However, these calculations allow for random variation in the relative risk estimation, which is complicated by the patterns of missing data (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results); for the complex sampling in NHIS to estimate the joint distribution of age, AGEFLB, NBIOPS, NUMREL, and WEIGHT; and for estimation of the conditional distribution of DENSITY in the BCDDP subcohorts, given other risk factors (Table 2). These complex variance calculations are described in the Appendix. Approximate confidence intervals on absolute risk projections can be obtained from Fig. 2, and a Gauss program is available to compute the confidence intervals analytically.
The relative risk model used in this study was obtained by combining data from the BCDDP screening phase with data from the three follow-up subcohorts, even though there was evidence of heterogeneity of relative risk estimates between the screening phase and the follow-up phase (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). We used this strategy in the expectation that the resulting absolute risk projections would work reasonably well over both short-term and longer term intervals. Independent validation studies are needed to test this strategy. The calculations in the paper also assume that a woman's relative risk remains constant over the risk projection interval. If one anticipates that a factor such as NUMREL, DENSITY, or WEIGHT will change over time, one could adapt Eq. 7 by allowing the relative risk to vary with risk factors X(t) that change with age t. For example, if risk factors remain constant, a 40-year-old woman with NBIOPS = 1, AGEFLB = 2, NULREL = 0, WEIGHT = 1, and DENSITY = 4 will have an absolute risk of 16.1% to age 70 years. If, instead, her DENSITY decreases to 3 and her WEIGHT increases to 2 at age 55 years, then her projected risk between ages 40 and 70 years would be 15.0%.
We found large attributable risks of 0.779 (95% CI = 0.733 to 0.819) for women who were younger than 50 years and of 0.747 (95% CI = 0.702 to 0.788) for women who were 50 years old or older. These attributable risks are larger than the values near 0.48 found for the Gail model (1). The larger attributable risks with the new model reflect the high prevalence of women with DENSITY of more than 0 (Table 3) and the large relative risks associated with DENSITY (Table 1), which are factors that have been noted previously (6).
Independent validation studies are needed to check the relative risks as well as the calibration (2,5) and discriminatory power (5) of the absolute risk model. A model is well calibrated if the observed numbers of breast cancer diagnoses in various subgroups of the population agree well with expected numbers based on the model (15). We have indirect evidence that the new model is well calibrated. Previous studies have shown that Gail model 2 is well calibrated (2,5). Even though predictions from Gail model 2 and the new model can differ appreciably (Fig. 3), the average predicted 5-year risk from the two models is quite similar for women in the following age intervals: younger than 45, 4655, 5665, and older than 65 years. When we analyzed data from the 1744 white control subject women in the screening phase with complete data on the standard risk factors DENSITY and WEIGHT, we found that the average predicted 5-year risks in these intervals were 0.8%, 1.5%, 2.2%, and 1.9%, respectively, for the new model and 0.7%, 1.5%, 2.1%, and 1.9% for Gail model 2. These data suggest that the new model gives average predictions close to those of Gail model 2 and is, therefore, well calibrated. Independent cohort data are needed to examine calibration in subgroups defined by factors such as age and levels of DENSITY and to check the model's applicability to nonwhite women. Ursin et al. (16) found similar relative risks for breast cancer associated with mammographic density among white, Asian American, and African American women.
The most widely used statistic to assess "discriminatory power" is the concordance statistic (5,15) or area under the receiver-operating curve. Unlike tests of calibration, which require cohort data, the concordance statistic can be estimated from a sample of women with breast cancer and from a sample of control subjects. By analyzing data from the case patients and control subjects from the screening phase of the BCDDP, Chen et al. (unpublished results) showed that the new model had greater age-specific concordance than Gail model 2. For example, for women aged 6064 years, the concordance of Gail model 2 was estimated as 0.602, compared with 0.664 for the new model. The difference 0.063 had standard error 0.026, indicating a statistically significant increase in concordance (two-sided P = .015). Statistically significant increases in concordance were found for the age intervals of 4549, 5054, 6064, and 70 years old or older, but increases were present in all seven age groups and ranged from 0.009 to 0.094. The unweighted average age-specific concordance for Gail model 2 was 0.596, compared with 0.643 for the new model. Thus, adding mammographic density improves the discriminatory power modestly compared to Gail model 2. Whether this improvement is sufficient to justify the additional expense of obtaining data on mammographic density may depend on the particular application of the risk projection model (15). In any case, it would be useful to have additional independent data to compare the concordance statistic of the new model with that of Gail model 2.
Tice et al. (17) studied whether mammographic density, measured by the categories of the Radiology Breast Imaging Reporting and Data System (BI-RADS) of the American College of Radiology, could add to the discriminatory power of the Gail model. They found that BI-RADS was an independent predictor of risk but that it barely increased the concordance above levels provided by the Gail model alone. For example, for women with complete information on Gail model covariates, the concordance was 0.70 from the Gail model alone and 0.71 with the addition of BI-RADS. One possible explanation for the smaller increase in concordance than found by Chen et al. (unpublished results) is that BI-RADS may be less informative than DENSITY. The difference may also be related to the fact that Tice et al. did not stratify their concordance calculations on age. Analysis of data from women of all ages results in higher concordance for the Gail model; this result may make it harder to detect the incremental value of BI-RADS.
Several limitations of our data and approach should be noted. As mentioned previously, to obtain relative risk estimates that might apply over shorter or longer time intervals, we combined data from the screening phase with data from the follow-up phase, even though estimates of relative risks for DENSITY and NUMREL were larger in the screening phase. Because DENSITY data were not available in NHIS, we needed to assume that the conditional distribution of DENSITY given age, AGEFLB, NBIOPS, NUMREL, and WEIGHT was the same in NHIS as in BCDDP to estimate the joint distribution of all these variables in the general U.S. population. Because medical practices and diagnostic procedures have changed over time, relative risks that are based on BCDDP data from 1973 to 1995 may differ somewhat from current relative risks. In particular, our data on DENSITY were based on mammograms taken from 1973 through 1975 and were later assessed by a single evaluator, who outlined the dense areas (9). Measurements that are based on newer technologies or that are made by multiple evaluators might yield different relative risks for DENSITY, although correlations among readings of the same films made by the evaluator in this study at different times were similar to correlations among different evaluators reported in other studies (9) (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). For these reasons, it is important that independent validation data be obtained to assess the relative risk features and absolute risk projections from this model.
A Gauss program is available to researchers from the corresponding author for further testing of this model. If the new model is shown to be valid in independent evaluations, a more convenient program could be made available for counseling and other applications.
| APPENDIX |
|---|
|
|
|---|
This appendix describes the statistical methods needed to compute confidence intervals on our estimates of absolute risk. This material documents our methods, and the approach may be useful to others who are developing risk models based on various data sources. The practitioner interested in using Tables 1 and 5 to project risk and Fig. 2 to approximate the confidence intervals can skip this Appendix.
We obtained the asymptotic formulas for calculating the variance of the absolute risk estimates by applying the delta method (18). For density levels c = 1, 2, 3, or 4 in Eq. 4, let
and
Further, let
and
The projected absolute risk
is a function of
, and
. That is,
Taylor series expansion of
around the true values of these parameters yields
![]() | [A1] |
The variance of the absolute risk estimate can be obtained from Eq. A1 in terms of the variances and covariances of the parameter estimates, which are identified by hats over the parameters and by substituting estimates of terms such as
To use Eq. A1 in this way, we calculated the following quantities in Eq. A1.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
and
![]() |
The formulas for
are exactly the same as for
except for suitable changes of superscripts from l to h that index probabilities conditional on t < 50 and t
50, respectively.
We obtained the variance of
by computing the variance of the right-hand side of Eq. A1. We noted that
because different data sources were used for each estimate. To obtain
we estimated the covariance matrix C for
and
with the software program SUDAAN (13), which appropriately takes into account the cluster sampling scheme for the NHIS. Then we obtained the variancecovariance matrix between
and
by applying the delta method. The variancecovariance matrix between
and
was obtained in a similar fashion.
We calculated the covariance between
and
which were both estimated from the BCDDP data. Recall that
was obtained as the linear combination of relative risk estimates from the screening phase and the three follow-up subcohorts,
in which w denotes matrix weights and the subscript s indexes the screening phase. We calculated
and
as described below. We included 3093 of the 3146 screening-phase control subjects in the follow-up study, for 3017 (97.5%) in the normal subcohort, 48 (1.6%) in the benign subcohort, and 28 (0.9%) in the biopsy-recommended subcohort. Thus, we treated
and
as zero and calculated
as described below.
We present the method for calculating
as an example. Let nnormal be the total number of subjects in the normal cohort,
be the baseline breast cancer hazard parameters in the piecewise exponential model as described previously (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results), and
be the parameters in the polytomous regression model for the distribution of DENSITY conditioned on age and weight used in the EM algorithm (Chen J, Ayyagari R, Chatterjee N, Pee DY, Schairer C, Byrne C, et al.: unpublished results). Let
be the score functions from the observed data log likelihood for the normal subcohort for
,
, and
, respectively. The vector of scores is given by Eq. 7 in Chen et al. (unpublished results). We then wrote the influence function for
as
![]() |
where
and so on. In this expression, the summations are over all members of the normal subcohort. The quantity
is the vector of influences of individual j on the components of
and
and
are corresponding influence vectors for
and
The covariance matrix of
can be estimated from the influences. For example,
can be estimated as
and
can be estimated as
where the summations are over all nnormal members of the subcohort. If
is the number of noncase patients in the normal subcohort who have DENSITY data, then the influence function for
can be obtained similarly:
![]() |
As before,
can be estimated by
and
can be estimated from
where now the summations are over the
members of the normal subcohort who influence
namely, the noncase patients in the normal subcohort who have DENSITY data.
To calculate
we noted that among the screening-phase control subjects who were included in the normal follow-up subcohort, 4.9% developed breast cancer and 95.1% remained healthy. Only the latter were used for the calculation of
Let
denote the number of individuals with nonzero influence on both
and
From the individual influences for
namely
obtained as described by Chen et al. (unpublished results), one can estimate
as
where the sum is over the
individuals who influence both
and
We derived the formula for the variance of the attributable risk similarly, as
The Taylor series expansion is
![]() |
The variance can be obtained in a similar fashion as for the predicted risk. In fact, terms required to compute the absolute risk can be used for the calculation of the attributable risk.
| NOTES |
|---|
|
|
|---|
The National Cancer Institute has supported numerous studies that produced data on breast cancer risk factors and disease outcomes for women in the Breast Cancer Detection Demonstration Project, including, recently, data on mammographic density. The authors assume all responsibility for the proper use of these data, the analysis and interpretation of the data, the writing of the manuscript, and the decision to submit the manuscript for publication, subject to standard clearance procedures for manuscripts in the Division of Cancer Epidemiology and Genetics.
| REFERENCES |
|---|
|
|
|---|
(1) Gail MH, Brinton LA, Byar DP, Corle DK, Green SB, Schairer C, et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst 1989;81:187986.
(2) Costantino JP, Gail MH, Pee D, Anderson S, Redmond CK, Benichou J, et al. Validation studies for models projecting the risk of invasive and total breast cancer incidence. J Natl Cancer Inst 1999;91:15418.
(3) Fisher B, Costantino JP, Wickerham DL, Redmond CK, Kavanah M, Cronin WM, et al. Tamoxifen for prevention of breast cancer: report of the National Surgical Adjuvant Breast and Bowel Project P-1 study. J Natl Cancer Inst 1998;90:137188.
(4) Gail MH, Costantino JP, Bryant J, Croyle R, Freedman L, Helzlsouer K, et al. Weighing the risks and benefits of tamoxifen treatment for preventing breast cancer. J Natl Cancer Inst 1999;91:182946.
(5) Rockhill B, Spiegelman D, Byrne C, Hunter DJ, Colditz GA. Validation of the Gail et al. model of breast cancer risk prediction and implications for chemoprevention. J Natl Cancer Inst 2001;93:35866.
(6) Byrne C, Schairer C, Wolfe J, Parekh N, Salane M, Brinton LA, et al. Mammographic features and breast cancer risk: effects with time, age, and menopause status. J Natl Cancer Inst 1995;87:16229.
(7) Boyd NF, Byng JW, Jong RA, Fishell EK, Little LE, Miller AB, et al. Quantitative classification of mammographic densities and breast cancer risk: results from the Canadian National Breast Screening Study. J Natl Cancer Inst 1995;87:6705.
(8) Harvey JA, Bovbjerg VE. Quantitative assessment of mammographic breast density: relationship with breast cancer risk. Radiology 2004;230:2941.
(9) Benichou J, Byrne C, Capece LA, Carroll LE, Hurt-Mullen K, Pee DY, et al. Secular stability and reliability of measurements of the percentage of dense tissue on mammograms. Cancer Detect Prev 2003;27:26674.[CrossRef][ISI][Medline]
(10) Surveillance, Epidemiology, and End Results (SEER) Program SEER*Stat Database: MortalityAll COD, Total U.S. (19692000) with Expanded VA County Detail. National Cancer Institute, DCCPS, Surveillance Research Program, Cancer Statistics Branch, released July 2003. Underlying mortality data provided by NCHS (http://www.cdc.gov/nchs). Available at http://www.seer.cancer.gov. [Last accessed: January 15, 2004.]
(11) Surveillance, Epidemiology, and End Results (SEER) Program. Public-use data (19962000), released 2003, based on November 2002 submission. Available at: http://www.seer.cancer.gov. [Last accessed: January 15, 2004.]
(12) National Center for Health Statistics. National Health Interview Survey (NHIS). Centers for Disease Control and Prevention, U.S. Department of Health and Human Services. Available at: http://www.cdc.gov/NCHS/nhis.htm. [Last accessed: July 22, 2002].
(13) Research Triangle Institute. SUDAAN example manual, release 9.0. Research Triangle Park (NC): Research Triangle Institute; 2004.
(14) The Gauss System, version 6. Maple Valley (WA): Aptech Systems; 2005.
(15) Gail MH, Pfeiffer RM. On criteria for evaluating models of absolute risk. Biostatistics 2005;6:22739.[Abstract]
(16) Ursin G, Ma H, Wu AH, Bernstein L, Salane M, Parisky YR, et al. Mammographic density and breast cancer in three ethnic groups. Cancer Epidemiol Biomarkers Prev 2003;12:3328.
(17) Tice JA, Cummings SR, Ziv E, Kerlikowske K. Mammographic breast density and the Gail model for breast cancer risk prediction in a screening population. Breast Cancer Res Treat 2005;94:11522.[CrossRef][ISI][Medline]
(18) Rao CR. Linear statistical inference and its applications. New York (NY): Wiley; 1965. p. 31922.
Manuscript received March 1, 2006; revised June 5, 2006; accepted July 20, 2006.





















