Literature
Home医源资料库在线期刊放射学杂志2003年1月第226卷第2期

Concordance of Breast Imaging Reporting and Data System Assessments and Management Recommendations in Screening Mammography1

来源:放射学杂志
摘要:ABSTRACTTopABSTRACTINTRODUCTIONMATERIALSANDMETHODSRESULTSDISCUSSIONREFERENCESPURPOSE:ToexaminehowfrequentlyBreastImagingReportingandDataSystem(BI-RADS)mammographicscreeningassessmentswereassociatedwithexpectedclinicalmanagementrecommendations。MATERIALSANDMETHODS:Seven......

点击显示 收起

1 From the Center for Health Studies, Group Health Cooperative of Puget Sound, 1730 Minor Ave, Suite 1600, Seattle, WA 98101 (S.H.T.); Affiliations for all other authors and author contributions are listed at the end of this article. Received March 20, 2001; revision requested April 23; revision received June 20; accepted August 9. Supported by grants U01CA86076 (W.E.B., L.E.I.), U01CA86082 (P.A.C.), U01CA63736 (M.B.D.), U01CA70013 (B.M.G.), U01CA63740 (K.K., E.A.S., V.L.E.), U0169976 (R.D.R.), U01CA63731 (S.H.T.), CAU01CA70040 (B.C.Y.), and R01CA63146 (N.U.) from the National Cancer Institute. .


     ABSTRACT

Top
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
PURPOSE: To examine how frequently Breast Imaging Reporting and Data System (BI-RADS) mammographic screening assessments were associated with expected clinical management recommendations.

MATERIALS AND METHODS: Seven Breast Cancer Surveillance Consortium mammography registries recorded screening assessments and recommendations in 1997 to identify the proportion of women in each BI-RADS category. The first screening assessment for a woman without cancer or a prior mammogram within 9 months was associated with its independently recorded recommendation.

RESULTS: Among 292,795 women, screening assessments included 269,022 (91.9%) with a "negative" or "benign finding," and 267,103 (99.3%) of these women were recommended for normal interval follow-up. Among 11,861 (4.1%) women with screening assessments of "probably benign finding," 4,782 (40.3%) were recommended for short interval follow-up as expected on the basis of the BI-RADS, but a high proportion (36.9%) were recommended for additional imaging. Among 1,625 (0.6%) women with "suspicious abnormality," most were recommended for biopsy (48.7%) or clinical examination and/or surgical consult (9.0%), but many were recommended for additional imaging (38.7%). Among 243 (0.1%) women with screening assessments "highly suggestive of malignancy," a majority were recommended for biopsy (73.3%) or clinical examination and/or surgical consult (18.1%) consistent with BI-RADS, but some were recommended for additional imaging (6.6%).

CONCLUSION: BI-RADS assessments and management recommendations are consistent for negative and benign assessments, but inconsistencies were found in assessments and recommendations for mammographic abnormalities.

 

Index terms: Breast, ACR Reporting and Data System • Breast neoplasms, diagnosis, 00.30 • Breast radiography, quality assurance, 00.11 • Cancer screening, 00.11, 00.30


     INTRODUCTION

Top
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
With the diffusion of screening mammography, there has been a growing interest in standardizing terminology to encourage consistency between assessments and management recommendations, as well as facilitate radiologists’ audits of their interpretive performance (13). The diffusion occurred as promotional efforts increased attention to the mortality reductions afforded by screening mammography (4,5). To address concerns about terminology, the American College of Radiology created the Breast Imaging Reporting and Data System (BI-RADS), which includes six numbered assessment categories of 0–5 with associated management recommendations (Table 1) (2). The BI-RADS committee intended that almost all screening assessments would be category 1, 2, or 0 (Sickles EA, oral communication, 2001). At the beginning of a National Cancer Institute–sponsored effort to develop a national consortium of mammography registries (the Breast Cancer Surveillance Consortium ) (6), radiologists at consortium facilities recorded BI-RADS assessments independently of recommendations. This recording occurred in accordance with customary practices of the time and provides a baseline for BI-RADS use because it occurred before BI-RADS usage was required in April 1999 by the Mammography Quality Standards Act (7).


fig.ommitted TABLE 1. American College of Radiology BI-RADS Categories

 

 
The relationship between the assessment and management recommendation has implications for clinical care, teaching, and evaluating the screening interpretations of radiologists. When radiologists provide inconsistent recommendations for given assessments, it may create confusion among referring clinicians about whether and how to conduct further evaluations. Valid computations of performance for medical audits are predicated on a consistent clinical meaning for each assessment (8). The assessment is used to classify the interpretation as "positive" or "negative" for the measurement of sensitivity, specificity, and positive predictive value (7,8). If a specific assessment category implies additional evaluation for one radiologist and biopsy for another, then they are using the BI-RADS differently and continuing the confusion the system is intended to reduce. Although the text accompanying an interpretation may clarify some inconsistencies, the BI-RADS categoric classification system was intended to avoid the ambiguity of free text. If the categories are used in ways other than those intended, there may be opportunities to either change the terminology or address teaching to the areas of inconsistent use.

To our knowledge, no one has evaluated the use of the American College of Radiology BI-RADS assessment and management recommendations across multiple facilities and geographic locations in routine clinical practice. Important work has been done to evaluate the positive predictive value of BI-RADS assessments (9,10) and between-reader variation in the findings descriptions and management recommendations (11). However, these studies used unique populations or specifically drawn samples. The purpose of this study was to examine how frequently BI-RADS screening assessments were associated with the expected clinical management recommendation in diverse routine practice settings.


     MATERIALS AND METHODS

Top
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
Setting
The mammography registries of the Breast Cancer Surveillance Consortium record assessments from multiple radiology facilities within their respective states of Colorado, North Carolina, New Hampshire, New Mexico, California (San Francisco), Washington (2 registries), and Vermont (6). Data from the mammography registries are sent to a central data repository at the BCSC Statistical Coordinating Center. All activities of the Breast Cancer Surveillance Consortium are reviewed and approved by the institutional review boards at each site. Individual informed consent was not required because of strict maintenance of anonymity and the observational nature of the study (12).

Subjects
This study includes mammographic information for women 40 years of age and older without a personal history of breast cancer who underwent a screening examination between January 1 and December 31, 1997, at any of the 158 participating mammography facilities within the seven BCSC registries. Throughout the manuscript, "facility" refers to the physical location where mammographic imaging is performed. A mammography registry refers to the sum of all the included facilities (range, 2–49 facilities per registry). The number of radiologists interpreting mammograms within a registry ranges from 10 to 165. The number of screening examinations per registry ranges from 11,563 to 80,911.

This report is restricted to 292,795 screening examinations identified as the first occurrence for a woman in the study period after the following exclusions: women with a prior history of breast cancer, women with additional imaging during the same visit, if the examination had no recorded assessment, and if the examination had a diagnostic indication. To further ensure that we included only screening mammograms, we also excluded any examination in 1997 that was preceded by a mammogram within 9 months.

Measurements
This study examined BI-RADS use by measuring the following: (a) the proportion of women assigned BI-RADS assessment categories 0–5 across the BCSC registries, (b) the proportion of women assigned specific recommendations, and (c) the proportion of assessments having the expected recommendation according to the BI-RADS.

We (S.H.T., L.E.I.) included the first recorded "routine screening" examination during the study for women in the registry and referred to it as the "index" examination. Demographic information (age, race, ethnicity, education), prior mammography experience, and assessments are collected in the course of usual care and pooled in a registry before being transferred periodically to the BCSC Statistical Coordinating Center. Encrypted unique identifiers for women, facilities, and radiologists maintain the anonymity of all data (12). The unique encrypted code is used to link records in the Statistical Coordinating Center over time (12).

Recording systems varied across the diverse facilities, but all included paper or automated summaries with an assessment and a management recommendation. Each assessment and management recommendation was recorded separately by the radiologist for each mammographic examination by using terminology consistent with the BI-RADS (described more fully later). The detailed text that might explain a recommendation could not be systematically recorded and summarized across facilities and was therefore not available for analysis. Examinations in which recording software created an automatic link between an assessment and management recommendation were excluded. This exclusion removed one registry (New Mexico) and several facilities that together accounted for 38,526 (11%) of the 350,632 women otherwise eligible for the study.

Assessments by radiologists at the study facilities corresponded with the six BI-RADS categories (Table 1) (3). These categories include one incomplete (category 0) and five final assessments (categories 1–5). Some facilities recorded two assessments at each examination, one for each breast, while others recorded a single assessment for each woman.

Recommendations for clinical follow-up were collected on the same form but recorded separately from the assessment. A single recommendation was given for 290,804 (99.3%) of all 292,795 assessments. For the 1,991 (0.7%) assessments with multiple recommendations, we included in this analysis the highest numbered recommendation among the following five ordered recommendations: (a) normal interval follow-up, (b) short interval follow-up (ie, <1 year), (c) needs additional imaging, (d) clinical examination or surgical consult, and (e) biopsy or fine-needle aspiration. The order reflects an increasing likelihood of malignancy. The data do not allow the separation of clinical examination and surgical consult recommendations. Women (14,814 [4.8%] of 308,003) with no recorded recommendation were categorized as "unknown" and not included in the tables or analysis. Some women (394 [0.1%] of 308,003) were given recommendations other than those listed earlier, which were recorded as "other" but not included in the tables. Recording of additional imaging did not include examinations to correct for technical deficiencies.

Because some registries recorded assessments and management recommendations for each breast, we established a systematic method for assigning the assessment and management recommendation for a woman. We (S.H.T., L.E.I.) rank ordered the assessments based on potential cancer risk ("negative," "benign finding," "probably benign finding," "need additional imaging evaluation," "suspicious abnormality," "highly suggestive of malignancy"). When assessments were not the same for both breasts we (S.H.T., L.E.I.) assigned the woman the higher assessment. The recommendations for the breast were ordered from 1 (low) to 5 (high) as shown. When examining the relationship between assessments and recommendations, the breast with the higher assessment was included for each woman and associated with the recommendation for the same breast.

Each case was classified as having received a recommendation consistent with the BI-RADS assessment if one of the following conditions (a–d) was met based on the third edition of the BI-RADS: (a) assessment of "need additional imaging evaluation" (category 0) with additional views or ultrasonography recommended, (b) assessment of "negative" or "benign finding" (category 1 or 2) with a normal interval follow-up recommended, (c) assessment of "probably benign finding" (category 3) with a recommendation for short interval follow-up, and (d) assessment of "suspicious abnormality" or "highly suggestive of malignancy" (categories 4 or 5) with a recommendation for clinical examination or surgical consult and biopsy or fine-needle aspiration.

Cancer Rates
Because preliminary findings suggested some potential discord between how assessments of abnormalities were associated with recommendations, we also calculated cancer rates and/or examination for categories 0, 3, 4, and 5. BCSC registry data include cancers identified through local Surveillance Epidemiology and End Results registries, state cancer registries, and pathology databases. A file of uniquely encrypted records of women with cancer can then be linked to the file of assessments for those women. We (S.H.T., L.E.I.) linked the index examinations to cancer occurrence within the next 365 days for the subset of registries where the data was available (Colorado, North Carolina, New Hampshire, Washington [one registry], and Vermont).

Data Analysis
We (S.H.T., L.E.I.) examined the data in the following three ways: (a) first assessment by mammography registry, (b) first recommendation by mammography registry, and (c) first assessment by recommendation for all registries combined. We then evaluated the proportion of assessments having recommendations consistent with the BI-RADS by age categories (40–49 years, 50–59 years, 60–64 years, and 65 years or older) and prior mammography experience (<1 year, 1–2 years, 3–4 years, >5 years, and never) for each assessment category by using a 2 statistic to compare proportions. A P value of less than .05 was considered to indicate a statistically significant difference.


     RESULTS

Top
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
The Breast Cancer Surveillance Consortium population reported here includes 292,795 women with the following ages and mammography experience: (a) age; 32% (n = 93,389) 40–49 years, 41% (n = 119,487) 50–64 years, and 27% (n = 79,919) 65 years or older; and (b) mammography experience; 94% (n = 273,952) report ever having had a mammogram. Among those for whom we had the relevant data, women had the following characteristics: (a) race (n = 237,815), 90% (n = 213,278) Caucasian; 6% (n = 14,652) African-American; 0.5% (n = 1,145) Native American or Eskimo; 2% (n = 5,214) Asian or Pacific Islander; and 1.5% (n = 3,526) Other; (b) Hispanic ethnicity, 3.5% (7,404 of 213,069); (c) education (n = 220,562), 9% (n = 20,387) less than high school; 29% (n = 63,470) high school graduate; 28% (n = 61,591) some college; 34% (n = 75,114) college or postgraduate; and (d) mammography experience, 70% (134,842 of 191,842) had a mammogram within the previous 2 years.

Table 2 shows the first assessments for the index examinations (n = 292,795) for 1997. A majority (n = 269,022; 91.88%) of women had "negative" or "benign finding" assessments (category 1 or 2, respectively), but registries differed in the proportion of each. Across all registries combined, the "negative" assessment was used much more commonly (78.89%) than the "benign finding" assessment (12.99%). Not shown in the Table is that cancer rates were 10 in 1,000 examinations for category 3, 34 in 1,000 for category 0, 167 in 1,000 for category 4, and 684 in 1,000 for category 5.


fig.ommitted TABLE 2. First Mammography Assessment for Index Screening Examinations for 1997

 

 
Table 3 shows the recommendations among all women with an assessment (n = 292,795). As noted earlier, the majority (n = 290,804; 99.3%) had a single recommendation, but some (n = 1,991; 0.76%) had two or more. In cases where two recommendations were given, one is listed in the table based on the ordinal scale described in the methods mentioned. Normal interval follow-up is the most common recommendation (92.06%). Additional imaging was the next most common recommendation (5.13%). Short interval follow-up (2.05%) was less common. Biopsy or fine-needle aspiration (0.4%) and clinical examination or surgical consult (0.36%) were the least common recommendations resulting from the screening mammogram.


fig.ommitted TABLE 3. Frequency of Recommendations for 1997

 

 
Table 4 shows the recommendations by assessments. The percentages reflect the proportion receiving a recommendation within an assessment category. The assessment and recommendation were consistent with the BI-RADS for categories 1, 2, and 0. For example, "incomplete assessments" (category 0) were almost always (95.5%) associated with a recommendation for additional imaging. Similarly, 99.62% of the women with "negative" assessments and 97.27% of the women with a "benign finding" were recommended for normal interval follow-up. A small proportion (28 [0.01%] of 292,795) of the examinations had the highest assessment and highest management recommendation in opposite breasts.


fig.ommitted TABLE 4. Recommendations according to Mammography Assessment for 1997

 

 
Assessments and recommendations were less consistent with the BI-RADS among women with assessments categories 3, 4, and 5. Among women with a "probably benign finding," a moderate proportion (40.32%) was recommended for "short interval follow-up," but "additional imaging" also was common (36.88%), and many women (18.77%) were recommended for "normal interval follow-up." Additional imaging was recommended for a relatively high proportion of women (38.71%) with an assessment of "suspicious abnormality." Even among women with an assessment "highly suggestive of malignancy," 6.58% were recommended for additional imaging.

The proportion of management recommendations consistent with assessments differed across age categories for "probably benign finding" (category 3) (P < .001) and "suspicious abnormality" (category 4) (P < .001). Among women ages 40–49, 50–59, 60–64, and 65 years or older, the proportion recommended for short interval follow-up was 38.9% (1,615 of 4,155), 39.2% (1,396 of 3,559), 33.6% (386 of 1,147), and 43.3% (1,299 of 3,000), respectively. Among women ages 40–49, 50–59, 60–64, and 65 years of age or older in category 4, the proportions recommended for biopsy, fine-needle aspiration, or clinical examination or surgical consult were 52.3% (262 of 501), 56.1% (249 of 444), 54.2% (96 of 177), and 65.6% (330 of 503), respectively. Among women in category 4, a higher proportion were recommended for additional imaging rather than biopsy, fine-needle aspiration, or clinical examination or surgical consult among women ages 40–49 years (220 [43.9%] of 501) compared with women ages 65 years and older (160 [31.8%] of 503).

The proportion of consistent recommendations differed for women with probably benign assessments whose last mammograms were more recent (P < .001). For women with a mammogram obtained within less than 1 year previously, 1–2 years, 3–4 years, more than 5 years, and never, the proportion recommended for short interval follow-up was 40.7% (374 of 919), 33.7% (1,369 of 4,063), 41.1% (546 of 1,328), 48.9% (387 of 791), and 53.7% (780 of 1,453), respectively, among women with a "probably benign finding" assessment. Also among women with a "probably benign finding" assessment, 28% (257 of 919) of those whose most recent prior mammogram occurred within less than 1 year were recommended for normal interval follow-up compared with 21.0% (852 of 4,063) of those whose most recent mammogram occurred 1–2 years previously. Additional imaging was recommended 26.7% (245 of 919) and 41.6% (1,688 of 4,063) of the time among women whose most recent prior mammogram occurred less than 1 year and 1–2 years earlier, respectively.


     DISCUSSION

Top
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
In this study, we examine the use of the BI-RADS assessments and management recommendations for the largest population of U.S. women reported to date to our knowledge. The population in this study is demographically similar to the random sample of U.S. women in the 1997 Behavioral Risk Factor Surveillance Survey (13) except for a smaller proportion of women in the oldest and youngest age categories. We found that most "negative" or "benign finding" assessments are associated with a recommendation that is consistent with the BI-RADS, and these screening assessment categories account for 91.9% (269,022 of 292,795) of all women. This consistency reflects a strong contribution of the BI-RADS to achieving the intentions of the creators of the reporting system. However, there is some inconsistency with the BI-RADS for assessments and management recommendations among the 8.1% (n = 23,773) of women with a breast abnormality. While some of the inconsistency may be addressed by teaching, clarifications in documentation, and changes in the reporting system, the inconsistency may also reflect clinical practice that needs further exploration. For example, the BI-RADS categories 3 and 4 show inconsistency that differs by patient age and when the previous mammogram occurred. This systematic inconsistency suggests that there may be something about the clinical practice of screening mammography that is not currently accommodated by the BI-RADS.

Free text might explain some apparent inconsistencies between assessments and recommendations in our data, although the text was not available for analysis. Rarely (0.11%, category 1; 0.42%, category 2; 3.13%, category 3), women with a negative, benign, or probably benign assessment were referred for clinical and/or surgical evaluation. While an incorrect use of the BI-RADS might explain the discrepancy, it is also possible that the discrepancy is due to the presence of a clinical finding. Even though the examination was intended for screening, findings occur, and depending on that finding the radiologist may recommend surgical evaluation or reexamination by the referring clinician. A limitation in these data are that we cannot separate a recommendation for clinical evaluation from one for a surgical evaluation. The surgical recommendation is inconsistent with the BI-RADS, but the text might show that it was clinically indicated if a physical finding was present. Automated mammography reporting systems might include standard text to address this situation, in which clinical findings are driving the recommendation.

Inconsistencies are much more common among the 8.1% of examinations with a breast abnormality on the image, particularly among examinations where there is a "probably benign" or "suspicious" abnormality. When assessments were recorded for each breast, a small proportion of the time (0.01% of all assessments), the most serious recommendation occurred in the breast that was contralateral to the most serious assessment. While it is a small proportion, it points out the need to confirm the laterality of any lesion before subsequent evaluation or biopsy.

Assessments and management recommendations for "probably benign" lesions are not completely consistent with the BI-RADS. Use of the assessment has been shown to vary with facility and age (14), and some controversy exists about whether it should ever be used until after full problem-solving imaging (15,16). Despite the interpretive challenges, the group of lesions appears to be highly predictive of a benign lesion in some centers, and associated with a cancer rate of 1% in our data (9,17). This suggests that the assessment category is being used correctly, since the expected rate of cancer for these lesions is less than 2% (18).

Since the cancer rate among the "probably benign" lesions is low, the problem with the implementation of the BI-RADS category 3 appears to be with the associated management recommendation. While a relatively high proportion of women with "probably benign finding" assessments received the expected short interval follow-up recommendation (4,782 [40.3%] of 11,861), a high proportion were recommended for additional imaging (n = 4,374; 36.9%), and some were recommended for "normal interval follow-up" (n = 2,227; 18.8%). We hypothesize that the high proportion recommended for additional imaging may reflect interest in conveying a level of concern to a referring clinician. The radiologist may want clarification of an ambiguous lesion that is judged likely to be cystic and therefore order additional imaging with ultrasonography. However, the radiologist may want to be clear that the risk of cancer is low so that the referring physician’s discussion conveys the appropriate level of breast cancer risk to the woman. The substantial proportion of women with annual follow-up may be due to a suggested protocol of following the abnormality annually after two 6-month follow-up mammograms (17). Our result that women with a "probably benign finding" were more likely to have a recommendation for annual mammography if they had a mammogram in the previous year supports this interpretation, but consideration should also be given to other explanations. For example, a recent prior mammogram was associated with less use of additional evaluation, suggesting that the comparison film provided a reassuring reference. This same reassurance might explain why annual follow-up, rather than 6-month follow-up, was recommended. However, more research into radiologist’s thinking in these circumstances is needed before we can provide an adequate explanation.

Improvements in the BI-RADS use might occur through some modifications in documenting instructions and teaching, although our results show that the current terminology establishes consistent recommendations for a majority of women. For example, the BI-RADS instructions might further improve consistency by specifying how to categorize "probably benign findings" that appear stable after 6- and 12-month follow-up assessments. Consideration could also be given to stating that such follow-up studies be classified as diagnostic. Teaching about the intended use of the BI-RADS category 0 could improve consistency if it meant that the category was used more frequently. The BI-RADS committee intended that additional imaging occur after category 0 assessments (Sickles EA, oral communication, 2001). Screening interpretations would therefore include categories 1, 2, and 0. Any additional imaging would be done after a category 0 interpretation, and then a category 1–5 would be assigned. It is expected that less than 10% of screening examinations would require recall for additional imaging (8). We report 5.1% of women being recommended for additional imaging, but they are distributed among several assessments, such that category 0 accounts for only 3.43% of all interpretations. Teaching might reduce recommendations for additional imaging after assessments other than "category 0, need additional imaging."

Changes in terminology could also be considered to further improve consistency. For example, we found that categories 4 and 5 assessments commonly include a proportion of women recommended for additional imaging. As noted earlier, this is not the intended use of the BI-RADS and may reflect a misunderstanding of how to use the categories. However, it may also be that the radiologist wants the evaluation of associated abnormalities to occur before a biopsy of an obvious lesion. The referring physician is therefore provided with a recommendation for additional imaging in order for an evaluation to be done in the appropriate order and to convey the radiologist’s level of concern. Because we commonly found additional imaging recommended among category 3, 4, and 5 assessments, some consideration could be given to modifying category 0 in order for the level of concern to be stated clearly, such as "incomplete assessment of a probably benign finding" and "incomplete assessment of a suspicious finding." In the former group, the imaging would be done primarily to clarify an ambiguity. In the latter group, the additional imaging would be done to clarify associated lesions and establish the extent of disease. Whether such a change would lead to more confusion or increased consistency needs further evaluation. It seems unlikely that teaching alone will eliminate the problem, if the current inconsistent use addresses a need to convey the radiologist’s level of concern. Now that the Mammography Quality Standards Act is in place, and more teaching has occurred about the BI-RADS, it will be important to see whether the inconsistency has been reduced.

One might also conclude that the BI-RADS should be implemented with only the assessment and recommendation linked by the recording software. However, implementation of recording software is still underway throughout the U.S., and many radiologists continue to dictate the assessments and recommendations. Education may increase consistency in the use of the BI-RADS terminology, but more needs to be done to evaluate the clinical concerns that may be limiting radiologists’ association of assessments and recommendations in practice. Such work would be facilitated by software that allowed automated coding to indicate that a recommendation is based on a clinical finding when there is no mammographic finding, since the BI-RADS is intended to address only mammographic findings. However, radiologists must guide the care of people, and so recommendations may be based on more than mammographic findings. Some facilities within our registries and some recording software have already implemented wording to explain management recommendations based on clinical knowledge. Such software might also identify inconsistencies between assessments and recommendations that might avoid the rare event (0.01%) noted in our data, in which the highest assessment and recommendation appeared to be in opposite breasts.

A limitation of this work is that these data were from 1997, before the Mammography Quality Standards Act required the recording of the BI-RADS assessments. However, The American College of Radiologists first published its BI-RADS in 1993, and radiologists in these registries had been using the terminology since at least 1996 and, in many cases, well before that time (1,3). The Food and Drug Administration requirement for using the BI-RADS assessments was implemented in April of 1999 and does not require their linkage to recommendations. It is unlikely that the requirement itself will therefore change the relationship between the assessments and the recommendations, but future research should evaluate this possibility.

Another limitation of this study is that it is not possible to assess how the recording systems used might have limited the association of interpretations and recommendations. The ability to independently record assessments and recommendations was a feature of recording sheets and software that was requested by radiologists at the start of this study. Registry sites used slightly different approaches but most simply had paper systems where the assessment and recommendation could each be indicated. Some facilities continued to dictate in addition to recording the findings on a paper form. While the structure of the form may have allowed the dissociation between the BI-RADS assessments and recommendations, it is impossible to assess the magnitude of the effect on this dissociation of differences in the recording systems themselves. It seems unlikely that the forms themselves did anything more than allow radiologists to record assessments and recommendations in accordance with their practice.

In conclusion, we provide evidence of how well terminology consistent with the BI-RADS was implemented in practice by 1997. While BI-RADS terminology was used consistently for the 91.9% of women with a "negative" or "benign finding" assessment, there is room for improvement in its use among the 8.1% of women who have a mammographic finding that needs additional attention.

The apparent inconsistency in use may also be important for planning educational efforts and for future research. Although the radiologists in this study all used the BI-RADS, that practice was not a requirement until April of 1999 (Mammography Quality Standards Act). In accordance with the requirement, the American College of Radiology has paid a great deal of attention to education regarding the BI-RADS terminology, including publishing the third edition of the BI-RADS manual, mailing of the manual to all American College of Radiology–accredited facilities, developing a continuing medical education course on the use of the BI-RADS, and creating a CD-ROM–based mammography interpretive skills self-assessment examination that emphasizes proper use of BI-RADS terminology (Sickles EA, oral communication, 2001) (3). Future teaching might address the inconsistencies reported here by stressing the appropriate use of "need additional imaging evaluation" and the distinction between "incomplete" and "final" assessments for screening interpretations, which is currently well documented in the BI-RADS manual. Future work should evaluate the effect of educational efforts, whether the inconsistencies reported here appear in other settings, whether the inconsistencies persist over time, and whether modifications in the reporting system are needed to address clinical realities.

 

     REFERENCES

Top
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 

  1. Kopans DB, D’Orsi CJ, Adler DD, et al. Breast Imaging Reporting and Data System Reston, Va: American College of Radiology, 1993.

  2. D’Orsi CJ, DeBor M. Reporting and communication. In: Bassett LW, Jackson VP, Jahan R, Fu YS, Gold RH, eds. Diagnosis of disease of the breast. Philadelphia, Pa: Saunders, 1997; 103-125.

  3. D’Orsi CJ, Bassett LW, Feig S, et al. American College of Radiology (ACR): Breast Imaging Reporting and Data System (BI-RADS) 3rd ed. Reston, Va: American College of Radiology, 1998.

  4. Tabar L, Vitak B, Chen HH, et al. The Swedish Two-County Trial twenty years later: updated mortality results and new insights from long-term follow-up. Radiol Clin North Am 2000; 38:625-651.

  5. Kerlikowske K, Grady D, Rubin SM, Sandrock C, Ernster VL. Efficacy of screening mammography: a meta-analysis. JAMA 1995; 273:149-154.

  6. Ballard-Barbash R, Taplin SH, Yankaskas BC, et al. Breast Cancer Surveillance Consortium: a national mammography screening and outcomes database. AJR Am J Roentgenol 1997; 169:1001-1008.

  7. Food and Drug Administration. Quality Mammography Standards. Final Rule-21 CFR Parts 16 and 900 [Docket No. 95N-0192] RIN 0910-AA24 edition Washington, DC: Department of Health and Human Services, 1997 Oct 28.

  8. Linver MN, Osuch JR, Brenner RJ, Smith RA. The mammography audit: a primer for the mammography quality standards act (MQSA). AJR Am J Roentgenol 1995; 165:19-25.

  9. Orel SG, Kay N, Reynolds C, Sullivan DC. BI-RADS categorization as a predictor of malignancy. Radiology 1999; 211:845-850.

  10. Liberman L, Feng TL, Dershaw DD, Morris EA, Abramson AF. US-guided core breast biopsy: use and cost effectiveness. Radiology 1998; 208:717-723.

  11. Berg WA, Campassi C, Langenberg P, Sexton MJ. Breast imaging reporting and data system: inter- and intraobserver variability in feature analysis and final assessment. AJR Am J Roentgenol 2000; 174:1769-1777.

  12. Carney PA, Geller BM, Moffett H, et al. Current medico-legal issues and confidentiality issues in a large multi-center research program: the National Cancer Institute’s Breast Cancer Surveillance Consortium. Am J Epidemiol 2000; 15:371-378.

  13. National Center for Chronic Disease Prevention and Health Promotion. Behavioral risk factor surveillance system 1984-1995 survey data. Behavioral Surveillance Branch CD-ROM Washington, DC: Centers for Disease Control and Prevention, U.S. Department of Health and Human Services,; Series 1, No. 1.

  14. Caplan LS, Blackman D, Nadel M, Monticciolo DL. Coding mammograms using the classification "probably benign finding short interval follow-up suggested. AJR Am J Roentgenol 1999; 172:339-342.

  15. Hall FM. Probably benign breast nodules: follow-up of selected cases without initial full-problem solving imaging (editorial). Radiology 1995; 194:305.

  16. Sickles EA. Probably benign breast nodules: follow-up of all cases requires initial full problem-solving imaging. Radiology 1995; 194:305-306.

  17. Sickles EA. Probably benign breast lesions: when should follow-up be recommended and what is the optimal follow-up protocol?. Radiology 1999; 213:11-14.

  18. Sickles EA. Management of probably benign breast lesions. Radiol Clin North Am 1995; 33:1123-1130.

作者: Stephen H. Taplin MD MPH Laura E. Ichikawa MS 2007-5-12
医学百科App—中西医基础知识学习工具
  • 相关内容
  • 近期更新
  • 热文榜
  • 医学百科App—健康测试工具