Sensitivity and specificity of classification systems for fatness in adolescents 2004年80卷第3期 | 39康复网

Martin G Neovius, Yvonne M Linné, Britta S Barkeling and Stephan O Rossner

¹ From the Obesity Unit, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden

² The data-collection phase of this study was funded by the EuropeanCommission, Quality of Life and Management of Living Resources,Key action 1 "Food, nutrition and health" program as part ofthe project entitled "Dietary and genetic influences on susceptibilityor resistance to weight gain on a high fat diet" (QLK1-2000-00515).The analysis phase was funded by Arbetsmarknadens Forsakrings-ochAktiebolag.

³ Address reprint requests to MG Neovius, Obesity Unit, KarolinskaInstitutet, Karolinska University Hospital, SE-141 86 Stockholm,Sweden. E-mail: martin.neovius{at}medhs.ki.se.

ABSTRACT
Background: Various body mass index (BMI) standards have beenproposed for defining overweight in adolescence, but few studieshave evaluated their diagnostic accuracy.

Objective: We compared the sensitivity and specificity of BMI-basedclassification systems for detecting excess fatness in adolescents.

Design: A cross-sectional analysis of 474 adolescents aged 17y was used. Body composition was measured by using densitometry.The international BMI-based systems recommended by the InternationalObesity Task Force and the World Health Organization were evaluatedon the basis of their sensitivity and specificity for detectingexcess body fat. Receiver operating characteristic analysiswas performed to derive cutoffs to maximize the sum of sensitivityand specificity. True positives were defined by using the percentagebody fat cutoffs proposed by Williams et al (Am J Public Health1992;82:358-63).

Results: For both classification systems, the specificity foroverweight was high for both sexes (0.95–1.00). The sensitivitywas fairly high for the males (0.72–0.84) but was verylow for the females (0.22–0.25). For the males, a BMIcutoff equal to the 85th percentile on a Swedish BMI referencechart maximized the sum of sensitivity and specificity whilehaving both high sensitivity (0.92) and high specificity (0.92).For the females, larger tradeoffs in specificity were neededto improve sensitivity. The mean (±SE) areas under thereceiver operating characteristic curves for the males and thefemales were 0.97 ± 0.02 and 0.85 ± 0.02, respectively.

Conclusions: Recommended international classification systemshave very high specificity, which results in few cases of nonoverweightadolescents being mislabeled as overweight. However, the sensitivityis very low in female adolescents. Thus, many overweight femaleadolescents could be missed in intervention programs that usethe proposed international BMI cutoffs as selection criteria.

Key Words: Adolescent overweight • body mass index • classification systems • percentage body fat • diagnostic accuracy • sensitivity • specificity • receiver operating characteristic analysis

INTRODUCTION
Despite the growing concern about adiposity-related problemsamong the young, no universally accepted classification systemfor adolescent obesity exists. Although body mass index (BMI;in kg/m²) is widely used for classification of adult overweightand obesity, its use in adolescents is controversial (1-3).The limitations of BMI as a measure of adiposity in the pediatricpopulation are larger than those in the adult population becauseBMI varies with age, sex, and maturation (4, 5). An additionalcomplication, for all age groups, is that relative risks associatedwith certain BMI values seem to be population dependent (6).Thus, universal classification systems are difficult to design.Currently, there are a number of proposed systems. For example,the International Obesity Task Force (IOTF) and the World HealthOrganization (WHO) have recommended different internationalclassification systems for childhood and adolescent obesity(7, 8). In addition to these systems, national variants exist(9). The controversy around the classification systems makesit difficult to monitor global and national trends, make comparisonsbetween studies, stratify for public health measures, and screenin clinical practice. Furthermore, messages to the media andhence the public might be confusing when population prevalenceestimates fluctuate depending on the choice of classificationsystem (10).

The classification system proposed by Cole et al (IOTF/Cole)(7), which is recommended by the IOTF, is gaining increasingacceptance. This system was derived mainly for global monitoring(7). However, a trend toward recommending the use of the IOTF/Colereference for clinical practice and public health measures atthe national level has developed (11). Such recommendationsare made despite the fact that the reference has not been thoroughlyevaluated in terms of screening ability and relation to morbidityand mortality (11). Two studies have hitherto focused on evaluatingclassification systems on their ability to detect excess bodyfat among children and adolescents (12, 13). A limiting factorin such attempts is the absence of reference values definingoverweight or obesity in terms of percentage body fat (%BF).The studies found that the IOTF/Cole system is highly specificbut is insensitive for finding obesity (12, 13). However, bothstudies arbitrarily defined true positives for obesity as thetop 5% in the study population. Hence, the standards used werenot anchored to health outcomes, which can be done by usinga health-related criterion to define overweight or obesity asabove certain threshold values of %BF (14, 15). The purposeof the present study was threefold: 1) to evaluate the sensitivityand specificity of recommended, international, BMI-based classificationsystems for detecting fatness, 2) to compare these systems witha national reference, and 3) to examine the influence on theanalyses of the choice of reference values for excess fatness.

SUBJECTS AND METHODS
The subjects in the Stockholm Weight Development Study were481 adolescents (n = 279 females and 202 males). Body-compositiondata were available for 474 of the subjects. The adolescentswere a subset of the offspring of 1423 women who participatedin the Stockholm Pregnancy and Weight Development Study in 1984–1985(16). The local Ethical Committee of Huddinge University Hospitalgranted ethical approval for the study. Written informed consentwas obtained from each mother, and verbal consent was also obtainedfrom each adolescent.

The BodPod Body Composition System (Life Measurement Instruments,Concord, CA) was used to measure the subjects’ weightto the nearest 0.1 kg while they were dressed only in underwear.The subjects’ standing height was measured to the nearest0.5 cm while they stood against a wall-mounted stadiometer.BMI was determined as Quetelet’s index (kg/m²).

%BF was measured by using air-displacement plethysmography withthe BodPod. The equipment was used in an enclosed room withoutwindows, where a constant environment could be kept. A seriesof repeated measurements was performed on phantoms of knownweights and volumes for the assessment of methodologic error.Two measurements were performed on each fasting subject accordingto the manufacturer’s instructions and recommendations,with the subject wearing tight-fitting underwear or a swimsuitand a swim cap (17, 18). A single air-displacement plethysmographyprocedure consisted of 2 measurements of body volume. If thesediffered by >150 mL, a third measurement was performed. Byusing preprogrammed equations, predicted lung volume was usedto calculate body volume. Appropriate corrections for thoracicgas volume and skin surface area artifact were applied to thisraw measurement to obtain actual body volume. The final resultreported by the instrumentation was calculated from the averageof the raw measurements or from the average of the closest 2measurements when 3 measurements were required. Data on bodydensity were converted to %BF by using the equation of Siri(19), as used by the software supplied by the manufacturer.

BMI-based classification systems
The IOTF/Cole system consists of sex-specific BMI percentilecurves that at age 18 y pass through the BMI cutoffs for adultoverweight and obesity of 25 and 30, respectively (7). The definitionsof adolescent overweight and obesity are thereby linked to adultrisk. The percentile curves were produced from large-surveydata from the United Kingdom, the United States, Holland, Singapore,Hong Kong, and Brazil (n = 97 876 males and 94 851 females)(7). The reference is recommended by the IOTF and is widelyused (4).

The WHO/MDD system was derived by Must, Dallal, and Dietz (MDD)from data collected in 1971–1974 as part of the firstUS National Health and Nutrition Examination Survey (NHANESI) (8). It is a sex- and age-specific percentile-based systemin which overweight and obesity (or at risk of overweight andoverweight) are defined as BMI values above the 85th and 95thpercentiles, respectively. The reference has been recommendedby several health organizations, including a WHO Expert Committee(20).

He et al (21) derived age- and sex-specific percentiles froma longitudinal study of 3650 full-term infants born in Swedenin the 1970s. In comparison with American BMI reference values,the Swedish values are much lower, especially at the higherpercentiles (21). Cutoffs for classification of overweight havenot been derived. In the present article, the age- and sex-specificBMI cutoff of the IOTF/Cole system that corresponds to a BMIof 25 at age 18 y, the WHO/MDD 85th percentile, and the He etal 85th percentile were used in defining subjects as normal-weightor overweight.

Definition of excess body fat
There are no generally accepted %BF cutoffs for excess fatnessor for overweight or obesity in children and adolescents. Severalprevious studies have defined childhood or adolescent obesityas the fattest 5% in the sample as determined by various measuresof %BF (2, 12, 13). Such a method sets the true prevalence toa fixed percentage, and although persons with higher %BF thanother persons in the group may be identified, the relation toincreased morbidity risk remains unclear and may vary. However,Williams et al (15) published %BF cutoffs derived from findingsof a significant overrepresentation of selected cardiovascularrisk factors, such as high blood pressure and unfavorable lipoproteinprofiles. In a sample of 3320 subjects aged 5–18 y, Williamset al (15) found that %BF values of 25% and 30% for males andfemales, respectively, were suitable to define excess fatness.%BF estimates were derived from skinfold thickness measurements,a method that has limitations in adolescents (4, 22). However,through the methodology used to convert the measurements to%BF, the typical errors due to heterogeneity in fat-free masswere minimized, as described by Sardinha et al (14). Therefore,it is less likely that any bias occurred (14). In the presentstudy, these criterion-based cutoffs were used as referencevalues for defining overweight to avoid setting the prevalenceto a fixed percentage by using %BF cutoffs that are unrelatedto metabolic risk.

Statistical analyses
Statistical analyses were performed by using SPSS for WINDOWS(version 11.5; SPSS Inc, Chicago). Sensitivity for fatness wasdefined as the probability of the respective systems to classifysubjects with excess fatness as overweight (true positives).Specificity was defined as the probability of classifying subjectswithout excess fatness as nonoverweight (true negatives). Receiveroperating characteristic (ROC) analysis was performed to determinecutoff values to minimize the total number of misclassificationsand evaluate the general performance of BMI in reflecting bodyfatness. ROC analysis describes the clinical performance ofscreening tests in terms of diagnostic accuracy or the abilityto correctly classify subjects into clinically relevant subgroups,as defined by a reference test (23). The diagnostic accuracyof the screening measure is evaluated by summarizing the potentialof the test to discriminate between the absence and presenceof a health condition. In the present study, the diagnosticaccuracy referred to the ability of BMI to discriminate overweightfrom nonoverweight as assessed by %BF measured with the useof air-displacement plethysmography and as defined by the %BFcutoffs proposed by Williams et al (15). In the ROC analysis,the true-positive rate (sensitivity) is plotted against thefalse-positive rate (1 – specificity) across a range ofvalues from the diagnostic test. In the present study, sex-specificcurves were constructed with %BF as the reference test and BMIas the diagnostic test. Thereafter, BMI cutoffs maximizing thesum of sensitivity and specificity were derived.

The area under the ROC curve was used as a measure of the overallperformance of the ROC curve because it reflects the probabilitythat the diagnostic test will classify correctly (24). The areaunder the ROC curve can take values between 0 and 1, where 1is a perfect screening test and 0.5 is a test equal to chance.In the ROC curves below, a line was plotted at a 45° angleto represent an area under the ROC curve of 0.5. Positive [sensitivity/(1– specificity)] and negative [(1 – sensitivity)/specificity]likelihood ratios were also calculated to express the odds thata given value of a screening test outcome would be expectedin a person with or without the target disorder, respectively.

RESULTS
Subject characteristics are presented in Table 1. The mean BMIvalues did not differ significantly between the sexes, whereasthe mean %BF was significantly higher in the females than inthe males (P < 0.001). The mean %BF for the males was almost9 percentage points lower than the recommended 25%BF cutofffor overweight in males, whereas the mean %BF for the femaleswas nearly equivalent to the proposed cutoff of 30% (Table 2).This explains the high prevalence of true positives for overweightamong the females, in comparison with the prevalence of overweightas defined by various BMI-based references (Table 2).

View this table:
TABLE 1. Subject characteristics¹

View this table:
TABLE 2. BMI (in kg/m²) and percentage body fat (%BF) cutoffs for 17-y-old males (n = 200) and females (n = 274) from 4 references¹

%BF and BMI were significantly correlated in both the malesand the females (males, r = 0.74, P < 0.01; females, r =0.72, P < 0.01). However, high correlational validity doesnot guarantee clinical validity of classification systems. Therefore,the nature and extent of misclassifications were evaluated byROC analysis.

The IOTF/Cole and WHO/MDD classification systems were highlyspecific for both sexes, but their sensitivity was very lowfor the females (Table 3). Thus, almost all adolescents labeledas overweight were truly overweight, whereas 75% of the trulyoverweight females were mislabeled as normal-weight. The resultwas similar for the widely used adult BMI cutoff of 25, whichwill be applied when the adolescents become 18 y of age.

View this table:
TABLE 3. Sensitivity and specificity for excess fatness for BMI-based references¹

Cutoffs were derived through ROC analysis to maximize the sumof sensitivity and specificity (Table 3). For the males, thiscutoff was equivalent to the 85th percentile on the SwedishBMI percentile charts. The 85th percentile for the females tradedsome specificity for a moderate increase in sensitivity, which,however, was still very low. For the females, the optimal cutoffderived by ROC analysis improved the sensitivity relative tothat of the international references but required large tradeoffsin specificity.

A comparison of the performance of the optimal cutoffs betweenthe sexes is further illustrated by the resulting positive andnegative likelihood ratios (Table 3). With the use of the optimalBMI cutoff derived from ROC analysis, a truly overweight malewould be 12 times as likely as a truly normal-weight male tobe classified as overweight, whereas a truly normal-weight malewould be only 0.09 times as likely to be classified as overweight.For the females, the optimal system performed much worse, withpositive and negative likelihood ratios of 3 and 0.30, respectively.

In both sexes, BMI was significantly better than chance as adiagnostic test for overweight (P < 0.001). The area underthe ROC curve was 0.97 for the males and 0.85 for the females,which indicates a lower probability for BMI values to producethe correct diagnosis in the females than in the males (Figures1 and 2). The lower area under the curve explains the less sensitiveand specific optimal cutoff for the females and the lower positiveand higher negative likelihood ratios.

View larger version (18K):
FIGURE 1.. Receiver operating characteristic curve for male adolescents. BMI was significantly better than chance as a diagnostic test for excess fatness [ (±SE) area under the curve = 0.97 ± 0.02; n = 200]. The 45° line represents chance as a diagnostic test (area under the curve = 0.5). Excess fatness was defined according to the percentage body fat cutoffs proposed by Williams et al (

View larger version (19K):
FIGURE 2.. Receiver operating characteristic curve for female adolescents. BMI was significantly better than chance as a diagnostic test for excess fatness [ (±SE) area under the curve = 0.85 ± 0.02; n = 274]. The 45° line represents chance as a diagnostic test (area under the curve = 0.5). Excess fatness was defined according to the percentage body fat cutoffs proposed by Williams et al (
Because there are no generally accepted reference values todefine overweight or obesity by %BF in adolescents, the resultsin the presented analysis will be determined by the choice of%BF cutoff. Therefore, the influence of different %BF cutoffson the definition of true positives was also analyzed (Table4). For the males, %BF cutoffs from 17.5% to 30% were examined.To produce equal sensitivity and specificity with the use ofthe IOTF/Cole system, true overweight in 17-y-old males wouldhave to be considered at a %BF > 30%, ie, 5 percentage pointshigher than the cutoff proposed by Williams et al (15). Forthe females, the corresponding %BF cutoff would need to be near40%, ie, 10 percentage points above the proposed cutoff.

View this table:
TABLE 4. Sensitivity analysis of the choice of percentage body fat (%BF) cutoff for definition of overweight and its influence on sensitivity and specificity for fatness¹

DISCUSSION
Few studies have evaluated proposed BMI-based classificationsystems for adolescent obesity for their respective diagnosticaccuracy in detecting fatness (12, 13). In several studies,correlational analyses between different measures of fatnesswere conducted, but such studies can show only the closenessof association, not the extent and type of misclassifications.Therefore, ROC analysis was used to evaluate the clinical validityof BMI as a diagnostic tool for detecting excess fatness. Weevaluated the sensitivity, specificity, and positive and negativelikelihood ratios of the BMI-based classification systems recommendedfor international use by the IOTF and the WHO (7, 20). The resultswere compared with a national BMI reference, and BMI cutoffsmaximizing the sum of sensitivity and specificity were alsoderived from the sample.

For identification of Swedish adolescents with excess fatness,the IOTF/Cole and WHO/MDD classification systems were shownto have very high specificity in both sexes, but the sensitivitywas very low in the females. All the females classified as overweightwere truly overweight, but 75% of the truly overweight femaleswere misclassified as having normal weight. Thus, many overweightfemales would be missed in intervention programs using BMI asthe selection criterion. Through ROC analysis, cutoffs werederived for the males to further improve the tradeoff betweensensitivity and specificity. This cutoff was identical to the85th BMI percentile for 17-y-old Swedish males from a nationalreference (21). For the females, BMI proved to be less validin classifying persons with excess fat as overweight. The cutoffsfor optimizing the tradeoff in females were much lower thanthe ones recommended by the IOTF and the WHO, but the positivelikelihood ratio was low, and the negative likelihood ratiowas fairly high.

The choice of %BF cutoff to define true positives or true overweightdetermines the results in this kind of analysis and could beused as an argument to reject the results. However, to produceequal sensitivity and specificity when using the IOTF/Cole systemto classify overweight, the %BF cutoffs would need to be 30%for males and nearly 40% for females. For males and females,respectively, the cutoffs suggested in the literature are 25%and 30% (criterion-based for 5–18 y of age) (15), 20%and 30% (criterion-based for 9–15 y of age) (25), and21% and 34% (%BF at 17 y of age corresponding to a BMI of 25at 18 y of age) (26). From these suggested cutoffs and the resultsfrom the various %BF cutoffs provided in the present article,it seems fairly safe to conclude that the IOTF/Cole and WHO/MDDclassification systems are highly specific in both sexes butare insensitive for overweight in 17-y-old Swedish females.

Evidence of screening ability and relation to morbidity is availablefor some national BMI-based classification systems (27, 28)but is scarce for the proposed international systems (29). Ina comparison of screening ability for obesity in a British populationbetween 1990 reference data for the United Kingdom and the IOTF/Colesystem, Reilly et al (30) found that the IOTF/Cole system hadlow sensitivity in girls and very low sensitivity in boys. Thisresult is in fairly good agreement with our results for overweight17-y-old females but not with our results for males. In theaforementioned study, true positives for obesity were definedas subjects belonging to the top 5% of the %BF distribution(30). The use of that kind of distribution-based definitionof excess fatness has been criticized (14), because the average%BF associated with a specific percentile may vary considerably(2, 14). Thus, male and female children and adolescents withthe highest %BF in the group may be identified, but they donot necessarily need to be overweight or display elevated cardiovascularrisk factors (14).

In the present study, %BF cutoffs defined by a biological endpointwere used instead (15). Proposed %BF cutoffs derived from abiological endpoint approach can be criticized on the groundsof study population, sample size, and chosen endpoints. Thecutoffs used in the present study can specifically be criticizedfor not being age specific (26). Thus, a systematic underestimationof the proportion of excess adiposity in younger subjects andan overestimation in older subjects are likely to result, especiallyin females (26). These effects may have contributed to the largediscrepancy in prevalence estimates between the BMI-based and%BF-based classification systems in the females. There are noreported prevalence estimates of overweight for 17-y-old Swedesbased on %BF or morbidity. In 1998 the prevalence of overweight(BMI > 25) in 16-84-y-old women was 38%, but the prevalencein 16–24-y-old women was only 12% (31).

Furthermore, the %BF cutoffs used were derived from a biracialAmerican sample. However, reference values for healthy %BF rangeshave not been published for Swedish, Scandinavian, or Europeanpopulations. Using cutoffs anchored to metabolic risk appearsto be the best available alternative and has been used for otherEuropean samples in similar evaluations (14). In addition, possiblepopulation differences are likely to be within the %BF cutoffranges included in the sensitivity analysis in the present article,which supports the conclusions of the present study.

The choice of method for estimating %BF is also a source ofpotential variation in results between studies. Mei et al (32)used both dual-energy X-ray absorptiometry and skinfold-thicknessmeasurements to estimate %BF when comparing the sensitivityand specificity for fatness of BMI compared with those of weight/height³;Sardinha et al (14) used dual-energy X-ray absorptiometry whenevaluating BMI, triceps skinfold thickness, and upper arm girth;Reilly et al (13) and Fu et al (12) used bioelectrical impedancewhen evaluating the IOTF/Cole classification system; and inthe present study, densitometry by air-displacement plethysmographywas used. Air-displacement plethysmography has been proven toproduce %BF estimates of comparable accuracy to those producedby dual-energy X-ray absorptiometry and hydrostatic weighing(33).

With the assumption that air-displacement plethysmography producesaccurate and valid %BF measurements and that true positivesfor overweight are defined by the %BF cutoffs proposed by Williamset al (15), the results from the present study clearly showthe tradeoffs between sensitivity and specificity when applyingdifferent classification systems for overweight. Which systemto recommend for national use is not obvious, because such arecommendation is dependent on the purpose of the system. Theoptimal cutoffs derived in the present study maximize the sumof sensitivity and specificity, which may be considered optimalfor selective public health interventions. For clinical practice,minimizing the number of false positives is often preferredto avoid the stigma associated with being mislabeled as obesein adolescence. However, many true positives will be missedas a consequence, unless ancillary measures are used in conjunction.

This study examined only the diagnostic accuracy of the classificationsystems for detecting fatness. Future studies need to evaluatethe diagnostic accuracy for directly detecting cardiovascularrisk factors. Such studies have been conducted to some extentin adults and prepubertal children (34, 35).

In conclusion, the tradeoff between sensitivity and specificityshould be analyzed in detail before making general recommendationsabout classification systems for overweight. The diagnosticdemands on a classification system intended for use in clinicalpractice are different from those on systems intended for publichealth use or monitoring. Therefore, recommendations shouldbe explicit regarding the setting in which suggested systemsshould be used. A multipurpose system may be the easiest toimplement but would not suit the varying demands of public health,clinical practice, and monitoring. An international referenceis a compromise to obtain acceptable, comparable prevalenceestimates at the global level. At the national level, giventhe probable population differences in relative risks at certainBMI values, the seriousness of the adolescent obesity problem,and its character as a major cost driver through obesity-relatedillnesses, customized systems derived from national data arelikely to be more efficient. Such systems should therefore bedeveloped.

ACKNOWLEDGMENTS
We especially thank Catharina Grimming, Eva Hedlund, Maria Saxer,and Karin Vagstrand for providing help and support to the study.We also thank James Stubbs (Rowett Institute) and Paul Higgins(University of Alabama at Birmingham) for valuable commentsand discussions and the unit for Preventive Nutrition, KarolinskaInstitutet, for providing BodPod equipment support.

YML and BSB were the lead epidemiologists on the project andwere primarily responsible for developing the study design forthe Stockholm Pregnancy and Women’s Nutrition Study (1999)and the follow-up Stockholm Weight Development Study (2002).They also supervised the data collection and helped in editingthe manuscript. MGN provided critical input for the conceptionof this particular article, was responsible for conducting theanalyses, performed the statistical analyses, and drafted themanuscript. SOR was the principal investigator; conceived theidea of the 3 studies in 1984, 1999, and 2002, respectively;assisted with the study design; and provided help with manuscriptrevision. None of the authors had any conflicts of interest.

REFERENCES

Maynard LM, Wisemandle W, Roche AF, Chumlea WC, Guo SS, Siervogel RM. Childhood body composition in relation to body mass index. Pediatrics 2001;107:344–50.
Lazarus R, Baur L, Webb K, Blyth F. Body mass index in screening for adiposity in children and adolescents: systematic evaluation using receiver operating characteristic curves. Am J Clin Nutr 1996;63:500–6.
Lindsay RS, Hanson RL, Roumain J, Ravussin E, Knowler WC, Tataranni PA. Body mass index as a measure of adiposity in children and adolescents: relationship to adiposity by dual energy x-ray absorptiometry and to cardiovascular risk factors. J Clin Endocrinol Metab 2001;86:4061–7.
Burniat W, Cole TJ, Lissau I, Poskitt E. Child and adolescent obesity. 1st ed. Cambridge, United Kingdom: Cambridge University Press, 2002.
Guo SS, Chumlea WC, Roche AF, Siervogel RM. Age- and maturity-related changes in body composition during adolescence into adulthood: the Fels Longitudinal Study. Int J Obes Relat Metab Disord 1997;21:1167–75.
WHO Expert Consultation. Appropriate body-mass index for Asian populations and its implications for policy and intervention strategies. Lancet 2004;363:157–63.
Cole TJ, Bellizzi MC, Flegal KM, Dietz WH. Establishing a standard definition for child overweight and obesity worldwide: international survey. BMJ 2000;320:1240–3.
Must A, Dallal GE, Dietz WH. Reference data for obesity: 85th and 95th percentiles of body mass index (wt/ht²) and triceps skinfold thickness. Am J Clin Nutr 1991;53:839–46.
Guillaume M. Defining obesity in childhood: current practice. Am J Clin Nutr 1999;70(suppl):126S–30S.
Rolland-Cachera MF, Castetbon K, Arnault N, et al. Body mass index in 7–9-y-old French children: frequency of obesity, overweight and thinness. Int J Obes Relat Metab Disord 2002;26:1610–6.
Reilly JJ. Assessment of childhood obesity: national reference data or international approach? Obes Res 2002;10:838–40.
Fu WP, Lee HC, Ng CJ, et al. Screening for childhood obesity: international vs population-specific definitions. Which is more appropriate? Int J Obes Relat Metab Disord 2003;27:1121–6.
Reilly JJ, Dorosty AR, Emmett PM. Identification of the obese child: adequacy of the body mass index for clinical practice and epidemiology. Int J Obes Relat Metab Disord 2000;24:1623–7.
Sardinha LB, Going SB, Teixeira PJ, Lohman TG. Receiver operating characteristic analysis of body mass index, triceps skinfold thickness, and arm girth for obesity screening in children and adolescents. Am J Clin Nutr 1999;70:1090–5.
Williams DP, Going SB, Lohman TG, et al. Body fatness and risk for elevated blood pressure, total cholesterol, and serum lipoprotein ratios in children and adolescents. Am J Public Health 1992;82:358–63.
Ohlin A, Rossner S. Maternal body weight development after pregnancy. Int J Obes 1990;14:159–73.
Dempster P, Aitkens S. A new air displacement method for the determination of human body composition. Med Sci Sports Exerc 1995;27:1692–7.
McCrory MA, Gomez TD, Bernauer EM, Mole PA. Evaluation of a new air displacement plethysmograph for measuring human body composition. Med Sci Sports Exerc 1995;27:1686–91.
Siri WE. Body composition from fluid spaces and density: analysis of methods. 1961. Nutrition 1993;9:480–92.
WHO. Physical status: the use and interpretation of anthropometry. Geneva: WHO, 1995.
He Q, Albertsson-Wikland K, Karlberg J. Population-based body mass index reference values from Goteborg, Sweden: birth to 18 years of age. Acta Paediatr 2000;89:582–92.
Lohman TG. Applicability of body composition techniques and constants for children and youths. Exerc Sport Sci Rev 1986;14:325–57.
Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem 1993;39:561–77.
Hanley JA. The robustness of the "binormal" assumptions used in fitting ROC curves. Med Decis Making 1988;8:197–203.
Dwyer T, Blizzard CL. Defining obesity in children by biological endpoint rather than population distribution. Int J Obes Relat Metab Disord 1996;20:472–80.
Taylor RW, Jones IE, Williams SM, Goulding A. Body fat percentages measured by dual-energy X-ray absorptiometry corresponding to recently recommended body mass index cutoffs for overweight and obesity in children and adolescents aged 3–18 y. Am J Clin Nutr 2002;76:1416–21.
Dietz WH. Health consequences of obesity in youth: childhood predictors of adult disease. Pediatrics 1998;101:518–25.
Morrison JA, Barton BA, Biro FM, Daniels SR, Sprecher DL. Overweight, fat patterning, and cardiovascular disease risk factors in black and white boys. J Pediatr 1999;135:451–7.
Reilly JJ, Wilson ML, Summerbell CD, Wilson DC. Obesity: diagnosis, prevention, and treatment; evidence based answers to common questions. Arch Dis Child 2002;86:392–4.
Reilly JJ, Savage SA, Ruxton CH, Kirk TR. Assessment of obesity in a community sample of prepubertal children. Int J Obes Relat Metab Disord 1999;23:217–9.
The Swedish Council on Technology Assessment in Health Care. Obesity—problems and solutions: a systematic literature review. 1st ed. Gothenburg, Sweden: Elanders Graphic Systems, 2002.
Mei Z, Grummer-Strawn LM, Pietrobelli A, Goulding A, Goran MI, Dietz WH. Validity of body mass index compared with other body-composition screening indexes for the assessment of body fatness in children and adolescents. Am J Clin Nutr 2002;75:978–85.
Fields DA, Goran MI, McCrory MA. Body-composition assessment via air-displacement plethysmography in adults and children: a review. Am J Clin Nutr 2002;75:453–67.
Ito H, Nakasuga K, Ohshima A, et al. Detection of cardiovascular risk factors by indices of obesity obtained from anthropometry and dual-energy X-ray absorptiometry in Japanese individuals. Int J Obes Relat Metab Disord 2003;27:232–7.
Higgins PB, Gower BA, Hunter GR, Goran MI. Defining health-related obesity in prepubertal children. Obes Res 2001;9:233–40.

Received for publication December 5, 2003. Accepted for publication April 2, 2004.

作者： Martin G Neovius