Seminars in Hematology
Volume 45, Issue 3 , Pages 189-195 , July 2008

Interpreting Diagnostic Test Accuracy Studies

  • Patrick M.M. Bossuyt

      Affiliations

    • Corresponding Author InformationAddress correspondence to Patrick M.M. Bossuyt, PhD, Department of Clinical Epidemiology and Biostatistics, Academic Medical Center, University of Amsterdam, Room J1b-214, PO Box 22700, 1100 DE Amsterdam, the Netherlands.

References 

  1. Knottnerus JA, van Weel C, Muris JW. Evaluation of diagnostic procedures. BMJ. 2002;324:477–480
  2. Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD initiative (Standards for Reporting of Diagnostic Accuracy). Clin Chem. 2003;49:1–6
  3. Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3:32–35
  4. Hilden J, Glasziou P. Regret graphs, diagnostic uncertainty and Youden's index. Stat Med. 1996;15:969–986
  5. Hilden J. Prevalence-free utility-respecting summary indices of diagnostic power do not exist. Stat Med. 2000;19:431–440
  6. Hunink MG, Glasziou P, Siegel JE, Weeks JC, Pliskin JS, Elstein AS, et al. Decision Making in Health and Medicine: Integrating Evidence and Values. Oxford, UK: Cambridge University Press; 2001;
  7. Righini M, Aujesky D, Roy PM, Cornuz J, De Moerloose P, Bounameaux H, et al. Clinical usefulness of d-dimer depending on clinical probability and cutoff value in outpatients with suspected pulmonary embolism. Arch Intern Med. 2004;164:2483–2487
  8. McGee S. Simplifying likelihood ratios. J Gen Intern Med. 2002;17:646–649
  9. Glas AS, Lijmer JG, Prins MH, Bonsel GJ, Bossuyt PM. The diagnostic odds ratio: A single indicator of test performance. J Clin Epidemiol. 2003;56:1129–1135
  10. Rostoff P, Piwowarska W, Gackowski A, Konduracka E, El Massri N, Latacz P, et al. Electrocardiographic prediction of acute left main coronary artery occlusion. Am J Emerg Med. 2007;25:852–855
  11. Linn S. New patient-oriented diagnostic test characteristics analogous to the likelihood ratios conveyed information on trustworthiness. J Clin Epidemiol. 2005;58:450–457
  12. Kraemer HC. Evaluating Medical Tests: Objective and Quantitative Guidelines. Newbury Park, CA: Sage Publications; 1992;
  13. Rutjes AW, Reitsma JB, Vandenbroucke JP, Glas AS, Bossuyt PM. Case-control and two-gate designs in diagnostic accuracy studies. Clin Chem. 2005;51:1335–1341
  14. Smith TP. Pulmonary embolism: What's wrong with this diagnosis?. AJR Am J Roentgenol. 2000;174:1489–1497
  15. Anderson DR, Kahn SR, Rodger MA, Kovacs MJ, Morris T, Hirsch A, et al. Computed tomographic pulmonary angiography vs ventilation-perfusion lung scanning in patients with suspected pulmonary embolism: A randomized controlled trial. JAMA. 2007;298:2743–2753
  16. Schrecengost JE, LeGallo RD, Boyd JC, Moons KG, Gonias SL, Rose CE, et al. Comparison of diagnostic accuracies in outpatients and hospitalized patients of d-dimer testing for the evaluation of suspected pulmonary embolism. Clin Chem. 2003;49:1483–1490
  17. Wells PS, Anderson DR, Rodger M, Stiell I, Dreyer JF, Barnes D, et al. Excluding pulmonary embolism at the bedside without diagnostic imaging: Management of patients with suspected pulmonary embolism presenting to the emergency department by using a simple clinical model and d-dimer. Ann Intern Med. 2001;135:98–107
  18. Moons KG, van Es GA, Deckers JW, Habbema JD, Grobbee DE. Limitations of sensitivity, specificity, likelihood ratio, and Bayes' theorem in assessing diagnostic probabilities: A clinical example. Epidemiology. 1997;8:12–17
  19. Diamond GA. Reverend Bayes' silent majority (An alternative factor affecting sensitivity and specificity of exercise electrocardiography). Am J Cardiol. 1986;57:1175–1180
  20. Irwig L, Bossuyt P, Glasziou P, Gatsonis C, Lijmer J. Designing studies to ensure that estimates of test accuracy are transferable. BMJ. 2002;324:669–671
  21. Pepe MS. The Statistical Evaluation of Medical Tests for Classification and Prediction. Oxford, UK: Oxford University Press; 2003;
  22. Bachmann LM, Puhan MA, ter Riet G, Bossuyt PM. Sample sizes of studies on diagnostic accuracy: Literature survey. BMJ. 2006;332:1127–1129
  23. Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005;58:982–990
  24. Harbord RM, Deeks JJ, Egger M, Whiting P, Sterne JA. A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics. 2007;8:239–251
  25. Deeks JJ. Systematic reviews in health care: Systematic reviews of evaluations of diagnostic and screening tests. BMJ. 2001;323:157–162
  26. Smidt N, Rutjes AW, van der Windt DA, Ostelo RW, Reitsma JB, Bossuyt PM, et al. Quality of reporting of diagnostic accuracy studies. Radiology. 2005;235:347–353
  27. Siddiqui MA, zuara-Blanco A, Burr J. The quality of reporting of diagnostic accuracy studies published in ophthalmic journals. Br J Ophthalmol. 2005;89:261–265
  28. Rama KR, Poovali S, Apsingi S. Quality of reporting of orthopaedic diagnostic accuracy studies is suboptimal. Clin Orthop Relat Res. 2006;447:237–246
  29. Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative. Ann Intern Med. 2003;138:40–44
  30. Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, et al. The STARD statement for reporting studies of diagnostic accuracy: Explanation and elaboration. Clin Chem. 2003;49:7–18
  31. Smidt N, Rutjes AW, van der Windt DA, Ostelo RW, Bossuyt PM, Reitsma JB, et al. The quality of diagnostic accuracy studies since the STARD statement: Has it improved?. Neurology. 2006;67:792–797
  32. Lilienfeld DE. Abe and Yak: The interactions of Abraham M. Lilienfeld and Jacob Yerushalmy in the development of modern epidemiology (1945-1973). Epidemiology. 2007;18:507–514
  33. Ledley RS, Lusted LB. Probability, logic and medical diagnosis. Science. 1959;130:892–930
  34. Miettinen OS, Henschke CI, Yankelevitz DF. Evaluation of diagnostic imaging tests: diagnostic probability estimation. J Clin Epidemiol. 1998;51:1293–1298
  35. Moons KG, Harrell FE. Sensitivity and specificity should be de-emphasized in diagnostic accuracy studies. Acad Radiol. 2003;10:670–672
  36. Guggenmoos-Holzmann I, van Houwelingen HC. The (in)validity of sensitivity and specificity. Stat Med. 2000;19:1783–1792
  37. Perera R, Heneghan C. Making sense of diagnostic tests likelihood ratios. Evidence Based Med. 2006;11:130–131
  38. Grimes DA, Schulz KF. Refining clinical diagnosis with likelihood ratios. Lancet. 2005;365:1500–1505
  39. Jaeschke R, Guyatt G, Sackett DL. Users' guides to the medical literature (III. How to use an article about a diagnostic test. A. Are the results of the study valid? Evidence-Based Medicine Working Group). JAMA. 1994;271:389–391
  40. Chien PF, Khan KS. Evaluation of a clinical test (II: Assessment of validity). BJOG. 2001;108:568–572
  41. Puhan MA, Steurer J, Bachmann LM, ter Riet G. A randomized trial of ways to describe test accuracy: The effect on physicians' post-test probability estimates. Ann Intern Med. 2005;143:184–189
  42. Fischer JE, Bachmann LM, Jaeschke R. A readers' guide to the interpretation of diagnostic test properties: Clinical example of sepsis. Intensive Care Med. 2003;29:1043–1051
  43. Bossuyt PM, Irwig L, Craig J, Glasziou P. Comparative accuracy: Assessing new tests against existing diagnostic pathways. BMJ. 2006;332:1089–1092
  44. Hutchinson JM, Gigerenzer G. Simple heuristics and rules of thumb: Where psychologists and behavioural biologists might meet. Behav Processes. 2005;69:97–124
  45. Feinstein AR. Misguided efforts and future challenges for research on “diagnostic tests.”. J Epidemiol Community Health. 2002;56:330–332
  46. Mrus JM. Getting beyond diagnostic accuracy: Moving toward approaches that can be used in practice. Clin Infect Dis. 2004;38:1391–1393
  47. Fryback DG, Thornbury JR. The efficacy of diagnostic imaging. Med Decis Making. 1991;11:88–94
  48. Bossuyt PM, Lijmer JG, Mol BW. Randomised comparisons of medical tests: Sometimes invalid, not always efficient. Lancet. 2000;356:1844–1847
  49. Lord SJ, Irwig L, Simes RJ. When is measuring sensitivity and specificity sufficient to evaluate a diagnostic test, and when do we need randomized trials?. Ann Intern Med. 2006;144:850–855

PII: S0037-1963(08)00060-7

doi: 10.1053/j.seminhematol.2008.04.001

Seminars in Hematology
Volume 45, Issue 3 , Pages 189-195 , July 2008