Transcript Slide 1
IS THERE A BENEFIT TO STANDARDIZING METHODS FOR GRADING EVIDENCE AND MAKING RECOMMENDATIONS – IF SO, IS GRADE "THE ONE"? Stakeholder Summit on Using Quality Systematic Reviews to Inform Evidence-based Guidelines US Cochrane Center June 4 and 5, 2009 Yngve Falck-Ytter, M.D. Assistant Professor of Medicine Case Western Reserve University Disclosure In the past 5 years, Dr. Falck-Ytter received no personal payments for services from industry. His research group received research grants from Three Rivers, Valeant and Roche that were deposited into non-profit research accounts. He is a member of the GRADE working group which has received funding from various governmental entities in the US and Europe. Some of the GRADE work he has done is supported in part by grant # 1 R13 HS016880-01 from the Agency for Healthcare Research and Quality (AHRQ). Content Why revisiting guideline methodology? GRADE approach Quality of evidence Strength of recommendations Why societies have adopted GRADE Reassessment of clinical practice guidelines Editorial by Shaneyfelt and Centor (JAMA 2009) “Too many current guidelines have become marketing and opinion-based pieces…” “AHA CPG: 48% of recommendations are based on level C = expert opinion…” “…clinicians do not use CPG […] greater concern […] some CPG are turned into performance measures…” Confidence in evidence There always is evidence “When there is a question there is evidence” Evidence alone is never sufficient to make a clinical decision Better research greater confidence in the evidence and decisions Hierarchy of evidence Randomized Controlled Trials Cohort Studies and Case Control Studies Case Reports and Case Series, Non-systematic observations Expert Opinion BIAS Expert Opinion Expert Opinion STUDY DESIGN Reasons for grading evidence? People draw conclusions about the quality of evidence and strength of recommendations Systematic and explicit approaches can help to protect against errors, resolve disagreements communicate information and fulfill needs be transparent about the process Change practitioner behavior However, wide variation in approaches GRADE working group. BMJ. 2004 & 2008 8 Which grading system? P: In patients with acute hepatitis C … I : Should anti-viral treatment be used … C: Compared to no treatment … O: To achieve viral clearance? Evidence Recommendation Organization B Class I AASLD (2009) II-1 -/- VA (2006) 1+ A SIGN (2006) -/- “Most authorities…” AGA (2006) Level of evidence in GI CPGs AASLD A B C Multiple RCTs or meta-analysis Single randomized trial, or nonrandomized studies Only consensus opinion of experts, case studies, or standard-of-care AGA ACG Good Consistent, well-designed, well conducted studies […] 1. Multiple published, well-controlled (?) randomized trials or a well designed systemic (?) metaanalysis Fair Limited by the number, quality or consistency of individual studies […] 2. One qualitypublished (?) RCT, published welldesigned cohort/ case-control studies Poor … important flaws, gaps in chain of evidence… 3. Consensus of authoritative (?) expert opinions based on clinical evidence or from well designed, but uncontrolled or non-rand. clin. trials ASGE A. RCTs B. RCT with important limitations C. Observational studies D. Expert opinion 10 What to do? 11 Limitations of existing systems Confuse quality of evidence with strength of recommendations Lack well-articulated conceptual framework Criteria not comprehensive or transparent GRADE unique breadth, intensity of development process wide endorsement and use conceptual framework comprehensive, transparent criteria Focus on all important outcomes related to a specific question and overall quality Grades of Recommendation Assessment, Development and Evaluation GRADE Working Group David Atkins, chief medical officera Dana Best, assistant professorb Martin Eccles, professord Francoise Cluzeau, lecturerx Yngve Falck-Ytter, associate directore Signe Flottorp, researcherf Gordon H Guyatt, professorg Robin T Harbour, quality and information director h Margaret C Haugh, methodologisti David Henry, professorj Suzanne Hill, senior lecturerj Roman Jaeschke, clinical professork Regina Kunx, Associate Professor Gillian Leng, guidelines programme directorl Alessandro Liberati, professorm Nicola Magrini, directorn James Mason, professord Philippa Middleton, honorary research fellowo Jacek Mrukowicz, executive directorp Dianne O’Connell, senior epidemiologistq Andrew D Oxman, directorf Bob Phillips, associate fellowr Holger J Schünemann, professorg,s Tessa Tan-Torres Edejer, medical officert David Tovey, Editory Jane Thomas, Lecturer, UK Helena Varonen, associate editoru Gunn E Vist, researcherf John W Williams Jr, professorv Stephanie Zaza, project directorw a) Agency for Healthcare Research and Quality, USA b) Children's National Medical Center, USA c) Centers for Disease Control and Prevention, USA d) University of Newcastle upon Tyne, UK e) German Cochrane Centre, Germany f) Norwegian Centre for Health Services, Norway g) McMaster University, Canada h) Scottish Intercollegiate Guidelines Network, UK i) Fédération Nationale des Centres de Lutte Contre le Cancer, France j) University of Newcastle, Australia k) McMaster University, Canada l) National Institute for Clinical Excellence, UK m) Università di Modena e Reggio Emilia, Italy n) Centro per la Valutazione della Efficacia della Assistenza Sanitaria, Italy o) Australasian Cochrane Centre, Australia p) Polish Institute for Evidence Based Medicine, Poland q) The Cancer Council, Australia r) Centre for Evidence-based Medicine, UK s) National Cancer Institute, Italy t) World Health Organisation, Switzerland u) Finnish Medical Society Duodecim, Finland v) Duke University Medical Center, USA w) Centers for Disease Control and Prevention, USA x) University of London, UK Y) BMJ Clinical Evidence, UK GRADE uptake Where GRADE fits in Prioritize problems, establish panel Systematic review Searches, selection of studies, data collection and analysis Prepare evidence profile: Quality of evidence for each outcome and summary of findings Assess overall quality of evidence Decide direction and strength of recommendation Draft guideline Consult with stakeholders and / or external peer reviewer Disseminate guideline Implement the guideline and evaluate GRADE Assess the relative importance of outcomes GRADE: Quality of evidence The extent to which our confidence in an estimate of the treatment effect is adequate to support particular recommendation. Although the degree of confidence is a continuum, we suggest using four categories: High Moderate Low Very low 17 Quality of evidence across studies Outcome #1 Quality: High Outcome #2 Quality: Moderate Outcome #3 Quality: Low III V II IB Old system GRADE Determinants of quality RCTs start high Observational studies start low What lowers quality of evidence? 5 factors: Detailed design and execution Inconsistency of results Indirectness of evidence Imprecision Publication bias What is the study design? 20 1. Design and execution Study limitations (risk of bias) For RCTs: Lack of allocation concealment No true intention to treat principle Inadequate blinding Loss to follow-up Early stopping for benefit Cochrane Risk of bias graph in RevMan 5 22 2. Consistency of results Look for explanation for inconsistency patients, intervention, comparator, outcome, methods Judgment variation in size of effect overlap in confidence intervals statistical significance of heterogeneity I2 Heterogeneity Pagliaro L et al. Ann Intern Med 1992;117:59-70 24 3. Directness of Evidence Indirect comparisons No head-to-head comparison Drug A versus drug B Tenofovir versus entecavir in hepatitis B treatment Differences in patients (early cirrhosis vs end-stage cirrhosis) interventions (CRC screening: flex. sig. vs colonoscopy) comparator (e.g., differences in dose) outcomes (non-steroidal safety: ulcer on endoscopy vs symptomatic ulcer complications) 4. Imprecision Small sample size small number of events wide confidence intervals uncertainty about magnitude of effect 5. Reporting Bias (Publication Bias) Reporting of studies publication bias number of small studies Reporting of outcomes Quality assessment criteria Quality of evidence Study design Lower if… High (4) Randomized trial Study limitations (design and execution) Moderate (3) Low (2) Very low (1) Inconsistency Observational study Indirectness Imprecision Higher if… What can raise the quality of evidence? Publication bias 28 BMJ 2003;327:1459–61 29 Quality assessment criteria Quality of evidence Study design Lower if… Higher if… High (4) Randomized trial Study limitations Large effect (e.g., RR 0.5) Very large effect (e.g., RR 0.2) Inconsistency Evidence of dose-response gradient Indirectness All plausible confounding would reduce a demonstrated effect Moderate (3) Low (2) Very low (1) Observational study Imprecision Publication bias 30 Categories of quality High Further research is very unlikely to change our confidence in the estimate of effect Moderate Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate Low Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate Very low Any estimate of effect is very uncertain 31 Judgments about the overall quality of evidence Most systems not explicit Options: Benefits Primary outcome Highest Lowest Beyond the scope of a systematic review GRADE: Based on lowest of all the critical outcomes 32 GRADE evidence profile Strength of recommendation “The strength of a recommendation reflects the extent to which we can, across the range of patients for whom the recommendations are intended, be confident that desirable effects of a management strategy outweigh undesirable effects.” Although the strength of recommendation is a continuum, we suggest using two categories : “Strong” and “Weak” Desirable and undesirable effects Desirable effects Mortality reduction Improvement in quality of life, fewer hospitalizations/infections Reduction in the burden of treatment Reduced resource expenditure Undesirable effects Deleterious impact on morbidity, mortality or quality of life, increased resource expenditure 4 determinants of the strength of recommendation Factors that can weaken the strength of a recommendation Explanation Lower quality evidence The higher the quality of evidence, the more likely is a strong recommendation. Uncertainty about the balance of benefits versus harms and burdens The larger the difference between the desirable and undesirable consequences, the more likely a strong recommendation warranted. The smaller the net benefit and the lower certainty for that benefit, the more likely is a weak recommendation warranted. Uncertainty or differences in values The greater the variability in values and preferences, or uncertainty in values and preferences, the more likely weak recommendation warranted. Uncertainty about whether the net benefits are worth the costs The higher the costs of an intervention – that is, the more resources consumed – the less likely is a strong recommendation warranted. Implications of a strong recommendation Patients: Most people in this situation would want the recommended course of action and only a small proportion would not Clinicians: Most patients should receive the recommended course of action Policy makers: The recommendation can be adapted as a policy in most situations Implications of a weak recommendation Patients: The majority of people in this situation would want the recommended course of action, but many would not Clinicians: Be prepared to help patients to make a decision that is consistent with their own values/decision aids and shared decision making Policy makers: There is a need for substantial debate and involvement of stakeholders Critical Outcome Critical Outcome Important Outcome Not High Moderate Low Very low Summary of findings & estimate of effect for each outcome Systematic review Grade down P I C O Outcome 1. 2. 3. 4. 5. Grade up RCT start high, obs. data start low Risk of bias Inconsistency Indirectness Imprecision Publication bias 1. Large effect 2. Dose response 3. Confounders Guideline development Formulate recommendations: • For or against (direction) • Strong or weak (strength) By considering: Quality of evidence Balance benefits/harms Values and preferences Revise if necessary by considering: Resource use (cost) Rate overall quality of evidence across outcomes based on lowest quality of critical outcomes • • • • “We recommend using…” “We suggest using…” “We recommend against using…” “We suggest against using…” Summary, and Why institutions adopt GRADE 1. GRADE is gaining acceptance as international standard 2. GRADE has criteria for evidence assessment across a range of questions and outcomes 3. Criteria for moving from evidence to recommendations 4. Simple, transparent, systematic 5. Balance between simplicity and methodological rigor