Building a Bridge over Troubled QALYs: Developing

Download Report

Transcript Building a Bridge over Troubled QALYs: Developing

Third Plenary Session
ISPOR 14th Annual International Meeting
Effectiveness in CEA: QALYS
Life expectancy multiplied by
health-related quality of life:
Quality-adjusted life years
Calculating QALYS:
If,
HRQL= 0.7
And,
A treatment gives 10 extra years of
life (@ 0.7 per year)
Then….
People receiving the treatment gain
an average of 7 QALYs each
A QALY is a QALY is a QALY
Saves
Lives
Improves
HRQL
#People HRQL
LE
100
x 0.8 x 50
=
=
QALYs
4000
10,000
=
4000
x
0.1
x
4
The cost-effectiveness of one thing
compared to another…
Cost treatment 1 – Cost treatment 2
Effectiveness treatment 1 – Effectiveness treatment 2
= COST per QALY
For example…
Cost
Life Expectancy HRQL
Group A
$80,000
2 Years
Group B
$ 8,000
1 Year
X
X
.6
.8
QALYS
=
=
Cost-effectiveness:
$80,000 - $8,000 = $72,000 = $180,000/QALY
1.2 – 0.8
0.4
1.2
0.8

The QALY is a widely used measure of
health gain.

Long-standing criticism of the theoretical
basis, and practical application of,
QALYs.

Plenary session, 10th Annual International
Meeting (Kahneman), May 2005.

Issues panel, 11th Annual International
Meeting (Fryback, Kahneman, McGuire),
May 2006.

Two-day invitational consensus
development workshop, November 2007.



Funding from AHRQ and NCI.
25 participants.
Discussion of:
- the basics of QALYs;
- the main challenges surrounding QALYs;
- retaining and enhancing QALYs;
- the use of QALYs in decision-making;
- towards a consensus on the QALY
Milton C. Weinstein PhD
Harvard School of Public Health, Boston MA, USA
George Torrance PhD
McMaster University, Hamilton, ON, Canada
Alistair McGuire PhD
London School of Economics, London, UK
Method for valuing health effectiveness
in cost-effectiveness analysis for
resource allocation decisions
 Values health based on time spent in
health states
 Endorsed by US Panel and NICE for
reference case


Represent individual patient preferences

Reflect equity, fairness, or political goals
Grounded in decision science (based on
expected utility theory)
 Individuals move through health states
over time.
 Each health state has a “value”
 Health = value-weighted time (QALYs)


Perfect health = 1.0, dead = 0.0

Interval scale properties
 e.g., 0.2  0.4 = 0.6  0.8

States worse than dead have
negative value
Value = preference (desirability)
 Valued by whom?

 individuals experiencing a health state or
illness
 individual who may or may not experience
that health state in the future
 individuals considering the health of a
community

(Values are for health states, not for changes in
health states)
What is being valued?
 Whom do we ask?
 What do we ask?
 How are health outcomes defined?




What is being valued?
Whom do we ask?
How are health outcomes defined?
Conventional QALYs allow for different
answers to these questions
 The answer depends on the question
being asked

Societal resource allocation: priority
setting across proposed programs or
interventions
 Societal (programmatic) audit:
evaluation of ongoing
activities/programs
 Personal clinical decisions or
decisions about insurance coverage


Personal clinical/insurance choice
 desirability of health outcomes to the
individual
 ex ante perspective

Societal/program audit
 current health of affected population
members, as valued by themselves
 ex post perspective (i.e. patient
preferences/experience utility)

Societal resource allocation
 individual health (aggregated) or
community health
 ex ante or ex post

Personal clinical/insurance
 the individual, +/- informed by
patients/disabled people

Societal/program audit
 members of the affected population

Societal/individual health
(aggregated)
or

Societal/community health
 representative sample of population
○ including patients/disabled people
○ informed by patients/disabled people







HUI-2
HUI-3
SF-6D
QWB
EQ-5D
15D
AQOL
 patient  health state
 community survey  value score
 US Panel and NICE reference case method
 different instruments give different results
 value scores may be population-specific

Health states
 valuation independent of duration or
sequence

Health states
 conventional QALY approach
 valuation independent of duration or sequence

Health paths (profiles)
 theoretically superior
 practical problem: large number of paths

Health changes
 can incorporate equity or fairness
 practical problem: large number of changes
 order of changes matters

QALYs should be discounted at the
same rate as costs (US Panel, NICE)
Erik Nord PhD
Norwegian Institute of Public Health, Oslo, Norway
Norman Daniels PhD
Department of Population and International Health, Harvard School
of Public Health, Boston, MA, USA
Mark Kamlet PhD
Heinz School of Public Policy and Management, Carnegie Mellon
University, Pittsburgh, PA,USA
The Conventional QALY

Defined as expressing the personal
utility of health outcomes as judged ex
ante, “on average,” by the general
public, from behind a veil of ignorance,
about future health, based on self
interest.
Issues:

(i) Substantial, empirical inter-method
variation in ex ante assessment
 SG yields higher values than TTO and greater
in turn than rating scale
 Which one is “right”?

(ii) Empirical unwillingness to trade-off
lifetime
 Means that less is invested in preventing the
outcome “confined to a wheel chair”
 Use of “experience” utility disfavors
prevention, use of ex-ante utility doesn’t
capture adaptation/foregone opportunities
iii. Concerns for fairness

No consideration for pretreatment health
state
○ At odds with ethical theory/public opinion that
suggests that in setting priorities societies
often emphasize how bad off individuals
would be without intervention
○ i.e. concern for “severity”
iii.Concerns for fairness

Conventional QALY model implies that the
value of an intervention is proportional to the
beneficiary’s capacity to benefit
 At odds with theory/public opinion that it should not
be held again people that they have conditions for
which there are no complete cures or whose
remaining lifetime is shorter

Similarly, life years gained for those at full
health valued more than life years gained for
those at less than full health
 Conflicts with equal right to protection of life by all
iv. Subtraction doesn’t “add up”

Standard QALYs measure differences in
health states, not gains in health
 Ex ante preference elicitation on health states and
subsequent subtraction of health state values from
one another
 Decreases data requirements, i.e. the number of
possible changes is much highter than the number
of possible states

Nonetheless, this is a proxy approach, yet to
be validated in the health economics literature
Incorporating concerns
for fairness

Count as “1” all gained life years if good
enough to be desired by affected persons
 Leads to inconsistencies with individual preferences


Place less weight on the duration of health
benefits in comparisons of programs for
patients with different life expectancies
Add explicit equity weights
 Overload the model?


Different “priority classes” for QALYS with
different ratio cut-offs
Treat “prevention” differently than “treatment”
Joseph Lipscomb PhD
Department of Health Policy and Management, Rollins School of
Public Health, Emory University, Atlanta, GA, USA
Dennis Fryback PhD
University of York, York, UK
Marthe Gold MD, MPH
University of Wisconsin, Madison, WI, USA
Dennis Revicki PhD
City University of New York Medical School, New York, NY, USA




For analyses requiring a summary measure of health
that integrates quantity of life and quality of life, QALY
is arguably the gold standard.
But should it truly be the coin of the realm?
Substantive concerns have been raised
Our conclusion (in preview): These conceptual and
methods issues signal opportunities for making
important incremental improvements in the QALY –
rather than abandoning the construct.

Conceptualization and construction of health states
- Which domains?
- Which health levels within each domain?

Psychometric approaches for eliciting preferences

Statistical strategies for deriving overall value weights
for QALYconv
For example……



Highly simple model structure: QALYconv linearly additive
function of time in health states, with an exponential
discounting factor to reflect time preference
Distributional and other ethical issues not formally
integrated into model
Some suggest that the value component of QALY should
be “experience-based” (from real-time perspective)
rather than “ex ante”

In fact, QALYconv plays important role in regulatory and
purchasing decisions in many industrialized nations
 But push back with NICE
 Non-QALY approaches being taken in France/Germany

Much less in U.S:





Not (yet) by FDA
Not (yet) by Center for Medicare & Medicaid Service
Not (yet) by most private payers
Recent studies suggest that resistance to CEA may be less than
suggested particularly given serious cost issues within public
programs and employers
Should we abandon the QALY?



QALYconv has proved to be serviceable vehicle for
quantifying joint mortality-morbidity impacts at
individual and population level
To abandon QALYconv now is to sever link to
hundreds of published studies & multiple ongoing
investigations – including many capitalizing on data
now collected in national surveys, clinical trials,
observational studies…..
A more productive pathway: pursue program of
research that takes QALYconv as starting point for
“continuous QALY improvement” over time


Prominent “health measurement systems” (e.g.,
HUI2/3, EQ-5D, QWB, and also SF-36) are the major
engines behind preference-based assessments,
including CEAs and population surveillance
Multiple applications now in national surveys of health
(e.g.,MEPS, U.S.-Canada Joint Survey, Medicare HOS)

But, basic issues about what exactly to measure
remain active topics for investigation
 All major systems view health as multidimensional concept, but
domains vary across systems
 Even for similar domains (e.g., Mobility/Ambulation) item content
differs across systems

Variations in domain structure allows selection of
particular QALYconv deemed best for application at
hand – but does not promote comparability across
studies. Solutions?
 Work toward “consensus domain structure” as one aspect of
community-based deliberative processes to identify and codify
citizen perspectives on health measurement, or else
 Cross-walk QALY scores between measurement systems

To appraise, and improve, item content (within a
domain), apply well-defined psychometric methods
 Mixed qualitative-quantitative approaches to improve content
and construct validity e.g. HUI2/3 and SF36v1/v2
 Promising development: application of item response theory
(irt) methods, e.g., Pickard et al. to study 3-Level vs. 5-Level
EQ-5D
Across the major health measurement systems, derivation
of the value component of QALYconv (the “Q part”) varies
in important ways:
 Method for eliciting preferences
- Standard gamble vs. time-tradeoff vs. visual analog scale
- Assumed duration of health state (1 day, 1 year, 10 years)

Deriving aggregate QALY score for a multidimensional
health state
- Multi-attribute utility modeling (HUI2/3)
- Econometric modeling (EQ-5D, QWB, SF-6D)

States worse than death
- EQ-5D and HUI2/3 allow for them
- QWB, SF-6D, HALex do not

Cross-walk scores across measurement
systems
 Predicting one set of QALY scores from another via
regression modeling (e.g., an EQ-5D to SF-6D mapping)
 Hierarchical irt modeling (a la Fryback et al) to map (for
example) from EQ-5D to irt-derived latent variable continuum
to SF-6D
or…….
 Initiate consensus process leading to
“Reference Case” QALY, establishing baseline
expectations about health state definition and
valuation
Additional issues raised about QALYconv
 No health state duration or sequencing
effects incorporated
 Investigate preferences over multi-state health profiles,
with states drawn from current measurement systems
(e.g., from the EQ-5D)

Value component of QALYconv based
(typically) on community-derived health state
preferences
 Instead, draw community preferences from those who’ve
experienced the states of health (Nord proposes SAVE)
 Instead, of using ex ante community preferences, use
experience-based valuations (Dolan and Kahneman)
 (The challenge is how to operationalize for efficient
application to health program evaluation)
Fairness matters – but it is not a matter that can be settled
by QALYconv
In response….

Equity Weighting: Factor fairness directly into the
preference weighting process (e.g., approaches advanced
by Nord, by Wagstaff, and by Johannesson)

Constrained Optimization Modeling: Maximizing QALY
improvement, subject to meeting equity conditions (e.g.,
as illustrated by Stinnett and Paltiel and by Chen and Bush)

Community-Based Deliberative Processes where the
implications of cost-effectiveness decisions based on
QALYconv can be examined for fidelity to public values
(e.g. Citizen’s Councils)
Must one decide between “building a bridge over
troubled QALYs” and “sailing off into the lesscharted waters” of non-QALY approaches?
False Choice!
Instead, cross that bridge and boldly set sail to new
lands,
(BUT treat the conventional QALY as the point of
departure for the development of new models in
order to capitalize on what has been learned
across many years. This will also allow us to
maintain continuity/comparability in tracking of
trends in population health and in CEAs.)
Paul Kind
University of York, York, UK
Jennifer Elston Lafata PhD
Center for Health Services Research, Henry Ford Health System,
Detroit, MI, USA
Karl Matuszewski MS, PharmD
Elsevier/Gold Standard, Tampa, FL, USA
Dennis Raisch BSPharm, MS, PhD
University of New Mexico, Albuquerque, MN, USA
 Who
are they?
 Faceless bureaucrats?
 Company management?
 Health plan CEOs?
 Mysterious, nameless committees?
 Doctors & hospitals?
 Patients?
 All of the above?

Matrix of potential users/uses of QALYs
 Patients, provider(s), employer/ins, govt.
 Observing health status, comparing to
reference norm, changes over time
 Aggregation levels (individual, groups, pop.)
Lack of consensus on HrQoL by
instrument developers, economists
 There is hope, there is no better
alternative

Pick a measure/methodology
 Incorporate into studies
 Report results
 Educate all constituencies

One health-based input to
decisions (of many)
2. Can be used at various levels in
health system
3. Reference method is required
1.
Michael Drummond DPhil
University of York, York, UK
Diana Brixner BSPharm, PhD
University of Utah, Salt Lake City, UT, USA
Marthe Gold, MD, MPH
City University of New York Medical School, New York, NY USA
Paul Kind
University of York, York, UK
Alistair McGuire PhD
London School of Economics, London, UK
Erik Nord PhD
Norwegian Institute of Public Health, Oslo, Norway
Context for consensus
Many “perspectives”
 Desire to move economic analyses
along in a manner that overcomes
outsider scepticism
 Therefore:

 Best agreement on high level principles
 Areas of departure described in the agenda
for further research
1. QALYs are but one input to
health care decision-making

There are always additional factors that society and
decision-makers must take into account, i.e.
 Equity, social justice
 Budget

These factors need to be incorporated within
healthcare decision making in a manner that allows:
 Transparency
 Legitimacy
2. QALYs can be used in different ways
at multi-levels of health care
Traditionally QALYs have been used for resource
allocation between groups in the population
 QALYs can serve additional purposes in systems
where there is no universal budget

 The context may be narrower when the
government is not the funder of health care e.g.,
○ different treatments for the same condition
○ focusing on health outcomes rather than $s.
3. Health is a determinant
of well-being
The QALY is a measure and valuation of
health, but intersectoral decision-making may
benefit from the broader context of the concept
of “well-being”
 (In the interim the QALY remains highly
serviceable for population-based health care
decision making)

4. Both ex-ante preferences
and experience should count
Health state valuations should reflect both
the experience of people who have familiarity
with them, as well as ex-ante preferences
that reflect forgone capabilities and
opportunities
 But, how do we take account of differences
between patients and the “inexperienced”
 Can differences be melded into a common
language to serve broader resource
allocation decisions?

5. Distributive issues must be
addressed satisfactorily
Cost/QALY is a measure of efficiency,
not of fairness
 Failure to attend to this distinction will
sink the credibility of economic
analyses
 (Distributive issues can be addressed
within the QALY measure itself, or
within the accompanying decisionmaking process)

6. Different methods for
valuing QALYs yield different
results
7. The QALY approach to
summing health gains over
time is simplistic
The Panel on Cost-Effectiveness in
Health and Medicine, 1996

Recommendations for Reference Case
analyses stopped short of endorsing a single
approach. Agreement:
 Generic HRQL
 0-1 scale preference based
 include domains important to the problem under
consideration;
 include effects of morbidity on productivity and
leisure

PCEHM research recommendations
 Support research that assesses the performance of
different measurement strategies in relationship to
others
 Compare valuation strategies
 Develop a catalogue of weights
8. The time has come for all good health
economists to rally behind a
Reference Case QALY

As an outcome measure for decision making,
the QALY continues to fall short of its potential
 CEAs remain non-comparable across diseases and
interventions
 Academic debate rolls on and distracts from progress
in moving economic analyses into mainstream
decision making

A process should be put in place to define a
reference case for QALY measurement
Reference Case QALY II: Ways
Forward
Acclaim a measure (without excluding
others)
 Use cross-walks that allow
interpretability between studies using
different measures
 Set up a consensus group to determine
criteria for a preferred approach

 An extant measure?
 Cross walks?
 New measure?
Michael Drummond DPhil
Professor of Health Economics
Centre for Health Economics
University of York
York, UK

Publication of the consensus workshop
proceedings in Value in Health (vol. 12,
suppl. 1, 2009) available free at:
http://www.ispor.org/meetings/Invitational/
WorkshopPhila1107.asp

Accompanying editorials, raising additional
challenges

Official pushback on QALYs, most notably
in Germany.

NICE’s supplementary guidance on ‘end of
life’ therapy.

Research agenda from the workshop.

Some additional thoughts.
The relevance of non-health objectives
to health-care decision-making.
 Case studies on the use of QALYs at
different levels in the health-care
system.
 Case studies on the use of QALYs in
decentralized, privately funded healthcare systems.

The impact of health on broader well-being.
 The role of an expanded QALY,
incorporating dimensions other than
health.
 The relationship between the valuations of
those experiencing, or having experience
of, health states and the valuations of the
general public.





Methods for briefing members of the public
on the experiences of those in particular
health states.
Comparisons between the main methods for
valuing health in respect of their
incorporation of distributional concerns.
Weighting schemes reflecting distributional
concerns.
Qualitative research into the community’s
views about distributional concerns.
Evaluation of health gains versus the
evaluation of health states.
 Research into the assumption of linearity of
preferences over time and the ways of
obtaining valuations of pathways or
profiles.
 Development of a reference case, or series
of reference cases, for estimating QALYs.

Any decision-making procedure for allocating
healthcare resources needs to weigh benefits
and costs, plus their distribution.
 All approaches are value-laden, whether the
values are made explicit or not.
 The main issue for health economics and
outcomes research is the extent to which
quantification and aggregation contributes to
the decision-making process.
