Annotated Bibliography
RESEARCH ARTICLE
What utility scores do mental health service
users, healthcare professionals and members
of the general public attribute to different
health states? A co-produced mixed methods
online survey
Chris FloodID 1☯*, Sally Barlow1☯, Alan Simpson1, Amanda Burls2, Amy Price3,
Martin CartwrightID 2 , Stefano BriniID
2 , Service User and Carer Group Advising on
Research (SUGAR) members 1¶
1 Centre for Mental Health Research, School of Health Sciences, University of London, London and East
London NHS Foundation Trust, London, United Kingdom, 2 Centre for Health Services Research, School of
Health Sciences, University of London, London, United Kingdom, 3 Department of Continuing Education,
University of Oxford, Oxford, United Kingdom
☯ These authors contributed equally to this work. ¶ Membership of the Service User and Carer Group Advising on Research (SUGAR) is provided in the
Acknowledgments.
Abstract
Background
Utility scores are integral to health economics decision-making. Typically, utility scores have
not been scored or developed with mental health service users. The aims of this study were
to i) collaborate with service users to develop descriptions of five mental health states (psy-
chosis, depression, eating disorder, medication side effects and self-harm); ii) explore feasi-
bility and acceptability of using scenario-based health states in an e-survey; iii) evaluate
which utility measures (standard gamble (SG), time trade off (TTO) and rating scale (RS))
are preferred; and iv) determine how different participant groups discriminate between the
health scenarios and rank them.
Design and methods
This was a co-produced mixed methods cross-sectional online survey. Utility scores were
generated using the SG, TTO and RS methods; difficulty of the completing each method,
markers of acceptability and participants’ preference were also assessed.
Results
A total of 119 participants (58%) fully completed the survey. For any given health state, SG
consistently generated higher utility scores compared to RS and for some health states
higher also than TTO (i.e. SG produces inflated utility scores relative to RS and TTO).
Results suggest that different utility measures produce different evaluations of described
health states. The TTO was preferred by all participant groups over the SG. The three
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 1 / 18
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Flood C, Barlow S, Simpson A, Burls A,
Price A, Cartwright M, et al. (2018) What utility
scores do mental health service users, healthcare
professionals and members of the general public
attribute to different health states? A co-produced
mixed methods online survey. PLoS ONE 13(10):
e0205223. https://doi.org/10.1371/journal.
pone.0205223
Editor: Takeru Abe, Yokohama City University,
JAPAN
Received: February 16, 2017
Accepted: September 21, 2018
Published: October 23, 2018
Copyright: © 2018 Flood et al. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: Data are from the
’What utility scores do mental health service users,
carers and professionals give to different health
states? A co-produced research study using a
mixed methods online survey and analysis’ study
whose authors may be contacted at c.flood@city.
ac.uk. At the time of submission the authors
expressed that they could not unfortunately make
publicly available our raw data for ethical and legal
reasons, as public availability could compromise
participant groups scored four (of five) health scenarios comparably. Psychosis scored as
the worst health state to live with while medication side-effects were viewed more positively
than other scenarios (depression, eating disorders, self-harm) by all participant groups.
However, there was a difference in how the depression scenario was scored, with service
users giving depression a lower utility score compared to other groups.
Conclusion
Mental health state scenarios used to generate utility scores can be co-produced and are
well received by a broad range of participants. Utility valuations using SG, TTO and RS
were feasible for use with service users, carers, healthcare professionals and members of
the general public. Future studies of utility scores in psychiatry should aim to include mental
health service users as both co-investigators and respondents.
Introduction
Mental ill health is a key contributor to the burden of disease [1] costing an estimated £70-
£100 billion per year in the United Kingdom (UK), equivalent to 4.5% of gross domestic prod-
uct (GDP) [2]. Over half of this cost relates to reduced quality of life [3]. There is a need to
prioritise interventions that are cost-effective and target health states that service users report
have the greatest impact on their lives.
Traditionally, health economists and policy-makers use health utilities to estimate treat-
ment cost-effectiveness and inform prioritizing decision-making [4]. Utility is a weighted and
scaled method of quantifying a person’s preference for an experienced or hypothetical health
state [5]. Utility scores are obtained by asking people to evaluate their preference for living in
particular health states (e.g., depression) or experiencing a health-related event (e.g., medica-
tion side effects). This evaluation may draw on a person’s current or past experience, or their
imagining of what it would be like to live with the health state in the future [6]. Service users
who live with mental illness/distress and receive mental health treatments are well-placed to
inform policymakers on the impact of mental ill health on their quality of life.
Utility scores can be elicited using various methodologies including time trade-off (TTO),
standard gamble (SG) and rating scales (RS) [7]. These methods ask people to consider hypo-
thetical health states and either trade an improvement in health for a reduction in time alive
(TTO) or a greater risk of death (SG), or to rate the health state on a scale (RS). Several factors
are important when choosing a utility method to use. These include their psychometric prop-
erties such as: validity (does the method elicit a true preference for the health state?); reliability
(does the method elicit reproducible scores?); feasibility (is the method practical for the target
population and setting?); and acceptability for the target population. In situations where there
is relatively little experience with making health state valuations it is desirable to employ sev-
eral methods in parallel to determine which is the most suitable [8]. However, when employed
in parallel, different methods can yield different utility scores raising legitimate concerns
about how patients, commissioners and policy-makers should use the evidence from different
utility methods to guide their decisions [9, 10].
The SG is considered the ‘gold standard’ method because it includes an element of uncer-
tainty, thought to reflect real world uncertainty over decisions about health and healthcare.
The TTO and RS do not involve uncertainty but rather derive utility values which may be
transformed to utility scores [11]. Some studies report that the SG method generates higher
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 2 / 18
patient confidentiality or participant privacy. In
terms of providing the original analysed data, they
have sought further advice from their research
centre’s Ethics Committee who originally gave their
study ethical approval. They have advised that as
the authors’ original ethics permissions did not
specify to respondents that their data would be
made available to third parties, that this could be
deemed an unethical practice and also break
permissions under the UK Data Protection Act and
the General Data Protection Regulation (GDPR)
which has just become law in the UK (May 2018).
In addition to ethical and legal concerns the
authors are also mindful that some of the
qualitative data – though for the most part unlikely
to identify mental health service users, has a
potential for vulnerable individuals who participated
in the study to recognise their responses that may
end up being available in the public domain,
(significantly without their consent), and thus
cause them distress. For further independent
verification of this please contact Dr Nick Drey on
[email protected], the Chair of the School of
Health Sciences Ethics Committee at City,
University of London.
Funding: The author(s) received no specific
funding for this work.
Competing interests: The authors have declared
that no competing interests exist.
scores (indicating more positive evaluations, or less negative evaluations, of the health state)
than TTO and RS [12]. Similarly some have found that TTO is scored higher than RS [7].
Whether societal preference (amongst the general population) or experience-informed
preference (amongst patients) should guide policy-making remains contested [4]. Gold et al.
(1996) propose that societal preferences should be used for macro-level decision-making and
patient preferences for meso-level (guideline development) decision-making [13]. Utility
scores derived from patients may differ from those of the general population or other specialist
groups such as healthcare professionals. For example, people experiencing multiple health
states give greater weight to mental health states than physical health states, compared to the
scoring of the general population [4, 14]. However, some authors have raised concerns over
the challenges of producing fair and balanced evaluations of health states for individuals who
have personally experienced the health state or symptoms described [15].
Utility scores are widely used for priority setting and resource allocation for physical health
states, but less frequently for mental health states [11, 16]. However, there is little evidence of
service users’ inclusion in the development or scoring of valuations in mental health states
[11]. Emphasis has been placed on the cognitive challenge that scoring health utilities poses
and how some mental illnesses may limit comprehension of the task [17]. Despite these con-
cerns, empirical studies demonstrate it is feasible to derive health utility scores from patients
with severe and enduring mental illnesses such as schizophrenia [15, 17–19], bipolar disorder
[20], depression [11] and affective and alcohol related disorders [15]. These studies have dem-
onstrated that service users can discriminate by disease severity and medication side effects
[20]. It has been recognised that the questions and procedures used to generate utility scores
are abstract and challenging [21] and there have been recommendations that methodology
should be refined to accommodate patients’ ‘mental status’ [17]. Another significant concern
around framing effect biases, can be reduced through the involvement of mental health service
users in developing the health state descriptions used to elicit utility scores, [7] and is central
to the research approach reported in this paper.
A comparative research design, with study materials co-produced with service users, may
reduce some of the limitations to help achieve more valid utility measures. Studies comparing
SG and TTO are usually exclusively quantitative. Our study also includes qualitative elements
to gain insight into the acceptability of the different utility methods and explore factors influ-
encing participants’ values and preferences in health state valuations [22]. We also examine
whether health utilities can be measured remotely using an e-survey.
Study objectives were:
• To co-develop descriptions of mental health states from which utility scores could be
derived, and co-produce utility questions that are understandable to service users.
• To assess the feasibility and acceptability of using scenario-based health states to measure
health utility.
• To determine which utility measure is preferred and how participant groups discriminate
between the scenarios.
• To compare utility scores provided by service users, carers, healthcare professionals and
interested members of the general public.
Materials and methods
Research design
The study used a cross-sectional online survey to collect quantitative and qualitative data.
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 3 / 18
Population, sampling and data collection
Mental health service users, carers, healthcare professionals and interested members of the
public were invited to take part in the survey. Service users and carers were recruited via a link
on the national Rethink Mental Illness charity website (www.rethink.org). The survey was pro-
moted by snowball emails and social networking sites Twitter and Facebook. Participants were
self-selecting and indicated which participant group they identified with. The electronic survey
was open for recruitment from March 2015 to July 2015. The study was conducted in the UK
but did not preclude participation from other countries.
Ethical approval
Ethical approval was granted by the School of Health Sciences’ Research Ethics Committee
City, University of London.
Instrument design
The survey was designed collaboratively with members of the Service User and Carer Group
Advising on Research (SUGAR) [23] and research academics at City, University of London.
The SUGAR group has 13 service users with lived experience of mental illness and three carers
and meets monthly to advise on research projects within the Centre for Mental Health
Research and East London NHS Foundation Trust. The original study design was presented at
the January 2013 SUGAR group meeting. Members were invited to become involved in the
study, to ask questions about the research and discuss how the research study could proceed.
Members of the SUGAR group agreed to work collaboratively on the study and contribute to
the design of instruments. Instrument design occurred in two stages: 1) developing the mental
health state scenarios; and, 2) designing the survey questions.
Stage 1: Development of the mental health state scenarios. The SUGAR group helped
write several short fictional scenarios describing the presentation and experience of specific
mental health conditions. Members worked in groups of two or three. We offered guidance to
the group by prompting members with questions such as ‘how would someone describe living
with that condition?’ ‘What would impact on their condition?’ Once complete, each scenario
was presented to the wider group for feedback. The scenarios went through several iterations
through group discussions over three months. An example hypothetical health state and its
description is given in Box 1.
A total of ten scenarios, focusing on different mental health states, were developed. The
final survey used five scenarios chosen by the SUGAR group. The example in Box 1 focusing
on psychosis, and another four scenarios on medication side-effects, self-harm, eating disor-
ders and severe depression were used (see S1 Table).
Stage 2: Development of the survey questions. Survey questions were designed using SG
and TTO methods for scoring the five health states. The survey questions were designed by the
authors and reviewed by the SUGAR group to ensure that they were comprehensible.
Box 1: Psychosis health state scenario
‘Joseph lives alone and is scared that people are out to kill him and says that these people are going to bomb his house. His neighbours also want him out because of what they see as strange behaviour on his part, his general oddity and the fact that he talks to himself. Joseph hears voices which reinforce his fears.’
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 4 / 18
The survey was housed on SmartSurvey (www.smartsurvey.co.uk) and included 29 questions
(some with several parts). We estimated it would take 20–25 minutes to complete. The first part
asked respondents to read the information sheet and consent to the study. Participants could
withdraw from the study at any stage by simply clicking out of the survey. For descriptive pur-
poses socio-demographic information (i.e., age, gender, ethnicity, level of education and marital
status) was requested and is summarised in Table 1. The main body of the survey included the
five scenarios with questions linked to each of the scenarios to assess health utilities using RS,
SG and TTO methods. After completing the three utility measures for all five scenarios, respon-
dents were asked questions about the acceptability of the measures. Ten-point Likert rating
scales assessed the perceived difficulty of each method. Preference for each method was reported
alongside free text response boxes so that participants could expand on their responses. A final
free-text response box at the end of the survey allowed for feedback on anything that may have
affected their response to the questions. The Checklist for Reporting Results of Internet E-Sur-
veys (CHERRIES) [24] was used to inform the development of the survey.
Utility measures
All the utility methods generate a score from 0–1 (0: worst possible health state– 1: best possible health state). The methods of eliciting utility scores for each measure are described below:
Rating scale questions. For each scenario participants were asked to score the health state
from 0–10 with lower values representing more negative appraisals of the health state (0: worst imaginable health state– 10: the best possible health state). In order to obtain a RS utility measure the responses given by the participant was divided by 10 to produce values between the ranges of 0–1.
Time trade-off questions. For each scenario respondents were asked to imagine making
a choice between spending the next ten years of life in the health state described (e.g., psycho-
sis), or ’trading’ some years of life to be completely free of symptoms for the rest of their life.
They were then asked to indicate the maximum number of years of their life they would be
willing to trade to have complete wellness. To help with comprehension the SUGAR group
suggested that some people might understand the term ’trade’ as swapping, surrendering or
sacrificing, this was incorporated into the description.
The choice of how many years to trade was offered incrementally, one year at a time (to a maxi-
mum of 10). A choice was required for each year to identify the point of indifference which was
reached when the participant could no longer choose. The utility for the health state was calculated
from the proportion of years traded at the point of indifference. For example, if someone trades 4
out of a possible 10 years of life to achieve full health, then the utility they ascribe to the health state
would be 0.6 (Utility = 1 –(years traded at the point of indifference/total possible years to trade)).
Standard gamble (SG) questions. For each scenario respondents could choose to remain
in the health state (e.g., psychosis) for the rest of their lives or take a gamble in which there was
a specified risk of dying but, if they did not die, they would be fully healthy. As part of this pro-
cess respondents were asked to score the maximum risk of death they would take in exchange
for guaranteed full health until a point of indifference. For example, if respondents find it hard
to choose whether or not they would risk a 10% chance of death for a 90% chance of full health,
then their utility for that state is 0.9. If they are indifferent when there is a 90% chance of death
and 10% chance of full health, then their utility for that health state is 0.1.
Analysis
Quantitative analysis
Utility scores were calculated in Microsoft Excel, imported into SPSS version 21 [25] and
checked and cleaned by two researchers. Descriptive statistics (e.g., means + standard
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 5 / 18
Table 1. Participant socio-demographic data.
Service
Users
Carers Interested member of the
Public
Healthcare
Professionals
Service Users & Health Care
Professionals
(n = 46) (n = 6) (n = 31) (n = 28) (n = 5)
Age in years: mean (s.d.) 32 (12) 49
(17)
36 (11) 39 (12) 40 (18)
Range 17–62 19–72 18–59 21–57 29–67
Gender:
Female 34 (74) 5 (83) 29 (93) 19 (68) 100 (5)
Male 12 (26) 1 (17) 2 (7) 9 (32)
Country
England 42 (91) 5 (83) 30 (97) 25 (89) 5 (100)
Wales 2 (4)
Scotland 1 (2) 1 (17) 1 (4)
Other country 1 (2) 1 (3) 2 (7)
Ethnicity:
English 30 (65) 4 (67) 18 (58) 17 (61) 3 (60)
Other British 7 (15) 1 (17) 4 (13) 2 (7) 1 (20)
Other White 2 (6) 1 (4)
Asian 4 (13) 3 (14)
Irish 2 (4) 1 (4)
African 1 (3) 3 (11)
Black/British 1 (2) 1 (17)
Black/Caribbean 1 (1)
Other ethnic group 3 (6) 1 (3)
Relationship Status:
Never married/formed a civil partnership 27 (59) 11 (35) 10 (36) 3 (60)
Married/in civil partnership 9 (20) 5 (81) 9 (29) 7 (25)
Cohabiting 7 (15) 1 (17) 3 (10) 9 (32) 2 (40)
Divorced/Separated 2 (4) 5 (16) 2 (7)
Widowed 1 (2) 3 (10)
Work Status:
in paid employment 25 (48) 3 (50) 17 (55) 22 (79) 3 (60)
temporarily off sick 4 (8)
Unemployed 2 (4) 1 (17) 3 (10)
Retired 1 (17) 2 (6)
looking after the family, home or dependents 2 (6)
Unable to work because of Long term disability
or ill health
6 (13)
In full time education or training 9 (20) 1 (17) 6 (19) 6 (21) 2 (40)
Other 3 (6) 1 (3)
Qualification:
Higher degree 17 (37) 2 (33) 14 (45) 17 (61) 1 (20)
Degree/degree level 10 (22) 12 (39) 10 (36)
Other higher education below degree 4 (9) 1 (3) 1 (4) 3 (60)
A-levels/similar 9 (17) 1 (17) 2 (6) 1 (20)
GCSE/O-level/similar 5 (11) 2 (33) 2 (6)
Trade Apprenticeships 1 (2)
No Qualifications 1 (2) 1 (17)
All data presented as N and (%) unless stated otherwise. Missing data for Age, N = 1, Service user and Health care professional
https://doi.org/10.1371/journal.pone.0205223.t001
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 6 / 18
deviations; frequencies; percentages) were used to summarise the sample characteristics and
the outcome measures (utility scores) using three different methods (RS, TTO, SG) for five
health states, across three participant groups (SU, HCP, MoP). These comparisons enable an
evaluation of discriminatory power. Discriminatory power is a function of three factors: the
description of the health state, the utility method, and the evaluative abilities of participants.
Discriminatory power is present when health states that would be expected to be scored differ-
ently are scored differently. Observing discriminatory power therefore implies that no influen-
tial biases are present (e.g., floor or ceiling effects, central tendancy bias). Conversely, a lack of
discriminatory power across all participant groups raises questions about the health state
descriptions and/or the utility method, whereas a lack of discriminatory power in only some
participant groups suggests a lack of evaluative abilities in those groups.
Understanding, acceptability and preference. Understanding, acceptability and prefer-
ence for the three utility scoring methods across the different participant groups was assessed
using (a) the proportion of successfully completed surveys, (b) a perceived difficulty Likert
scale, (c) the reported preference for the utility measures, and (d) the number of zero traders
and maximal traders.
Statistical analysis. Inferential statistics were based on three groups (n = 105): service
users (SUs; N = 46), healthcare professionals (HCPs; N = 28) and interested members of the
public (MoPs; N = 31) because there were insufficient participants in other groups (Carers;
N = 6, Service users and healthcare professionals; N = 5). A two-way mixedanalysis of variance
(ANOVA) was conducted to determine whether there were significant differences in the way
the three groups (SUs, HCPs, MoPs) scored the five health states using the three types of utility
measure (RS, TTO or SG). Due to multiple testing, the level of significance (α-level) was reduced to 0.01. Tukey post-hoc tests were conducted to ascertain differences in scoring for
the different scenarios, utility measures and participant groups.
One-way ANOVAs were conducted to explore differences in the perceived difficulty of
each utility measure and the percentage preference scores, across groups.
Analysis of zero-traders and maximal traders. Zero-traders, respondents who did not
trade any years of life for improved health (TTO) or gamble at any % risk of death (SG), and
maximal-traders, respondents who traded the maximum amount of time (10 years) or
accepted the maximum amount of risk to live in perfect health, were identified.
Qualitative data
The free text boxes enabled participants to provide qualitative information about factors that
may have influenced their responses and their preference of utility measure. A basic thematic
analysis was undertaken [26] line-by-line using constant comparisons. Identified themes were
independently checked by two researchers and disagreements resolved by a third reviewer.
Results
Sample
During the four month recruitment period 204 people accessed the survey: 85 were partially
completed and 119 (58%) fully completed. The mean time to complete the survey was 14 mins
(range from 4 mins to 120 mins), with 75% of respondents completing the survey within 9 and
57 minutes. Eight participants returned to the survey and time of completion could not be
obtained. Participant characteristics are given in Table 1. Of the 119 complete responders, 46
identified as service users; 6 were carers; 31 were interested members of the public; and 28
were mental health professionals. Five respondents described themselves as both a service user
and a healthcare professional. A further 3 participants that selected multiple identities were
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 7 / 18
excluded, leaving 116 in the descriptive data. Participants were between 17 and 72 years old.
More females completed the survey than males, ranging from 68%-100% across participant
groups. The majority of the respondents were based in England and a large percentage (73%)
identified themselves as English or “other British”. A high proportion had University-level
education, with 22% reporting having a degree (e.g., BA, BSc) and 37% a higher degree (e.g.,
MSc, PhD).
Non-completion of the survey. The surveys that were started but not completed (N = 85)
were not included in the analysis however we provide some further detail here. Seventy nine
completed the first stage of the survey allowing us to view the socio-demographics. The major-
ity left the survey after completing the first scenario questions. The demographics of the partic-
ipants completing the survey were similar to those who did not. The average age of non-
completers was 36, 45/79 were female, 44/79 had a higher degree and 24/79 were service users.
Comparative utility scores. Utility scores ranged from zero to one. Comparative mean
scores for the utility measures and participant groups are provided in Table 2. Similar patterns
of scoring were observed across participant groups and the SG consistently scored higher
(indicating a better health state) than the RS in all five scenarios, and for some health states
more than TTO.
There were no significant interactions between utility measure and the participant group
for any scenario. There was a substantial main effect of utility measure on utility score in all
scenarios, suggesting that different utility measures produce different scores. Table 2 summa-
rises the descriptive and inferential statistics. There were no significant differences in how par-
ticipant groups scored four scenarios (psychosis, side-effects, self-harm and eating disorders).
There was a significant main effect of participant group in the depression scenario, F (2, 102) =
4.80, p = 0.01, partial eta squared = 0.086, suggesting that there was a difference in the way that
service users, healthcare professionals and interested members of the public scored this sce-
nario. Tukey post-hoc tests suggested that service users gave depression a lower utility score
(RS: 0.30; TTO: 0.31; SG: 0.50) (perceived it as worse to live with) than healthcare professionals
(RS: 0.33; TTO: 0.49; SG: 0.70), p = 0.036 and interested members of the public (RS: 0.36;
TTO: 0.48; SG: 0.64), p = 0.025 (post hoc tests non-significant at reduced α level <0.01). Ranking the scenarios. There was considerable consistency in how the scenarios were
ranked and the type of utility measure used (see Table 3). Similarly, there was consistency
across the participant groups’ mean ranking of health states.
Across all groups and utilty measures, psychosis scored as the worst health state to live with
while medication side-effects were viewed most positively. MoPs and HCPs scored depression
as the second worst health state across all utility measures; SUs ranked depression equal to psy-
chosis using the RS and SG. Eating disorders and self-harm were mid-ranked across all groups
and utility measures.
Acceptability of the utility measures
Perceived difficulty in completing the questions. Participants were asked to measure on a
Likert scale how hard they thought it was to complete the questions. A score of zero referred to
‘not difficult at all’ and a score of 10 represented ‘very difficult’. A one-way ANOVA revealed no
significant differences in the perceived difficulty of the utility methods between SUs (mean =
5.35 (SD = 2.87), MoP (6.32, 2.86) and HCPs (6.75, 2.44) (F (2, 102) = 2.53, p = 0.085).
Further qualitative detail about the perceived difficulty in completing the questions was
derived from the free-text responses from 38 participants (12 SUs, 1 Carer (C), 14 HCPs, 7
MoP, and 4 people identifying with two or more of the population categories). These were col-
lated into five core themes, a summary is provided in Table 4 with illustrative quotes.
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 8 / 18
Utility measure preference. The majority of participants (N = 72, 60%) found the TTO
measure easier to complete than the SG (N = 47, 40%). Exploring differences across participant
groups revealed that preference for the TTO was held by SUs(63%, N = 29), MoP(58%,
N = 18), HCPs(61%, N = 17) and 3 out of 5 (60%) people identifying as both SUs andHCPs.
There were 22 free-text responses about how the preferred choice was made. Ten were
from SUs, six fromMoP, four from HCPs, and one each from a carer and a SU who was also
aHCP. Nine respondents indicated that neither TTO nor SG was easier to complete, three
reported that SG was easier, and four thought that the TTO was easier and gave reasons for
these.
Table 2. Comparative utility scores between utility measures used and respondents.
Participant Groups
Utility Measures by Scenario
(Mean ± SD) Mental Health Service User Interested member of the public Healthcare
Professional
(n = 46) (n = 31) (n = 28)
Psychosis
RS 0.30 (0.17) 0.26 (0.22) 0.26 (0.16)
TTO 0.24 (0.33) 0.28 (0.29) 0.34 (0.33)
SG 0.50 (0.37) 0.50 (0.26) 0.52 (0.34)
ANOVA:
by Group F (2, 102) = 0.189, p = 0.83
by Utility F (1.95, 204) = 24.65, p<0.0001
Medication Side-effects
RS 0.47 (0.18) 0.53 (0.19) 0.53 (0.15)
TTO 0.56 (0.36) 0.65 (0.31) 0.68 (0.33)
SG 0.68 (0.37) 0.74 (0.33) 0.78 (0.33)
ANOVA:
by Group F (2, 102) = 1.67, p = 0.19
by Utility F (2, 204) = 22.01, p<0.0001
Self-Harm
RS 0.38 (0.20) 0.36 (0.19) 0.39 (0.20)
TTO 0.47 (0.39) 0.53 (0.30) 0.58 (0.31)
SG 0.61 (0.36) 0.67 (0.32) 0.75 (0.29)
ANOVA:
by Group F (2, 102) = 1.32, p = 0.27
by Utility F (2, 204) = 41.05, p<0.0001
Eating disorders
RS 0.35 (0.22) 0.35 (0.20) 0.41 (0.19)
TTO 0.42 (0.34) 0.56 (0.33) 0.63 (0.31)
SG 0.60 (0.34) 0.69 (0.31) 0.75 (0.28)
ANOVA:
by Group F (2, 102) = 3.65, p = 0.03
by Utility F (2, 204) = 46.85, p<0.0001
Depression
RS 0.30 (0.21) 0.36 (0.23) 0.33 (0.18)
TTO 0.31 (0.35) 0.48 (0.30) 0.49 (0.34)
SG 0.50 (0.36) 0.64 (0.29) 0.70 (0.25)
ANOVA:
by Group
F (2, 102) = 4.80, p = 0.01
by Utility F (2, 204) = 35.67, p<0.0001
https://doi.org/10.1371/journal.pone.0205223.t002
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 9 / 18
Participants who preferred the TTO measure (N = 4) provided responses that fell into two
main categories:
• Lack of clarity (of SG): TTO was easier to understand than the SG because “[SG was] confus- ingly worded” and “a bit too arbitrary”.
• Personal meaning: “Would rather have quality of life over duration” [service user and
healthcare professional]. A service user expressed that the TTO was easier to relate to for
them “Because I know how much time in my life has been lost being ill”.
Participants who preferred the SG (N = 3) provided responses that could be grouped into
two key areas of concern.
• Uncertainty: not knowing the length of life years that they had left: ‘[finding] Balance between trading time for wellness is difficult to assess given none of us know how long we will live. Also is effected by age. The percentage risk is more immediate’ [SU].
• Complexity: A service user thought that the wording in the TTO was more difficult “I couldn’t figure out whether I would spend 10 years unwell and then be okay for the rest of my life or 10 years and then die straight away.” A mental health professional also stated that the SG was easier because they were “short questions and easy to select the percentage’.
Analysis of zero-traders and maximal-traders. Zero-traders are participants who want
the maximum length of life at whatever cost to quality of life. Maximal-traders want the maxi-
mum quality of life at whatever expense to length of life. There were zero-traders and maxi-
mal-traders in both of the utility scoring methods, these will be presented in turn.
Table 3. Health state scenarios ranked according to valuation by method and participant.
Mental Health Service Users (N = 46)
Rank Rating Scale Time Trade-Off Standard Gamble
1 Psychosis / Depression Psychosis Psychosis / Depression
2 Eating Disorders Depression Eating Disorders
3 Self-harm Eating Disorders Self-Harm
4 Medication side-effects Self-Harm Medication side-effects
5 Medication side-effects
Interested members of the public (N = 31)
Rank Rating Scale Time Trade-Off Standard Gamble
1 Psychosis Psychosis Psychosis
2 Eating Disorders Depression Depression
3 Self-Harm / Depression Self-Harm Self-Harm
4 Medication side-effects Eating Disorders Eating Disorders
5 Medication side-effects Medication side-effects
Healthcare Professionals (N = 28)
Rank Rating Scale Time Trade-Off Standard Gamble
1 Psychosis Psychosis Psychosis
2 Depression Depression Depression
3 Self-Harm Self-Harm Self-harm / Eating Disorders
4 Eating Disorders Eating Disorders Medication side-effects
5 Medication side-effects Medication side-effects
Rank: 1 = Worst Health State
https://doi.org/10.1371/journal.pone.0205223.t003
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 10 / 18
Time trade-off. Five participants were zero-traders across all scenarios using the TTO
method (2 service user, 2 healthcare professionals and 1 member of the public). Overall there
were 70 incidents of zero trading (12% of all responses) across the scenarios and participants.
The highest incidence of zero traders was for the medication side-effects scenario with 24 par-
ticipants (20%) choosing not to trade years. This contrasts with only 6 participants (5%) choos-
ing not to trade years in the psychosis scenario.
Seven participants who were maximal traders across all scenarios using the TTO method (4
service users, 1 carer, 2 healthcare professionals and 1 member of the public). There were 152
incidents of maximal trading (26% of all responses) across the scenarios. The highest incidence
of maximal traders was in the psychosis scenario with 58 participants (49%) choosing to trade
Table 4. Difficulty with scoring health utilities: Themes and illustrative quotes.
Moral and emotional reactions
Participants referred to how they reflected on their choices and spoke about emotional reactions to the questions
and moral dilemmas that they felt when completing the valuations.
• ‘frustrated as I couldn't explain my choices’ [C & HCP] •‘The questions which raised isolation as a factor made me more likely to trade years’ [HCP] • Another spoke about feeling ‘despair’ [C] • ‘I felt guilty rating things as less important as it seemed like I was belittling the condition’ [P] • ‘questions difficult in a moral sense’ [HCP]
Relevance to own experience
Some participants argued that lived experience could be advantageous in answering the questions. Concerns were
raised about difficulties in imagining what it would be like to live with some health state. This was acknowledged by
SUs and HCPs.
• Could relate more to own experience so rated them worse, which makes my answers subjective, rather than objective’ [SU]
• ‘Difficult to understand what those symptoms really feel like and be able to accurate make a judgment as to what you would do’ [HCP]
• ‘As only one section was even vaguely relevant to something I had experienced, I did not feel competent to make an assumption of what it would be like to experience most of the states described’ [SU]
• ‘finding it hard, to imagine being in the described situations’ [HCP] Standard Gamble Confusing
Several people found the wording in the risk question difficult
• ‘percentage questions confusing’ [HCP] • ‘finding the ‘risk % section quite hard to understand’[P] • ‘Risk % section quite hard to understand’ [SU]
Instructions unclear/ambiguous
Several responses were received around the wording and difficulty with interpreting what was expected when
completing the valuations.
• ‘I found the wording & the concept of the questions confusing’ [HCP]
• Another mentioned ‘not really understanding your instructions. It felt very abstract.’ [P] • ‘Instructions were complex’ [P] ‘Too complicated’ [P]
• ‘Instructions were not clear’ [HCP]
Conceptually challenging/uncertainty over choices
Some respondents found the methods conceptually challenging and making valuations philosophically difficult.
Some references were made to concerted efforts in thinking through the responses and making judgements.
• ‘This is not a questionnaire that could easily be completed by a lay person who does not have research training’ [P] • ‘hard to be consistent across questions’ [P]
• One respondent stated it was ‘hard to make a judgement between trading the end years of your life with the likelihood of dying by suicide in the next ten years’ [SU]
• ‘Life & death decisions are hard and not very realistic’ [SU]
https://doi.org/10.1371/journal.pone.0205223.t004
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 11 / 18
the maximum number of years (10 years) to live in a better health state. The lowest incidence
of maximal trading was for medication side effects with 16 people (13%) choosing to trade 10
years to live without side effects.
Standard gamble. Three participants were zero-traders across all scenarios using the SG
method and all were service users. Overall there were 69 incidents of zero trading (12% of all
responses) across the scenarios and participants. The highest incidence of zero traders was for
the medication side-effects scenario with 26 participants (22%) choosing not to accept any %
risk of death for a better health state. This contrasts with only 6 participants (5%) choosing not
to risk death in the psychosis scenario.
Six participants were maximal traders across the scenarios using the SG method (3 service
users, 1 carer and 2 members of the public). Overall there were 62 incidents of maximal trad-
ing (10%) across the scenarios. The psychosis scenario had the highest incidence of maximal
traders (17/119 (14%)), while medication side effects had the lowest incidence of maximal
trading (11/119 (9%)).
Did anything else affect participants’ responses?. We received 66 participant responses
to the open-ended question asking if anything had affected their responses. The majority of
the responses (N = 46, 70%) related to personal experiences of mental illness and identifying
with the person in the scenario. Twenty four of the responses about personal experience were
from service users, with seven from HCPs, six fromMoP, three from carers, and five responses
from people identifying with two or more of the participant type categories.
“Probably those ones that I can 'feel' the pain of relative to those ones that I have to imagine— I'm probably more willing to trade years on things that I can remember feeling.” [SU]
“Partner suffers from psychosis and I have seen this suffering straight on” [C]
“I have experienced severe depression myself & have also worked with people with the rest of the diagnoses discussed.” [HCP]
In contrast, others reflected on their lack of personal lived experience of mental illness and
how that bought challenges in completing valuations on the health states.
“Not having first-hand experience and therefore having to rely on impressions of my personal- ity to consider what my actions might be” [MoP]
Several respondents referred to their emotional state at that moment (N = 7), mentioning
feelings of sadness, tiredness and social isolation. Whilst two people reflected on the complex-
ity of the scenarios and others on challenges with moral decisions and two service user partici-
pants referred to negative images or stigma of mental health conditions.
Discussion
In this study, we sought to collaborate with service users to co-produce descriptions of mental
health states from which to generate utility scores and frame utility questions so that they are
comprehensible to service users. Another aim was to determine the feasibility of using differ-
ent utility methods via an online questionnaire. We compared utility scores provided by ser-
vice users, healthcare professionals, members of the public, and carers (descriptively). The
acceptability of the co-produced health states and the different utility methods to determine
health utility was also examined.
The results indicated that:
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 12 / 18
1. Mental health state scenarios used to generate utility scores can be co-produced and are
well received by a broad range of participants using an online survey.
2. Standard techniques used to elicit utility valuations (SG; TTO and RS) were feasible for use
with service users, carers, healthcare professionals and members of the general public.
3. Similar trends were seen in utility scores elicited by the different utility methods across all
participant groups. For a given health sceanrio, the SG was generally scored higher (indi-
cated a more preferred health state) compared the TTO and RS. Some differences between
participant groups emerged in the scenario on depression.
4. Participants ranked the scenarios comparably demonstrating equivalence in discrimination
and weighting of the scenarios.
5. The TTO was preferred over the SG.
Searching the literature we were unable to locate previous examples where mental health
state scenarios were co-produced with service users and carers for use within an e-survey.
In line with previous research [8], we found significant differences between the utility
scores when using different types of utility measure within each scenario. Similar patterns to
those found in other studies were identified, with respondents scoring the highest utility when
using the SG and lowest utility when using the RS methodology [27].
Of particular interest, service users gave a lower utility value (indicating a less preferred
health state) for the depression scenario than healthcare professionals and interested members
of the public. Isacson et al. (2005) found that people with depression rated their health state
utilities significantly lower than those without [28]. The literature to date suggests that “well-
informed” respondents (i.e., people who have experienced the condition) may score the sce-
nario as less threatening and therefore give a higher utility score than respondents who did not
share that experience. This is the converse of what is seen in this data, and therefore does not
fit with theories such as the disability paradox [29] or the stress-appraisal-coping paradigm
[30]. It is important to acknowledge that we do not know which of our respondents had expe-
rienced depression and therefore it is unclear whether these findings are due to direct experi-
ences or knowledge relating to a hypothetical health state. Stiggelbout (2008) provides a
thorough review on how scenarios are interpreted and judgements are made by people with
lived experience and those naïve to the lived experience during the process of scoring utilities [4]. Of particular interest to the field of mental health is the focus of the illness in the person’s
life and their constructed meaning. One study showed a recovery-focused approach to inter-
preting the illness where people with the human immodeficiency virus (HIV) reframed living
with the illness positively by focusing on how HIV fit in with the broader context of their life
rather than purely focusing on the impact on their health [31].
Ranking
With regard to how the scenarios were ranked, there was consistency across participant groups
in ranking the psychosis scenario as the most undesirable scenario to live with. This was irre-
spective of utility measure used and it may have implications for service users prioritising
treatments that could maximize preferences or health gain. Of course, prioritisation will
depend on the estimated gain from any actual intervention.
Acceptability
Completion of the survey. In this pilot study a high proportion of service users, health-
care professionals and members of the public successfully completed the utility scores for five
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 13 / 18
described healthcare scenarios. There were no substantial differences demographically
between participants who completed the survey and those that did not.
Difficulty. In terms of difficulty in completing the online survey, we found that there was
a suggestion that healthcare professionals and members of the general public perceive the util-
ity measures as more difficult than service users. However there were no significant between-
group differences. Arguably this demonstrates that mental health service users are just as capa-
ble of scoring utility scenarios as are members of the public and healthcare professionals. How-
ever, this interpretation should be treated cautiously as the qualitative data suggests that there
is some difficulties with the SG and TTO utility methods for all participant groups. Partici-
pants found the scenarios and scoring mechanisms difficult to understand and were uncertain
over how to score the scenarios overall. Some respondents also had concerns around accepting
the philosophical notions of trading, ‘giving up life years’ or ‘risking’, indicating that face valid-
ity within the scenarios remains a challenge. Consistent across the groups was a preference for
scoring the TTO. Participants found the TTO easier to understand as they were able to relate
to losing years of life more readily than accepting an increased risk of death (SG).The literature
also suggests that the TTO is preferred by some for the relative ease of use compared to the SG
and has been reported as consistent with individual preferences [9, 12] and the most frequently
used method [14].
Zero-traders. Only 5% of participants using both TTO and the SG for psychosis refused
to trade. Zero-traders were most prevalent in the scenario for medication side effects with 20%
refusing to trade time (TTO) and 22% unwilling to gamble on an increased risk of death (SG).
This may be a function of participants accepting the side effects as a necessary albeit an unde-
sirable aspect of treatment.
Limitations. Some participants started the survey but did not complete it (non-comple-
ters). The reason for this is unknown, although feedback from other respondents suggest it
may have been due to the format of the survey and complexity of questions.
Recruitment was voluntary using an internet link, some degree of self-selection bias is likely
and probably resulted in a less representative sample. Table 1 indicates that the sample of ser-
vice users is unusually well-educated, with 37% having a higher degree and another 22% a
degree. This is higher than the average in the UK, where 34.4% of the population is estimated
to have achieved a degree-level qualification or above [32]. Given the nature of the research it
may not be surprising that the sample is relatively well educated, and does limit the generaliz-
ability of the findings. The use of online surveys can also pose a challenge for people who do
not have access to a computer and this may have had an impact on recruitment. However it is
difficult to estimate the true impact of any potential selection bias when data on non-partici-
pants is unavailable [33].
Respondents identified themselves as being healthcare professionals, members of the pub-
lic, service users or carers, responses which cannot be verified by the researchers. Additionally
for many participants these categories are not exclusive and there will be overlap with people
identifying with more than one category. For those who identified as service users we have no
information about their clinical condition (e.g. diagnosis, severity, duration of time living with
the condition) and therefore associations with scoring disease specific-scenarios was not possi-
ble [17]. Because we did not use quality of life measures alongside the utility measures conver-
gent validity could not be assessed.
In this study we did not control for the order effects of scenario presentation and the poten-
tial that scores were moderated by anchoring.
There are also unresolved questions about how to measure health. Our measures informed
by our co-produced scenarios also included an element of social participation. This is an
important consideration when proposing to measure mental health with people whose
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 14 / 18
condition, recovery [34] and quality of life is affected by broader social considerations such as
housing or employment experiences and interventions. One of our scenarios included the
description of neighbours’ perceptions of the person with the illness, which may reflect real
issues around relating to inclusiveness, stigma and even the reality of experiencing paranoia,
but here there is also a danger of stretching the concept of social participation.
It may be argued that these types of analyses lend themselves more to moderate disorders, the
treatment for which is typically designed to ameliorate symtoms as part of improving mental
health. With a recovery model in prominence [34], symptom control may not always be the sole
concern for severe and enduring disorders such as schizophrenia, where many interventions
would seek to target quality of life much more broadly (including housing, employment and other
measures of recovery and social participation). Biases in the direction of understating the benefits
of these factors on the quality of life of individuals could arise and this may be a further limitation.
In addition, some conditions are not susceptible to adaptation, and they interrupt daily life
almost continually. By their very nature they draw attention to themselves (one cannot just
think about something else most of the time); for example, the pre-occupative nature of
depression or chronic pain. With this in mind, service users may give depression a lower utility
score (i.e. less preferred health state) than other groups, with a risk thereafter for utility weights
to be given that are too high. Additionally every preference elicitation question, by their
nature, focuses our attention on something, and so we will generally be led to overstate the rel-
ative importance to our lives of the things that we are asked to focus on [35]. For equal consid-
eration is the evidence that suggests that the strength of preference may also be a poor guide to
the intensity of experience [36–38] and a propensity for us to exaggerate the extent to which
we will attend to the state being valued (Dolan and Kahneman, 2008), with us all being mem-
bers of the ‘public’ and ‘patients’ and therefore susceptible to exaggeration [35].
Dolan et al. (2010) also point out, trade-off responses themselves are related to the fre-
quency and intensity of negative thoughts about health in ways that may not have been previ-
ously well captured by any of the proposed valuation methods [35]. Different values may also
capture “experience” rather than “preference.” Dolan and Kahneman (2008) cite Smith et al’s
(2006) work with the following example; a patient with a colostomy thinks they are happy with
the colostomy, and expects to be happy again without it. However, when it is removed they
remember their previous state (of having the colostomy) as being unacceptable and, in terms
of preferences, they report that they would be willing to pay a great deal, including life-years,
to get rid of that state [39], a reflection of the extremely negative prior experience.
Lastly it is worth considering in the treatment of mental health preferences the potential for
‘cognitive denial’, where patients may find it difficult to admit how poor their health really is,
or ‘suppressed recognition of full health’ where patients cease to realize what full health may be
like and a have ‘lowered expectation’ overall. [40].
Future development of utility measures. Future research of this type may provide a
more rigourous assessment of how health is being conceptualized in the development of such
scenarios, while finding ways of helping to create scenarios and scoring mechanisms that are
less complex. Possibly a greater challenge will be to create scenarios that do not lead to philo-
sophical objections.
Nord et al. [41] discuss the use of QALYs in terms of ex ante and ex post. Ex ante is the
more traditional approach and refers to health utility judgements made by the general public
from behind a ‘veil of ignorance’. There is merit in the ex post approach which refers to the uti-
lization of direct experience of the health state as “experienced utility”. The participants within
this study are a combination of both ex ante and ex post participants. It may be beneficial to
identify previous health experiences in respondents but conversely it may influence willingness
to participate if scenarios are felt less ‘hypothetical’.
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 15 / 18
Conclusion
This study involved service users and reports the initial steps towards developing and embrac-
ing a process of research co-production in a complex field [42, 43]. Additional studies involv-
ing service users in utility measurement are needed in the attempt to promote sensitive
measurement design, increase instrument validity, study feasibility and the acceptability of the
measures. Future studies may aim to build on more extensive involvement by developing
knowledge and understanding to include service users in the analysis of data and interpreta-
tion of results [44].
Traditionally there have been wide variations in the utility values reported contributing to
an overall lack of clarity in reporting methods used to elicit the utility values [45]. This study
offers data to compare different valuation methods in order to help assess their feasibility
whilst at the same time transparently reporting the methods and some of the difficulties and
limitations of our approach. It adds to the limited qualitative evidence reported alongside util-
ity scores for a range of health states and offers insights into factors that influenced respon-
dents’ decisions, the relative difficulty of and preferences for measures used. This will help
inform our future research and that of others to better prepare such utility design in the future.
Supporting information
S1 Table. Available as information file for the five scenarios used in the utility measure-
ments and offered to participants in the online questionnaire.
(DOCX)
Acknowledgments
We thank the participants of this study for taking the time to make contributions and Rethink
Mental Illness for supporting the promotion of the study. We particularly thank members of
the SUGAR group for shaping this research. Membership of SUGAR at the time of this study
is listed below, with their lead author and contact person:
Nelly Adongakulu, Mike Ahern, Mary Amuda, Mandy Bannister, Claudette Brandon,
Fadeke Coker, June Hanshaw, Jay Hudson, Richard Humm, Jagadish Jha, Marybel Moore,
Isaac Samuels, Jean Taylor.
All SUGAR members are Honorary Research Fellows, Centre for Mental Health Research,
School of Health Sciences. Lead author for the group and contact email address: Richard
Humm [email protected].
Author Contributions
Conceptualization: Sally Barlow, Amanda Burls, Stefano Brini.
Formal analysis: Chris Flood, Sally Barlow, Martin Cartwright.
Investigation: Chris Flood.
Methodology: Chris Flood, Sally Barlow, Amanda Burls, Amy Price.
Project administration: Alan Simpson.
Supervision: Chris Flood.
Writing – original draft: Chris Flood, Sally Barlow.
Writing – review & editing: Chris Flood, Sally Barlow, Alan Simpson, Amanda Burls, Amy
Price, Martin Cartwright, Stefano Brini.
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 16 / 18
References 1. Vos T, Barber RM, Bell B, Bertozzi-Villa A, Biryukov S, Bolliger I et al.: Global, regional, and national
incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in
188 countries, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. The
Lancet 2015, 386(9995):743–800.
2. Davis S: Annual Report of the Chief Medical Officer 2013: Public Mental Health Priorities-Investing in
the Evidence. London: Department of Health 2014.
3. Economic and social costs of mental health problems in 2009/10
4. Stiggelbout AM, Vogel-Voogt D: Health state utilities: a framework for studying the gap between the
imagined and the real. Value in Health 2008, 11(1):76–87. https://doi.org/10.1111/j.1524-4733.2007.
00216.x PMID: 18237362
5. Dolan P, Kahneman D: Interpretations of utility and their implications for the valuation of health. The
Economic Journal 2008, 118(525):215–234.
6. Gamelin A, Sastre MTM, Sorum PC, Mullet E: Eliciting utilities using functional methodology: People’s
disutilities for the adverse outcomes of cardiopulmonary resuscitation. Quality of life research 2006, 15
(3):429–439. https://doi.org/10.1007/s11136-005-2830-y PMID: 16547782
7. Doctor JN, Bleichrodt H, Lin HJ: Health utility bias: a systematic review and meta-analytic evaluation.
Medical Decision Making 2008.
8. Wee HL, Li SC, Xie F, Zhang XH, Luo N, Feeny D et al.: Validity, Feasibility and Acceptability of Time
Trade-Off and Standard Gamble Assessments in Health Valuation Studies: A Study in a Multiethnic
Asian Population in Singapore. Value in health 2008, 11(s1):S3–S10.
9. Bleichrodt H: A new explanation for the difference between time trade-off utilities and standard gamble
utilities. Health economics 2002, 11(5):447–456. https://doi.org/10.1002/hec.688 PMID: 12112493
10. Elkin EB, Cowen ME, Cahill D, Steffel M, Kattan MW: Preference assessment method affects decision-
analytic recommendations: a prostate cancer treatment example. Medical Decision Making 2004, 24
(5):504–510. https://doi.org/10.1177/0272989X04268954 PMID: 15358999
11. Bennett KJ, Torrance GW, Boyle MH, Guscott R: Cost-utility analysis in depression: the McSad utility
measure for depression health states. Psychiatric Services 2000.
12. Morimoto T, Fukui T: Utilities measured by rating scale, time trade-off, and standard gamble: review
and reference for health care professionals. Journal of Epidemiology 2002, 12(2):160–178. PMID:
12033527
13. Gold MR SJ, Russell LB,Weinstein MC.: Cost-Effectiveness in Health and Medicine. New York: Oxford
University Press; 1996.
14. Brazier J: Measuring and valuing mental health for use in economic evaluation. Journal of health ser-
vices research & policy 2008, 13(suppl 3):70–75.
15. König H-H, Günther OH, Angermeyer MC, Roick C: Utility Assessment in Patients with Mental Disor-
ders. Pharmacoeconomics 2009, 27(5):405–419. https://doi.org/10.2165/00019053-200927050-00005
PMID: 19586078
16. Flood C: Should" standard gamble" and‴time trade off" utility measurement be used more in mental health research? The journal of mental health policy and economics 2010, 13(2):65–72. PMID:
20919593
17. Voruganti LN, Awad AG, Oyewumi LK, Cortese L, Zirul S, Dhawan R: Assessing health utilities in
schizophrenia. Pharmacoeconomics 2000, 17(3):273–286. PMID: 10947302
18. Briggs A, Wild D, Lees M, Reaney M, Dursun S, Parry D et al.: Impact of schizophrenia and schizophre-
nia treatment-related adverse events on quality of life: direct utility elicitation. Health and quality of life
outcomes 2008, 6(1):1.
19. Lenert L, Morss S, Goldstein MK, Bergen M, Faustman W, Garber AM: Measurement of the validity of
utility elicitations performed by computerized interview. Medical care 1997, 35(9):915–920. PMID:
9298080
20. Revicki DA, Hanlon J, Martin S, Gyulai L, Ghaemi SN, Lynch F et al.: Patient-based utilities for bipolar
disorder-related health states. Journal of affective disorders 2005, 87(2):203–210.
21. Konig H-H: Measuring preferences of psychiatric patients-A review of the use of standard gamble, time
trade-off and contingent valuation in patients with depression or schizophrenia. Psychiatrische Praxis
2004, 31(3):118–127. https://doi.org/10.1055/s-2003-812598 PMID: 15042475
22. Papageorgiou K, Vermeulen KM, Leijten FR, Buskens E, Ranchor AV, Schroevers MJ: Valuation of
depression co-occurring with a somatic condition: feasibility of the time trade-off task. Health Expecta-
tions 2015, 18(6):3147–3159. https://doi.org/10.1111/hex.12303 PMID: 25393599
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 17 / 18
23. Simpson A, Jones J, Barlow S, Cox L: Adding SUGAR: service user and carer collaboration in mental
health nursing research. Journal of psychosocial nursing and mental health services 2013, 52(1):22–
30. https://doi.org/10.3928/02793695-20131126-04 PMID: 24305906
24. Eysenbach G: Improving the quality of Web surveys: the Checklist for Reporting Results of Internet E-
Surveys (CHERRIES). Journal of medical Internet research 2004, 6(3):e34. https://doi.org/10.2196/
jmir.6.3.e34 PMID: 15471760
25. Corp I: SPSS statistics for Windows. Version 21.0. In.: Author Armonk, NY; 2012.
26. Braun V, Clarke V: Using thematic analysis in psychology. Qualitative research in psychology 2006, 3
(2):77–101.
27. Arnold D, Girling A, Stevens A, Lilford R: Comparison of direct and indirect methods of estimating health
state utilities for resource allocation: review and empirical analysis. Bmj 2009, 339.
28. Isacson D, Bingefors K, von Knorring L: The impact of depression is unevenly distributed in the popula-
tion. European Psychiatry 2005, 20(3):205–212. https://doi.org/10.1016/j.eurpsy.2004.12.011 PMID:
15935418
29. Ubel PA, Loewenstein G, Schwarz N, Smith D: Misimagining the unimaginable: the disability paradox
and health care decision making. Health Psychology 2005, 24(4S):S57. https://doi.org/10.1037/0278-
6133.24.4.S57 PMID: 16045420
30. Lazarus RS, Folkman S: Stress, appraisal, and coping: Springer publishing company; 1984.
31. Tsevat J, Sherman SN, McElwee JA, Mandell KL, Simbartl LA, Sonnenberg FA et al.: The will to live
among HIV-infected patients. Annals of internal medicine 1999, 131(3):194–198. PMID: 10428736
32. Ball C: Most people in the UK do not go to university—and maybe never will. In: The Guardian. www.
theguardian.com/higher-education-network/blog/2013/jun/04/higher-education-participation-data-
analysis; 2013.
33. Bethlehem J: Selection bias in web surveys. International Statistical Review 2010, 78(2):161–188.
34. Anthony WA: Recovery from mental illness: The guiding vision of the mental health service system in
the 1990s. Psychosocial rehabilitation journal 1993, 16(4):11.
35. Dolan P: Thinking about it: thoughts about health and valuing QALYs. Health Economics 2011, 20
(12):1407–1416. https://doi.org/10.1002/hec.1679 PMID: 20967923
36. Schkade DA, Kahneman D: Does living in California make people happy? A focusing illusion in judg-
ments of life satisfaction. Psychological Science 1998, 9(5):340–346.
37. Wilson TD, Gilbert DT: Affective forecasting. Advances in experimental social psychology 2003, 35
(35):345–411.
38. De Wit GA, Busschbach JJ, De Charro FT: Sensitivity and perspective in the valuation of health status:
whose values count? Health economics 2000, 9(2):109–126. PMID: 10721013
39. Smith DM, Sherriff RL, Damschroder L, Loewenstein G, Ubel PA: Misremembering colostomies? For-
mer patients give lower utility ratings than do current patients. Health Psychology 2006, 25(6):688.
https://doi.org/10.1037/0278-6133.25.6.688 PMID: 17100497
40. Versteegh M, Brouwer W: Patient and general public preferences for health states: a call to reconsider
current guidelines. Social Science & Medicine 2016, 165:66–74.
41. Nord E, Daniels N, Kamlet M: QALYs: some challenges. Value in Health 2009, 12(s1):S10–S15.
42. Gillard S, Turner K, Lovell K, Norton K, Clarke T, Addicott R et al.: “Staying native”: coproduction in
mental health services research. International Journal of Public Sector Management 2010, 23(6):567–
577.
43. Clark M, Pinfold V, Szymczynska P, Hamilton S, Peacocke R, Dean S, et al. Co-production in mental
health research: reflections from the People Study. Mental Health Review Journal 2015, 20(4):220–
231.
44. Gillard S, Simons L, Turner K, Lucock M, Edwards C: Patient and public involvement in the coproduc-
tion of knowledge reflection on the analysis of qualitative data in a mental health study. Qualitative
Health Research 2012, 22(8):1126–1137. https://doi.org/10.1177/1049732312448541 PMID:
22673090
45. Mohiuddin S, Payne K: Utility Values for Adults with Unipolar Depression Systematic Review and Meta-
Analysis. Medical Decision Making 2014:0272989X14524990.
Service user involvement in scoring utility measures
PLOS ONE | https://doi.org/10.1371/journal.pone.0205223 October 23, 2018 18 / 18
Copyright of PLoS ONE is the property of Public Library of Science and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.