Department of Clinical Epidemiology and Medical Technology Assessment, University Hospital Maastricht, Maastricht
Department of Medical, Clinical and Experimental Psychology, Maastricht University
Department of Medical, Clinical and Experimental Psychology, Maastricht University, and Department of Medical Psychology, University Hospital Maastricht, Maastricht
Department of Psychiatry, VU Medical Center/GGZ BuitenAmstel, Amsterdam
Department of Clinical and Health Psychology and Department of Psychiatry, Leiden University, Leiden
Department of Psychiatry, VU Medical Center/GGZ BuitenAmstel, Amsterdam
Department of Clinical and Health Psychology and Department of Psychiatry, Leiden University, Leiden
Department of Psychiatry, VU Medical Center/GGZ BuitenAmstel, Amsterdam
Department of Clinical Epidemiology and Medical Technology Assessment, University Hospital Maastricht, and Department of Health Organisation, Policy and Economics, Care and Public Health Research Institute, Maastricht University, Maastricht, The Netherlands
Correspondence: Thea van Asselt, University Hospital Maastricht, Department KEMTA, PO Box 5800, 6202 AZ Maastricht, The Netherlands. Email: avas{at}kemta.azm.nl
None. Funding detailed in Acknowledgements.
|
|
|---|
Schema-focused therapy (SFT) and transference-focused psychotherapy (TFP) for borderline personality disorder were recently compared in a randomised multicentre trial.
Aims
To assess the societal cost-effectiveness of SFT v. TFP in treating borderline personality disorder.
Method
Costs were assessed by interview. Health-related quality of life was measured using EQ–5D. Outcomes were costs per recovered patient (recovery assessed with the Borderline Personality Disorder Severity Index) and costs per quality-adjusted life-year (QALY).
Results
Mean 4-year bootstrapped costs were
37 826 for SFT and
46 795
for TFP (95% uncertainty interval for difference –21 775 to 3546); QALYs
were 2.15 for SFT and 2.27 for TFP (95% UI –0.51 to 0.28). The
percentages of patients who recovered were 52% and 29% respectively. The SFT
intervention was less costly and more effective than TFP (dominant), for
recovery; it saved
90 457 for one QALY loss.
Conclusions
Despite the initial slight disadvantage in QALYs, there is a high probability that compared with TFP, SFT is a cost-effective treatment for borderline personality disorder.
|
|
|---|
|
|
|---|
Resources used
At every assessment a structured cost interview was administered by an
independent research assistant. Formal registries such as hospital information
systems or insurer's databases are considered to be incomplete, since a
considerable amount of resource use is situated outside (mental) healthcare
institutions.5
Besides, individual patient data cannot be traced from registries. Therefore,
patient-reported prospective cost
diaries,6 or
retrospective cost
interviews,7 are the
preferred instruments covering all relevant events. We chose a 3-month recall
interview,8 since a
prospective cost diary was expected to lead to more missing items, given the
patient characteristics. From a societal perspective, the cost interview
covered work status and absence, sources of income, domestic activities,
informal care, medication use, alcohol and drugs, out-of-pocket expenses, and
consumption of healthcare and societal resources (including visits to general
practitioners, hospitals, psychiatrists and psychologists, crisis centres,
alternative healers, social work and drug rehabilitation centres). Informal
care is an important item in mental healthcare, since people with mental
illness can impose a heavy burden on their environment: those close to the
patient (family, friends or neighbours) take care of the patient and take over
domestic tasks. A key characteristic of informal care is that caregivers would
not want to care for someone outside their social environment for a similar
wage.9 Out-of-pocket
costs are actual expenses reported by the individual and are highly relevant
in people with borderline personality disorder. These costs relate to (for
instance) excessive smoking and shopping, binge eating and extremely high
telephone
bills.5
The cost interview (based on the cost diary described by Goossens et al)6 was judged by various experts and pilot tested by research assistants in patients, after which both face validity and content validity of the interview were improved. The questions constituting the cost interview were read to the participants, who indicated whether or not they had been absent from work, used a particular healthcare facility, etc. Only costs related to the person's personality disorder were considered relevant, because unrelated costs were not expected to differ between the treatment groups. Patients were explicitly asked whether they thought resource use was linked to problems arising from their personality disorder. In the rare cases in which resource use was probably disorder-related but the patient indicated that it was not, patients' judgements were overruled. For instance, if patients had regular contact with social services because of problems with raising their children and they rated this as not disorder-related, the contacts were still included in the analysis. Since for alcohol and drugs it is difficult to decide which part of the cost is related to the personality disorder, all alcohol and drug expenses were considered to be disorder-related.
The number of therapy sessions was recorded by the therapist. The number, duration and time (inside or outside office hours) of telephone contacts with the therapist were extracted from the standardised session form therapists filled out after each session. Training and supervision costs were not taken into account, because they are training costs and as such not part of the therapy.
Unit prices
Standardised Dutch unit prices were
used.10 When a
standardised unit price was not available, prices were based on tariffs.
Medication costs were obtained from the Dutch Pharmacotherapeutic
Compass.11
Productivity costs, incurred when patients are unable to perform paid work,
were calculated according to the human capital
approach,12 in
which the number of hours of absence from work was multiplied by the actual
gross wage per hour. For unpaid work and study, the number of hours absent was
multiplied by a shadow price. The shadow price was also applied to the
informal care. All unit prices were expressed in euros for the year 2000.
Outcome measures
Effectiveness was expressed as the proportion of recovered patients
according to the Borderline Personality Disorder Severity Index (BPDSI)
version
IV,3,13
a 70-item index with nine dimensions, representing the DSM–IV borderline
personality disorder criteria. Total scores range from 0 to 90. When the BPDSI
score was below 15 at follow-up, patients were considered to have recovered.
Additionally, effectiveness was expressed as quality-adjusted life-years
(QALYs). A QALY is a measure of life expectancy, weighted by the
health-related quality of life (HRQoL) represented by utility scores.
Health-related quality of life was measured with the EQ–5D, which
contains five dimensions: mobility, self-care, daily activities,
pain/discomfort, and
depression/anxiety.14
Each dimension is rated at three levels: no problem, some problems and major
problems. Based on preferences elicited from a general UK population,
EQ–5D health states can be converted into utility
scores.15 The
maximum possible number of QALYs within 4 years is 4 (4 years multiplied by 1,
the optimal health state).
Cost-effectiveness
Two incremental cost-effectiveness ratios (ICERs) were determined. These
ratios are calculated by dividing the difference in costs between the
treatments by the difference in effectiveness between the treatments, and
represents the extra amount of money that has to be invested or will be saved
to gain or lose one extra unit of effect. The first ICER was the cost per
recovered patient and the second ICER was the cost per QALY gained.
Data analysis
Costs and effects were discounted at
4%.10 Missing items
in otherwise completed assessments were imputed with SPSS (version 12 for
Windows) Missing Value Analysis, option regression. Completely missing
assessments because of patients who left the study or completed the treatment
before the 3-year limit were analysed by carrying the last observation
forward, in accordance with Giesen-Bloo et
al.3
For BPDSI recovery status results after 4 years, a logistic regression was performed with treatment group and baseline BPDSI score as covariates, again in accordance with Giesen-Bloo (further details available from the authors). Since cost data are generally highly skewed, and not distributed normally, bootstrap simulations with 1000 replications were performed to estimate uncertainty intervals around the mean costs.16,17 For the same reasons, QALY scores and both ICERs were bootstrapped with 1000 replications. The uncertainty interval is represented by the 2.5th and 97.5th percentiles. Results of ICER bootstraps are presented in cost-effectiveness planes and cost-effectiveness acceptability curves (CEACs). Cost-effectiveness planes show differences in costs on the vertical axis and differences in effect on the horizontal axis. Bootstrapped cost-effectiveness pairs located in the northwest quadrant indicate SFT to be inferior to TFP (more costly and less effective than TFP). Cost-effectiveness pairs located in the south-east quadrant show SFT to be dominant over TFP (more effective and less costly than TFP). With respect to the other two quadrants (higher costs for better effectiveness and lower costs for lower effectiveness) the choice of an intervention depends on the threshold value, i.e. what society is prepared to pay for an effectiveness gain, or willing to accept as savings for effectiveness loss. The CEAC represents the probability that, given a certain threshold for the willingness to pay for a QALY or for recovering a patient, the intervention is cost-effective.18
Secondary analyses
Secondary analyses were performed to assess robustness of the results,
initially using the last observation carried forward method for patients who
completed the therapy early with consent of the therapist but did not complete
later assessments, and imputing baseline values for true `drop-outs', i.e.
patients who stopped therapy without consent of the therapist. We then
conducted a `completers' analysis, analysing data for only those patients who
completed treatment. Patients who completed early with consent of their
therapist were included in this analysis, those who dropped out were not.
As costs at baseline were not comparable between both groups, a correction was performed for the following measurements by means of a regression analysis as recommended by Manca et al.19 A correction was also performed for utility and BPDSI scores. An analysis per recovered patient with a healthcare perspective, meaning that only direct healthcare costs (including costs of therapy) were considered, was followed by an investigation of the impact of the unit price of a therapy session, the unit price of informal care and out-of-pocket costs on total costs. We separately doubled and halved the costs of these items. The results of the first four secondary analyses are again represented in a CEAC, combined with the CEAC of the main analysis.
|
|
|---|
|
View this table: [in a new window] |
Table 1 Baseline characteristics of study participants (n=86)
|
Costs
Baseline assessment
At baseline, costs were not distributed normally (P=0.029,
Kolmogorov–Smirnov test. The bootstrapped total costs at baseline were
4324 for TFP and
3331 for SFT, a difference of about
900.
However, neither of the cost categories nor the total societal cost was
significantly different between conditions.
After start of treatment
Bootstrapped societal costs over 4 years were
46 795 for TFP and
37 826 for SFT (non-significant difference;
Table 2). The direct
non-healthcare costs were significantly different between the two intervention
therapies, mainly owing to informal care costs. With respect to direct
healthcare costs, treatment costs (including costs of telephone contacts with
the therapist for SFT) were about
2200 higher for SFT, whereas the costs
of visits to mental healthcare centres and other healthcare facilities were
about
3500 higher for TFP. Table
3 presents mean societal costs, EQ–5D utility scores, BPDSI
scores and percentage recovered patients per measurement. Costs in both groups
decreased with time. In addition, aside from the second assessment when costs
were comparable between the two groups, the TFP costs were continuously
higher.
|
View this table: [in a new window] |
Table 2 Total costs per cost item and bootstrapped costs for subtotals per
patient for the schema-focused therapy group (n=44) and the
transference-focused psychotherapy group (n=42)
|
|
View this table: [in a new window] |
Table 3 Costs, EQ–5D utility scores, disorder severity scores and patient
recovery over the course of the study
|
Effectiveness
BPDSI score
The proportion of patients who had recovered after 4 years was 52% for the
SFT group and 29% for the TFP group (Table
3). Logistic regression analysis with treatment group and BPDSI
baseline score as covariates showed a significant effect in favour of SFT
(Wald test 4.45, P=0.035; OR=2.64, 95% CI 1.07–6.49). Without
the baseline BPDSI the group effect was similar. (For details of the BPDSI
results, see Giesen-Bloo et
al;3 further
details are available from the authors.)
Quality-adjusted life-years
For one participant no EQ–5D score was available, therefore QALY
results and ICERs for cost per QALY are based on 85 patients.
Table 3 shows that baseline
utility scores were not distributed normally (P=0.037,
Kolmogorov–Smirnov test). Baseline utility scores were 0.49 for the TFP
group and 0.46 for the SFT group (P=0.734, Mann–Whitney test).
In TFP, utility scores increased early in treatment and more or less remained
stable, whereas in SFT utility scores continued to increase. As a result,
total QALYs over the 4-year period were 2.15 for SFT and 2.27 for TFP. The
bootstrapped difference in QALYs between SFT and TFP of –0.12 (95% UI
–0.51 to 0.28) was not statistically significant.
Cost-effectiveness
Societal costs per recovered patient
As the societal costs in the SFT group were lower and recovery rate was
higher compared with the TFP group, SFT was dominant over TFP
(Table 4). The bootstrapped
results (Fig. 1) show that 91%
of the cost-effectiveness pairs are in the south-east quadrant, where SFT is
dominant. Another 8% lie in the north-east (i.e. more costly, more effective)
quadrant, and 1% is located in both west quadrants, where SFT is less
effective than TFP. The CEAC (Fig.
2, main analysis) indicates that, regardless of the threshold
value, the chance that SFT is cost-effective is over 90%.
|
View this table: [in a new window] |
Table 4 Results of main and secondary analyses
|
![]() View larger version (17K): [in a new window] [as a PowerPoint slide] |
Fig. 1 Bootstrapped costs and effects for cost per recovered patient, showing 1000
bootstrap replications for incremental cost-effectiveness of schema-focused
therapy compared with transference-focused psychotherapy. Costs are plotted on
the y axis and effects on the x axis, so a bootstrap
replication in the south-east quadrant means that schema-focused therapy is
less costly and more effective than transference-focused psychotherapy for
that replication.
|
![]() View larger version (20K): [in a new window] [as a PowerPoint slide] |
Fig. 2 Cost-effectiveness acceptability curves for secondary analyses of cost per
recovered patient. BPDSI, Borderline Personality Disorder Severity Index;
ICER, incremental cost-effectiveness ratio. The curves indicate the
probability (y axis) of schema-focused therapy being cost-effective
compared with transference-focused psychotherapy, given the threshold value
(x axis) for recovering a patient.
|
Societal cost per QALY
The incremental cost-effectiveness ratio based on the QALY shows that
90 457 is saved when one QALY is sacrificed, as SFT was less costly and
less effective than TFP (Table
4). The bootstrapped results
(Fig. 3) show that 68% of the
cost-effectiveness pairs are in the south-west quadrant, where SFT is less
costly and less effective. The south-east quadrant (i.e. the dominance
quadrant) contains 28% of the replications, and the final 4% are located in
the inferiority quadrant. The cost-effectiveness acceptability curve
(Fig. 4, main analysis)
indicates that the probability of SFT being cost-effective decreases with an
increasing threshold value. Assuming that society's maximum willingness to pay
is
20 000 for a QALY gain (or accepts a minimum compensation of
20
000 for a QALY loss), the probability that SFT is cost-effective is 84%.
![]() View larger version (18K): [in a new window] [as a PowerPoint slide] |
Fig. 3 Bootstrapped costs and effects for cost per quality-adjusted life-year,
showing 1000 bootstrap replications for incremental cost-effectiveness of
schema-focused therapy compared with transference-focused therapy. Costs are
plotted on the y axis and effects on the x axis, so a
replication in the south-west quadrant means that schema-focused therapy is
less costly but also less effective than transference-focused psychotherapy
for that replication.
|
![]() View larger version (18K): [in a new window] [as a PowerPoint slide] |
Fig. 4 Cost-effectiveness acceptability curves for secondary analyses of cost per
quality-adjusted life-year. ICER, Incremental cost-effectiveness ratio. The
curves indicate the probability (y axis) of schema-focused therapy
being cost-effective compared with transference-focused psychotherapy, given
the threshold value (x axis) for a quality-adjusted life-year.
|
Secondary analyses
Costs per recovered patient
Table 4 also shows the
secondary analyses. For costs per recovered patient, results were robust,
meaning that SFT dominates TFP. The cost-effectiveness acceptability curves
demonstrate that, regardless of the threshold value, the probability that SFT
is cost-effective is consistently over 90%
(Fig. 2). From a healthcare
perspective, the CEAC is less favourable for SFT. With respect to varying the
unit prices and the out-of-pocket costs, total costs of SFT stayed below costs
of TFP in all cases.
Costs per QALY
With respect to costs per QALY (Table
4), results were robust when using the completers analysis and the
regression corrected analyses. However, using the baseline values for
imputation of missing values, SFT became dominant. The cost-effectiveness
acceptability curves (Fig. 4)
show that baseline value imputation leads to more favourable results for SFT
compared with the main analysis. However, regression correction and a
completers analysis both lead to less favourable results for SFT. The
descending CEACs mean that the probability of SFT being cost-effective
decreases if the value society attaches to a QALY increases, dropping from 96%
(best case) to 53% (worst case).
|
|
|---|
QALYs
Utility scores increased from 0.49 to 0.61 in the TFP group and from 0.46
to 0.63 in the SFT group, improvements of 0.12 and 0.17 respectively. The
minimally important difference for EQ–5D utility scores is considered to
be 0.03 points;20
both therapies thus had a clinically relevant effect on quality of life.
Therefore, in this sample, the EQ–5D can be considered sensitive to
change. However, the resulting QALY was unable to discriminate significantly
between the two intervention therapies, although the proportion of recovered
patients was higher in the SFT group. Quality-adjusted life-years based on the
4-year period were slightly lower for the SFT group, resulting in an ICER
indicating a saving of over
90 000 for a QALY loss.
To date there is no consensus about a reasonable threshold value for
cost-effectiveness. Most studies with the explicit goal of determining a
threshold found values ranging from
10 000 to
27
000.21–23
However, in practice, thresholds used for appraisal of new interventions may
be higher, around
40 000 for the UK and
80 000 for The
Netherlands.24,25
The CEACs show a range of thresholds, varying from
0 to
40 000. In
this study, when applying a
20 000 per QALY threshold, SFT in the main
analysis would have an 84% probability of being cost-effective. In the
secondary analyses, the worst-case scenario had a probability of 68% and the
best-case scenario had a probability of 94%. Given this range, the probability
that SFT is more cost-effective than TFP in terms of QALYs can be considered
moderate to high, in addition to the fact that this probability was already
high for the analysis per recovered patient.
BPDSI v. EQ–5D
It is difficult to explain what the reason for the divergence between BPDSI
and QALY results might be. When a patient recovers from borderline personality
disorder, one would expect the HRQoL to improve correspondingly to at least
some extent. In the long run, this was indeed the case: the slopes in increase
of HRQoL were significantly steeper in the SFT group than in the TFP
group.3 However,
during the first half-year of treatment, QALY and BPDSI scores diverged
markedly, and as a consequence, during the first 2 years of treatment, the
HRQoL in the SFT patients was lower than that in TFP patients. This difference
is difficult to understand as it was not paralleled in any of the clinical
outcome measures. At follow-up at 4 years, the HRQoL in the SFT group was
slightly higher than the HRQoL in the TFP group, but not enough to offset the
total QALY difference. Maybe SFT is initially experienced as more burdensome
by patients, and it may take time for clinical improvement to be translated
into an increase in HRQoL. Another reason for the discrepancy might be a
difference in responsiveness between the EQ–5D, which has 5 items each
with three-point scale responses, and the BPDSI, which has 70 items with
mostly ten-point scale responses. There is little published information
concerning the use of the EQ–5D in a population with borderline
personality disorder. For depression, the EQ–5D seems useful and
sensitive to
change;26,27
in schizophrenia, results are
contradictory.28,29
Limitations
A number of limitations should be addressed. First, we used the last
observation carried forward method to impute missing assessments. Although
this is in accordance with the clinical study, this method is regarded as
naïve.30 We
therefore also performed a multiple
imputation.31
However, probably because of the high cumulative drop-out at the last three
measurements, point estimates resulting from the multiple imputation tended
heavily towards extreme (and even impossible) values. Because this tendency
could not be justified from the trends in the observed data, we have not
included the multiple imputation results in this paper. Second, the fact that
we only included disorder-related costs might have led to bias in the estimate
of the total costs, since patient judgement of what is and what is not
disorder-related may be flawed. However, as we do not expect patients'
judgements in the SFT group to differ systematically from those in the TFP
group, this presumably did not affect the incremental costs. Third, the time
horizon of the analysis was 4 years; more long-term information should be
collected to confirm the results now presented. Fourth, the economic
evaluation was limited to the comparison of only two possible treatments.
Other alternatives, such as natural course and care as usual, have not been
considered. The use of a single randomised controlled trial as a vehicle for
economic evaluation is often an insufficient basis for decision-making,
because decision-makers need to be informed about costs and effects for the
full range of alternative
interventions.32 In
that respect, our study has contributed evidence to the broader case of
cost-effectiveness of treatment for borderline personality disorder.
Implications of the study
As this is the first cost-effectiveness analysis of psychotherapy in the
field of borderline personality disorder from a comprehensive societal
perspective, it is difficult to place the results in a broader perspective.
Even for the much better-documented case of treatment for depression,
comparisons between studies are complicated because of vastly different
interventions, outcome measures and cost
perspectives.33 The
same applies to personality disorders in general and borderline personality
disorder in particular. However, our study shows that long-term psychotherapy
for borderline personality disorder improves HRQoL and decreases societal
costs. In terms of recovered patients, there is a high probability that SFT is
more cost-effective than TFP. In terms of QALYs, the probability that SFT is
cost-effective is moderate to high.
Future research should further investigate the reason for the discrepancy between the BPDSI and QALY scores, and the relation between the EQ–5D and other measures of quality of life among people with borderline personality disorder. Furthermore, we agree with Giesen-Bloo et al that different comparators such as care as usual and natural course should be studied,3 including economic evaluations with a longer period of follow-up.
|
|
|---|
|
|
|---|
Related articles in BJP:
Read all eLetters
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||