Stavanger University Hospital, Division for Psychiatry, and Department of Psychiatry, University of Bergen, Norway
Ullevaal University Hospital, Norway
Stavanger University Hospital, Division for Psychiatry, Norway
Roskilde County Psychiatric Hospital Fjorden, Roskilde, Denmark
Departments of Behavioural Sciences in Medicine, University of Oslo, Norway
Department of Psychiatry, Yale University, New Haven, Connecticut, USA
Correspondence: Tor K. Larsen, Stavanger University Hospital, Psychiatric Clinic, Armauer Hansensv. 20, pb 8100, N-4068 Stavanger, Norway. Email: tklarsen{at}online.no
|
|
|---|
Aims To study the effect of early intervention in first psychosis on one-year outcome using an historical quasi-experimental design.
Method We compare the outcome of two samples of first-episode psychosis from the same healthcare district at different time periods. The historical control sample was assessed during 1993–1994, before the establishment of a system for early detection of psychosis. The experimental sample is the early detection sample in the Early Treatment and Intervention in Psychosis study assessed during 1997–2000.
Results At 1-year follow-up, the early detection group was younger, had a smaller fraction of individuals with schizophrenia, had less severe negative and general symptoms and had more friends in the past year than the historical control group. No differences were found in clinical course (remission, relapse, continuously psychotic) or positive symptoms, but more patients in the early detection sample were treated as outpatients without hospitalisation.
Conclusions Early detection of schizophrenia in one healthcare sector is associated with less severe deterioration at 1 year.
|
|
|---|
The Early Treatment and Intervention in Psychosis study (TIPS) is the first to reduce DUP with an early detection programme involving the creation of easy access psychosis-detection teams and a massive and persistent educational campaign about the first signs of psychosis aimed at the general public, the primary health services and the school system (Melle et al, 2004). The study also compared the effect of reducing DUP on the severity of first-episode psychosis using a parallel-control, quasi-experimental design. Such a design compares the clinical presentation and course of patients with first-episode schizophrenia in different health-care sectors, an experimental sector with an early detection programme (Rogaland, Norway) and two control sectors without such a programme (Ulleval, Norway and Roskilde, Denmark). The term parallel control derives from the fact that the experimental and control sectors collect their samples over the same (parallel) time period; for the TIPS this was 1997–2000 (Melle et al, 2004).
In TIPS DUP was reduced significantly in the experimental early detection sector compared to no-early detection control sectors (5 weeks median compared to 16 weeks median). At intake (baseline), patients in the early detection sector had significantly less severe positive, negative and general symptoms (P<0.01 for all comparisons) (Melle et al, 2006). At 1-year follow-up the differences in positive and general symptoms disappeared but the negative symptom differences remained significant (P<0.005) (Larsen et al, 2006).
The focus of this paper will be a comparison of the TIPS 1997–2000 early detection or experimental sector sample with a different no-early detection control group. This control group consists of all patients with first-episode non-affective psychosis who came to treatment in the same healthcare sector, i.e., Rogaland County, but at an earlier time period, the years 1993–1994, before any educational campaigns. Before the TIPS programme began, this study was carried out to measure DUP in first-episode psychosis in the middle and southern sections of Rogaland County. The investigation used essentially the same inclusion criteria, assessment battery and follow-up procedures as in TIPS (Larsen et al, 2001). This sample, coming from the same early detection sector but from a different time before the early detection campaign, provides a no-early detection historical control sample to the 1997–2000 early detection experimental sample.
We have previously reported a comparison between the TIPS early detection sample from the first 2 years of recruitment (1997–1998, n=66) with the 1993–1994 historical control sample regarding baseline characteristics (n=43) (Larsen et al. 2001). We found that the patients with early detection at baseline had a significantly shorter DUP, were younger, misused substances more often, had better premorbid adjustment and had less symptoms. In this paper we compare the 1-year outcome of the full 4-year early detection sample in TIPS (1997–2000, n=133) with the 1993–1994 historical control sample at baseline and 1-year outcome. We aim to see if the 1-year outcome findings are similar to those of the TIPS parallel-control study (as reported in Larsen et al, 2006) using a different no-early detection historical control group. The null hypothesis is that the findings of the parallel control 1-year outcome will not be replicated.
|
|
|---|
Assessments
The Structured Clinical Interview for DSM–IV (SCID)
(Spitzer et al, 1995)
was used in 1997–2000. SCID for DSM–III–R was used in the
1993–1994 period; the diagnosis were converted into DSM–IV
diagnosis by T.K.L. Premorbid functioning was measured by the Premorbid
Adjustment Scale (PAS), which describes four premorbid periods in life:
childhood (
11 years), early adolescence (12–15 years), late
adolescence (16–18 years), and adulthood (
19 years)
(Cannon-Spoor et al,
1982). A previous analysis identified two premorbid dimensions (a)
social, consisting of PAS items, social isolation and peer relationships; and
(b) academic, containing school performance and school adaptation. Two
parameters for each dimension are rated: (a) childhood level of adjustment;
and (b) degree of change of level of adjustment over post-childhood
developmental phases (for details about this modification see
Larsen et al,
2004).
The DUP was measured as the time from onset of psychosis until the start of adequate treatment (for details see Melle et al, 2004). Level of symptoms were measured by the Positive and Negative Syndrome Scale (PANSS; Kay et al, 1987). Misuse of alcohol and other drugs was measured by the Drake Scale (Drake et al, 1990). Social functioning was measured with the Strauss–Carpenter Scale (Strauss & Carpenter, Jr, 1974). At 1 year all assessments were repeated, including the SCID.
All major baseline assessments such as diagnosis, PANSS, Global Assessment of Functioning (GAF), drug misuse and DUP underwent tests of intra- and inter-site reliability with satisfactory results within the early detection period (for details see Friis et al, 2003). For the historical control period reliability tests were carried out for diagnosis (20 written case vignettes rated by T.K.L. and S.O.) with a kappa of 0.89. For the PANSS and GAF a similar study was carried out with intraclass correlations (ICC 1.1) between 0.65 and 0.90. A test–retest study was carried out for PAS and DUP with satisfactory results (for details see Larsen et al, 1996).
Data analysis
Statistical procedures
Analyses were performed with the statistical package SPSS (version 12.0 for
Windows). The applied methods are reported for all group comparisons. All
tests were two-tailed. We used nonparametric tests for data without normal
distribution. As noted in several other studies, the DUP does not seem to have
a normal distribution, whereas its natural logarithm does. In multiple linear
regression analyses DUP was transformed to its natural logarithm.
For the multivariate analysis the selection of variables to be included in the regression model was based on their assumed clinical importance through reviews of relevant studies and a set of variables comprising measures of differences between the areas. The variables were entered hierarchically with from the early detection sample on the last step. The final model was examined for interaction effects and the effects of outliers and influential observations, including leverages.
|
|
|---|
|
View this table: [in a new window] | Table 1 Comparison between historical control sample and early detection sample with first-episode psychosis at baseline |
Follow-up findings (Table 2)
At 1-year follow-up we reassessed 100% of the historical control sample for
all variables, for the early detection sample we were able to reassess 99% for
illness-course, 97% for diagnosis and 88% with PANSS. Fewer of the patients in
the early detection group were hospitalised, but the 1-year clinical course
was not different between either of the two samples. The historical sample had
a higher fraction of individuals with schizophrenia, more negative and general
symptoms on the PANSS, and also had fewer friends in the past year.
|
View this table: [in a new window] | Table 2 Comparison between historical control and early detection samples with first episode psychosis at 1 year follow-up |
The multivariate analysis revealed that negative symptoms were related significantly to social deficits in childhood, to increasing social deficits during subsequent premorbid development, and to developing a narrow schizophrenia disorder. Reducing DUP, i.e. coming from the early detection period (1997–2000), remained significant after controlling for the other variables.
|
|
|---|
At baseline, the early detection sample was larger in number, younger in age and displayed healthier premorbid social functioning. The fraction of this sample meeting criteria for DSM–IV schizophrenia was smaller and symptomatically the sample had less severe positive, negative and general symptoms. The only way the early detection sample may have been worse than the historical control sample was in scoring higher on drug misuse.
By 1-year follow-up significantly more of the patients in the early detection group were treated as out-patients, but the clinical course between early detection and historical control was comparable, with similar sample fractions being in remission, in relapse or continuously psychotic. The historical control sample still had a significantly higher fraction of patients with schizophrenia, but the difference between the two groups was less than at baseline, owing primarily to a larger shift in diagnosis from schizophreniform to schizophrenia within the early detection sample. Symptomatically at 1 year there were no longer differences in positive symptoms, but the differences in negative and general symptoms remained robust, particularly the former. Individuals with early detection at 1 year scored higher for having friends, a new finding given that they were not different on this parameter at baseline.
Overall, psychopathologically the differences between the the two sample groups samples were more attenuated at 1 year than they were at baseline. Although this suggests that with more time the early detection sample might catch up with the historical control sample vis à vis level of deterioration and chronicity, many of the initial differences remained robust at 1 year. This was especially true for the difference in negative symptoms between the groups, which was virtually the same at 1 year as it was at baseline. Furthermore, when we analysed the difference for negative symptoms for a subsample with core and narrow schizophrenia, the difference persisted even for those with poor premorbid social adjustment, suggesting that early detection may attenuate primary as well as secondary negative symptoms (Kirkpatrick et al, 1989).
The 1-year differences in favour of the early detection sample in this study do not support the null hypothesis. On the other hand we cannot say our findings replicate the parallel-control studys 1-year outcome (Larsen et al, 2006) because the experimental samples are not completely different between studies. Nevertheless we can say our early detection–historical control differences in this historical control study are more numerous and quantitatively more robust than they are in the parallel-control study. This may reflect that the experimental-control differences in DUP in this study versus the parallel-control study are also more robust, e.g. 6 v. 26 weeks (median) in this study compared to 5 v. 16 weeks (median) in the parallel-control study (Melle et al, 2004).
The patients in the historical control group clearly began first treatment for psychosis on average much later in their psychosis than the early detection patients. The patients in this control sample may, in effect, be further along in the window of deterioration that characterises the longitudinal course of schizophrenia (McGlashan, 1988; McGlashan & Fenton, 1993). If so, they may also be closer to the plateau in deficit formation that determines the end of this window. Given sufficient time and more follow-ups, the early detection sample of this study may eventually reach the same level of deficit as the historical control sample if early detection does nothing to reduce the course of deterioration in psychosis. If such is the case, then the differences we are reporting here at 1 year could be an example of a lead-time bias as described by Black & Welch (1993), that is, an artifact of early detection of disorder. If, however, secondary prevention is conferred by early intervention, then the early detection baseline advantage should persist through the window of deterioration and emerge 1–3 years later as a permanent difference or advantage that lasts a lifetime. In both this historical control study and the TIPS parallel-control study, secondary prevention may be happening, especially with negative symptoms, but further follow-up is necessary to rule out the competing hypothesis that what we are seeing is the consequence of lag-times between samples in their progression through a common window of deterioration to an equivalent deficit plateau.
Eearly detection in this study may or may not confer secondary prevention
in psychosis, but it clearly provides several examples of tertiary prevention.
The early detection group was younger, better functioning premorbidly, and
less symptomatic, with fewer individuals meeting criteria for
core schizophrenia. If early detection works by reducing the
threshold at which the signs and symptoms of psychosis are recognised, then we
would expect to see a greater capture of younger and less severe
cases who escape detection until later when they are more obviously
psychopathologic. Indeed, one consequence of effective early detection may be
to appear to increase the incidence of schizophrenia in a sector when, in
effect, all cases are legitimate but a higher fraction of all possible cases
are being detected.
|
View this table: [in a new window] | Table 3 Multiple linear regression analysis of the effect of independent variables on negative symptoms at 1-year follow-up1 |
Almost 1 in 4 of the patients in the early detection group was treated for first psychosis as out-patients. Another positive result of early detection may be identifying people as psychotic at a milder stage of disorder and avoiding hospitalisation, i.e. reducing the collateral damage and expense that usually attends a first psychotic episode.
Strengths of the study
A strength of the study is the well-organised, nationalised mental
healthcare system of Rogaland County, Norway. The samples are representative
and are drawn consecutively from the same catchment area separated by only 4
years. No other services treat patients with first-episode psychosis within
this region, and the samples have a high follow-up rate.
Limitations
A major limitation of the study is the (necessary) quasi-experimental
design and the probability of cohort effects, i.e. sample differences emerging
because of changes over time in the populations studied. For example, our
early detection sample had more substance misuse than the historical control
sample. We deduce that this difference is real and reflects an epidemic of
drug use that arose in Scandinavia around the time of the TIPS project. This
finding illustrates how rapidly cohort effects (e.g. differences) can happen,
even in cohorts from the same catchment area separated by only 4 years.
Another limitation is that some of the designs and methods were not identical across the early detection and historical control cohorts. For example, time to remission was not rated in the latter cohort. Also, treatment was provided to the patients in the historical control sample during the year but it was not standardized or recorded, so we cannot estimate how much the differences between the samples at 1 year derive from treatment differences. All of the clinical ratings in the historical control cohort were performed by one person (T.K.L.) and, while he was adequately trained, standard interrater reliability between the historical control and the early detection sample was never established. The detection of cases was also more intensive during the TIPS period; in particular, cases from outpatients clinics might have been overlooked in the historical control sample. Finally, the historical control cohort was assessed diagnostically prior to DSM–IV, consequently, requiring that the DSM–III–R diagnoses be retrospectively reassessed as to whether or not they met DSM–IV criteria. This was done by one person (T.K.L.) without subjecting the process to reliability testing or consensus determination.
|
|
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||