SPECIAL ARTICLES |
Glasgow Caledonian University and Douglas Inch Centre, Glasgow, UK
Glasgow Caledonian University, Glasgow, UK
University of California, Irvine, California, USA
Correspondence: Dr David J. Cooke, Forensic Psychology Services, Douglas Inch Centre, 2 Woodside Terrace, Glasgow G3 7UY, UK. Email djcooke{at}rgardens.vianw.co.uk
Funding detailed in Acknowledgements.
1 The PCL-R is a 20-item rating scale of traits and behaviours intended for
use in a range of forensic settings. Definitions of each item are provided and
evaluators rate the lifetime presence of each item on a 3-point scale (0,
absent; 1, possibly or partially present; 2, definitely present) on the basis
of an interview with the participant and a review of case history
information. ![]()
2 Although we give a broad verbal account of each of the chief models that
have been proposed for the PCL-R (two-, three- and four-factor; see
Fig. 2), given the imprecision
of natural language, we stress the importance of consulting the mathematical
code provided in the data supplement to the online version of this article to
specify each model. ![]()
3 It is noteworthy that this is only one possible interpretation of this set
of covariances: this model has six equivalent models, all of which are equally
tenable statistically but which lead to different theoretical
interpretations. ![]()
4 Another problem in the literature is the proliferation of underpowered
studies. Confirmatory factor analysis requires moderate-to-large samples
(Kline, 1998). Many of the
attempts to explore the structure of the PCL measures have been seriously
underpowered in terms of sample size, with samples at, or even well below,150
individuals (e.g. Jackson et
al,2002; Hill et
al,2004; Salekin et
al, 2006; Vitacco et
al, 2006). Kline
(1998) provides guidance on
the issue and indicates that 20 cases per free parameter is desirable, 10:1 is
just acceptable and the statistical stability with a 5:1 must be regarded as
suspect. The three-factor hierarchical model with testlets has 36 free
parameters, suggesting a minimum sample size of between 360 and 720. The
four-factor hierarchical model has 40 free parameters (minimum
n=400-800); the four-factor correlated model has 42 free parameters
(minimum n=420-840) and the two-factor, four-facet hierarchical model
has 41 free parameters (minimum n=410-820). Underpowered studies will
mislead (Floyd & Widaman,
1995). In addition to the problem of lack of stability is the
problem of Heywood cases. Small samples are prone to improper solutions in
which estimated correlations are greater than 1 or estimated error variances
are less than 0. Solutions may also fail to converge. ![]()
5 The level of fit achieved on the development sample using this method is
excellent. S-B
2=167, d.f.=56, AIC=55, NFI=0.98, NNFI=0.98,
CFI=0.99, RSMEA=0.04. ![]()
6 Some commentators have advocated the use of MPlus; the rationale for their
preference is unclear. The same pattern of results was achieved using MPlus.
The level of fit achieved with the three-factor model with testlets was good:
2=181, d.f.=40, CFI=0.95, RSMEA=0.06; for the three-factor
model without testlets it was fair:
2=261, d.f.=0.43,
CFI=0.92, RSMEA=0.08; and for the four-factor hierarchical model the fit was
poor:
2=692, d.f.=73, CFI=0.82, RSMEA=0.10. Results for all
models are available from the authors. ![]()
|
|
|---|
|
|
|---|
|
|
|---|
The structure of the PCLR and its antecedents has been the subject of some debate. The original PCLR manual (Hare, 1991) lacked clarity about the structure of the test (Cooke & Michie, 2001). For a number of years a two-factor model dominated the literature (Harpur et al, 1988). Unfortunately, the support for this model was over-reliant on congruence coefficients; these provide inadequate tests of the similarity of factor solutions across samples. Cooke & Michie (2001), using item response theory, confirmatory factor analysis and cluster analytical methods, argued that 13 of the 20 PCLR items are conceptually distinct and psychometrically non-redundant indicators of psychopathy. Since they were found to be relatively poor indicators of psychopathy, items that tapped antisocial behaviour largely were excluded. Cooke & Michie (2001) developed a well-fitting hierarchical structure in which the superordinate trait, psychopathy, overarched three highly correlated symptom factors: arrogant and deceitful interpersonal style, deficient affective experience and impulsive and irresponsible behavioural style (see Fig. 1). The first factor was specified by glibness/superficial charm, grandiose sense of self-worth, pathological lying, and conning/manipulative, the second factor by lack of remorse or guilt, shallow affect, callous/lack of empathy and failure to accept responsibility for own actions, and the third factor by need for stimulation/proneness to boredom, irresponsibility, impulsivity, parasitic lifestyle and lack of realistic, long-term goals. This model, despite being described by some as `controversial' (Salekin et al, 2006), is conceptually coherent (Skeem & Cooke, 2007) and consistent with clinical tradition (Cooke & Michie, 2001). Moreover, it has been replicated in a number of independent samples and by independent researchers using both the PCL (Jackson et al, 2002; Skeem et al, 2003) and other measures of psychopathic traits (Andershed et al, 2002). The three-factor model also has been shown to relate to external correlates in a theoretically coherent manner (Hall et al, 2004).
![]() View larger version (17K): [in a new window] [as a PowerPoint slide] |
Fig. 1 Structure of the Psychopathy Checklist - Revised. Hierarchical three-factor
model with testlets. PCL, Psychopathy Checklist - Revised.
|
There can be little doubt that the three-factor model has stimulated a number of researchers to reconsider the structure of the PCLR measure and its implications for our understanding of the construct of psychopathic personality disorder. This must be regarded as positive: the definition and validity of constructs must be revisited as knowledge advances (Smith et al, 2003).
|
|
|---|
More recently, Hare & Neumann (2005) have argued that PCLR items that capture antisocial tendencies, including criminality, are indicators of important psychopathic traits, asserting that the `real core of psychopathy has yet to be uncovered' (p. 62). They observe that the exclusion of antisocial behaviour in the three-factor model decreases the utility of the PCLR for predicting violence and aggression (see Skeem et al, 2003). Furthermore, they assert that `current findings suggest that the four-factor model has incremental validity over the three-factor in predicting important external correlates of psychopathy' (Neumann et al, 2007: p. 22). This logic is confused. Adding variables, for example, gender, age or a history of substance misuse, would also improve prediction. However, such an improvement would not imply that these characteristics are core to psychopathic personality disorder. A measure's validity in representing the construct of psychopathy should not be confused with its utility in predicting deviant behaviour (Skeem et al, 2003).
We have argued elsewhere that there are good reasons to reject the contention that criminal behaviour should play a central role in diagnosing psychopathy; instead, such behaviour is best viewed as a secondary feature, or sequela, of the disorder (Cooke et al, 2004, 2006). In a companion paper (Skeem & Cooke, 2007), we present conceptual (logical, theoretical) directions for resolving the debate about whether antisocial behaviour is an essential component or `downstream correlate' of psychopathy. In the current paper, we focus on empirical (analytical) directions, demonstrating how the application of appropriate statistical methods is necessary to help advance understanding of psychopathy.
To inform the debate, we consider the appropriateness of various analytical strategies and demonstrate their impact on the model selected to describe the PCLR. In the interests of transparency, we append as data supplements to the online version of this paper the code for all models tested (data supplement 1) together with the correlation matrix for the dataset we used (data supplement 2). This will allow other researchers to replicate or reject our conclusions. Our goal is to address three of the difficulties that confront the field in this debate about the structure of psychopathy. First, never are the competing models compared directly using the same dataset and the same approach to modelling. Second, the verbal descriptions of the models considered are often imprecise and thus it is hard for independent researchers to parameterise these models accurately. Third, contentious analytical approaches such as parcelling are adopted. We begin by describing the competing models, then consider key issues of method and conclude by presenting analyses to illustrate these issues of method.
|
|
|---|
The three-factor model
The three-factor model is illustrated in
Fig. 1. There are perhaps four
points of emphasis regarding this three-factor model. First, the structure is
hierarchical, with a superordinate construct `psychopathy' that is
sufficiently unidimensional to be regarded as a coherent psychopathological
construct or syndrome (Cooke & Michie,
2001; Cooke et al,
2005a,
b). This hierarchical
structure reflects a common model of personality and personality disorder in
which traits of different levels of generality, from general to more specific,
are structured in a hierarchical manner
(McCrae & Costa, 1995).
Second, the three factors can be regarded as having reliable general variance
as a consequence of the influence of the broad psychopathy construct shared
with the other factors. In addition, however, there is reliable specific
variance unique to each particular factor. The value of refining the broad
construct into specific factors has advantages in that the specificity between
aspects of the disorder and external variables may be clearer
(Raine et al, 2000;
Soderstrom et al,
2002; Dolan & Anderson,
2003; Hall et al,
2004). Thus, this hierarchical model highlights `differential
relations between the psychopathy factors and a variety of important criteria'
(Neumann et al, 2007:
p. 24), but requires that the factors investigated are actually components of
the general disorder of psychopathy.
Third, although some variants of the original three-factor model exclude testlets for the sake of parsimony (Skeem et al, 2003; Odgers, 2005; see Fig. 2), below the level of specific factors, and between the items, are testlets. Testlets occur when items are more highly associated than can explained by their relationship with the underlying latent trait; thus, a pair of items that form a testlet can be construed as being somewhere between one and two items (Chen & Thissen, 1997). Indeed, the use of item response theory demonstrated that all PCLR items other than poor behavioural controls form testlets (Cooke & Michie, 2001). Testlets do not merely capture shared error variance, instead testlets are conceptually meaningful. Testlets combine specific indicators to form higher-order facets within the hierarchy of personality features.
![]() View larger version (14K): [in a new window] [as a PowerPoint slide] |
Fig. 2 Structure of the Psychopathy Checklist Revised. Degraded three-factor model
(without testlets). PCL, Psychopathy Checklist - Revised.
|
The four-factor models
The current debate is frequently described as a choice between a three- and
a four-factor model. This is misleading as there are two three-factor models
and many four-factor models (Hare,
2003; Hare & Neumann,
2005). Frequently, authors fail to distinguish between these
models and this creates conceptual confusion. We can identify at least ten
four-factor or equivalent models in the literature. We
describe these as: (a) a four-factor hierarchical model; (b) a two-factor,
four-facet hierarchical
model;3 (c) a
four-factor correlated model. Since each of these models can be `parcelled',
we also describe a four-factor parcelled model.
A four-factor hierarchical model
Hare (2003) implied that
four factors (i.e. the three factors from the Cooke & Michie
(2001) model together with a
criminality `factor' specified by five items that tap criminal behaviours) are
in a hierarchical relationship with the superordinate psychopathy factor
(Fig. 3). Although Hare
(2003) argued that the model
`envelopes' the three-factor model, to date no results have been provided to
support this position.
![]() View larger version (14K): [in a new window] [as a PowerPoint slide] |
Fig. 3 Structure of the Psychopathy Checklist Revised. Hierarchical four-factor
model. PCL, Psychopathy Checklist - Revised.
|
![]() View larger version (16K): [in a new window] [as a PowerPoint slide] |
Fig. 4 Structure of the Psychopathy Checklist Revised. Hierarchical two-factor,
four-facet model. PCL, Psychopathy Checklist - Revised.
|
A four-factor correlated model
A number of researchers (e.g., Hare,
2003; Hill et al,
2004; Hare & Neumann,
2005) have presented correlated factor models in which the three
factors from the three-factor model, together with the fourth criminality
`factor', are all inter-correlated (Fig.
5). Hence, each factor (e.g. factor 1) is correlated with all of
the other factors (e.g. factors 2, 3 and 4). Neumann et al
(2007) contend that correlated
factor models are superior to the hierarchical models previously offered.
![]() View larger version (18K): [in a new window] [as a PowerPoint slide] |
Fig. 5 Structure of the Pscyhopathy Checklist Revised. Correlated four-factor
model. PCL, Psychopathy Checklist - Revised.
|
![]() View larger version (8K): [in a new window] [as a PowerPoint slide] |
Fig. 6 Structure of the Psychopathy Checklist Revised. Two-factors, four-facet
parcelled model. PCL, Psychopathy Checklist - Revised.
|
|
|
|---|
Correlated v. hierarchical models
The work of Hill et al
(2004) highlights the emerging
difficulty in distinguishing between hierarchical and non-hierarchical models
(Hare & Neumann, 2005;
Neumann et al, 2007).
A key feature of a hierarchical model is the demonstration that the higher
order construct of interest is sufficiently unidimensional to be regarded as a
coherent psychopathological syndrome. For two- or three-factor models, all
correlated models are inherently hierarchical in that they are mathematically
equivalent to models with a superordinate factor overarching subordinate
factors. For models with four or more factors, this is no longer the case. In
terms of statistical modelling, a PCLR three-factor correlated model
has three correlations among the factors, and the hierarchical model also has
three loadings on the superordinate psychopathy factor. In contrast, the
PCLR four-factor correlated model has six correlations among the
factors, whereas the hierarchical model has four loadings on the superordinate
psychopathy factor. The hierarchical model is more parsimonious, more
constraints being placed on the model, and thus it is a more demanding model
to fit.
Nevertheless, proponents of the four-factor model strongly favour nonhierarchical models: `we recommend using first-order models with correlated factors in future research' (Neumann et al, 2007). The assumption is that `the strong correlations between the factors... reveal that they are indicators for a second-order latent variable' (Neumann et al, 2007). This assumption that correlated and hierarchical models are the same is misleading. It is necessary to explicitly compare a four-factor hierarchical model with a four-factor correlated model.
This issue has fundamental conceptual importance. The three-factor hierarchical model implies that psychopathic personality disorder (the superordinate factor) is underpinned by distinct constellations of interpersonal, affective and lifestyle traits (the first-order factors): the expression of these trait constellations is caused both by the overarching disorder and specific variance associated with the factor. The four-factor correlated model does not imply the presence of an overarching disorder that produces particular symptoms. Instead, these symptoms could be merely a hodge-podge of domains that co-occur. Essentially, the correlated model implies a compound trait composed of distinct constructs without a common cause (Smith et al, 2003). For example, measures of psychopathy are often associated with indices of alcohol and drug misuse and/or addiction (Rutherford et al, 2000). Most authorities, however, would not construe substance misuse and addiction as central to psychopathy, but rather view them as associated features of the disorder (Cleckley, 1988; World Health Organization, 1992; American Psychiatric Association, 1994). This can be tested empirically by comparing the relative fit of a hierarchical v. a correlated model, each with four factors: the interpersonal, lifestyle and affective factors of the PCLR and a fourth `addiction' factor. If the authorities are right about the lack of centrality of addiction to psychopathy, the hierarchical model will not fit whereas the correlated factor model will.
This is equally true for the inclusion of items that essentially are counts of antisocial behaviours in the model. If a hierarchical model fits, one could argue that antisocial behaviours are a core part of the disorder. If the correlated model fits, one can only assume that antisocial behaviours are correlated with psychopathy; perhaps a self-evident, if not trivial, observation. The distinction between correlated and hierarchical models is thus core to this debate.
There are no reasonable arguments for neglecting this distinction. Proponents of the four-factor model argue that if the four factors have differential associations with external correlates, then it would be unwise to employ a superordinate model to seek out such differential associations. This argument conflates the scoring and application of a measure (PCLR total scores or PCLR factor scores) with the understanding of a construct (psychopathy, with specific trait constellations). Hierarchical models represent specific factors that underpin superordinate constructs. Unlike correlated models, hierarchical models require that the specific factors included in the model be part of a coherent construct. Thus, hierarchical models have great potential for understanding both psychopathic personality disorder and its specific components.
The use of testlets v. correlated errors
Local dependency occurs when there is consistency among item responses that
cannot be explained by individual differences on the latent trait being
measured. Testlets are groups of items that show local dependence (a testlet
formed of two items may be viewed as somewhere between one and two items).
Although local dependence can emerge for a variety of reasons, with a rating
scale the most common reason is the overlap of item definitions. PCLR
definitions are often overlapping. For example, `lack of remorse or guilt' and
`failure to accept responsibility for own actions' both require consideration
of whether the individual externalises blame. In creating the screening
version of the PCLR (the PCLSV), Hare and colleagues recognised
this issue and grouped conceptually related PCLR items to produce
distinct PCLSV items (Hart et
al, 1995; Cooke et
al, 1998).
If there is consistent evidence that testlets exist within a scale this indicates local dependence. Local dependence is an undesirable property of a scale for three reasons. First, local dependence complicates the structure underpinning the data and can incorrectly challenge the assumption that a unidimensional trait underpins the test. This is crucial if data are to be subjected to parcelling. Second, local dependence leads to an overestimation of the true amount of information provided by the test. That is, the test appears to be more accurate than it actually is because information is double-counted. Third, the ratings do not allow clinicians to adequately distinguish between conceptually distinct symptoms. Although testlets can be confused with correlated errors, the two concepts are conceptually and mathematically distinct. Conceptually, unlike correlated errors, testlets explicitly describe the measurement model, specifying theoretically meaningful sub-facets within the hierarchical description of the disorder. For example, `x=pathological lying' and `conning/manipulative' combine to describe a deceptive interpersonal style. This was implicitly acknowledged when these two PCLR items were combined to create the one PCLSV item called `deceitful' (Hart et al, 1995). Correlated errors are more opaque they do not provide this additional level of description. Mathematically, testlets are more elegant than correlated errors. A model with a three-item testlet is more parsimonious that a model with three items with correlated errors that load on the same factor: the former requires two parameters, the latter three parameters. We are criticised for including testlets in our three-factor model (Neumann et al, 2007). In our view, any attempt to provide an accurate model of the structure of the PCLR should consider testlets, even if only to reject the need for their inclusion in any model.
The use of parcelling
In structural equation modelling a parcel is an aggregate-level indicator
derived by combining individual items (e.g. adding individual PCLR
items to derive a new manifest variable;
Little et al, 2002).
This is a controversial technique
(Bandalos, 2002;
Little et al, 2002).
Proponents of the technique argue that parcelling has two broad advantages.
First, combining items results in composite variables with better psychometric
properties than item variables (e.g. greater reliability, a higher ratio of
common-to-unique factor variance, smaller and more equal intervals between
scale points and distributions that are less likely to violate distributional
assumptions; Little et al,
2002; Kim & Hagtvet,
2003). Second, parcelling results in models with better fit
indices. This is because they reduce sources of sampling error, require fewer
parameters and are less likely to be affected by correlated residuals or dual
loadings. Broadly, the number of variances and covariances that the model must
account for is reduced (Bandalos,
2002; Little et al,
2002; Kim & Hagtvet,
2003; Martens,
2005).
Opponents of parcelling point out five problems. First, parcelling can obscure the multidimensional nature of the items. Bandalos (2002) noted that when the assumption of unidimensionality is not met (an assumption that is rarely tested) `the use of parcels can obscure rather than clarify the factor structure of the data' (p. 80). This is clearly a problem when there is evidence of local dependency, as there is with the PCLR items (Cooke & Michie, 2001). Second, the improvement in fit is more apparent than real; models that do not fit at an item level can be made to appear to fit with parcelling. Bandalos & Finney (2001) noted that parcelling improves model fit, irrespective of whether the model is specified correctly or not; this also reduces our ability to detect mis-specified models. Kim & Hagtvet (2003) demonstrated empirically that when parcelled and item models were compared, the parcelled models yielded better fit statistics. Unlike the item models, the parcelled models pointed to the acceptance of mis-specified models. Third, Bandalos (2002) reported that parcelling can bias estimates of structural parameters (e.g. path coefficients; `factor loadings'). Fourth, comparisons of factor structure across groups for parcelled variables vary considerably from those based on individual items (Bagozzi & Edwards, 1998). This will affect our understanding of important issues, including cross-cultural variation in psychopathy and variation across gender, age and race. Fifth, even if the use of parcelling is defensible statistically, from a clinical perspective it can result in the loss of important information about the condition being considered (Little et al, 2002). When one sums across several items and then examines the relation between that sum (e.g. parcelled interpersonal facet) and an external variable (e.g. dominant behaviour), it is impossible to know that one aspect of the sum (e.g. grandiosity/charm) strongly predicts the external variable, whereas the other (e.g. deception) is unrelated. Data aggregation results in a loss of information.
When is it legitimate to use parcelling to analyse PCLR data? Justification of the approach depends on (a) the purpose of the analysis and (b) the analytical strategy adopted before parcelling is undertaken. Little et al (2002) noted that `careful delineation of the goals of the study is clearly the paramount issue' (p. 6). If the purpose of the analysis is to explicate the interrelations among items on a test for construct validation purposes, then parcelling is inappropriate (Rogers & Schmitt, 2004). However, if the goal is to examine the interrelations among well-established measures of latent traits then parcelling may be appropriate. In the latter case it is assumed that the structure of the latent traits is well established (i.e. is not the subject of debate) and the primary interest is in putative causal relations among the latent traits rather than the measurement model (Little et al, 2002; Rogers & Schmitt, 2004; Martens, 2005). Rogers & Schmitt (2004) warned, however, that even when a measure has been validated at the item level in terms of a measurement model, parcelling of these items can still result in `undesirable or unpredictable effects on estimates and fit when testing the theoretical model' (p.380).
If parcelling is to be attempted, an essential prerequisite is an analysis of the dimensionality of the parcel (Bandalos & Finney, 2001; Bandalos, 2002; Little et al, 2002; Rogers & Schmitt, 2004). Unfortunately, this is more honoured in the breach than the observance. Kim & Hagtvet (2003) demonstrated that the unidimensionality of a parcel must be established before it is entered into a more complex model. In the absence of unidimensionality the structural relations among latent traits cannot be interpreted (Little et al, 2002). Kim & Hagtvet (2003) propose a method that explicitly models the single item and parcel indicators simultaneously, allowing a comprehensive evaluation of the unidimensionality of the parcel. Interestingly, this approach is formally equivalent to the testlet approach adopted by Cooke & Michie (2001). The three-factor approach with testlets provides greater understanding of the PCLR items than a four-factor parcelled approach.
In summary, our understanding of the structure of the PCLR, and to some degree our understanding of psychopathy, may be adversely influenced by the application of inappropriate methods for specifying the basic framework of the measurement model (correlated rather than hierarchical factors) and specific components (parcels rather than items and/or testlets).
|
|
|---|
Participants
The sample comprised a total of 1212 adult male
offenders.4 The
largest subsample comprised 608 adult male offenders from seven prisons in Her
Majesty's Prison Service (HMPS) in England and Wales, selected to be
representative of the HMPS population. Additional subsamples included a
representative sample of 246 offenders from the Scottish Prison Service
(Cooke & Michie, 1999), a
stratified random sample of 253 offenders from Scotland's largest prison
(Michie & Cooke, 2006) and
a sample of 105 incarcerated Scottish offenders who volunteered to participate
in a study of early childhood experiences
(Marshall & Cooke, 1998).
Only complete cases were used (n=827) to ensure that the same data
were used irrespective of the model being tested.
The procedure
The PCLR ratings were made according to instructions in the test
manual (Hare, 1991). All
PCLR evaluations were conducted by trained raters.
Confirmatory factor analysis
Confirmatory factor analysis permits quantification of a particular factor
structure's fit within a particular sample. We assessed quality of fit using
multiple indices, as each index has limitations
(Kline, 1998;
MacCallum & Austin, 2000).
Different aspects of fit were evaluated, including absolute fit (
2), fit
adjusted for model parsimony (non-normed fit index, NNFI) and fit relative to
a null model (comparative fit index, CFI), and root mean square error of
approximation (RMSEA). Following convention, the criterion for adequate fit
was defined as CFI and NNFI
0.90 and RMSEA
0.08
(Byrne, 1994). Following Kim
& Hagvtet (2003) we
classified RMSEA values into four categories: close fit (0.000.05),
fair fit (0.060.08), mediocre fit (0.080.10), and poor fit
(>0.10).
Confirmatory factor analysis was performed using EQS for Windows (Bentler & Wu, 1995). Participants with missing data were deleted listwise for these analyses. Maximum likelihood estimation with robust-fit statistics and standard errors was used. The correlations were polychorics. Recommendations in the electronic help manual for the EQS 6 software suggested that this estimation approach is the best EQS approach for data of this type. This differs from the approach used in Cooke & Michie (2001).5 We also ran the analyses using the MPlus program with robust-weighted least-squares estimation and the same pattern of results was obtained.6
Comparison of models
We started our analysis by estimating a one-factor model with all 20 items
loading on a single latent trait. Fit statistics
(Table 1) indicate that this is
not a plausible model. We then tested the traditional two-factor model, which
contains 8 items that load on the factor described as `the selfish, callous
and remorseless use of others' and 9 items that load on a factor termed `the
chronically unstable and antisocial lifestyle; social deviance' factor
(Harpur et al, 1988).
Fit statistics (Table 1)
indicate that this too is not a plausible model. We then tested the amended
two-factor model (Hare, 2003),
i.e. we added the item `criminal versatility' to the second factor. Fit
statistics (Table 1) again
indicate that this is not a plausible model.
|
View this table: [in a new window] |
Table 1 EQS categorical variables
|
We then fitted the full three-factor hierarchical model with testlets (Cooke & Michie, 2001). This model achieves a close fit with a CFI of 0.96 and an RSMEA of 0.05 (Table 1).
Examining the impact of testlets
We then fitted a three-factor model with the testlet level removed. Such a
model has been advocated by others on the grounds of parsimony. Using the
criteria presented above this model achieves a fair and acceptable fit.
However, the model with testlets achieves superior fit; direct comparison
indicated that its fit is significantly better (
2(4,
n=827) =456, P<0.001). These results indicate that
testlets provide a more comprehensive account of the measurement model
underpinning the PCLR. Despite the clear superiority of the original
three-factor model, in the remaining analysis we excluded testlets to provide
a more rigorous test of the three-factor model of the PCLR. We call
this model without testlets the degraded three-factor model.
Examining the fit of the fourth criminality `factor'
We then tested the three unparcelled four-factor models described in the
literature; the four-factor hierarchical model, the two-factor, four-facet
hierarchical model and the four-factor correlated model. None of these models
achieve conventionally acceptable levels of fit
(Table 1). Their level of fit
is poorer than the level of fit achieved by even the degraded three-factor
model.
Exploring the impact of parcelling
Parcelling, or adding individual items together prior to model fitting, was
used to achieve the fit indices presented for the four-factor models in the
PCLR manual (Hare,
2003: Figs 7.1, 7.3, 7.4). A prerequisite to parcelling is to
demonstrate unidimensionality for the items being parcelled
(Bandalos, 2002;
Kim & Hagtvet, 2003;
Rogers & Schmitt, 2004).
The presence of testlets in PCLR data
(Cooke & Michie, 2001)
means that this assumption is not met; that is, multiple latent constructs are
tapped by items that are parcelled within individual factors of the
four-factor models.
To examine the effects of parcelling on our dataset, we first estimated the
two-factor, four-facet hierarchical model. Fit statistics
(Table 1) indicate that this
does not provide an adequate fit. We then parcelled the items by adding them
together within their respective factor. We tested the fit of this model and
fit statistics (Table 1) reveal
a very good fit, with a non-significant
2 value, and a CFI of
1.0.
We next tested the potential of parcelling to mislead. To do so, we compared the fit of an incorrect model that did, and did not, involve parcelling. The incorrect model involved swapping two item pairs within the two-factor, four-facet model: `pathological lying' with `poor behavioural controls' and `irresponsibility' with `failure to accept responsibility for own actions'. Therefore, in this incorrect model, 4 of the 18 items loaded on the wrong factors. We then parcelled these 4 items into the same wrong factors.
Not surprisingly, the fit statistics for the unparcelled model indicate
that swapping items substantially degraded the model's fit
(Table 1). Indeed, the fit
statistics suggest that this is an incorrect model. In contrast, the fit
statistics for the parcelled model indicate an extremely good fit with a
non-significant
2 value, a CFI of 1.00 and a RMSEA of 0.00. We
concluded that parcelling is an inappropriate technique when the intent is to
understand the interrelations among PCLR items.
|
|
|---|
Appropriate methods for modelling PCL-R scores
On grounds of empirical results, and on the grounds of theory, we argue
that a comprehensive evaluation of the measurement model underpinning the
PCLR requires the specification of three broad factors that are
underpinned by more specific testlets. On empirical grounds, this three-facet
hierarchical model with testlets achieves a close fit. Although the model is
not under-identified, as there are more observations than free model
parameters (Kline, 1998), the
model is not ideal because there are not enough indicators per testlet. The
model is, however, the best that can be achieved within the psychometric
limitations posed by the PCLR (e.g. limited item coverage, local
dependency). Even when the model is degraded by removing the testlet level,
the fit achieved is fair. On theoretical grounds, the three-facet hierarchical
model with testlets describes the broad interpersonal, affective and
behavioural features that have traditionally been linked to the clinical
construct of psychopathy. In addition, this model describes more specific
facets at the testlet level, including, for example, deceptive interpersonal
style and behavioural instability.
Is criminal behaviour central to the construct of psychopathy?
Proponents of the four-factor model(s) embrace the three Cooke & Michie
factors within their models. The point at dispute is the role of additional
items that essentially enumerate antisocial acts. None of the non-parcelled
PCLR models that add these antisocial behaviour items achieves
acceptable fit. This indicates that these models do not provide adequate
measurement models for the PCLR. This is true whether the models
involve hierarchical factors or (the less demanding) correlated factors. In
our view there is no compelling empirical evidence to support the conclusion
that antisocial behaviour is a central feature of psychopathy.
In addition, there are good conceptual (logical, theoretical) reasons for considering antisocial behaviour to be causally downstream from psychopathic personality disorder (Cooke et al, 2004; Skeem & Cooke, 2007). First, classical clinical descriptions of psychopathy do not give a central role to antisocial behaviour (Schneider, 1950; Karpman, 1961; Arieti, 1963; Cleckley, 1988). As Blackburn (2005) noted `criminal behavior was not intrinsic to Cleckley's concept' (p. 279). Indeed, Cleckley (1988), referring to the propensity to be antisocial in general, and seriously criminal in particular, indicated that `such tendencies should be regarded as the exception rather than as the rule, perhaps, as a pathologic trait independent, to a considerable degree, of the other manifestations which we regard as fundamental' (p. 262). The critical feature for Cleckley was not the occurrence of criminal behaviour in itself but rather the occurrence of criminal behaviour for which the motivation is obscure. Simple counts of criminal acts cannot address this subtlety.
Second, it is plausible that the characteristic traits of psychopathy have a functional link with antisocial behaviour. Individuals who are grandiose frequently have a strong sense of entitlement that permits them to steal from, rape and exploit others. Those who lack empathy and anxiety may fail to inhibit violent thoughts and urges. Impulsivity increases the likelihood that these individuals will carry out criminal acts without considering the consequences (Cooke et al, 2004; Skeem & Cooke, 2007).
Third, specific socially deviant acts are qualitatively different from the pervasive and persistent personality traits that underpin the three factors within the PCLR (Blackburn, 1988). As Lilienfeld (1994) noted it is important not to conflate `basic tendencies' (traits) and `characteristic adaptations' (specific acts).
Fourth, antisocial behaviour has been linked to a number of mental disorders (e.g. psychotic disorders, learning disability, substance misuse and other personality disorders). It is thus a non-specific indicator (Blackburn, 1988; Skeem & Mulvey, 2001). Theories of crime have implicated a multitude of factors in relation to antisocial behaviour (Gottfredson & Hirschi, 1990).
We would argue that there are thus strong theoretical and empirical reasons for excluding measures of criminal and antisocial acts from attempts to measure the construct of psychopathy; not least because it represents significant construct drift (Blackburn, 2005).
The importance of understanding the structure of a measure
Some commentators have argued that the three-factor model differs very
little from the two-factor model and it is of little importance whether two-,
three-, or four-factor models are used. For example, Jones, et al
(2007) expect the three- and
four-factor models to `perform alike' because the `models are quite similar'.
They opine that the decision to use either model will hinge on personal
preference or `researchers' underlying conceptualisation of psychopathy'. We
disagree. Given that the set of symptoms being modelled is the same, the
content of any derived structure will inevitably be similar. This does not
mean that the underlying structures are the same. Obtaining greater
understanding of the structural properties of a disorder can yield many
advantages (Watson et al,
1994).
First, it can serve as a starting point for the identification of fundamental psychological structures or processes. Structural research on IQ tests revealed distinct verbal and spatial factors; subsequent neuropsychological research indicated that these factors measured separate neural sub-systems (Watson et al, 1994).
Second, understanding structure can inform theories of causation: are the distinct facets products of some common underlying tendency towards psychopathy or are they not true facets but merely a number of distinct constructs without a common cause?
Third, explication of the structure can improve investigations of construct validity. If items are not grouped into unidimensional constructs, their relation to other variables in the nomological net may not be readily apparent. Associations with cognate variables based on a broad measure of a construct effectively average the associations underpinned by distinct factors; it is not clear which part of the measure drives the association, with the average association frequently being weaker than that of the strongest factor.
Fourth, an appreciation of structure can improve scales; and can provide direction on where new variables should be added to improve construct representation or removed to reduce construct-irrelevant variance (Lilienfeld, 1994; Floyd & Widaman, 1995; Little et al, 2002). Construct under-representation occurs when a measure fails to capture key aspects of the latent construct: it has been argued elsewhere, for example, that the PCLR fails to adequately assess problems of self, attachment and interpersonal style which are central to the construct of psychopathy (Cooke et al, 2006). Construct-irrelevant variance occurs when the measure captures aspects of latent constructs other than the target latent construct. It is our contention (Cooke et al, 2004; Skeem & Cooke, 2007) that the inclusion of counts of criminal and other antisocial behaviour in the PCLR represents construct-irrelevant variance.
Conclusion
The validation of a construct is never complete. Validation is important
for reasons of theory and for reasons of practice. The field is in danger of
falling into the trap of operationalism: conflating a fallible
measure of psychopathy (PCLR) with the construct of
psychopathy. Psychopathy and criminal behaviour are distinct constructs. If we
are to understand their relationships and, critically, whether they have a
functional relationship, it is essential that these constructs are
measured separately. This is particularly critical within the context of the
DSPD project, where individuals are detained because of the assumption of a
functional link between their personality disorder and the risk that they
pose. Recently, we have endeavoured to develop a more comprehensive model of
the construct of psychopathy. Using clinical informants and a
trait-descriptive adjectival approach we have identified after
numerous iterations a list of 33 symptoms that are grouped rationally
into six domains of functioning (interpersonal attachment;
behavioural; cognitive; interpersonal dominance; emotional; and self).
This model is currently being subjected to empirical evaluation.
This study is not without limitations. First, we were not able to test the various models on the data from the PCLR manual (Hare, 2003). Second, we were unable to demonstrate the chief problem inherent in correlated (rather than hierarchical) PCLR model; that any correlate, whether essential to psychopathy or not, will fit. In future research, we will determine whether adding a non-psychopathic factor (e.g. addiction) to core PCLR factors yields adequate fit indices in correlated factor models and (appropriately) poor fit indices in hierarchical models. Third, the results focus only on adult males prisoners; the generalisability of the results to other groups, including female offenders, remains unclear (Forouzan & Cooke, 2005).
|
|
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
G. Ponce, J. Hoenicka, M. A. Jimenez-Arriero, R. Rodriguez-Jimenez, M. Aragues, N. Martin-Sune, E. Huertas, and T. Palomo DRD2 and ANKK1 genotype in alcohol-dependent patients with psychopathic traits: association and interaction study The British Journal of Psychiatry, August 1, 2008; 193(2): 121 - 125. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Vitacco Psychopathy The British Journal of Psychiatry, October 1, 2007; 191(4): 357 - 357. [Full Text] [PDF] |
||||
![]() |
C. Neumann Psychopathy The British Journal of Psychiatry, October 1, 2007; 191(4): 357 - 358. [Full Text] [PDF] |
||||
![]() |
P. E. Mullen Dangerous and severe personality disorder and in need of treatment The British Journal of Psychiatry, May 1, 2007; 190(49): s3 - s7. [Abstract] [Full Text] [PDF] |
||||
Read all eLetters
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||