Ethical and Professional Issues in Psychology Testing

profileEverleigh
KossonetalFactorStructureoftheHarePsychopathyChecklist.pdf

Title:

Database:

Factor structure of the Hare Psychopathy Checklist: Youth Version (PCL:YV) in adolescent females. By: Kosson, David S., Neumann, Craig S., Forth, Adelle E., Salekin, Randall T., Hare, Robert D., Krischer, Maya K., Sevecke, Kathrin, Psychological Assessment, 10403590, 20130301, Vol. 25, Issue 1

APA PsycArticles

Factor Structure of the Hare Psychopathy Checklist: Youth Version (PCL:YV) in Adolescent Females

By: David S. Kosson Department of Psychology, Rosalind Franklin University of Medicine and Science; Craig S. Neumann Department of Psychology, University of North Texas Adelle E. Forth Department of Psychology, Carleton University Randall T. Salekin Department of Psychology, University of Alabama Robert D. Hare Department of Psychology, University of British Columbia, Vancouver, British Columbia, Canada Maya K. Krischer Department of Child and Adolescent Psychiatry, University of Cologne, Cologne, Germany Kathrin Sevecke

!Listen"

UAGC Library

Department of Child and Adolescent Psychiatry, University of Cologne, Cologne, Germany Acknowledgement: Adelle E. Forth, David S. Kosson, and Robert D. Hare are coauthors of the Psychopathy Checklist: Youth Version (PCL:YV), published by Multi-Health Systems, 3770 Victoria Park Avenue, Toronto, Ontario, Canada M2H 3M6 and receive royalties from its sale.

We thank Henrik Andershed, Jacqueline Das, Heather Gretton, Derek Indoe, Malin Hemphälä, Sheilagh Hodgins, Kathleen Lewis, Roy O'Shaughnessy, R. Rowe, Corine de Ruiter, Fred Schmidt, Anders Tengström, Anita Thapar, and Todd Willoughby for providing much of the data analyzed in these studies. Several of the samples examined here are described in greater detail in the PCL:YV manual (Forth, Kosson, & Hare, 2003). The collection of some of the data examined in this study was supported by National Institute of Mental Health Grant MH49111 to David S. Kosson; by funding from the William H. Donner Foundation to Craig S. Neumann; by grants from the Centers for Disease Control, the Department of Justice, and the Department of Youth Services to Randall T. Salekin; and by a grant from the Alexander Humboldt Foundation to Kathrin Sevecke.

Models of the factor structure of psychological tests play a critical role in understanding and validating the constructs they are designed to assess. Scores on subsets of items for a measure that cohere similarly in diverse and independent samples provide evidence for the generalizability of the construct being measured. Evidence that a pattern of covariances is consistent with theoretical expectations makes an important contribution to construct validation (Strauss & Smith, 2009). Factor models that generalize across different kinds of samples provide a foundation for subsequent scientific studies that examine whether these dimensions are characterized by similar nomological networks across samples. Such studies, in turn, can be used to test hypotheses about the mechanisms underlying the components of a syndrome.

Psychopathy is a severe syndrome of personality pathology that is widely associated with callous and manipulative interpersonal behavior as well as impulsive and irresponsible antisocial behavior. The standard clinical measures of the psychopathy construct are the Hare Psychopathy Checklist (PCL) scales that ask raters to make inferences about underlying dispositions by integrating information from interviews, behavioral observations, and file or other collateral material (Hare & Neumann, 2009). These scales include the Psychopathy Checklist— Revised (PCL-R; Hare, 2003), the Psychopathy Checklist: Screening Version (PCL:SV; Hart, Cox, & Hare, 1995), and the Psychopathy Checklist: Youth Version (PCL:YV; Forth, Kosson, & Hare, 2003).

Several factor models of the PCL scales have been proposed. The four-factor model suggests

that individual differences in the dispositions that comprise psychopathy are underlain by differences in one or more of four correlated dimensions that reflect specific interpersonal, affective, lifestyle, and antisocial features. Evidence corroborating this model comes from confirmatory factor analyses (CFAs) of PCL scores in a variety of forensic, clinical, and community populations (e.g., Babiak, Neumann, & Hare, 2010; Hare & Neumann, 2008; Neumann & Hare, 2008). This pattern of strong correlations has been explained by a second- order general factor (Neumann, Hare, & Newman, 2007; Neumann, Kosson, Forth, & Hare, 2006) said to reflect the superordinate syndrome of psychopathic personality. This interpretation is consistent with behavior genetic research that has shown that four psychopathy factors similar to the PCL factors can all be accounted for by a common genetic trait (Larsson, Andershed, & Lichtenstein, 2006).

The three-factor model (Cooke & Michie, 2001) is identical with respect to the first three dimensions of the four-factor model but omits the antisocial features dimension (and the five items that load on that component). Because tests of both the three- and four-factor PCL models of psychopathy often yield acceptable fit in adult and adolescent males (Cauffman, Kimonis, Dmitrevia, & Monahan, 2009; Neumann, Hare, & Newman, 2007; Neumann et al., 2006; Salekin, Brannen, Zalot, Leistico, & Neumann, 2006), these models are currently the dominant models for the internal structure of psychopathy based on clinical measures.

In contrast, the fit of these factor models in female samples is more controversial. Among adult women, Warren et al. (2003) reported good fit for the two- and three-factor models. Similarly, Neuman, Hare, and Newman (2007) reported good fit for the four-factor model in both male and female adult inmates, whereas Salekin, Rogers, and Sewell (1997) suggested the factor structure is somewhat different in women than men. Bolt, Hare, Vitale, and Newman (2004) conducted item analyses in large samples of adult male and female offenders and reported that scalar equivalence may hold, at least approximately, for male and female offenders, in spite of some evidence for differential test functioning and for differential item functioning on some lifestyle and antisocial dimension items.

Vitale and Newman (2001b) noted that most prior factor analytic studies have involved small samples that may have provided inadequate power for examining factor structure. They emphasized the need for researchers to conduct studies with larger samples of females. Examining adolescents, Forth et al. (2003) reported acceptable fit for the three-factor model in a sample of female adolescents, whereas the four-factor model achieved acceptable fit only on the absolute fit indices examined. However, their sample (based on six different subsamples) included only 147 girls. Consequently, analyses were likely underpowered for evaluating both

models. In addition, Forth et al. did not subdivide the sample to examine fit separately for incarcerated versus probation samples of girls or for samples from different parts of the world.

Subsequent studies have also yielded conflicting findings. Jones, Cauffman, Miller, and Mulvey (2006) reported reasonable fit for the three- and four-factor models in girls but only after making minor changes to the factor structures that have not been evaluated in other studies. In contrast, Sevecke, Pukrop, Kosson, and Krischer (2009) reported that neither the three-factor nor the four-factor model yielded generally acceptable fit among incarcerated German adolescent females. In an item response theory analysis, Schrum and Salekin (2006) reported that some of the same items that are most discriminating in male samples were most discriminating in a sample of female youths. However, they noted that some items were more or less discriminating in girls than in boys. Despite a few recent studies the relative dearth of research in this area is of concern because of the potential differences in measurement structure and in the correlates of constructs that can occur across sex. Although prior studies provide some information about the factor structure of PCL:YV psychopathy in specific settings and locations, the small size of these samples is likely to work against obtaining good fit for both the three- and four-factor models.

The possibility of a different factor structure for girls than for boys is especially interesting in light of evidence that some of the correlates of scores (on clinical measures of psychopathy) in males do not consistently generalize to female samples. For example, associations between psychopathic traits and response modulation deficits (Vitale & Newman, 2001a; cf. Vitale, Brinkley, Hiatt, & Newman, 2007) and affective modulation of startle reflexes (Sutton, Vitale, & Newman, 2002) appear less consistent in adult female than male samples. Although some studies have yielded relatively similar patterns of correlations for psychopathy ratings in females and in males (Kennealy, Hicks, & Patrick, 2007; Stockdale, Olver, & Wong, 2010) or patterns of correlations in females similar to those previously reported for males (Bauer, Whitman, & Kosson, 2011; Penney & Moretti, 2007), other studies have cast doubt on the construct validity of PCL:YV scores among female adolescents (Odgers, Reppucci, & Moretti, 2005; Vincent, Odgers, McCormick, & Corrado, 2008).

It is important to keep in mind that all of the findings on the factor structure of psychopathy reviewed above are based on PCL measures of psychopathy. Factor analytic studies can only provide evidence on the structure of a construct as assessed using a specific measure. Even so, evidence that the factor structure differs for girls and boys when psychopathy is assessed with the PCL:YV would suggest the possibility that some of the differences in behavioral and physiological correlates of psychopathy ratings may reflect differences in the nature of the psychopathy construct in girls. In brief, evidence that the symptoms of psychopathy (as

assessed by clinical measures) cohere differently for girls than boys would suggest that different features may be critical to the expression of psychopathy in girls and would increase the plausibility of the perspective that different mechanisms may account for the appearance of these symptom dimensions.

In contrast, evidence for a similar underlying factor structure would suggest that a psychopathy measure is performing similarly in boys and girls. To the extent that the symptoms examined cohere in similar ways across sex, it becomes more likely that a pattern of similar correlations between psychopathy ratings and external criteria reflects similar underlying mechanisms. Although evidence for similarity in internal structure does not invalidate the differences reported in correlational studies, it would be consistent with the possibility that similar mechanisms may account for those relationships between psychopathy and external criteria that are similar in males and females.

As noted above, one of the chief limitations of prior factor analytic investigations in females has been the use of small samples. Small samples can lead to poor fit, even though the fit might be quite good when examined within large enough samples. Another limitation of prior studies is that none of the above mentioned studies compared factor models across youth in different countries or continents, and no prior studies have compared the fit of the different models in different kinds of settings.

The Current Study

The primary goal of the present study was to test the factor structure underlying PCL:YV-based psychopathy in a large sample of adolescent females. We assembled data from a large number of prior published studies that used the PCL:YV with adolescent girls. This provided us with a relatively large data set of 776 adolescent females (646 with no missing values). In examining this large sample, we hoped to provide greater clarity on the factor structure of psychopathy in adolescent females.

A secondary goal of this study was to evaluate the fit of the best fitting models in more homogeneous subgroups of participants and to assess whether the models demonstrated invariance for subsamples of participants assessed in different continents and participants assessed in different settings. We conducted both separate confirmatory factor analyses (CFAs) for North American and European subsamples as well as multiple-group CFAs to evaluate whether the models provided good explanations for the pattern of item-to-factor relationships in samples in different continents; we conducted a parallel set of separate CFAs and multiple-

group CFAs to address the same issues for incarcerated adolescent females versus those in less restrictive settings, including probation and short-term detention/evaluation centers.

We realized that there are important cultural differences between the different countries within North America and within Europe. However, an analysis of factor loadings in North American versus European samples provides an initial examination of whether the factor models are characterized by structural invariance across geographic region. Similarly, because youths who commit more frequent and more serious crimes and perform poorly under conditional release are likely to be sent to more restrictive settings, it is likely such settings will include a higher proportion of youths with many psychopathic features. Consequently, an analysis of the fit of the models for incarcerated adolescents versus adolescents in less restrictive (short-term detention, community probation, and clinic) settings provides a preliminary assessment of invariance across setting.

Method

Participants Data on 776 adolescent females were made available to the authors. These participants had participated in 14 different studies with independent samples; findings for a combination of the data from five of these samples were previously reported in the PCL:YV technical manual (Forth et al., 2003). Basic descriptive information about the participants from each sample is listed in Table 1.

Participants

Measure Psychopathy Checklist: Youth Version (PCL:YV)

The PCL:YV is a multi-item rating scale that assesses interpersonal and affective characteristics as well as overt behaviors associated with psychopathy. The measure is designed to be completed by trained observers who rate the presence of each trait disposition on the basis of a semistructured interview and a review of case history information or other collateral source(s). Ratings based on both interview and collateral data are described as obtained using the standard assessment method. The PCL:YV manual also permits the use of files only to complete the instrument but suggests caution in interpreting file-only scores, as the file-only method commonly provides substantially less information for scoring several of the interpersonal and affective items. Even so, prior factor analytic studies indicate acceptable fit for both PCL-R scores and PCL:YV scores completed solely on the basis of institutional files (Bolt et al., 2004; Forth et al., 2003). Scores of 0 (consistently absent), 1 (inconsistent), or 2 (consistently present) for each item of the PCL:YV reflect inferences about the consistency of the specific tendency or disposition across different situations and sources of information.

Scores on the PCL:YV have demonstrated internal consistency, with alpha coefficients ranging from .79 to .94 for total scores and mean interitem correlations ranging from .44 to .63 (Forth et al., 2003; Vitacco, Neumann, & Caldwell, 2010). Alphas for factor scores have ranged from .68 to .77 in the validation sample (Forth et al., 2003) and from .50 to .82 in smaller samples (Andershed, Hodgins, & Tengström, 2007; Vitacco et al., 2010; Vitacco, Neumann, Caldwell, Leistico, & Van Rybroek, 2006) with one exception (Skeem & Cauffman, 2003). Lower alphas for factor scores are expected in light of the number of items that contribute to each factor. Researchers have obtained good to excellent interrater reliability for total scores (ICCs range from .82 to .98; see Andershed et al., 2007; Cauffman et al., 2009; Das, de Ruiter, Doreleijers, & Hillege, 2009; Forth et al., 2003). The interrater reliability for factor scores is more variable, ranging from .43 to .86 (Forth et al., 2003; Skeem & Cauffman, 2003). PCL:YV scores correlate moderately with indices of externalizing psychopathology, instrumental violence, criminal activity, and antisocial behavior and predict recidivism in male adolescents (Flight & Forth, 2007; Kosson, Cyterski, Steuerwald, Neumann, & Walker-Matthews, 2002; Kubak & Salekin, 2009; Murrie, Cornell, Kaplan, McConville, & Levy-Elkon, 2004; Salekin, 2008; Salekin, Neumann, Leistico, DiCicco, & Duros, 2004; Schmidt, McKinnon, Chattha, & Brownlee, 2006; Vitacco et al., 2006, 2010).

Data Analysis

Confirmatory factor analyses test the fit of specific models for the latent structure underlying variation on observed indicators. Such analyses require that investigators first specify the number of latent factors, the relationships between indicators and factors, and the factor variances and covariances within a model and then statistically test the adequacy of their model in terms of standard model fit criteria. To test alternative latent structures of the PCL:YV, we carried out CFAs with Mplus (Version 5; Muthén & Muthén, 1998–2007), using the robust (mean- and variance-adjusted) weighted least squares (WLSMV) estimator recommended for use with ordinal data such as PCL:YV items (Flora & Curran, 2004; Neumann, Kosson, & Salekin, 2007).

Because each fit index has limitations, and there are no agreed upon methods for definitively determining quality of fit (Kline, 1998), adequacy of fit for each model was estimated using several measures. Because the chi-square is usually significant with large samples, investigators typically rely on other fit indices to assess the adequacy of a model. We calculated two widely validated relative fit indices: the comparative fit index (CFI) and the Tucker–Lewis index (TLI). The TLI and the CFI are incremental fit measures comparing the estimated model with a null or independence model; the TLI tends to be more adversely affected by the estimation of additional parameters that do not improve model fit and is less sensitive to sample size than many other relative fit indices (Marsh, Balla, & McDonald, 1988). For these indices, larger values indicate better fit of the hypothesized model (a conventional standard is 0.9 or above for acceptable fit, 0.95 or above for excellent fit; Hu & Bentler, 1995; Kline, 1998).

We also examined two absolute fit indices: the root-mean-square error of approximation (RMSEA) and the standardized root-mean-square residual (SRMR). Absolute indices gauge how well the model-generated covariance matrix reproduces the sample covariance matrix. Smaller absolute fit indices, and thus smaller residual error values, indicate better fit. The RMSEA is an index that also rewards model parsimony (T. A. Brown, 2006), whereas the SRMR appears to be an especially sensitive indicator of poor fit. Moreover, because the SRMR and RMSEA provide relatively different approaches to estimating absolute fit, they provide somewhat more independent assessments of fit than some indices (Hu & Bentler, 1995). For these indices, Hu and Bentler (1995) suggested that scores below .05 indicate good fit, whereas scores between .05 and .08 indicate acceptable fit, and values above 1.0 indicate poor fit (Browne & Cudeck, 1993; Hoyle, 1995; Marsh, Hau, & Wen, 2004; MacCallum, Browne, & Sugawara, 1996).

An important caveat to the use of multiple fit indices is that, as model complexity increases, so does the size of the sample needed to test the model and the difficulty of achieving conventional

levels of model fit (Marsh et al., 2004). Put another way, if a more complex model displays approximately similar fit to a less-complex model, then the former is said to have survived a riskier test (Vitacco, Neumann, & Jackson, 2005; Vitacco, Rogers, Neumann, Harrison, & Vincent, 2005). However, given the adequate size of our full sample for all models examined, models were considered to fit adequately only if they indicated at least fair to acceptable fit on all four primary fit indices examined. Because the subsamples were only about half as large as the full sample from which they were drawn, we required at least fair to acceptable fit on three or more fit indices to indicate acceptable fit for subsample analyses.

As in most recent tests of the three-factor model, we did not test the original three-factor model that uses testlets, because prior studies conducted in several samples have shown that the model with testlets results in untenable solutions with impossible values (i.e., negative variance; Kosson et al., 2002; Salekin et al., 2006). For this reason, we examined a three-factor model without testlets (i.e., allowing the latent variables to load directly onto the PCL:YV items). It is this three-factor model that has achieved good fit in prior CFAs of adolescents (e.g., Neumann et al., 2006; Sevecke et al., 2009). Because the three-factor model is based on a different set of items and therefore a different covariance matrix than is the four-factor model, direct statistical comparison of these models is not possible (T. A. Brown, 2006; Kline, 1998).

From a mathematical modeling perspective, the three-factor model is less parsimonious than the four-factor model because it requires estimating 29 parameters to model 91 data points (df = 91–29 = 62), whereas the four-factor model uses only 42 free parameters to explain 171 data points (df = 171–42 = 129). However, this study was not designed to examine the relative merits of these two factor models but to examine their fit in a large sample of adolescent females and to examine whether there was evidence for invariance of these two models across different settings and for samples from Europe versus North America. Moreover, because the three-factor model is contained within the four-factor model, the two models are quite similar in most respects. The chief difference between the models concerns the nature of the antisocial dimension of the psychopathy construct (and the five items that load on this latent factor; this issue is addressed elsewhere, e.g., Cooke, Michie, Hart, & Clark, 2004; Hare & Neumann, 2008, 2010).

Results

Preliminary Analyses Principal analyses were based on participants with complete data. To ensure that the same participants were included in three-factor and four-factor analyses, only participants with

complete data for the 18 items needed for the four-factor analyses were included in these analyses. Complete data for the 18 items were available for 646 adolescent females (369 incarcerated females and 277 females drawn from less restrictive settings, i.e., probation, detention centers, and clinics).

Supplementary analyses were also conducted including cases containing missing values. These analyses included 776 adolescent females (423 incarcerated and 353 probation/detention/clinic adolescents). To test the assumption of full information CFAs that missing data were missing at random (i.e., that there are no systematic reasons why some items were not scored for some participants), we first conducted analyses to ascertain whether missing data covaried with differences on demographic variables. Chi-square analyses revealed that missing values were more prevalent in North American than in European data sets, χ (1) = 94.35, p < .001, and were more prevalent among incarcerated youth than among youth in less restrictive settings, χ (1) = 10.60, p = .001. Missingness was also much more likely for file-only than for standard (interview plus file) data, χ (1) = 37.56, p < .001. Although ethnicity was only available for 342 cases, missing values were also more prevalent among Caucasian and Native North American adolescents than among African or Latina adolescents, χ (3) = 87.22, p < .001. Based on these analyses, we focus on analyses for samples including no missing values.

Confirmatory Factor Analyses of the PCL:YV in the Full Sample of Female Adolescents We first examined the fit of the various factor models in the full sample. The one-factor and two- factor models were examined using the same 18 items as in the four-factor model. The CFA for the one-factor model indicated unacceptable fit for two of the four primary fit indices examined, χ (78) = 699.37, p < .001, CFI = .80, TLI = .90, RMSEA = .111, SRMR = .087. The CFA for the two-factor model also indicated unacceptable fit for the CFI and only fair to acceptable fit for the RMSEA, with adequate fit on the other two indices examined, χ (79) = 498.36, p < .001, CFI = 0.86, TLI = 0.93, RMSEA = 0.091, SRMR = 0.074.

In contrast, both the three-factor and four-factor model yielded adequate fit for both relative fit indices and for both absolute fit indices. Figure 1 displays the standardized parameters for these two models. The CFA results for the three- and four-factor models for the full sample and each of the subsamples are shown in Table 2. The CFAs also showed that the factors were correlated as expected. The latent correlations between the interpersonal and the affective, lifestyle, and antisocial dimensions were .68, .63, and .56; the correlations between the affective and lifestyle and antisocial dimensions were .73 and .66; and the correlation between the lifestyle and antisocial dimensions was .83.

2

2

2

2

2

2

Figure 1. Factor loadings, factor covariances, and fit indices for the four-factor model, full sample analysis in adolescent females (N = 646). behav. = behavior; CFI = comparative fit

index; TLI = Tucker–Lewis index; RMSEA = root-mean-square error of approximation; SRMR = standardized root-mean-square residual.

Confirmatory Factor Analysis Model Fit Results

Confirmatory Factor Analyses of the PCL:YV in Subsamples of Female Adolescents Both models also yielded consistently adequate fit in the North American subsample. In fact,

model fit was in the good to excellent range for both of the relative fit indices and in the reasonable range for both absolute fit indices for both models, despite a substantial drop in sample size from 646 to 285 cases (which makes the size of the subsamples suboptimal for assessing the fit of the four-factor model). In contrast, only the three-factor model yielded consistently acceptable fit in the European subsample. The fit of the four-factor model was in the fair to acceptable range for three of four indices examined but slipped barely below acceptable levels for the CFI.

Both models also yielded generally acceptable fit for the subsample of youth in less restrictive (i.e., probation, detention, and clinic) settings. In brief, the fit was acceptable to good for both models for both measures of relative fit and for the SRMR. However, the RMSEA yielded only fair fit for both models in this subsample. In contrast, only the three-factor model yielded generally acceptable fit for the subsample of incarcerated girls. Only the CFI slipped below acceptable levels for this model. For the four-factor model, both relative fit indices were unacceptably low, and both indices of absolute fit indicated only fair fit.

In summary, both the three- and four-factor models had good fit to the data (i.e., were able to reproduce the observed item covariance structure with adequate precision) in the full sample, and both models provided reasonable fit among North American girls. Even among European girls, both models generally provided at least fair to reasonable fit. Similarly, with the exception of the RMSEA, both models provided adequate levels of fit among girls on probation and in detention. The only subsample in which the pattern of findings suggested fit below conventional levels of acceptability was the incarcerated subsample, for which the four-factor model yielded unacceptable fit on two indices and only fair to acceptable fit on the other two indices examined.

Tests of Model Invariance Between North American and European Samples To test whether the three-factor model fit equally well in North American and European adolescent females, we conducted two multiple-group CFAs (MGCFAs) and compared the fit for the two models using a chi-square difference test. First, we allowed Mplus to freely estimate all model parameters separately by sample (i.e., factor loadings and item thresholds), fixing scale factor values at 1.0, factor means at 0, and factor variances at 1, as is the default in Mplus in multigroup analyses when using the default delta parameterization. Under these conditions, the model yielded evidence of acceptable fit on all three indices other than the chi-square, χ (80, N = 646) = 216.89, p < .001, CFI = 0.938, TLI = 0.964, RMSEA = 0.073. Next, we repeated the analysis constraining the loadings (but not the thresholds) to be equal in the two groups. This model yielded similar evidence of good fit across the two subsamples on relative fit indices and acceptable fit on an absolute fit index, χ (78, N = 646) = 289.82, p < .001, CFI = 0.941, TLI =

2

2

0.963, RMSEA = 0.074. However, the model yielded poorer fit than the less constrained model, as evidenced by a significant chi-square difference test, χ (9) = 26.65, p = .002. To examine whether the lack of invariance could be attributable to a single item loading differently, we reconducted the MGCFA allowing different loadings on PCL:YV Item 9, Parasitic Orientation, the item for which the multigroup CFA estimating the loadings separately had suggested the most disparity in item loadings (see Table 3). This analysis yielded a nonsignificant chi-square difference test, χ (8) = 13.16, p = .11. In sum, the generally reasonable fit for the three-factor model with constraints on the loadings in the two samples provides evidence of structural invariance. Moreover, it was possible to obtain evidence of structural invariance as indicated by good fit for a model requiring equal loadings on 12 of the 13 item indicators across the North American and European subsamples.

Factor Loadings and Thresholds for the Four-Factor Model in Multigroup Analyses in Which Item Loadings Are Estimated Separately in Different Subsamples

2

2

We also examined whether the four-factor model fit equally well in the North American and European samples. Again, the unconstrained model yielded evidence of acceptable fit on all indices, χ (132, N = 646) = 411.24, p < .001, CFI = 0.908, TLI = 0.950, RMSEA = 0.081. Once again, the model constrained to have equal loadings (but not thresholds) also yielded acceptable fit on both relative and absolute fit indices, χ (117) = 362.15, p < .001, CFI = 0.919, TLI = 0.951, RMSEA = 0.081. However, as for the three-factor model, the chi-square difference test indicated poorer fit for the constrained model, χ (13) = 39.16, p < .001. As above, we examined whether we could achieve structural invariance on all but one item by allowing the loadings to vary for one indicator. Once again, the MGCFA suggested the most disparate loadings were for PCL:YV Item 9. In this case, the MGCFA allowing separate thresholds in the two samples but equal loadings (except on Item 9) yielded a chi-square difference test that was still significant, χ (12) = 23.19, p = .03. Similarly, allowing different loadings on two items (Items 9 and 20) also did not eliminate the lack of invariance, χ (11) = 21.59, p = .03. In short, allowing for different loadings on one or two of 18 items did not result in a nonsignificant chi-square for the four-factor model.

In order to examine whether estimated factor means differed in the European versus North American subsamples, we also conducted multigroup analyses using mean structures by allowing Mplus to hold the item indicator thresholds equal across groups and, by setting mean levels of each factor to 0 for the North American subsample, estimating mean levels of the latent factors separately in the European subsample. Similar to the analyses summarized above, these analyses indicated a lack of invariance across the samples, reflecting in part the fact that the items are discriminating at different levels of the underlying factors in each set of subsamples examined (e.g., European vs. North American). As expected, these analyses also indicated higher latent mean levels of several PCL factors in the North American sample (means by default set to zero) relative to the European subsample: European subsample Ms = −0.36 and −0.43 for the affective and antisocial factors, respectively (zs = 4.01, 4.98, ps < .001), with a similar difference approaching significance for the lifestyle factor, European subsample latent M = −0.17 (z = 1.79, p = .07). Interestingly, the European latent mean for the interpersonal factor was nonsignificantly higher than that for the North American subsample (M = +0.13, z = 1.26, ns).

Tests of Model Invariance Between Incarcerated and Detention/Probation/Clinic Samples A similar set of analyses was conducted to assess measurement invariance as a function of setting, although, as noted above, the sample size for these comparisons was suboptimal for assessing the four-factor model. Once again, the MGCFA for the three-factor model allowing separate estimation of parameters across the two settings demonstrated acceptable fit on both

2

2

2

2

2

2

relative and absolute fit indices, χ (79) = 251.20, p < .001, CFI = 0.912, TLI = 0.943, RMSEA = 0.082. Once again, the analysis that required equal loadings also provided a generally reasonable fit to the data, χ (76) = 226.60, p < .001, CFI = 0.923, TLI = 0.949, RMSEA = 0.078, although its slightly poorer fit was confirmed by a significant chi-square difference test, χ (9) = 19.03, p = .02. Allowing the two subsamples to differ in the loading for one item, Item 5, was sufficient to yield a nonsignificant chi-square difference test, χ (8) = 10.21, p = .25. In summary, these analyses indicated not only configural invariance but a moderate degree of metric invariance for the three-factor model across setting.

Results were similar for the four-factor model. The unconstrained model suggested generally acceptable fit across the two settings, χ (129) = 469.53, p < .001, CFI = 0.874, TLI = 0.922, RMSEA = 0.090. However, it is noteworthy that this was the only unconstrained model in which an absolute or relative fit index (in this case, the CFI) fell below conventional levels of acceptability. In this case, the model requiring equal loadings (but not thresholds) yielded generally fair to acceptable fit, χ (121) = 449.46, p < .001, CFI = 0.878, TLI = 0.920, RMSEA = 0.092, again with the exception of the CFI. In addition, the chi-square difference test demonstrated that the fit was poorer when the loadings were required to be equal across the two settings, χ (13) = 53.71, p < .001. Table 3 shows that the indicator loadings for the two subsamples had appeared relatively discrepant across items on several factors when the loadings and thresholds were estimated separately in the two samples. In this case, allowing separate loadings on one or two items was not sufficient to produce invariance, suggesting that the differences in the loadings between the two subsamples are more numerous than for the other models and subsamples examined, albeit within the context that the samples were somewhat smaller than is recommended for conducting such analyses.

An additional MGCFA using mean structures was conducted to examine whether estimated factor means differed in the incarcerated versus the clinic/detention subsamples. In this analysis, mean levels of each factor were set to 0 for the incarcerated subsample. This analysis yielded estimated factor means in the clinic/detention/probation sample as follows: for the interpersonal factor, −0.86, z = 8.07; for the affective factor, −0.62, z = 6.84; for the lifestyle factor, −0.70, z = 7.48; for the antisocial factor, −0.86, z = 9.01 (all ps < .001).

Discussion

The principal aim of this study was to examine the fit of the three- and four-factor models of psychopathy as assessed with the PCL:YV in a large sample of adolescent females. Because no large-scale analyses have previously been reported, this study was designed to provide

2

2

2

2

2

2

2

greater clarity on the factor structure of the PCL:YV in adolescent females. Analyses revealed consistently acceptable fit for both models in this large sample. More specifically, the TLI and SRMR suggest good to excellent fit for both models, and the CFI and RMSEA indicate acceptable, although not excellent, fit for both models. Given that no prior studies had included as many participants as is recommended given the number of parameters to be estimated, these findings demonstrate that, when sample size and power are adequate, these models provide a good explanation for the pattern of intercorrelations among PCL:YV item scores.

Integrating these findings with large-scale tests of factor structure using PCL-based measures in other kinds of samples, there is now evidence that both the three- and four-factor models achieve acceptable fit in adult males, in most studies of adult females, in adolescent males, and in adolescent females (Babiak et al., 2010; Cauffman et al., 2009; Hare & Neumann, 2008; Jackson, Neumann, & Vitacco, 2007; Neumann & Hare, 2008; Neumann et al., 2006; Salekin et al., 2006; Vitacco, Neumann, & Jackson, 2005). Taken together, these findings indicate that the factor structure of psychopathy as assessed by PCL measures is relatively robust. These results are important because some have argued, based on small sample studies, that the four- factor model does not fit. Moreover, it is important to keep in mind that, in contrast to studies that pit theoretical predictions against a null hypothesis, increases in statistical power in model-fitting analyses do not substantially increase the likelihood of obtaining results that corroborate a theory (e.g., see Rodgers, 2010). In other words, good fit is not a simple function of sample size. Rather, adequate sample size simply ensures a powerful test of the adequacy of a model.

As noted earlier, factor analytic studies cannot provide direct tests of the superiority of one of these factor models to the other (T. A. Brown, 2006). Moreover, the indirect evidence regarding model fit indicates that, at this level, both models provide adequate fit. Thus, the relative value of one versus the other model must be decided based on other criteria (cf. Neumann, Hare, & Newman, 2007; Neumann, Vitacco, Hare, & Wupperman, 2005; Vitacco, Neumann, & Jackson, 2005, 2006, 2010). Hare and Neumann (2010) and Vitacco, Neumann, & Jackson (2005) have raised questions about the methods reported by Cooke and Michie (2001) for selecting the 13 items in the three-factor model and the criteria for excluding “antisocial” items. Given that the four-factor model subsumes the three-factor model, these findings demonstrate substantial evidence that it provides a powerful conceptual architecture for understanding the correlates and mechanisms underlying psychopathy as well as for testing the predictive validity of psychopathy scores. However, research should also examine the CFA results of subsamples to determine whether there are differences across subsamples of psychopathic youth. We discuss these issues below.

The Internal Structure of PCL:YV Psychopathy in Subsamples The subsample analyses suggest that both the three-factor and four-factor models also provide a reasonable representation of PCL:YV item score intercorrelations in most of the smaller subsamples examined. It must be emphasized that although these subsamples were larger than those in prior factor analytic studies of the PCL:YV in girls, they were smaller than is recommended for testing the four-factor model. Because the three- and four-factor models estimate 29 and 42 free parameters, tests of these models should include at least approximately 300 and 420 subjects, respectively (using a 10:1 ratio of subjects-to-free parameters; Bentler, 1980). Therefore, our subsamples of approximately 300 (Ns = 277 to 369) should have been adequate to evaluate the three-factor model but may have been somewhat underpowered with respect to the four-factor model.

That the models yielded evidence of adequate fit in the less restrictive (probation/detention/clinic) sample and in the North American sample and generally adequate fit in the European sample provides evidence for the robustness of these models. There was also some evidence for invariance of the three-factor model (i.e., allowing only one item loading to freely vary), but this same situation was not evident for the four-factor model. It is important to keep in mind that these analyses nevertheless allowed the groups to differ in their thresholds. Whereas differences in the pattern of indicator-to-factor loadings are commonly interpreted as indicating differences in factor structure, differences in thresholds refer to distinctions in the levels of the underlying latent constructs at which the items are maximally discriminating. Overall, the findings here showed that levels of psychopathy tend to be higher in prisons (i.e., settings involving long-term incarceration) than in community and detention (short-term incarceration) settings (Forth et al., 2003). Similarly, levels of psychopathic traits appear to be higher in North American than in European settings (Sullivan & Kosson, 2006; Verona, Sadeh, & Javdani, 2010). In other words, higher levels of psychopathic traits must be present, on average, in individuals from North American and prisons samples before the items provide information (discrimination) on those with (vs. without) psychopathic personality features.

In spite of the generally acceptable fit for both models for subsamples in different continents and across settings, the multigroup analyses also demonstrated that the fit was better when the different subsamples were allowed to differ in some item loadings. Even allowing the indicator- to-factor loadings to differ for one or two items was sufficient to render the chi-square difference test (regarding constrained vs. unconstrained models) nonsignificant for the three-factor model. Consequently, some of the item loadings are not identical across geographical region and setting. Yet most of the differences in item-to-factor loadings that we observed are small enough that they do not result in significant differences in overall model fit. However, in each case, there

was at least one PCL:YV item for which the difference in loadings was substantial enough to produce a significant chi-square difference test, unless the loadings on this item were permitted to vary across groups. In summary, in most cases there were apparent differences in some loadings (as shown in Table 3), but these differences were not sufficient to produce a lack of invariance.

In contrast, even allowing the loadings on one or two items to vary across groups was not sufficient to obtain evidence of structural invariance for the four-factor model. In this case, the multigroup analysis continued to demonstrate a lack of invariance, even when loadings were estimated separately for several indicators. For example, as shown in Table 3, there appear to be important differences in the loadings of PCL:YV items associated with several different psychopathy dimensions in settings involving long-term incarceration versus those involving short-term incarceration.

In spite of the suboptimal size of the subsamples for assessing the fit of the four-factor model, the relatively weaker fit for the four-factor model in the incarcerated subsample than in the other subsamples examined (i.e., acceptable on absolute fit indices but not relative fit indices) and the lack of invariance for the four-factor model across continents and across more restrictive and less restrictive settings merit discussion. The lack of invariance in the multigroup analysis appears to reflect the fact that there were several items with different factor loadings in the long- term versus shorter-term incarceration and community samples. It is possible that both these findings reflect the relatively poor fit of the four-factor model in a single but relatively large sample (i.e., the incarcerated sample of Sevecke et al., 2009). Alternatively, it is possible that the four-factor model does not fit as well among incarcerated girls as among probation and detention and clinic girls or among samples that collapse across settings. However, this latter possibility appears to us less parsimonious given the general consistency of large sample analyses discussed above. Alternatively, the lack of invariance may reflect the possibility of fundamental differences in how well some antisocial and lifestyle items discriminate in different groups of individuals (for more information, see Mokros et al., 2010). Given that the subsamples were smaller than recommended for evaluating the fit of the four-factor model, the only way to resolve these issues would be to conduct an analysis of a large sample of incarcerated female adolescents (i.e., including 420 or more incarcerated girls).

Limitations As discussed in the Introduction, factor analytic studies can make a valuable contribution to the construct validation enterprise (Strauss & Smith, 2009). Evidence that a structural model accounts for the pattern of covariances among item scores in a new population and, thus, can

be generalized to the pattern observed in other populations provides powerful evidence that the larger construct that a given measure assesses is similar across the two populations. Thus, our results highlighting that scores on PCL factor indicators covaried in similar ways in adolescent girls versus what has been found with other samples suggests that the PCL-based conceptualization of psychopathy is likely similar in adolescent girls, compared with other diverse samples. At the same time, factor analytic studies have notable limitations with respect to construct validity research. Evidence for patterns of similar coherence among item scores and similar covariance among factor scores does not ensure that the underlying construct is the same. Consequently, it remains possible that the nomological network surrounding psychopathy and the four (or three) components of psychopathy is different in critical ways in adolescent females than in adolescent males.

Other types of studies and research designs are necessary to evaluate relationships between psychopathy (and psychopathy components) and the quasi-criteria linked to psychopathy in adults and in adolescent males. Even so, the existence of similar internal structure in adolescent females assessed with the PCL:YV provides a foundation that permits clearer interpretation of the pattern of correlations with theoretically informed criteria in studies of adolescent females. To the extent that item scores on the different PCL:YV items and composite scores of the factors themselves cohere in similar ways in adolescent girls and in other samples, it becomes unlikely that the construct of psychopathy is wholly different in adolescent girls than in other groups, and it becomes more likely that the similarities that are observed in the patterns of correlations between PCL psychopathy (total and composite facet) scores and scores for external criteria reflect similar mechanisms.

Conversely, in the context of similar underlying internal structure, differences in the pattern of observed correlations are likely to reflect true differential relationships between PCL-based psychopathy and other constructs (cf. Odgers et al., 2005). Consequently, in light of current findings, future studies examining the convergent validity and discriminant validity of scores on the four dimensions of psychopathy in adolescent girls are especially important. In this context, studies examining construct validity that compute correlations between scores on latent variables (instead of manifest indicators) and external criteria have the advantage that they permit modeling of variance in these indicators separately from unique and error variance (Bentler, 1980).

At least one additional important limitation of our use of factor analysis is noteworthy. As noted in the introduction, this study did not address the internal structure of other kinds of measures of psychopathy. It remains possible that studies employing self-report measures and informant

(i.e., parent and teacher) measures of psychopathic traits will ultimately yield evidence of a different internal structure in adolescent females than in adolescent males and adults. As discussed by Strauss and Smith (2009), the nomothetic span of a construct refers to the extent to which different measures of a construct provide evidence for similar patterns of relations with external criteria. As we mentioned earlier (see Footnote 2), findings obtained using different kinds of psychopathy measures often do not converge with respect to the nature of the internal structure of psychopathy. Absent evidence that psychopathy is underlain by similar components across different kinds of measures (but see Williams, Paulhus, & Hare, 2007, for one exception), we could not expect to see similar patterns of relationships across distinct measures. Yet attempts to understand and overcome these methodological limitations are important to overcome a monomethod bias in psychopathy research.

In general, there has been little research examining the possibility of differences in item loadings on underlying psychopathy dimensions in different samples. The evidence for differences across subsamples in the loadings of one or two items on a latent factor suggests that there may be differences in rater behavior across settings, or, more substantively, differences in the way psychopathic traits are manifested across diverse groups of individuals. Only additional research can address the replicability of these differences and the reasons why items reflecting certain features of psychopathy appear to function differently across setting and in subsamples of youth in different continents. One possibility discussed by Hare and Neumann (2006) is that items reflecting early, persistent, and versatile antisociality become increasingly important when examining nonincarcerated and community samples. However, this possibility cannot explain the current pattern of disparate loadings for items on all four dimensions in long-term incarcerated versus shorter-term detention and community samples.

Finally, one additional direction for future research is the examination of sex differences in the latent structure of psychopathy as assessed with the PCL:YV. In light of the finding that the three-factor and four-factor models yield acceptable fit in a large samples of adolescent females, it becomes possible to ask if there is invariance in the latent structure of PCL:YV psychopathy across sex. Only a multigroup analysis with large samples of males and females can address this issue.

Footnotes Some researchers have argued that the antisocial dimension of the four-factor model should

be excluded based on conceptual grounds and have specifically argued that antisociality is not central enough to psychopathy to justify its inclusion. This argument is beyond the scope of the current study, and we encourage interested readers to see Skeem and Cooke (2010) and Hare

1

and Neumann (2008, 2010) for recent discussions of relevant issues. We note here only that some of these discussions do not make clear that the five items comprising the dimension commonly referred to as the antisocial dimension are not scored on the basis of participation in antisocial behavior per se. Rather, these items are designed to assess early, persistent, and versatile expressions of antisocial behavior that distinguish some individuals who commit antisocial behavior from other individuals who commit antisocial behavior.

The internal structure of self-reports and observer ratings of psychopathic features depends upon the instruments used. For example, analyses of mother and teacher ratings of psychopathic traits in pre-adolescents, as measured by the Antisocial Process Screening Device (Frick & Hare, 2001), suggest a slightly different three-factor structure than has been identified using PCL-based measures. Factor structures resulting from analyses of self-report scores are variable across instruments, with studies reporting evidence for three-factor and four-factor structures similar to those seen in the PCL measures for scores on the Self-Report Psychopathy Scale and the Youth Psychopathy Inventory (Larsson et al., 2006; Mahmut, Menictas, Stevenson, & Homewood, 2011; Williams et al., 2007) but reporting very different factor structures for some other self-report measures (e.g., Brinkley, Diamond, Magaletta, & Heigel, 2008). In some cases, different studies using the same instrument suggest different internal structures. For example, different factor analytic studies of scores on the Psychopathic Personality Inventory suggest disparate solutions involving two versus three dimensions (Benning, Patrick, Hicks, Blonigen, & Krueger, 2003; Neumann, Malterer, & Neumann, 2008; Uzieblo, Verschuere, Van den Bussche, & Crombez, 2010).

Forth et al. (2003) also did not use an optimal model estimation strategy for conducting their analyses. EQS (Version 5.6) and LISREL (Version 8.30) were not designed for use with ordinal item indicators. Mplus has advantages in analyses of ordinal indicators. When one of us reconducted the CFAs on the 147 females examined by Forth et al. using Mplus, results indicated acceptable fit for both the three- and four-factor models.

Of course, even if a psychopathy measure functions similarly in boys and girls, it remains possible that the phenotypic similarities reflect different underlying mechanisms. Conversely, it could be argued that the same mechanisms operate in females and males but that these mechanisms are associated with different constellations of features in girls and boys; however, this latter perspective does not appear very parsimonious.

Results for full-sample analyses including missing values indicated generally acceptable fit for the three-factor and four-factor models; however, the CFI slipped to .90 and .89 for the three-

2

3

4

5

and four-factor models. Both models were also generally acceptable for the North American and European subsamples, although the CFI slipped to .89 for the four-factor model in both subsamples. For analyses limited to incarcerated girls, the CFI was low for the three-factor model (CFI = 0.89), and, as in principal analyses, both relative fit indices were unacceptable for the four-factor model (CFI = 0.82, TLI = 0.87). For analyses of girls on probation, under detention, or at clinics, both models yielded low CFIs (0.89 and 0.88 for the three- and four- factor models, respectively) and borderline-unacceptable RMSEAs (0.099 and 0.097). Additional multigroup analyses demonstrated that fit was also poorer when both loadings and thresholds were constrained to be equal in the two groups. Results of these analyses are available upon request.

References Andershed, H., Hodgins, S., & Tengström, A. (2007. Convergent validity of the Youth Psychopathic Traits Inventory (YPI): Association with the Psychopathy Checklist: Youth Version (PCL:YV). Assessment, 14, 144–154. doi:10.1177/1073191106298286

Babiak, P., Neumann, C. S., & Hare, R. D. (2010. Corporate psychopathy: Talking the walk. Behavioral Sciences & the Law, 28, 174–193.

Bauer, D. L. A., Whitman, L., & Kosson, D. S. (2011. Reliability and construct validity of Psychopathy Checklist: Youth Version scores in incarcerated adolescent girls. Criminal Justice and Behavior, 38, 965–987. doi:10.1177/0093854811418048

Benning, S. D., Patrick, C. J., Hicks, B. M., Blonigen, D. M., & Krueger, R. (2003. Factor structure of the Psychopathic Personality Inventory: Validity and implications for clinical assessment. Psychological Assessment, 15, 340–350. doi:10.1037/1040-3590.15.3.340

Bentler, P. M. (1980. Multivariate analysis with latent variables: Causal modeling. Annual Review of Psychology, 31, 419–456. doi:10.1146/annurev.ps.31.020180.002223

Bolt, D., Hare, R. D., Vitale, J., & Newman, J. P. (2004. A multigroup item response theory analysis of the Hare Psychopathy Checklist-Revised. Psychological Assessment, 16, 155–168.

Brinkley, C. A., Diamond, P. M., Magaletta, P. R., & Heigel, C. P. (2008. Cross-validation of Levenson's Psychopathy Scale in a sample of federal female inmates. Assessment, 15, 464– 482. doi:10.1177/1073191108319043

Brown, T. A. (2006). Confirmatory factor analysis for applied research. New York, NY: Guilford Press.

Browne, M. W., & Cudeck, R. (1993. Alternative ways of assessing model fit. In K. A.Bollen & J. S.Long (Eds.), Testing structural equation models (pp. 136–162). Newbury Park, CA: Sage.

Cauffman, E., Kimonis, E. R., Dmitrieva, J., & Monahan, K. C. (2009. A multimethod assessment of juvenile psychopathy: Comparing the predictive utility of the PCL:YV, YPI, and NEO PRI. Psychological Assessment, 21, 528–542. doi:10.1037/a0017367

Cooke, D. J., & Michie, C. (2001. Refining the construct of psychopath: Towards a hierarchical model. Psychological Assessment, 13, 171–188. doi:10.1037/1040-3590.13.2.171

Cooke, D. J., Michie, C., Hart, S. D., & Clark, D. A. (2004. Reconstructing psychopathy: Clarifying the significance of antisocial and socially deviant behavior in the diagnosis of psychopathic personality disorder. Journal of Personality Disorders, 18, 337–357.

Das, J., de Ruiter, C., Doreleijers, T., & Hillege, S. (2009). Reliability and construct validity of the Dutch Psychopathy Checklist: Youth Version: Findings from a sample of male adolescents in a juvenile justice treatment institution. Assessment, 16, 88–102. doi:10.1177/1073191108321999

Flight, J. I., & Forth, A. E. (2007. Instrumentally violent youths: The roles of psychopathic traits, empathy and attachment. Criminal Justice and Behavior, 34, 739–751. doi:10.1177/0093854807299462

Flora, D. B., & Curran, P. J. (2004. An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychological Methods, 9, 466–491. doi:10.1037/1082-989X.9.4.466

Forth, A. E., Kosson, D., & Hare, R. (2003). The Hare Psychopathy Checklist: Youth Version. New York, NY: Multi-Health Systems.

Fowler, T., Langley, K., Rice, F., van den Bree, M. B., Ross, K., Wilkinson, L., …Thapar, A. (2009). Psychopathy trait scores in adolescents with childhood ADHD: The contribution of genotypes affecting MAOA, 5HTT and COMT activity. Psychiatric Genetics, 19, 312–319.

Frick, P. J., & Hare, R. D. (2001). The Antisocial Process Screening Device. Toronto, Ontario, Canada: Multi-Health Systems.

Gretton, H., & Hare, R. D. (2002). [Hare Psychopathy Checklist: Youth Version scores]. Unpublished raw data.

Hare, R. D. (2003). Manual for the Hare Psychopathy Checklist—Revise. (2nd ed.). Toronto, Ontario, Canada: Multi-Health Systems.

Hare, R. D., & Neumann, C. S. (2006. The PCL-R assessment of psychopathy: Development, structural properties, and new directions. In C.Patrick (Ed.), Handbook of psychopathy (pp. 58– 88). New York, NY: Guilford Press.

Hare, R. D., & Neumann, C. S. (2008. Psychopathy as a clinical and empirical construct. Annual Review of Clinical Psychology, 4, 217–246. doi:10.1146/annurev.clinpsy.3.022806.091452

Hare, R. D., & Neumann, C. S. (2009. Psychopathy and its measurement. In P. J.Corr & G.Matthews (Eds.), Cambridge handbook of personality psychology (pp. 660–686). Cambridge, England: Cambridge University Press.

Hare, R. D., & Neumann, C. S. (2010. The role of antisociality in the psychopathy construct: Comment on Skeem and Cooke (2010). Psychological Assessment, 22, 446–454. doi:10.1037/a0013635

Hart, S. D., Cox, D. N., & Hare, R. D. (1995). Manual for the Psychopathy Checklist: Screening Version (PCL:SV). Toronto, Ontario, Canada: Multi-Health Systems.

Hoyle, R. H. (1995). Structural equation modeling: Concepts, issues and applications. Thousand Oaks, CA: Sage.

Hu, L., & Bentler, P. M. (1995. Evaluating model fit. In R. H.Hoyle (Ed.), Structural equation modeling: Issues, concepts, and applications (pp. 76–99). Newbury Park, CA: Sage.

Indoe, D. (2002). [Hare Psychopathy Checklist: Youth Version scores]. Unpublished raw data.

Jackson, R. L., Neumann, C. S., & Vitacco, M. L. (2007. Impulsivity, anger, and psychopathy: The moderating effect of ethnicity. Journal of Personality Disorders, 21, 289–304. doi:10.1521/pedi.2007.21.3.289

Jones, S., Cauffman, E., Miller, J. D., & Mulvey, E. (2006. Investigating different factor structures of the Psychopathy Checklist: Youth Version: Confirmatory factor analytic findings. Psychological Assessment, 18, 33–48. doi:10.1037/1040-3590.18.1.33

Kennealy, P. J., Hicks, B. M., & Patrick, C. J. (2007. Validity of factors of the Psychopathy Checklist—Revised in female prisoners: Discriminant relations with antisocial behavior, substance abuse, and personality. Assessment, 14, 323–340. doi:10.1177/1073191107305882

Kline, R. B. (1998). Principles and practice of structural equation modelin. New York, NY: Guilford Press.

Kosson, D. S., Cyterski, T. D., Steuerwald, B. L., Neumann, C., & Walker-Matthews, S. (2002. The reliability and validity of the Psychopathy Checklist: Youth Version in non-incarcerated adolescent males. Psychological Assessment, 14, 97–109. doi:10.1037/1040-3590.14.1.97

Kosson, D. S., McBride, C. K., Whitman, L. A., Riser, R. E., Greco, J., & Tercek, R. (2012). Preliminary evidence for negative affectivity and maladaptive emotion regulation strategies in youth with psychopathic traits. Manuscript submitted for publication.

Krischer, M. K., & Sevecke, K. (2008. Early traumatization and psychopathy in female and male juvenile offenders. International Journal of Law and Psychiatry, 31, 253–262. doi:10.1016/j.ijlp.2008.04.008

Kubak, F. A., & Salekin, R. T. (2009. Psychopathy and anxiety in children and adolescents: New insights on developmental pathways to offending. Journal of Psychopathology and Behavioral Assessment, 31, 271–284. doi:10.1007/s10862-009-9144-2

Larsson, H., Andershed, H., & Lichtenstein, P. (2006. A genetic factor explains most of the variation in the psychopathic personality. Journal of Abnormal Psychology, 115, 221–230. doi:10.1037/0021-843X.115.2.221

Lewis, K., & O'Shaughnessy, R. (1998, June). Predictors of violent recidivism in juvenile

offenders. Poster presented at the annual convention of the Canadian Psychological Association, Edmonton, Alberta, Canada.

MacCallum, R. C., Browne, M. W., & Sugawara, H. M. (1996. Power analysis and determination of sample size for covariance structure modeling. Psychological Methods, 1, 130–149. doi:10.1037/1082-989X.1.2.130

Mahmut, M. K., Menictas, C., Stevenson, R. J., & Homewood, J. (2011, April25). Validating the factor structure of the Self-Report Psychopathy Scale in a community sample. Psychological Assessment. Advance online publication. doi:10.1037/a0023090

Marsh, H. W., Balla, J. R., & McDonald, R. P. (1988. Goodness-of-fit indexes in confirmatory factor analysis: The effect of sample size. Psychological Bulletin, 103, 391–410. doi:10.1037/0033-2909.103.3.391

Marsh, H. W., Hau, K. T., & Wen, Z. (2004. In search of golden rules: Comment on hypothesis- testing approaches to setting cutoff values for fit indices and dangers in overgeneralizing Hu and Bentler's (1999) findings. Structural Equation Modeling, 11, 320–341. doi:10.1207/s15328007sem1103_2

Mokros, A., Neumann, C. S., Stadtland, C., Osterheider, M., Nedopil, N., & Hare, R. D. (2010. Assessing Measurement Invariance of PCL-R Assessments from file reviews for North- American and German offenders. International Journal of Psychiatry & Law, 34, 56–63.

Murrie, D. C., Cornell, D. G., Kaplan, S., McConville, D., & Levy-Elkon, A. (2004. Psychopathy scores and violence among juvenile offenders: A multi-measure study. Behavioral Sciences & the Law, 22, 49–67. doi:10.1002/bsl.573

Muthén, L. K., & Muthén, B. O. (1998–2007). Mplus user's guide (5th ed.). Los Angeles, CA: Muthén & Muthén.

Neumann, C. S., & Hare, R. D. (2008. Psychopathic traits in a large community sample: Links to violence, alcohol use, and intelligence. Journal of Consulting and Clinical Psychology, 76, 893–899. doi:10.1037/0022-006X.76.5.893

Neumann, C. S., Hare, R. D., & Newman, J. P. (2007. The super-ordinate nature of the

Psychopathy Checklist—Revised. Journal of Personality Disorders, 21, 102–117. doi:10.1521/pedi.2007.21.2.102

Neumann, C. S., Kosson, D. S., Forth, A. E., & Hare, R. D. (2006. Factor structure of the Hare Psychopathy Checklist: Youth Version (PCL:YV) in incarcerated adolescents. Psychological Assessment, 18, 142–154. doi:10.1037/1040-3590.18.2.142

Neumann, C. S., Kosson, D. S., & Salekin, R. T. (2007. Exploratory and Confirmatory factor analysis of the psychopathy construct: Methodological and conceptual issues. In H.Hervé & J. C.Yuille (Eds.), The psychopath: Theory, research, and practice (pp. 79–104). Mahwah, NJ: Erlbaum.

Neumann, C. S., Malterer, M. B., & Newman, J. P. (2008. Factor structure of the Psychopathic Personality Inventory (PPI): Findings from a large incarcerated sample. Psychological Assessment, 20, 169–174. doi:10.1037/1040-3590.20.2.169

Neumann, C. S., Vitacco, M. J., Hare, R. D., & Wupperman, P. (2005. Reconstruing the “reconstruction” of psychopathy: A comment on Cooke, Michie, Hart, and Clark. Journal of Personality Disorders, 19, 624–640. doi:10.1521/pedi.2005.19.6.624

Odgers, C. L., Reppucci, N. D., & Moretti, M. M. (2005. Nipping psychopathy in the bud: An examination of the convergent, predictive, and theoretical utility of the PCL:YV among adolescent girls. Behavioral Sciences & the Law, 23, 743–763. doi:10.1002/bsl.664

Penney, S. R., & Moretti, M. M. (2007. The relation of psychopathy to concurrent aggression and antisocial behavior in high-risk adolescent girls and boys. Behavioral Sciences and the Law, 25, 21–41.

Rodgers, J. L. (2010. The epistemology of mathematical and statistical modeling: A quiet methodological revolution. American Psychologist, 65, 1–12.

Rowe, R. (2002). Predictors of criminal offending: Evaluating measures of risk/needs, psychopathy, and disruptive behavior disorder. (Unpublished doctoral dissertation). Carleton University, Ottawa, Ontario, Canada.

Salekin, R. T. (2008. Psychopathy and recidivism from mid-adolescence to young adulthood:

Cumulating legal problems and limiting life opportunities. Journal of Abnormal Psychology, 117, 386–395.

Salekin, R. T., Brannen, D. N., Zalot, A. A., Leistico, A. R., & Neumann, C. S. (2006. Factor structure of psychopathy in youth: Testing the applicability of the new four factor model. Criminal Justice and Behavior, 33, 135–157.

Salekin, R. T., Leistico, A. R., Trobst, K. K., Schrum, C. L., & Lochman, J. E. (2005. Adolescent psychopathy and personality theory—the interpersonal circumplex: Expanding evidence of a nomological net. Journal of Abnormal Child Psychology, 33, 445–460.

Salekin, R. T., Neumann, C. S., Leistico, A. R., DiCicco, T. M., & Duros, R. L. (2004. Psychopathy and comorbidity in a young offender sample: Taking a closer look at psychopathy's potential importance over the disruptive behavior disorders. Journal of Abnormal Psychology, 113, 416–427.

Salekin, R. T., Neumann, C. S., Leistico, A. R., & Zalot, A. A. (2004. Psychopathy in youth and intelligence: An investigation of Cleckley's hypothesis. Journal of Clinical Child and Adolescent Psychology, 33, 731–742.

Salekin, R. T., Rogers, R., & Sewell, K. W. (1997. Construct validity of psychopathy in a female offender sample: A multitrait–multimethod evaluation. Journal of Abnormal Psychology, 106, 576–585.

Schmidt, F., McKinnon, L., Chattha, H. K., & Brownlee, K. (2006. Concurrent and predictive validity of the Psychopathy Checklist: Youth Version across gender and ethnicity. Psychological Assessment, 18, 393–401.

Schrum, C. L., & Salekin, R. T. (2006. Psychopathy in adolescent female offenders: An item response theory analysis of the Psychopathy Checklist: Youth Version. Behavioral Sciences and the Law, 24, 39–63.

Sevecke, K., Pukrop, R., Kosson, D. S., & Krischer, M. K. (2009. Factor structure of the Hare Psychopathy Checklist: Youth Version in German female and male detainees and community adolescents. Psychological Assessment, 21, 45–56.

Skeem, J. L., & Cauffman, E. (2003. Views of the downward extension: Comparing the Youth Version of the Psychopathy Checklist with the Youth Psychopathic Traits Inventory. Behavioral Sciences & the Law, 21, 737–770.

Skeem, J. L., & Cooke, D. J. (2010. Is criminal behavior a central component of psychopathy? Conceptual directions for resolving the debate. Psychological Assessment, 22, 433–445.

Stockdale, K. C., Olver, M. E., & Wong, S. C. P. (2010. The Psychopathy Checklist: Youth Version and adolescent and adult recidivism: Considerations with respect to gender, ethnicity, and age. Psychological Assessment, 22, 768–781.

Strauss, M. E., & Smith, G. T. (2009. Construct validity: Advances in theory and methodology. Annual Review of Clinical Psychology, 5, 1–25.

Sullivan, E. A., & Kosson, D. S. (2006. Ethnic and cultural variations in psychopathy. In C. J.Patrick (Ed.), Handbook of psychopathy (pp. 437–458). New York, NY: Guilford Press.

Sutton, S. K., Vitale, J. E., & Newman, J. P. (2002. Emotion among women with psychopathy during picture perception. Journal of Abnormal Psychology, 111, 610–619.

Uzieblo, K., Verschuere, B., Van den Bussche, E., & Crombez, G. (2010). The validity of the Psychopathic Personality Inventory—Revised in a community sample. Assessment, 17, 334– 346.

Verona, E., Sadeh, N., & Javdani, S. (2010. The influences of gender and culture on child and adolescent psychopathy. In R. T.Salekin & D. R.Lynam (Eds.), Handbook of child and adolescent psychopathy (pp. 317–342). New York, NY: Guilford Press.

Vincent, G. M., Odgers, C. L., McCormick, A. V., & Corrado, R. R. (2008. The PCL:YV and recidivism in male and female juveniles: A follow-up into young adulthood. International Journal of Law and Psychiatry, 31, 287–296.

Vitacco, M. J., Neumann, C. S., & Caldwell, M. F. (2010. Predicting antisocial behavior in high- risk male adolescents: Contributions of psychopathy and instrumental violence. Criminal Justice and Behavior, 37, 833–846.

Vitacco, M. J., Neumann, C. S., Caldwell, M. F., Leistico, A., & Van Rybroek, G. J. (2006). Testing factor models of the Psychopathy Checklist: Youth Version and their association with instrumental aggression. Journal of Personality Assessment, 87, 74–83.

Vitacco, M. J., Neumann, C. S., & Jackson, R. L. (2005. Testing a four-factor model of psychopathy and its association with ethnicity, gender, intelligence, and violence. Journal of Consulting and Clinical Psychology, 73, 466–476.

Vitacco, M. J., Rogers, R., Neumann, C. S., Harrison, K. S., & Vincent, G. (2005. A comparison of factor models on the PCL-R with mentally disordered offenders: The development of a four- factor model. Criminal Justice and Behavior, 32, 526–545.

Vitale, J. E., Brinkley, C. A., Hiatt, K. D., & Newman, J. P. (2007. Abnormal selective attention in psychopathic female offenders. Neuropsychology, 21, 301–312.

Vitale, J. E., & Newman, J. P. (2001a. Response perseveration in psychopathic women. Journal of Abnormal Psychology, 110, 644–647.

Vitale, J. E., & Newman, J. P. (2001b. Using the Psychopathy Checklist—Revised with female samples: Reliability, validity, and implications for clinical utility. Clinical Psychology: Science and Practice, 8, 117–132.

Warren, J. I., Burnette, M. L., South, S. C., Chauhan, P., Bale, R., Friend, R., & Van Patten, I. (2003). Psychopathy in women: Structural modeling and comorbidity. International Journal of Law and Psychiatry, 26, 223–242.

Williams, K. M., Paulhus, D. L., & Hare, R. D. (2007. Capturing the four-factor structure of psychopathy in college students via self-report. Journal of Personality Assessment, 88, 205– 219.

Submitted: October 11, 2010 Revised: March 11, 2012 Accepted: March 13, 2012

This publication is protected by US and international copyright laws and its content may not be copied without the copyright holders express written permission except for the print or download capabilities of the retrieval software used for access. This content is intended solely for the use of the individual user.

Source: Psychological Assessment. Vol. 25. (1), Mar, 2013 pp. 71-83) Accession Number: 2012-16771-001 Digital Object Identifier: 10.1037/a0028986

Google Book Preview

Goodreads- User Reviews

Plum Print

EBSCO Connect Privacy Policy A/B Testing Terms of Use Copyright Cookie Policy Contact Us

powered by EBSCOhost

© 2021 EBSCO Industries, Inc. All rights reserved.