Assessment Guide
Empirically Supported Forensic Assessment
Robert P. Archer, Eastern Virginia Medical School
Elizabeth M. A. Wheeler and Rebecca A. Vauter, Central State Hospital
The field of Forensic Psychology has greatly expanded
over the past several decades, including the use of psy-
chological assessment in addressing forensic issues. A
number of surveys have been conducted regarding the
tests used commonly by forensic psychologists. These
surveys show that while tests specifically designed to
address forensic issues have proliferated, traditional
clinical assessment tests continue to play a crucial role
in many forensic evaluations. The current article identi-
fies some of the most salient characteristics of empiri-
cally supported forensic tests and provides examples of
tests felt to meet each of these five criteria. These crite-
ria include adequate standardization, acceptable relia-
bility and validity, general acceptance within the
community of forensic evaluators, availability of test
data from cross-cultural and cross-ethnic samples, and
comparison data relevant to specific forensic popula-
tions. Although the guidelines provided in this article
provide a helpful framework for evaluating the useful-
ness of forensic tests, the establishment of a national
review panel or workgroup to address this issue would
be highly useful, particularly in the potential controver-
sial task of identifying those tests that meet reasonable
guidelines to be identified as empirically supported
forensic assessment instruments.
Key words: forensic, forensic assessment, survey,
testing. [Clin Psychol Sci Prac 23: 348–364, 2016]
BRIEF REVIEW OF THE HISTORY AND DEFINITION OF
FORENSIC PSYCHOLOGY
The genesis of what would later be termed Forensic Psy-
chology can be traced back to the early 20th century
when psychologists first became involved in the attempt
to understand the limitations of eyewitness testimony
and, in particular, Hugo M€unsterberg’s advocacy for an
increased role for psychologists within the legal system
(Vaccaro & Hogan, 2004). Indeed, the scope of the
psychologist’s role in addressing psycholegal issues both
inside and outside of the courtroom has expanded
greatly as a result of several court rulings. For example,
the landmark 1962 ruling in Jenkins v. United States
helped to facilitate psychologists’ ability to testify inside
the courtroom as expert witnesses, by asserting that psy-
chologists could be qualified to give expert testimony
on issues of mental health. Over the past several dec-
ades, Forensic Psychology has established recognized
training models, board certification requirements, and
an American Psychological Association (APA) division
focused on this area of practice (Division 41, American
Psychology-Law Society).
For many laypeople, the term Forensic Psychology may
invoke dramatic images from CSI (Crime Scene Inves-
tigation). For many psychologists not actively involved
in Forensic Psychology, the idea of testifying in court
may invoke a strong anxiety response. While inaccurate
views of the role of forensic psychologists can easily be
dismissed as a product of the popular media, and the
testimony anxiety of those not actively involved in
Forensic Psychology can be effectively addressed by
Address correspondence to Robert P. Archer, Eastern Vir-
ginia Medical School – Psychiatry & Behavioral Sciences, 825 Fairfax Avenue, Norfolk, VA 23507. E-mail: robertarcher2@
cox.net.
doi:10.1111/cpsp.12171
© 2016 American Psychological Association. Published by Wiley Periodicals, Inc., on behalf of the American Psychological Association. All rights reserved. For permissions, please email: [email protected]
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
training and experience in the courtroom, there
remains a need for a clear and concise definition of the
term Forensic Psychology (Huss, 2009). In this regard, rel-
atively recent guidelines by the APA have helped to
clarify the term and define when psychologists are
working within the scope of Forensic Psychology.
The most recent edition of the APA’s Specialty
Guidelines for Forensic Psychology (APA, 2011) defines
Forensic Psychology as “professional practice by any
psychologist working with any sub-discipline of psy-
chology (e.g., clinical, developmental, social, cognitive)
when applying the scientific, technical, or specialized
knowledge of psychology to the law to assist in
addressing legal, contractual, and administrative mat-
ters” (p. 1). Thus, Forensic Psychology can encompass
not only direct legal issues (e.g., competency and sanity
evaluations) but also administrative/contractual issues
such as fitness for duty and disability evaluations.
The APA Guidelines also helped to define when a
psychologist is serving as a “forensic practitioner” in
the following statement:
Forensic practitioner refers to a psychologist when
engaged in the practice of Forensic Psychology.. . .
Such professional conduct is considered forensic
from the time the practitioner reasonably expects to,
agrees to, or is legally mandated to, provide exper-
tise on an explicitly psycholegal issue. (2011, p. 1)
The APA Guidelines do not rely on individuals’
typical area of practice in order to determine whether
they are providing forensic services, but instead focus
on what they are doing in the specific case. The
Guidelines state that simply being involved in a court-
related matter does not make one a forensic psycholo-
gist (e.g., someone testifying only on his or her treat-
ment of an individual would not be considered
forensic practice). In contrast, a neuropsychologist
retained to assess and subsequently testify regarding a
psycholegal issue (e.g., whether an individual suffered a
brain injury as the result of a car accident for which he
or she was seeking compensation) would be practicing
as a forensic psychologist regardless of her activities in
other areas of her practice (i.e., the rest of her caseload
was not court-involved individuals). Thus, anyone pro-
viding a psycholegal opinion in a legally mandated
matter is providing services as a forensic psychologist,
regardless of the nature of his or her typical practice
area.
DIFFERENCE BETWEEN OBJECTIVES OF CLINICAL
ASSESSMENT AND FORENSIC ASSESSMENT
There are some important differences between assess-
ments undertaken in the forensic context and clinical
assessment. Ackerman (1999), in his book Essentials of
Forensic Psychological Assessment, provided a summary
table that presents some of the salient differences
between clinical and forensic relationships. The key
components from that table will be reviewed here.
The first difference between clinical and forensic
assessments has to do with who is identified as the cli-
ent. In clinical assessment, the individual being assessed
is typically the identified client, whereas in a forensic
setting, an attorney or the court is usually the client of
record. This is a crucial distinction, and one from
which many of the other differences are derived
between the clinical and forensic evaluations.
A second major difference between the two types of
assessment concerns the rules that govern the disclosure
of information. In clinical assessments, patient–therapist privilege and the Health Insurance Portability and
Accountability Act (HIPAA) provide the guidelines
that cover disclosures. In forensic assessment, the scope
of disclosures may either be mandated by statute (e.g.,
in competency and sanity evaluations, the state or fed-
eral statutes dictate who has access to the report) or
covered under attorney–client privilege (e.g., in immi- gration cases or privately retained personal injury evalu-
ations in which the psychologist is typically retained by
one side). Overall, it is likely that the report produced
at the conclusion of a forensic evaluation will be more
widely distributed than a clinical assessment report, and
in some cases, the report can become part of a case file
that could be open to the public (e.g., in a sanity case
in a court of record such as a circuit court). Further, a
forensic psychologist may find parts of their report
quoted in case law, or even in the local or national
newspapers, an event much less likely to occur when
practicing clinical assessment.
A third significant difference is the stance the psy-
chologist takes toward the client/examinee in the eval-
uation process. In a clinical assessment, the psychologist
FORENSIC ASSESSMENT � ARCHER ET AL. 349
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
typically assumes an empathetic, supportive, and non-
judgmental stance. In contrast, in forensic assessment,
the evaluator is often advised to present in a more neu-
tral and objective manner. Indeed, presenting oneself as
empathetic or supportive during a forensic evaluation
may be viewed as being potentially misleading and
inappropriate. At the heart of the differences between
clinical and forensic assessment is the adversarial nature
of the legal system. While in clinical assessment, a rela-
tionship with a client would rarely be adversarial, it is
much more frequently the case in forensic assessment
that at least part of the process may be viewed as
adversarial by the examinee.
A fourth area of difference is that forensic psycholo-
gists are often skeptical of the accuracy of the exami-
nee’s self-report. Thus, the forensic psychologist is
more likely to rely on a variety of sources of informa-
tion to confirm the examinee’s self-report, while a
clinical psychologist conducting an assessment is more
likely to rely more heavily on the subject’s self-report.
This difference is related to the level of scrutiny and
collaboration applied to the self-report of the exami-
nee. Individuals participating in forensic evaluations
may have a wide variety of reasons to provide inaccu-
rate information during the evaluation. For example,
examinees may over-report their psychological stability
and parenting skills in order to obtain custody of their
children, while other examinees may exaggerate or
malinger mental health problems in order to avoid
criminal responsibility or to obtain monetary gain in
personal injury cases. Further, threats to validity in
forensic assessment may also come from a lack of client
self-awareness rather than a conscious intent to be dis-
honest or malinger. Thus, psychologists in forensic
assessments often rely on collateral sources of informa-
tion, including medical records; police reports; victim
statements; reports from employers, family members, or
friends; and even taped phone calls, e-mails, or text
messages to confirm or disconfirm the self-report infor-
mation provided by the examinee.
Finally, the goals of the clinical and forensic assess-
ments are typically quite different, and may even be in
conflict. Specifically, the goal of clinical assessment is
to assist the client (patient). This is typically done by
answering broader diagnostic questions to assist clients
in understanding more about themselves and to
facilitate treatment planning. In contrast, the goal of
the forensic assessment is to assist the court, or retain-
ing party such as an attorney, in providing opinions
regarding a psycholegal question. Thus, forensic assess-
ment may or may not be helpful to the individual
being assessed, and may not even provide a diagnosis
(e.g., it may not be necessary to provide a diagnosis in
a competency evaluation if it does not impact the
defendant’s abilities in court). Indeed, it is possible that
the outcome of a forensic assessment could be harmful
to an individual’s case (e.g., an individual wants to
plead not guilty by reason of insanity, but the evaluator
does not find that there is evidence to support such a
plea).
BROADNESS OF THE FIELD OF FORENSIC ASSESSMENT
As previously noted, Forensic Psychology encompasses
a large practice area. Huss (2009) reviewed the major
areas of Forensic Psychology and included various
topics, such as risk assessment at the time of sentencing,
insanity (criminal responsibility), competency to stand
trial, sex offender evaluations, juvenile transfers to adult
court, child custody, civil commitment, personal injury
cases, worker’s compensation, and competency to make
medical decisions. We would also add to this list the
additional practice areas of fitness for duty evaluations,
capital sentencing mitigation, immigration evaluations,
jury selection consultation, guardianship/conservator-
ship evaluations, and juvenile evaluations conducted at
the court’s request, although it is important to note
that this is not an all-inclusive list. It is quite clear that
two psychologists may report that they both practice
Forensic Psychology without engaging in similar evalu-
ations (e.g., one might conduct child custody and par-
enting evaluations while another might build a practice
on competency and sanity evaluations). Further, being
competent in one area of forensic practice does not
make a practitioner competent in other forensic prac-
tice areas.
Most practice areas in Forensic Psychology fall
under two broad categories: civil and criminal. Civil
areas of practice typically involve the relationship
between members of the community, and the goal is
not to punish a wrongdoer but to prevent or compen-
sate for a wrong. Examples of civil cases would be
evaluations of custody evaluations and personal injury
350 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
evaluations. In contrast, criminal practice typically
involves cases in which it is alleged that criminal laws
have been broken, and the goal is to appropriately
punish the wrongdoer. Examples of Forensic Psychol-
ogy involvement in criminal law would include com-
petency to stand trial evaluations and sanity evaluations,
as well as capital sentencing mitigation.
APA GUIDELINES (DIVISION 41 GUIDELINES)
Because Forensic Psychology encompasses such a vast
array of practice areas, there are a variety of specific
pathways through which psychologists can become
competent and proficient in their specialty area. The
Specialty Guidelines for Forensic Psychology (APA, 2011)
provide some general guidance regarding the relevant
issue in conducting forensic assessments. The following
is a brief summary of these guidelines:
• Forensic psychologists seek to focus on the legally relevant factors (i.e., they understand and are guided
by relevant legal statutes and case law).
• Forensic practitioners seek the appropriate use of assessment procedures (i.e., they understand the
strengths and limitations of the tests they select as
applied to the relevant forensic issues/populations).
These guidelines state that forensic practitioners also
seek to consider the strengths and limitations of
employing traditional assessment procedures in
forensic examinations.
• Given the stakes involved in forensic contexts, forensic practitioners recognize the need to take spe-
cial care to ensure the integrity and security of test
materials and results.
The guidelines also state that when test score valid-
ity has not been firmly established in a forensic con-
text, the petitioner should seek to describe test score
strengths and limitations and to explain these issues in
the forensic context. This explanation may include the
observation that the context in which the assessments
have been given may warrant adjustments in test score
interpretation.
The guidelines also suggest that forensic psycholo-
gists should
• take into account individual examinee differences, including “situational, personal, linguistic, and
cultural differences that might affect their judge-
ments or reduce the accuracy of their interpreta-
tions” (APA, 2011, p.15);
• take reasonable steps to explain, in an understandable manner, the results of the test to either the examinee
or his or her representative. If this feedback is not pos-
sible, this restriction should be explained in advance;
• document all the data sources they considered as part of the evaluation. Further, this documentation
should be made available based on proper subpoenas
or legal consent; and
• maintain careful and detailed records of the evalua- tion process.
IMPACT OF THE DAUBERT SUPREME COURT DECISION
Psychologists’ involvement in court-related matters has
long been governed, at least in part, by rules regarding
the admissibility of evidence. For over 50 years, this
involvement in federal courts and in many state courts
was governed primarily by the Supreme Court stan-
dard, which was developed out of the Frye v. United
States (1923) ruling. This ruling established that evi-
dence could be admissible if it were based on a tech-
nique or method generally accepted in the field. Thus,
for example, psychologists’ testimony may be found to
be admissible if based on a practice, test, or technique
generally accepted by other psychologists within that
field. For obvious reasons, however, determining what
method is “generally accepted” may be difficult when
applied to a specific test used within a specific context.
In 1993, the U.S. Supreme Court modified the
admissibility standard in its ruling in the case of Daubert
v. Merrell Dow Pharmaceuticals. The Daubert case
involved the admissibility of testimony based on tech-
nical or scientific data. The Supreme Court heard the
case because it felt that there were differences regarding
how lower courts had been determining the proper
standard for admitting expert testimony. The Court
noted that the Frye standard had been superseded by
the Federal Rules of Evidence. Indeed, the Federal
Rules of Evidence have one section for the “ordinary
witness” (701) and one for expert witnesses (702). The
Court noted that the additional requirements in 702
were not surprising given that “an expert is permitted
wide latitude in offering opinions, including those that
are not based on firsthand knowledge or observation.”
FORENSIC ASSESSMENT � ARCHER ET AL. 351
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
They observed that this relaxation of the requirement
for firsthand knowledge, a central part of common law,
is “premised on an assumption that the expert’s opin-
ion will have a reliable basis in the knowledge and
experience of his discipline.” Thus, the Court then
outlined the following guidelines for Rule 702 expert
witness testimony:
. . .If scientific, technical, or other specialized knowl-
edge will assist the trier of fact to understand the
evidence or to determine a fact in issues, a witness
qualified as an expert by knowledge, skill, experi-
ence, training, or education, may testify thereto in
the form of an opinion or otherwise.
The Court also noted that nothing in this rule
specifically required “general acceptance” for admissi-
bility. Therefore, the Court stated, “That austere stan-
dard [Frye], absent from and incompatible with the
Federal Rules of Evidence, should not be applied in
federal trials.” The Court went on to outline the limits
which were put into place by the Rules of Evidence,
including that “any and all scientific testimony or evi-
dence admitted is not only relevant, but reliable.”
The justices noted that many factors could benefi-
cially influence a judge’s decision-making process, and
therefore, they were not attempting to create a rigidly
defined checklist or test, but they did provide the fol-
lowing considerations in evaluating admissibility issues:
• Is the evidence/opinion based on a technique or method that has been tested and has established stan-
dards controlling its use and operation?
• Has the theory or technique undergone peer review and publication?
• Does the technique have a known and established error rate?
• Finally, “the general acceptance of a technique” still has a bearing on admissibility of testing, but no
longer serves as the sole or exclusive criterion.
Functionally, the Court developed a flexible stan-
dard to determine admissibility of expert testimony and
opinions. Two crucial Supreme Court cases followed
Daubert. In the 1997 ruling in General Electric Company
v. Joiner and the 1999 ruling in Kumho Tire Company v.
Carmichael, the Supreme Court clarified aspects of the
Daubert ruling regarding the admission of expert wit-
ness and testimony. Generally, criteria for admissibility
under this series of cases support the view that testi-
mony based on psychometrically reliable and valid
instruments, with test conclusions based on empirical
support, is more likely to meet admissibility guidelines
in federal courts and in the numerous states that follow
Daubert criteria.
Goodman-Delahunty (1997) argued that the effects
of Daubert have been to cause forensic psychologists
“to be more explicit about the scientific foundations of
their opinions” (p. 121), and Underwager and Wake-
field (1993) discussed the influence of the Daubert deci-
sion on psychological testimony in an article entitled
“A Paradigm Shift for Expert Witnesses.” A growing
number of articles have reviewed specific psychological
tests such as the MMPI-2-RF (Ben-Porath, 2012; Sell-
bom, 2012) and MCMI (Rogers, Salekin, & Sewell,
1999) in terms of their ability to meet Daubert criteria,
and there seems little doubt that any test used to form
the basis of an expert’s opinion in court is potentially
subject to review based on Daubert factors.
Many states have adopted the Daubert standard, but
several continue to use Frye or other admissibility crite-
ria. It is incumbent on forensic practitioners to know
which standard their state uses and how it might
impact admissibility of psychological evidence. For pur-
poses of the current article, however, it is most impor-
tant to note that various psychological tests, or
components of tests, have been submitted to a “Daubert
Challenge” or “Daubert Motion,” that is, a hearing
conducted before a judge where validity and admissi-
bility of expert testimony (in this case based on a speci-
fic test or tests) are challenged by opposing counsel as
failing to meet Daubert standards of methodology, relia-
bility, validity, and general acceptance.
WHAT DO WE KNOW ABOUT THE “GENERAL ACCEPTANCE”
OF TESTS USED IN FORENSIC EVALUATIONS?
A growing body of research literature has addressed the
issue of the types of assessment instruments typically
used in a variety of forensic settings. This research is an
important aspect of determining those psychological
tests that are generally accepted by practitioners to
address varied forensic issues, in turn contributing to
identifying tests that are generally accepted within the
352 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
field of psychology. We shall review some of the more
prominent of these studies that have focused on the
tests used across a variety of forensic settings, and then
briefly review some of the surveys of assessment instru-
ments used for specific forensic purposes, such as cus-
tody evaluations, violence risk assessment, and criminal
forensic evaluations.
SURVEYS OF TEST USAGE IN GENERAL FORENSIC SETTINGS
Boccaccini and Brodsky (1999) noted that psychologists
have become increasingly involved in forensic assess-
ments. The authors conducted a survey among 80
members of APA Divisions 12 (Clinical) and 40 (Neu-
ropsychology) to evaluate what instruments these prac-
titioners used in emotional injury assessments and to
see whether practitioners used these tests in a manner
consistent with the Daubert (1993) Supreme Court rul-
ing on the admissibility of expert witness testimony.
The 80 psychologists who completed surveys in this
study had conducted over 10,500 emotional injury
evaluations during their career, and listed a total of 67
different assessment instruments they had employed in
emotional injury evaluations during the past year.
However, only 11 of these tests were used by five or
more survey respondents, and no practitioners used
exactly the same combination of tests in their standard
battery for injury evaluations. The five most frequently
employed tests reported in this study included the
Minnesota Multiphasic Personality Inventory (MMPI)
or MMPI-2 (94%), the Wechsler Adult Intelligence
Scale–Revised (WAIS-R) or WAIS-III (54%), the Mil- lon Clinical Multiaxial Inventory-II (MCMI-II) or
MCMI-III (Millon, 1994; 50%), the Rorschach Inkblot
Technique (41%), and the Beck Depression Inventory
(31%). The authors noted that respondents indicated
that Daubert-related criteria such as general acceptance
of the test within the field and the presence of inde-
pendent research validation played an important role in
their selection of instruments. However, Boccaccini
and Brodsky (1999) reported that respondents also cited
test selection factors unrelated to the Daubert standard,
such as their personal clinical experience with a partic-
ular test, as popular reasons for test selection. Thus, in
response to the question concerning whether psycholo-
gists selected tests in forensic evaluations based on cri-
teria outlined in the Daubert decision, the authors
concluded, “Our findings indicate that the answer is
yes and no” (p. 257).
Since the survey by Boccaccini and Brodsky (1999),
several other surveys have been conducted on test
usage by psychologists in forensic settings. Table 1 pro-
vides a summary of the results from these studies.
Lally (2003), for example, surveyed 64 diplomates in
Forensic Psychology concerning the frequency with
which they used various tests, and their opinions con-
cerning the acceptability of a wide variety of specific
psychological tests, in six areas of forensic practice,
including mental state at the time of an offense, risk for
future violence, risk for future sexual violence,
Table 1. Result summary of test usage in general forensic settings
Study Sample
Most Frequently Used/Recommended Tests
Boccaccini and Brodsky (1999)
80 APA Division 12 or Division 40 members
MMPI/MMPI-2 (94%) WAIS-R/WAIS-III (54%) MCMI-II/MCMI-III (50%) Rorschach (41%) Beck Depression Inventory (31%)
Lally (2003) 64 Forensic psychology diplomates
MMPI-2 WAIS-III PCL-R Luria-Nebraska Halstead-Reitan
Archer et al. (2006)
152 APA Division 41 members or AAFP diplomates
Cognitive Assessment Wechsler Intelligence Scales
Single-Scale Tests Beck Depression Inventory Beck Anxiety Inventory
Multiscale Tests MMPI-2
Children/Adolescents MMPI-A
Projective Tests Rorschach
Viljoen et al. (2010) 215 psychologists who perform forensic evaluations of adults or children
Wechsler Intelligence Scales (75.3%) MMPI-2/MMPI-A (66%) Structured Assessment of Violence Risk in Youth (SAVRY) (Borum, Bartel, & Forth, 2003; 35.1%)
MCMI-III or MACI (31.2%) PCL-R or PCL:YV (24.7%)
Austin and Wygant (2012)
284 psychologists conducting forensic evaluations
MMPI-2 (59%) PAI (38%) MMPI-2-RF (29%) MCMI-III (26%) Rorschach (22%)
FORENSIC ASSESSMENT � ARCHER ET AL. 353
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
competency to stand trial, competency to waive Mir-
anda rights, and evaluation of malingering. Archer,
Buffington-Vollum, Stredny, and Handel (2006) evalu-
ated the survey responses of 152 doctoral-level psychol-
ogists who were members of Division 41 (The
American Psychology-Law Society) of the American
Psychological Association or diplomates of the Ameri-
can Academy of Forensic Psychology or both. These
respondents identified themselves as forensic psycholo-
gists, and a majority typically spent over 50% of their
time in forensic practice. Survey respondents were
asked to report their test usage within specific cate-
gories, such as competency to stand trial, custody eval-
uations, and sex offender risk assessments. Further,
survey results were also gathered for specific types of
testing, such as cognitive/intellectual tests, multiscale
inventories, neuropsychological tests, and single-scale
tests. Viljoen, McLachan, and Vincent (2010) con-
ducted a survey of psychologists involved in violence
risk assessments and psychopathy evaluations among
juveniles and adults. Their results were based on 85
psychologists who completed the survey related to
child assessments, and an additional 130 respondents
who had completed the survey related to their evalua-
tions of adults. Austin and Wygant (2012) utilized a
web-based survey of clinicians, including members of
the American Psychology-Law Society (AP-LS), the
Society for Personality Assessment (SPA), and a Foren-
sic Psychology email listserv. Four hundred seventy-
nine respondents completed the survey, of which 284
reported practicing in one or more forensic areas,
including the categories of competency/sanity evalua-
tions, criminal risk assessment evaluations, and medi-
cal–legal (non-neuropsychological) evaluations.
SURVEYS IN SPECIFIC FORENSIC AREAS
Surveys of tests used in forensic settings have also been
conducted in relation to specific types of evaluations or
areas of practice. For example, in a widely cited survey
of tests used in forensic neuropsychological evaluations,
Lees-Haley, Smith, Williams, and Dunn (1996)
reported that the Wechsler Adult Intelligence Scales
and the MMPI or MMPI-2 were found to be the two
most popular tests among neuropsychologists, followed
by the Wechsler Memory Scale (Wechsler, 1997), Trail
Making Tests A and B (Army Individual Test Battery,
1944), the Finger Oscillation Test (Halstead, 1947), the
Bender-Gestalt Visual-Motor Test (Bender, 1946), the
Category Test (Halstead, 1947), and the Wisconsin
Card Sorting Test (Berg, 1948). More recent surveys
of forensic neuropsychological test selection have been
provided by Rabin, Barr, and Burton (2005) and
others.
A number of surveys have been conducted regarding
the psychological test instruments used by psychologists
performing parenting capacity evaluations or custody
evaluations (e.g., Ackerman & Ackerman, 1997; Bow
& Quinnell, 2001, 2002). In particular, Bow, Gould,
Flens, and Greenhut (2006) conducted a survey based
on the responses of 87 psychologists regarding their test
selection, usage frequency, and opinions concerning
which psychological tests did, and did not, meet Dau-
bert criteria. The top factors influencing psychologists’
selection of test instruments included the availability of
adequate reliability and validity research, an adequate
normative sample, and the acceptability of the test
within the child custody evaluation field. Specifically,
75% of all respondents rated these factors as being
“highly” important or “extremely” important. The
MMPI-2 received the highest clinician ratings in terms
of being recommended as a test meeting Daubert stan-
dards (95.2%), followed by the Wechsler Adult Intelli-
gence Scale–Third Edition (WAIS-III; 86.1%), the MMPI-A (87.3%), the Wechsler Intelligence Scale for
Children-III/IV (Wechsler, 2003; 86.1%), and the Mil-
lon Clinical Multiaxial Inventory-III (MCMI-III;
77.6%). Of these tests, only the MMPI-2 and the
MCMI-III were recommended as meeting Daubert
standards and also reported to be used by the majority
of respondents in custody evaluations. The Personality
Assessment Inventory (PAI; Morey, 2007), the Psy-
chopathy Checklist–Revised (PCL-R; Hare, 1991), and the Beck Depression Inventory-II (BDI-II; Beck,
Steer, & Brown, 1996) were all endorsed as meeting
the Daubert standard, but were also more infrequently
used. Notably, the Rorschach technique, administered,
scored, and interpreted with the Exner (1993) Com-
prehensive System, was the only projective personality
measure seen to meet the Daubert standard, although it
received relatively infrequent use in evaluations. The
authors concluded that their findings provided some
preliminary guidance for psychologists in the selection
354 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
and usage of psychological testing in child custody
evaluations.
FORENSIC-SPECIFIC TESTS AND CLINICAL ASSESSMENT
INSTRUMENTS
As seen in the survey research reviewed earlier, forensic
evaluators routinely employ standardized assessment
measures. These can include both traditional clinical
assessment measures applied to answer forensic ques-
tions and test instruments specifically designed for use
in forensic evaluations to address a specific psycholegal
issue within a particular population. As noted by Otto
and Heilbrun (2002), there has been a proliferation of
instruments designed specifically to assess psycholegal
capacities, abilities, or knowledge. Such instruments
can enhance the quality of a forensic assessment by
providing standardized assessment procedures and
methods. These tests may serve to reduce examiner
bias or error, and may allow for meaningful compar-
isons over time or between evaluators. These instru-
ments can range from simple, structured, but
nonstandardized interviews regarding the legal issue at
hand, to instruments that are empirically constructed
and validated and have an extensive associated research
base. Forensic assessment instruments are available for
many types of forensic assessment; however, as cau-
tioned by Melton, Petrilla, Poythress, and Slobogin
(2007), valid instruments are not available for many of
the psycholegal questions forensic evaluators are com-
monly asked to address. Therefore, it is necessary for
evaluators to remain well informed on the literature
and psychometric properties of any test they are con-
sidering using in a forensic context.
One example of an instrument designed specifically
for use in forensic evaluations is the MacArthur Com-
petency Assessment Tool for Criminal Adjudication
(MacCAT-CA; Poythress et al., 1999), which is com-
posed of three scales: Understanding, Reasoning, and
Appreciation. The MacCAT-CA uses a hypothetical
case about two men (Fred and Reggie) and their
involvement in a serious assault. Examinees respond to
questions about Fred and Reggie’s case after being pre-
sented with the vignette, and these responses are
scored. Those deemed to have minimal or no impair-
ment show assessed deficits within one standard devia-
tion of a competent normative group. Mild and
clinically significant impairment levels for Understand-
ing and Reasoning are defined by further degrees of
standard deviation. For the Appreciation scale, cutoff
scores were assigned based on clinical reasoning regard-
ing delusional thinking. While it was derived from an
earlier test that assessed decisional competence in treat-
ment, the MacCAT-CA was specifically designed for
individuals involved in the criminal justice system
whose competence to stand trial has been questioned.
The Rogers Criminal Responsibility Assessment
Scale (R-CRAS; Rogers, 1984) is another example of
an instrument designed solely to address a specific
forensic construct. The R-CRAS is a structured deci-
sion model for quantifying psychological variables rele-
vant to the retrospective evaluation of insanity. It was
specifically validated to address the American Law
Institute (ALI) insanity standard, which requires an
assessment of a defendant’s cognitive and volitional
impairment at the time of the alleged offense. In addi-
tion, the R-CRAS provides clinical data relevant to
the M’Naghten insanity standard. The R-CRAS com-
bines an evaluation of general diagnostic categories
with an assessment of cognitive and volitional abilities
at the time of the offense. Thirty items are scored in
gradated severity against an anchor construct, and three
rationally constructed scales evaluate Patient Reliability,
Organicity, and Psychiatric Disorders. Finally, two
scales assess Cognitive Control and Behavioral Control.
The R-CRAS was designed to standardize clinical
judgments made regarding a legal construct that does
not readily link to traditional bodies of knowledge on
human behavior such as psychiatric diagnoses.
The array of available forensic assessment instru-
ments is wide-ranging. For example, the Inwald Per-
sonality Inventory (Inwald, 1992) was developed to
assess police officer candidates for behaviors and emo-
tional issues that might negatively affect their perfor-
mance as police officers. Instruments also include
measures of parenting capacity, daily decision making,
and competency to manage health-care decisions (e.g.,
MacArthur Competence Assessment Tool for Treat-
ment; Grisso, 1998). Forensic assessment instruments
are perhaps most commonly seen in adult criminal
court settings and are more prevalent in the area of
competency to stand trial (e.g., Fitness Interview Test– Revised, Roesch, Zapf, & Eaves, 2006; and Evaluation
FORENSIC ASSESSMENT � ARCHER ET AL. 355
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
of Competence to Stand Trial–Revised, Rogers, Till- brook, & Sewell, 2004), although measures also exist
for other areas of criminal forensic assessment (e.g.,
Miranda Rights Comprehension Instruments; Gold-
stein, Zelle, & Grisso, 2011).
There are both advantages and disadvantages to the
use of forensic assessment instruments. One clear
advantage is the ability to assess a circumscribed area of
legal concern. This allows the evaluator, if necessary,
to attend to the issue of interest to the court in a more
focused manner. In addition, most such tests have nor-
mative data available from forensic populations, which
allows for individual comparison with appropriate nor-
mative groups. Some tests have also developed an inde-
pendent body of research that allows a forensic
evaluator to testify with increased confidence regarding
the strengths and limitations of test findings. As noted
by Melton et al. (2007), many forensic assessment
instruments are also more likely than clinical assessment
instruments to be face valid in reference to the legal
issue at hand, which may cause them to be viewed as
more comprehensible by examinees.
However, many specialized forensic tests have a very
limited research literature, and their reliability and valid-
ity are therefore not always well understood. As Otto
and Heilbrun (2002) have noted, numerous instruments
have been marketed without adequate research and
development. Some tests have been published after con-
ducting only minimal research, and occasionally pub-
lished in the absence of a comprehensive test manual.
Even some of the more popular instruments, for exam-
ple, those designed to assess competence to stand trial,
have demonstrated mixed research findings, in particular
regarding their construct validity. Many of the less com-
monly used instruments have little independent research
associated with them, thus severely limiting the confi-
dence with which hypotheses can be established and test
inferences can be drawn. Another potential disadvantage
is the temptation to substitute a single narrowly focused
instrument (such as one that measures constructs rele-
vant to competence to stand trial) for a multimethod
approach that also includes nonquantitative methods.
Thus, the potential benefits of forensic assessment
instruments (especially their overt relevance to psy-
cholegal constructs) can also contribute to results being
misused or misconstrued.
Although forensic assessment instruments have pro-
liferated rapidly, traditional clinical assessment instru-
ments continue to play an integral role in forensic
evaluations. The surveys we reviewed earlier have
demonstrated that instruments such as the Minnesota
Multiphasic Personality Inventory–2 (MMPI-2; Butcher et al., 2001), the Millon Clinical Multiaxial
Inventory–Third Edition (MCMI-III; Millon, Millon, Davis, & Grossman, 1997), and the Wechsler Intelli-
gence Scales are all frequently used in forensic evalua-
tions, while more and more research has provided
support for the appropriate use of these instruments in
forensic contexts. As this literature cautions, however,
these tests were developed for clinical rather than
forensic purposes, and thus applying them in a different
context requires careful consideration of contextual
variables, and awareness of the guidance provided by
research findings. In contrast to forensic assessment
instruments, the traditional clinical tests most com-
monly used in forensic evaluations tend to have better-
established reliability and validity based on a more
extensive body of research. For example, there is very
extensive research on forensic applications of the
MMPI-2 (e.g., Pope, Butcher, & Seelen, 2006), as well
as many neuropsychological tests and, to a lesser
degree, instruments such as the MCMI-III (e.g.,
McCann & Dyer, 1996) and the Rorschach (e.g.,
Gacono & Evans, 2008).
Traditional clinical assessment tools may be of bene-
fit in forensic assessment due in part to features such as
validity indicators, which can be particularly helpful for
evaluating response style and defensiveness. Clinical
measures can also provide a broad understanding of
multiple areas of functioning, for evaluations in which
an overall assessment of functioning in multiple
domains may be helpful. In addition, a particular
advantage to the use of traditional clinical assessment
instruments is the well-established reliability and valid-
ity data available for these tests.
Despite the ever-deepening research support for
forensic applications of traditional clinical assessment
instruments, there are inherent challenges associated
with using them for this purpose. First, these tests were
not designed for forensic purposes but rather for diag-
nostic and treatment planning purposes in therapeutic
settings, and relatively few of them have appropriate
356 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
forensic norms or comparison groups. The most rele-
vant question in many forensic evaluations may not be
the extent to which an individual’s scores differ from
the normative population, but rather the extent to
which an individual’s responses differ from a specific
comparison population. For example, to what extent
does this police officer applicant’s test results differ
from those typically found in this population, or to
what extent does this parent’s test scores differ from
those produced by other parents in custody evalua-
tions? In addition, the available research addressing
forensic applications of traditional clinical assessment
instruments varies significantly depending on the indi-
vidual test.
As noted by Melton et al. (2007), another challenge
associated with traditional clinical tests is that the results
are best viewed as hypotheses, with refinement and
confirmation to occur over time and with additional
data. In forensic contexts, where examinee contact may
be limited, hypothesis evaluation may require extensive
third-party or archival information. In addition, the
typical computer-generated reports associated with
many such tests are ripe for misuse in the courtroom
setting, where results can easily be taken out of con-
text. On the other hand, this challenge related to the
brief examinee contact that often occurs in forensic
assessment may also be viewed as an advantage in that
forensic evaluators obtain much more extensive inde-
pendent data concerning their examinee from sources
such as medical and legal records and collateral inter-
views. These sources of information can often prove
more useful in testing hypotheses than the data gener-
ated from the more extensive self-report information
provided by the examinee across repeated contacts in
typical clinical settings.
WHAT EMPIRICALLY SUPPORTED CHARACTERISTICS ARE
IMPORTANT IN EVALUATING FORENSIC TESTS?
In this section, we will discuss the broad characteristics
of empirically supported forensic assessment instru-
ments, noting at the onset that the features sought in
tests used in evaluation are generally similar to those
required for any clinical assessment test, for example,
evidence of adequate standardization, reliability, and
validity. However, the specific standards used for estab-
lishing these characteristics in forensic testing may, at
points, differ from those required of general clinical
assessment instruments. We will, therefore, propose
five criteria that we believe to be particularly useful in
evaluating the empirical support of forensic tests:
• The first characteristic required of an empirically supported forensic test is that the test has adequate
standardization. By this term, we mean a clear speci-
fication of the conditions under which the test is to
be administered, scored, and interpreted, typically
provided in a comprehensive test manual. The test-
ing manual should also clarify the required test user
qualifications, test administration instructions, and
available scoring methods. The goal of this type of
standardization is to ensure, as much as possible, that
individual scores on the instrument actually reflect
differences on the traits or characteristics being mea-
sured, and are unrelated to the particular examiner
characteristics or geographic circumstances of the
evaluation. While this goal is, of course, an ideal that
is not entirely achievable, adequate standardization
does minimize the extent to which the test scores
produced by an instrument reflect characteristics
such as error variation due to differences in adminis-
tration, scoring, or interpretation procedures.
• A second characteristic important in the evaluation of tests used in a forensic context is established evi-
dence in peer-reviewed journals of acceptable levels
of reliability and validity in test scores when the test
is used for forensic purposes. This criterion includes
data relevant to establishing the error rate of the
instrument, which may be expressed as a standard
error of measurement and/or in terms of sensitivity
and specificity in predicting an individual’s member-
ship in forensically relevant groups, for example,
correctly identifying individuals who do and do not
have a high risk of sexual offense recidivism. Evi-
dence of reliability, of course, would also include
indices of situational stability across time for tests
measuring trait characteristics (e.g., test–retest relia- bility) as well as internal consistency measures such
as alpha coefficient reliability estimates for tests
focused on more homogeneous constructs. In this
latter regard, a forensic test would be considered
reliable if the coefficient measures demonstrate that
items are measuring a single or homogeneous
FORENSIC ASSESSMENT � ARCHER ET AL. 357
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
variable. Further, factors influencing this type of
internal reliability include the test length, homo-
geneity of the target population, and the standard
error of measurement of test scores. Validity, on the
other hand, refers to the extent to which a test use-
fully measures the construct or trait it was designed
to evaluate. There are several kinds of test validity,
including face validity (the extent to which test
items appear appropriate to the individual taking the
test), content validity (the extent to which items in
the test sample the relevant domain of interest), and
criteria-related validity (the extent to which the test
score produces evidence of concurrent and predic-
tive validity). Concurrent validity estimates are pro-
duced when test scores and external criteria
measures are evaluated at the same time and may be
divided into convergent and discriminant validity,
that is, the degree to which test scores are strongly
related to other measures of the same construct, and
the degree that test scores differ substantially from
scores obtained on measures of unrelated constructs,
respectively. The concept of discriminant validity is
often of particular importance in forensic testing.
For example, test scores that are elevated in the
presence of PTSD-related symptoms, but also ele-
vated as a result of symptoms related to many other
disorders, are of little etiological or diagnostic value
in diagnosing PTSD. The concept of predictive
validity is based on circumstances in which test
scores are obtained, and the individual’s performance
on relevant criteria is evaluated at some later date.
All of these measures of reliability and validity may
be seen as establishing a known error rate for a test,
that is, the degree to which the accumulated body
of research on the test establishes the scores is use-
fully measuring the target trait or characteristic ver-
sus the extent to which test scores reflect variance
due to sources of error unrelated to the trait or con-
struct of interest.
• A third characteristic of particular importance in evaluating forensic tests involves the “general accep-
tance within the field” criterion retained within the
Daubert Supreme Court decision. In this regard, the
results of the numerous surveys that have been con-
ducted concerning tests used for forensic purposes,
as well as surveys that have specifically solicited the
opinions of forensic psychologists regarding instru-
ments that meet Daubert standards (e.g., Bow et al.,
2006), are particularly relevant in addressing this cri-
terion.
• The fourth characteristic of empirically supported forensic testing relates to the importance of cross-
cultural or cross-ethnic evaluations of the forensic
test to provide evidence concerning the extent to
which test scores might be expected to vary as a
result of cultural or ethnic differences. Perhaps more
importantly, this issue also includes the extent to
which test results require a different interpretation
approach because test correlates have been shown to
vary as a function of cultural or ethnic factors. For
example, do the test scores of individuals evaluated
within a criminal context, including such issues as
competency to stand trial and sanity evaluations,
show evidence of systemic bias related to the racial,
ethnic, or cultural background of the examinee?
Further, evidence about cross-cultural stability is also
critically important in forensic instruments that may
be used in noncriminal evaluations such as personal
injury litigation, parenting capacity, or custody
evaluations. To what extent, for example, is an
MMPI-2 or MCMI-III evaluation of a Chinese-
born immigrant mother in a U.S.-based custody
evaluation likely to be influenced by cultural factors
in that individual’s background? The size of the
research literatures on this topic shows enormous
variability across test instruments, with a relatively
massive research literature spanning 50 years on
cross-cultural and cross-national influences on
MMPI and MMPI-2 test scores, to much more lim-
ited data available for more recent test instruments
and more specialized test instruments.
• The fifth and final factor proposed as a criterion for evaluating the empirical support for forensic tests
relates to the extent to which comparison or norma-
tive data are available to help interpret test scores in
the forensic population for which the test is being
employed. For example, if a self-report multiscale
inventory is being used to evaluate an individual in
a custody litigation context, are there comparison
data available through which to establish the typical
or mean performance of other individuals in this
context (beyond the data generated from the
358 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
nonclinical normative sample)? Further, what do we
know about the typical or mean profile characteris-
tics of personal injury litigants for a given multiscale
inventory? For many forensic tests, particularly those
designed for a specific forensic purpose, comparison
data are readily available for forensic populations of
interest. In contrast, many clinical assessment instru-
ments that have been extended for use in forensic
populations vary widely in terms of the availability
of adequate comparison group data. Among the bet-
ter instruments in this latter regard, the MMPI-2-RF
Technical Manual (Tellegen & Ben-Porath, 2008)
provides comparison data from large samples of men
and women in various forensic settings, including
criminal defendants, prison inmates, personal injury
and disability claimants, and child custody litigants.
EMPIRICALLY SUPPORTED TESTS USED IN FORENSIC
ASSESSMENT SETTINGS
A key characteristic required of empirically supported
forensic tests is adequate standardization. Based on this
criterion alone, it is possible to eliminate a number of
popularly used assessment techniques, including a vari-
ety of sentence completion tests, projective drawings
tests, and the Thematic Apperception Test (TAT). In
contrast, tests that exemplify a desirable degree of stan-
dardization include the various forms of the MMPI
(MMPI-2, MMPI-2-RF, and MMPI-A) and the
Wechsler Intelligence Scales. The Wechsler Intelligence
Scales, for example, present standardization samples for
these instruments that are large and diverse. The test
manuals for both the adult and child versions of the
Wechsler Intelligence Scales clearly specify the condi-
tions under which tests are to be administered, scored,
and interpreted, and the specific methods for doing so.
The testing manual provides clear administration and
scoring instructions that ensure a highly standardized
testing experience and scoring process for all exami-
nees. The required test user qualifications are also
clearly described. In addition, there are clear standards
of measurement error to assist the forensic evaluator in
appropriately framing results. The test does have appli-
cability limits with the forensic population (as it does
with any population), and the forensic evaluator should
be acutely aware, for example, of its limitations with
color-blind individuals, those with vision or hearing
impairments, or those with a history of traumatic brain
injury. On the whole, however, the Wechsler scales
provide a highly standardized approach to assessing the
intelligence construct in a diverse range of children and
adults.
A second desirable characteristic in the evaluation of
forensic tests is evidence of acceptable levels of reliabil-
ity and validity as established in research studies pub-
lished in peer-reviewed journals. While numerous
instruments could be used to illustrate acceptable levels
of reliability and validity, the Minnesota Multiphasic
Personality Inventory–Adolescent (MMPI-A) was selected for purposes of this article because of the dom-
inance of this test among self-report instruments used
with adolescents. Baum, Archer, Forbey, and Handel
(2009) reviewed the literature on the MMPI-A and
identified 277 articles, books, chapters, dissertations,
and monographs published on this test between the
time of its release in 1992 until 2007. As a point of
comparison, the Millon Adolescent Clinical Inventory
(MACI), the second most frequently investigated
objective self-report measure for adolescents, produced
84 publications or dissertations across the same 15-year
time frame. The information provided in this body of
literature has clearly established the test–retest reliabil- ity, internal consistency, and factor structure of the
MMPI-A, in addition to providing extensive correlate
information for MMPI-A scales in normal settings, as
well as in clinical and forensic settings. The standard
error measurements for the MMPI-A clinical scales
have been well documented to typically fall within a
range of 2–3 raw score points or 5 T-score points. While Hathaway and McKinley (1943) did not develop
the original form of the MMPI with the specific inten-
tion of utilizing this test in forensic evaluations, there
have been extensive studies conducted with the
MMPI-2 and MMPI-A in forensic settings sufficient to
establish the usefulness of the extension of these tests
for forensic evaluations. Indeed, the first study of the
MMPI in a juvenile offender population was con-
ducted by Capwell (1945a, 1945b), who examined the
effectiveness of scale Pd in accurately identifying delin-
quent and nondelinquent adolescents. The extensive
literature on the use of the MMPI-A in forensic set-
tings has been summarized in Archer (2005), who
noted that the MMPI-A has been used with
FORENSIC ASSESSMENT � ARCHER ET AL. 359
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
adolescents in the assessment of emotional damages
related to personal injury litigation, child custody eval-
uations, evaluations of sexually abused adolescents, and
presentencing and postsentencing evaluations of adoles-
cents in the juvenile justice system.
Baum et al. (2009) noted that of the 277 publica-
tions and dissertations on the MMPI-A between 1992
and 2007, 28 publications and 26 dissertations were in
the forensic area. These studies, for example, have
established the validity of the MMPI-A in predicting
external correlates among male juvenile delinquents
(Archer, Bolinskey, Morton, & Farris, 2003), boys and
girls referred for court-ordered predispositional evalua-
tions (Handel, Archer, Elkins, Mason, & Simonds-Bis-
bee, 2011), male adolescent sex offenders (Freeman,
Dexter-Mazza, & Hoffman, 2005), and incarcerated
adolescent substance abusers (Stein & Graham, 2005).
The mean and standard deviations typically found for
the MMPI-A in juvenile justice samples, as reflected
from the findings of 14 studies, were also reported in
Baum et al. (2009). These latter authors concluded that
testimony based on the MMPI-A is likely to meet the
guidelines established under Daubert for the admissibil-
ity of evidence.
As previously noted, a third characteristic of particu-
lar importance in evaluating forensic tests involves the
“general acceptance within the field” criterion as origi-
nally established in Frye, and subsequently retained
within the expanded guidelines in the Daubert Supreme
Court decision. Numerous instruments would clearly
meet this standard, such as the Psychopathy Checklist– Revised (PCL-R) in the measurement of psychopathy,
the Millon Clinical Multiaxial Inventory-III (MCMI-
III) in the assessment of personality disorders, the Test
of Malingered Memory (TOMM; Tombaugh, 1996),
and the Structured Interview of Reported Symptoms
(SIR-S; Rogers, Bagby, & Dickens, 1992) in the
assessment of malingering. It is our belief that the Per-
sonality Assessment Inventory (PAI) offers an interest-
ing example of a relatively new self-report inventory
that has become increasingly popular in forensic evalu-
ations. Much of the earlier survey research regarding
the most commonly used tests in forensic evaluations
was conducted when the PAI was still quite new, and
it was therefore not commonly cited among the most
popular tests for forensic work. However, the PAI has
quickly become popular in forensic and correctional
settings, perhaps in part due to its shorter length in
comparison with other instruments such as the MMPI-
2 (e.g., Piotrowski, 2007). This gave rise to a great deal
of independent research on the validity of the PAI with
forensic populations, with generally favorable results
(e.g., Douglas, Hart, & Kropp, 2001; Edens, Cruise, &
Buffington-Vollum, 2001). Its continued popularity,
combined with strong research support from highly
esteemed members of the forensic research community,
lends it the credibility to now be viewed as generally
accepted in the field.
The fourth characteristic of empirically supported
forensic tests relates to the availability of empirical
data supporting the cross-cultural or cross-ethnic use
of a forensic instrument. While several widely used
tests have an extensive literature in this regard, it is
doubtful that any assessment measure can currently
rival the extensive research literature that has been
developed on cross-cultural, cross-ethnic, and cross-
national characteristics of the MMPI and MMPI-2.
The MMPI and MMPI-2 have been extensively stud-
ied among American minority populations, with sub-
stantial data available regarding African American (e.g.,
Arbisi, Ben-Porath, & McNulty, 2002), Asian Ameri-
can (e.g., Dong & Church, 2003), and Hispanic
American (e.g., Butcher, Cabiya, Lucio, & Garrido,
2007) respondents. Indeed, Zapata, Kreuch, Landers,
Hoyt, and Butcher (2009) state that “strong support
for the use of American norms to assess Latino clients
living in the United States has been provided by a
number of studies.” Further, several studies have
addressed the issue of response patterns found among
American Indians (e.g., Greene, Robin, Albaugh,
Caldwell, & Goldman, 2003; Tinius & Ben-Porath,
1993). Several books have been dedicated to the pro-
cess of developing international adaptations of the
MMPI-2 and the research in that area (e.g., Butcher,
1996). Butcher (2011) reviewed some of the high-
lights in the cross-cultural research on the MMPI and
MMPI-2, which goes back as far as the 1940s, when
the items began being translated for use in other
countries (e.g., Italy, Cuba, Germany, and Norway, to
name a few of the early translations). Indeed, in 1957,
Nencini and Banissoni published an official version of
the MMPI, along with norms, for use in Italy.
360 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
Further, by the late 1950s, translations were also being
developed for Asian countries (e.g., Japan in 1959 and
Korea in 1963). Today, the MMPI-2 has been translated
into 32 foreign languages, and many of the MMPI-2 test
translations have been carefully evaluated in terms of test
equivalency with the English-language form of the
MMPI-2.
The fifth and final criterion proposed for establish-
ing a forensic test as empirically supported involves the
availability of comparison data to evaluate test scores in
relation to a specific forensic population for which the
test is employed. For example, as previously noted, the
MMPI-2-RF, a broad-gauge measure of psychopathol-
ogy normed on a sample of normal individuals, has a
variety of comparison groups to support its use in pop-
ulations of personal injury claimants, custody evaluation
litigants, and prison populations. Among tests designed
to address a specific forensic issue, the STATIC-99
(and STATIC-99R, the more recent revision of this
test) is an actuarial instrument designed to assess the
probability that a male sex offender will commit
another sex offense following his release. The STA-
TIC-99, developed by Hanson and Thorton (1999), is
the most widely used sex offender risk assessment
instrument in the world. Their database used in pre-
dicting risk of recidivism incorporated large samples of
male sex offenders from North America, Europe, New
Zealand, and the United Kingdom. Another outstand-
ing example of forensic-specific norms is the Mas-
sachusetts Youth Screening Instrument–Version 2 (MAYSI-2) by Grisso and Barnum (2000), which is a
52-item self-report questionnaire used to screen for
emerging mental health disorders in youth (ages 12– 17 years) at intake into the juvenile justice system.
Normative data are available for the MAYSI-2 based
on a combined sample of 70,695 youths from juvenile
justice systems in 19 states, representing over 200 agen-
cies, including detention, corrections, and intake pro-
bation agencies.
SUMMARY AND CONCLUSIONS
The area of forensic assessment is a quickly growing
and crucial area of Forensic Psychology. The practice
of Forensic Psychology is guided by several sources,
including the APA Specialty Guidelines for Forensic Psy-
chology (2011) and the legal standards that provide
practice guidelines in relevant federal and state statutes
and case law. One of these legal standards is the Dau-
bert (1993) decision by the Supreme Court, which
helped to provide guidance to judges regarding what
types of testimony (and by extension, assessment mea-
sures) may be admissible in court.
One of the main ways of establishing whether a test
meets the standard of general acceptance within the
field may be based on the results of practitioner sur-
veys regarding test usage. Research from a variety of
studies has indicated that there is a core group of tests
that are commonly used and recommended by psy-
chologists in forensic settings. However, while these
surveys may establish that a test is commonly used for
a particular evaluation purpose, this does not, by itself,
necessarily indicate that those tests meet other criteria
that might be established for empirically supported
tests. In terms of these criteria, the authors have rec-
ommended five key guidelines or criteria that they
consider when determining whether a test meets the
criteria necessary for designation as an empirically sup-
ported test used in forensic evaluations. Numerous tests
appear on the cusp of being empirically supported, and
additional focused research could help to clarify and
aid practitioners in making informed choices regarding
the composition of their assessment batteries.
While the guidelines in this article may provide a
helpful framework for evaluating the usefulness of these
new measures, the establishment of a national review
panel or workgroup to review research findings and
offer well-documented recommendations concerning
which tests do, and do not, provide sufficient evidence
to be classified as empirically supported forensic tests
could be very useful to practitioners in the field. This
type of approach would be similar to the evidence-
based treatment panels that have been established in
medicine, psychotherapy, and numerous other fields to
review the empirical research literature on treatment
effectiveness and offer conclusions and recommenda-
tions. Given the potentially contentious nature of eval-
uating test instruments for this purpose, it would be
essential to the credibility of such a group that it be
composed of a membership representative of Forensic
Psychology and assessment psychology professional
groups and organizations, and independent of the influ-
ence of commercial interests.
FORENSIC ASSESSMENT � ARCHER ET AL. 361
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
REFERENCES
Ackerman, J. M. (1999). Essentials of forensic psychological
assessment. West Sussex, UK: John Wiley & Sons.
Ackerman, M. J., & Ackerman, M. C. (1997). Custody
evaluation practices: A survey of experienced professionals
(revisited). Professional Psychology: Research and Practice, 28,
137–145. doi:10.1037/0735-7028.28.2.137 American Psychological Association. (2011). Specialty
guidelines for forensic psychology. Retrieved from http://
www.apa.org/practice/guidelines/forensic-psychology.pdf
Arbisi, P. A., Ben-Porath, Y., & McNulty, J. (2002). A
comparison of MMPI-2 validity in African American and
Caucasian psychiatric inpatients. Psychological Assessment.,
14(1), 3–15. doi:10.1037/1040-3590.14.1.3 Archer, R. P. (2005). MMPI-A: Assessing adolescent
psychopathology (3rd ed.). New York, NY: Routledge Press.
Archer, R. P., Bolinskey, P. K., Morton, T. L., & Farris, K.
L. (2003). MMPI-A characteristics of male adolescents in
juvenile justice and clinical treatment settings. Assessment,
10, 400–410. doi:10.1177/1073191103256128 Archer, R. P., Buffington-Vollum, J. K., Stredny, R. V., &
Handel, R. W. (2006). A survey of psychological test
use patterns among forensic psychologists. Journal of
Personality Assessment, 87(1), 84–94. doi:10.1207/ s15327752jpa8701_07
Army Individual Test Battery. (1944). Manual of directions
and scoring. Washington, DC: War Department, Adjutant
General’s Office.
Austin, K., & Wygant, D. B. (2012). A survey of adult
personality and psychopathology test usage. Poster presented at
the Kentucky Psychological Association Spring Academic
Conference, Lexington, KY.
Baum, L. J., Archer, R. P., Forbey, J. D., & Handel, R. W.
(2009). A review of the Minnesota Multiphasic
Personality Inventory–Adolescent (MMPI-A) and the Millon Adolescent Clinical Inventory (MACI) with an
emphasis on juvenile justice samples. Assessment, 16,
384–400. doi:10.1177/1073191109338264 Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Manual
for the Beck Depression Inventory-II. San Antonio, TX:
Psychological Corporation.
Bender, L. (1946). Instructions for the use of visual motor
Gestalt test: Cards and manual of instructions. Denver,
CO: American Orthopsychiatric Association.
Ben-Porath, Y. (2012). Addressing challenges to MMPI-2-RF-
based testimony: Questions and answers. Archives of Clinical
Neuropsychology, 27, 691–705. doi:10.1093/arclin/acs083 Berg, E. A. (1948). A simple objective technique for
measuring flexibility in thinking. Journal of General
Psychology, 39, 15–22. doi:10.1080/00221309.1948. 9918159
Boccaccini, M. T., & Brodsky, S. L. (1999). Diagnostic test
usage by forensic psychologists in emotional injury cases.
Professional Psychology: Research and Practice, 30, 253–259. doi:10.1037/0735-7028.30.3.253
Borum, R., Bartel, P., & Forth, A. (2003). Manual for the
Structured Assessment of Violence Risk in Youth (SAVRY).
Tampa, FL: University of South Florida.
Bow, J. N., Gould, J. W., Flens, J. R., & Greenhut, D.
(2006). Testing in child custody evaluations—Selection, usage, and Daubert admissibility: A survey of
psychologists. Journal of Forensic Psychology Practice, 6,
17–38. doi:10.1300/J158v06n02_02 Bow, J. N., & Quinnell, F. A. (2001). Psychologists’ current
practices and procedures in child custody evaluations: Five
years post American Psychological Association guidelines.
Professional Psychology: Research and Practice, 32, 261–268. doi:10.1037/0735-7028.32.3.261
Bow, J. N., & Quinnell, F. A. (2002). A critical review of
child custody evaluation reports. Family Court Review, 40,
164–176. doi:10.1111/j.174-1617.2002.tb00827.x Butcher, J. N. (1996). International adaptations of the MMPI-2:
Research and clinical applications. Minneapolis, MN:
University of Minnesota Press.
Butcher, J. N. (2011). A beginner’s guide to the MMPI-2 (3rd ed.).
Washington, DC: American Psychological Association.
Butcher, J. N., Cabiya, J., Lucio, E. M., & Garrido, M.
(2007). Assessing Hispanic clients using the MMPI-2 and
MMPI-A. Washington, DC: American Psychological
Association. doi:10.1037/11585-000
Butcher, J. N., Graham, J. R., Ben-Porath, Y. S., Tellegen,
A., Dahlstrom, W. G., & Kaemmer, B. (2001). Minnesota
Multiphasic Personality Inventory–2 (MMPI-2): Manual for administration, scoring, and interpretation (rev. ed.).
Minneapolis, MN: University of Minnesota Press.
Capwell, D. F. (1945a). Personality patterns of adolescent
girls: I. Girls who show improvement in IQ. Journal of
Applied Psychology, 29, 212–228. doi:10.1037/h0062853 Capwell, D. F. (1945b). Personality patterns of adolescent
girls: II. Delinquents and non-delinquents. Journal of
Applied Psychology, 29, 284–297. doi:10.1037/h0054701 Daubert V. Merrell Dow Pharmaceuticals, 113 S. Ct. 2786
(1993).
Dong, Y. L. T., & Church, A. T. (2003). Cross-cultural
equivalence and validity of the Vietnamese MMPI-2:
Assessing psychological adjustment of Vietnamese
refugees. Psychological Assessment, 15, 370–377. doi:10.1037/1040-3590.15.3.370
362 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
Douglas, K. A., Hart, S. D., & Kropp, P. R. (2001). Validity
of the Personality Assessment Inventory for forensic
assessments. International Journal of Offender Therapy and
Comparative Criminology, 45, 183–197. doi:10.1177/ 0306624X01452005
Edens, J. F., Cruise, K. R., & Buffington-Vollum, J. K.
(2001). Forensic and correctional applications of the
Personality Assessment Inventory. Behavioral Sciences and
the Law, 19, 519–543. doi:10.1002/bsl.457 Exner, J. E. (1993). The Rorschach: A comprehensive system:
Vol. 1. Basic foundations (3rd ed.). New York, NY: Wiley.
Freeman, K. A., Dexter-Mazza, E. T., & Hoffman, K. C.
(2005). Comparing personality characteristics of juvenile
sex offenders and non-sex offending delinquent peers: A
preliminary investigation. Sexual Abuse: Journal of Research
and Treatment, 17, 3–12. doi:10.1177/1524838006292519 Frye v. United States, 392 F. 1013 (D.C. Cir. 1923).
Gacono, C. L., & Evans, L. A. (2008). The handbook of
forensic Rorschach assessment. New York, NY: Erlbaum.
General Electric Company v. Joiner, 118 S. Ct. 512 (1997).
Goldstein, N. E., Zelle, H., & Grisso, T. (2011). The Miranda
Rights Comprehension Instruments-II. Sarasota, FL:
Professional Resource Press.
Goodman-Delahunty, J. (1997). Forensic psychological
expertise in the wake of Daubert. Law and Human Behavior,
21, 121–140. doi:10.1023/A:1024874228425 Greene, R. L., Robin, R. W., Albaugh, B., Caldwell, A., &
Goldman, D. (2003). Use of the MMPI-2 in American
Indians: II. Empirical correlates. Psychological Assessment,
15, 360–369. doi:10.1037/1040-3590.15.3.360 Grisso, T. (1998). Forensic evaluations of juveniles. Sarasota, FL:
Professional Resource Exchange.
Grisso, T., & Barnum, R. (2000). Massachusetts Youth
Screening Instrument-2nd version. Worcester, MA: University
of Massachusetts Medical School.
Halstead, W. C. (1947). Brain and intelligence: A quantitative
study of the frontal lobes. Chicago, IL: University of Chicago.
Handel, R. W., Archer, R. P., Elkins, D. E., Mason, J. A.,
& Simonds-Bisbee, E. C. (2011). Psychometric properties
of the Minnesota Multiphasic Personality Inventory-
Adolescent (MMPI-A) clinical, content, and
supplementary scales in a forensic sample. Journal of
Personality Assessment, 93, 566–581. doi:10.1080/ 00223891.2011.608752
Hanson, R. K., & Thorton, D. (1999). Static-99: Improving
actuarial risk assessments for sex offenders. Ottawa, Ontario,
Canada: Department of the Solicitor General of Canada.
Hare, R. D. (1991). The Hare Psychopathology Checklist-
Revised. Toronto, Canada: Multi-Health Systems.
Hathaway, S. R., & McKinley, J. C. (1943). The Minnesota
Multiphasic Personality Inventory (rev ed.). Minneapolis, MN:
University of Minnesota Press.
Huss, M. (2009). Forensic psychology: Research, clinical practice,
and applications. West Sussex, UK: John Wiley & Sons.
Inwald, R. E. (1992). Inwald Personality Inventory technical
manual (revised). New York, NY: Hilton Research.
Jenkins v. United States, 307 F. 2d 637 (1962).
Kumho Tire Company v. Carmichael, 526 U.S. 137 (1999).
Lally, S. J. (2003). What tests are acceptable for use in
forensic evaluations? A survey of experts. Professional
Psychology: Research and Practice, 34, 491–498. doi:10.1037/ 0735-7028.34.5.491
Lees-Haley, P. R., Smith, H. H., Williams, C. W., & Dunn,
J. T. (1996). Forensic neuropsychological test usage: An
empirical survey. Archives of Clinical Neuropsychology, 11(1),
45–51. doi:10.1016/0887-6177(95)00011-9 McCann, J., & Dyer, F. J. (1996). Forensic assessment with the
Millon inventories. New York, NY: Guilford Press.
Melton, G. B., Petrilla, J., Poythress, N. G., & Slobogin, C.
(2007). Psychological evaluations for the courts: A handbook for
mental health professionals and lawyers (3rd ed.). New York,
NY: Guilford Press.
Millon, T. (1994). Millon Clinical Multiaxial Inventory-III:
manual. Minneapolis, MN: Pearson Assessment.
Millon, T., Millon, C., Davis, R., & Grossman, S. (1997).
Millon Clinical Multiaxial Inventory-III manual (4th ed.).
Minneapolis, MN: Pearson Assessment.
Morey, L. C. (2007). The Personality Assessment Inventory
professional manual. Lutz, FL: Psychological Assessment
Resources.
Nencini, R., & Banissoni, P. (1957). MMPI (former ridotta):
Edizona Italiana [MMPI short form: Italian edition].
Florence, Italy: Organizzazioni Speciali.
Otto, R., & Heilbrun, K. (2002). The practice of forensic
psychology: A look toward the future in light of the past.
American Psychologist, 57, 5–18. doi:10.1037/0003-066X.57.1.5 Piotrowski, C. (2007). Forensic psychological testing as a
function of affiliation and organizational setting.
Organization Development Journal, 25, 94–98. Pope, H. S., Butcher, J. N., & Seelen, J. (2006). The MMPI,
MMPI-2, and MMPI-A in court: A practical guide for expert
witnesses and attorneys (3rd ed.). Washington, DC:
American Psychological Association.
Poythress, N., Nicholson, R., Otto, R. K., Edens, J. F.,
Bonnie, R. J., Monahan, J., & Hoge, S. K. (1999). The
MacArthur Competence Assessment Tool-Criminal
Adjudication: Professional manual. Odessa, FL: Psychological
Assessment Resources.
FORENSIC ASSESSMENT � ARCHER ET AL. 363
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .
Rabin, L. A., Barr, W. B., & Burton, L. A. (2005).
Assessment practices of clinical neuropsychologists in the
United States and Canada: A survey of INS, NAN, and
APA Division 40 members. Archives of Clinical
Neuropsychology, 20, 33–65. doi:10.1016/j.acn.2004.02.005 Roesch, R., Zapf, P. A., & Eaves, D. (2006). Fitness Interview
Test-Revised: A structural interview for assessing competency to
stand trial. Sarasota, FL: Professional Resource Press.
Rogers, R. (1984). Rogers Criminal Responsibility Assessment
Scales (R-CRAS). Odessa, FL: Psychological Assessment
Resources.
Rogers, R., Bagby, R. M., & Dickens, S. E. (1992).
Structured Interview of Reported Symptoms (SIRS) and
professional manual. Odessa, FL: Psychological Assessment
Resources.
Rogers, R., Salekin, R. T., & Sewell, K. W. (1999).
Validation of the Millon Clinical Multiaxial Inventory for
Axis II disorders: Does it meet the Daubert standard? Law
and Human Behavior, 23, 425–443. doi:10.1023/ A:1022360031491
Rogers, R., Tillbrook, C. E., & Sewell, K. W. (2004).
ECST-R: Evaluation of Competency to Stand Trial–Revised: Professional manual. Lutz, FL: Psychological Assessment
Resources.
Sellbom, M. (2012). The MMPI-2-RF is ready for the
Daubert challenge: Evidence, implications, and
recommendations for use in court testimony. Journal of
Psychological Practice, 17, 151–179. Stein, L. A. R., & Graham, J. R. (2005). Ability of substance
abusers to escape detection on the Minnesota Multiphasic
Personality Inventory-Adolescent (MMPI-A) in a juvenile
correctional facility. Assessment, 12, 28–39. doi:10.1177/ 1073191104270838
Tellegen, A., & Ben-Porath, Y. S. (2008). Minnesota
Multiphasic Personality Inventory-2: Restructured Form
(MMPI-2-RF) technical manual. Minneapolis, MN:
University of Minnesota Press.
Tinius, T., & Ben-Porath, Y. S. (1993). A comparative study of
Native and Caucasian Americans undergoing substance abuse
treatment. Twenty-Eighth Annual Symposium on Recent
Developments in the MMPI-2/MMPI-A, St. Petersburg, FL.
Tombaugh, T. N. (1996). The test of memory malingering.
Toronto, Canada: Multi-Health Systems.
Underwager, R., & Wakefield, H. (1993). A paradigm shift
for expert witnesses. Issues in Child Abuse Accusations, 5(3),
156–167. Vaccaro, T. P., & Hogan, J. D. (2004). The origins of
forensic psychology in America: Hugo M€unsterberg on
the witness stand. New York State Psychologist, 16, 14–17. Viljoen, J. L., McLachan, K., & Vincent, G. M. (2010).
Assessing violence risk and psychopathy in juvenile and
adult offenders: A survey of clinical practices. Assessment,
17, 377–395. doi:10.1177/1073191109359587 Wechsler, D. (1997). The Wechsler Adult Intelligence Scale. (3rd
ed.). San Antonio, TX: The Psychological Corporation.
Wechsler, D. (2003). Wechsler Intelligence Scale for Children. (4th
ed.). San Antonio, TX: The Psychological Corporation.
Zapata, A., Kreuch, T., Landers, R. N., Hoyt, T., &
Butcher, J. N. (2009). Personality assessment in personnel
settings using the MMPI-2: A cross-cultural comparison.
International Journal of Clinical Health and Psychology, 9,
287–298. Retrieved from http://www.redalyc.org/ articulo.oa?id=33712028007
Received January 12, 2016; revised May 3, 2016; accepted
May 3, 2016.
364 CLINICAL PSYCHOLOGY: SCIENCE AND PRACTICE � V23 N4, DECEMBER 2016
T hi
s do
cu m
en t i
s co
py ri
gh te
d by
th e
A m
er ic
an P
sy ch
ol og
ic al
A ss
oc ia
tio n
or o
ne o
f i ts
a lli
ed p
ub lis
he rs
. T
hi s
ar tic
le is
in te
nd ed
s ol
el y
fo r t
he p
er so
na l u
se o
f t he
in di
vi du
al u
se r a
nd is
n ot
to b
e di
ss em
in at
ed b
ro ad
ly .