Psych
Psychological Bulletin 2000, Vol. 126, No. 5, 703-726
Copyright 2000 by the American Psychological Association, Inc. 0033-2909/00*5.00 DOI; 10.1037/TO33-2909.126.5.703
Gender Differences in Moral Orientation: A Meta-Analysis
Sara Jaffee and Janet Shibley Hyde University of Wisconsin—Madison
C. Gilligan's (1982) critique of L. Kohlberg's theory of moral reasoning and her assertion that two modes
of moral reasoning (justice and care) exist have been the subject of debate within the field of psychology
for more than 15 years. This meta-analysis was conducted to review quantitatively the work on gender
differences in moral orientation. The meta-analysis revealed small differences in the care orientation
favoring females (d = -.28) and small differences in the justice orientation favoring males (d = .19).
Together, the moderator variables accounted for 16% of the variance in the effect sizes for care reasoning
and 17% of the variance in the effect sizes for justice reasoning. These findings do not offer strong
support for the claim that the care orientation is used predominantly by women and that the justice
orientation is used predominantly by men.
The 1982 publication of Carol Gilligan's In a Different Voice
marked one of those rare moments when social science research
breaches the ivied walls of academia and captures the public's
imagination. Gilh'gan's assertion that females and males speak in
different moral voices—a care voice characterized by the desire to
maintain relationships and to respond to others' needs and a justice
voice characterized by considerations of fairness and equity—
resonated with readers' experience (Mednick, 1989). Ms. maga-
zine named Gilligan its 1984 Woman of the Year, lauding her for
research that "created a new appreciation for a previously uncata-
logued female sensibility, as well as possibilities for new under-
standing between the genders" (Van Gelder, 1984, p. 37). In 1996,
Time magazine included Gilligan among its "Time 25"—25 inno-
vative Americans with the ability "to show us the world anew, to
educate and entertain us, to change the way we think about
ourselves and others" ('Time 25," 1996, p. 54). Wrote the editors,
How likely is it that a single book could change the rules of psychol-
ogy, change the assumptions of medical research, change the conver-
sation among parents and teachers and developmental professionals
about the distinctions between men and women, boys and girls? (p. 66)
More than 15 years after the publication of this influential work,
are its theses supported by available scientific data?
Gilligan's (1982) research on female moral reasoning chal-
lenged cognitive-developmental stage theories of moral develop-
ment on two fronts. First, she argued for a broader conceptualiza-
tion of moral reasoning that encompassed a care orientation as well
as a justice orientation. The care orientation is characterized by a
focus on maintaining relationships, responding to the needs of
others, and a responsibility not to cause hurt. The justice orienta-
Sara Jaffee and Janet Shibley Hyde, Department of Psychology, Uni-
versity of Wisconsin—Madison.
We would like to acknowledge the contributions of Rose Jadack and
MaryBeth Nolan to this project. The Graduate School of the University of
Wisconsin provided financial support.
Correspondence concerning this article should be addressed to Janet
Shibley Hyde, Department of Psychology, University of Wisconsin—
Madison, 1202 W. Johnson Street, Madison, Wisconsin 53706.
tion is concerned with principles of fairness and equity such as
those assessed in conventional measures of moral reasoning (Gil-
ligan, 1982). Although acknowledging that males and females
could use either a justice or a care perspective, Gilligan asserted
that care reasoning was used predominantly by females and justice
reasoning was used predominantly by males (Gilligan, 1982; Gil-
ligan & Attanucci, 1988). Second, she argued that Kohlberg's
Moral Judgment Interview (MJI; Colby et al., 1987)—the most
widely used measure of moral reasoning—was gender biased
because it was validated on an all-male sample and because its
scoring scheme characterized considerations of care and response
as less sophisticated than considerations of justice and fairness.
Over the last 15 years, Gilligan's (1982) work has inspired a
wealth of empirical research and criticism, some of which has
supported her claims and some of which has not. At the same time,
researchers have seldom agreed on how Gilligan's care and justice
orientations should be defined or how they should be measured.
Consequently, it has been difficult to compare the results of these
studies. The goal of this article is to evaluate Gilligan's assertions
that (a) there are gender differences in Kohlbergian moral stage
and (b) the care and justice orientations are gender related. The
former assertion has been reviewed meta-analytically by several
investigators, and those results are summarized here. Meta-
analysis was used in the present paper to evaluate the latter
assertion. In addition, a number of complexities surrounding the
study of gender and moral reasoning are reviewed. The following
sections present Gilligan's theory of moral reasoning, as well as
the criticisms of this work.
Gilligan's Theory of Moral Reasoning
Gilligan's (1977, 1982) theory of moral reasoning stemmed
from Kohlberg's cognitive-developmental stage theory of moral
development. Kohlberg (1969, 1976, 1984) proposed that individ-
uals progress through a sequence of invariant and universal stages
of moral reasoning. These six stages are grouped into three levels,
each of which represents a qualitative advance in the individual's
ability to understand and integrate diverse points of view (Kohl-
berg, 1976).
703
704 JAFFEE AND HYDE
Early research on Kohlbergian moral development reported that
Stage 3 was the modal stage for females and Stage 4 was the modal
stage for males (Fishkin, Keniston, & MacKinnon, 1973; Haan,
Smith, & Block, 1968; Holstein, 1969; Kohlberg & Kramer, 1969;
Poppen, 1974). Stage 3 reasoning is characterized by the desire to
maintain relationships and to meet others' expectations, and
Stage 4 is characterized by a law-and-order mentality in which
laws are upheld so as to maintain the social order. These findings
led some researchers to accuse Kohlberg's theory of gender bias
(Gilligan, 1977, 1982; Haan, 1978). Specifically, Gilligan (1977,
1982) argued that because Kohlberg derived his theory of moral
development from an all-male sample, he neglected to recognize a
distinctively female mode of moral reasoning—one that is char-
acterized by a desire to maintain relationships and a responsibility
not to cause hurt. This care orientation contrasts with a distinc-
tively male mode of moral reasoning—the justice orientation—
that is based on the abstract principles of justice, fairness, and
individualism captured by Kohlberg's MJI. Unlike Kohlberg's
moral stages, the care and justice orientations do not represent
cognitive structures that develop in a stagelike sequence. Instead,
they represent frameworks that can be modified by experience and
through which individuals interpret and resolve moral problems.
However, Gilligan argued that the care orientation's emphasis on
maintaining relationships led to its classification as a Stage 3
response in Kohlberg's scoring scheme.
Origins and Development of Moral Orientations
Gilligan and colleagues asserted that the care and justice orien-
tations are rooted in early childhood experiences of attachment and
inequality that foster, respectively, a relational and an individual-
istic self-concept (Gilligan & Attanucci, 1988; Gilligan & Wig-
gins, 1987). Because children are born into a position of inequality
and cannot survive without an attachment figure, all children are
exposed to the conditions that form the basis of both moral
orientations. Thus, all individuals have the capacity to understand,
experience, or implement either moral orientation. The reason that
males and females come to use one mode of moral reasoning over
the other is that these experiences of attachment and inequality are
differentially reinforced in a society dichotomized by gender.
Working from Chodorow's (1978) neopsychoanalytic account of
gender identity development, Gilligan proposed that because
women are the primary caretakers in most cultures, girls' self-
concepts are based on a definition of themselves as similar to and
connected with their mothers whereas boys' self-concepts are
rooted in their difference and separation from their mothers. More-
over, boys identify with their fathers, who may be perceived as
authority figures who hold power over them. Thus, the experience
of attachment and connection is more salient to girls, whereas the
experience of inequality and separation is more salient to boys.
These initial experiences of attachment and inequality may be
confirmed in later childhood and adolescence, resulting in an
association between gender and moral orientation (Gilligan &
Wiggins, 1987). Gilligan and Wiggins (1987) concluded:
The sex difference question, when framed in this way, does not carry
the implication that one sex is morally superior, nor does it imply that
moral behavior is biologically determined. Instead, it draws attention
to two perspectives on morality. To the extent that biological sex, the
psychology of gender, and the cultural norms and values that define
masculine and feminine behavior affect the experience of equality and
attachment, these factors presumably will influence moral develop-
ment, (p. 282)
Empirical Tests of Gilligan's Developmental Theory
These claims about the origins and development of moral voice
are largely untested. Benenson, Morash, and Petrakos (1998) ob-
served 41 mother-child dyads (children were 4 and 5 years old) in
a play setting and found that, compared with boys, girls were
physically closer to their mothers, engaged in more mutual eye
contact with their mothers, and were rated higher on global enjoy-
ment. Although the authors cited these data as supportive of
Chodorow's (1978) theory, it is not clear whether emotional close-
ness, as measured in this study, matches Chodorow's concept of
parental identification. Moreover, because emotional closeness
between fathers and their children was not studied, it is impossible
to determine whether the nature of children's relationships with
their fathers differs from their relationships with their mothers.
Other researchers have interpreted Gilligan to mean that gender
differences in moral orientation stem from parental socialization
practices that foster a relational self-concept in females and an
individualistic self-concept in males (Lollis, Ross, & Leroux,
1996; Walker, 1997). Lollis and colleagues (1996) observed par-
ents' interventions in property conflicts among 2-4-year-old sib-
lings. They found that mothers intervened more often and used
more care reasoning than fathers, although parents did not differ in
their use of justice reasoning. However, there was no evidence that
girls received more care-oriented interventions nor that boys re-
ceived more justice-oriented interventions from either parent.
Thus, the authors concluded that boys and girls were not being
socialized differently in this respect. This argument, however,
cited a social learning explanation for the development of gender
differences in moral reasoning, whereas Gilligan posited a neo-
psychoanalytic account in which the origin of gender differences
in moral reasoning lies in the child's sense of identification with
each parent.
Various researchers have called into question the testability of
Gilligan's claims (Walker, 1995), citing the difficulties inherent in
measuring constructs like parental identification, equality/inequal-
ity, and attachment/detachment. Empirical tests of Gilligan's
claims would indeed prove challenging. Such studies might de-
scribe how the connected and individuated self-concepts are dif-
ferentially reinforced for girls and boys and how the self-concept
changes over time. They also might elucidate the mechanisms by
which parental identification leads to gender differences in the
self-concept and, ultimately, to gender differences in moral
orientation.
Levels of Care Reasoning
Originally, Gilligan (1982) proposed that the care orientation
developed in three phases, the first characterized by an exclusive
focus on one's own needs (i.e., caring for oneself only), the second
characterized by self-sacrifice and a focus on others' needs, and
the third characterized by the ability to balance others' needs and
one's own needs. This sequence was derived from her interviews
with 29 women, diverse in age, ethnic background, and socioeco-
nomic status (SES), who were considering whether to undergo an
GENDER DIFFERENCES IN MORAL ORIENTATION 705
abortion. Gilligan (1982) conducted a follow-up study 1 year after
the initial abortion interview. Although many of the women had
traversed at least two of the levels, none of the 21 women had
progressed across all three, and many showed regression. Gilligan
seems to have dropped this developmental sequence in her more
recent work (Gilligan, Brown, & Rogers, 1990).
Skoe and Marcia (1991) developed and validated the Ethic of
Care Interview (ECI) to assess Gilh'gan's levels of care reasoning.
The ECI is negatively related to measures of authoritarianism and
positively related to measures of ego identity, cognitive complex-
ity, role-taking, and MJI scores (Skoe & Diessner, 1994; Skoe,
Pratt, Matthew, & Curror, 1996). Although some evidence exists
that females score higher than males on the ECI (Skoe & Gooden,
1993; Skoe et al., 1996), this finding has not been replicated
consistently (Skoe & Diessner, 1994; Sochting, Skoe, & Marcia,
1994). Unfortunately, the ECI has not been used to answer impor-
tant questions about the developmental progression of care rea-
soning or mechanisms of change. Although Skoe and Diessner
(1994) reported a positive association between ECI scores and age,
this finding was based on cross-sectional data. Skoe et al. (1996)
reported moderately stable levels of care reasoning across a 4-year
period in mid- to late adulthood, but it would be informative to
explore longitudinally the development of care reasoning at earlier
ages.
There has been little discussion of how or why individuals move
from one level of care to the next. Although Gilligan (1982)
suggested that individuals progress through these levels in times of
crisis, it is not clear if one reaches the highest level and remains
there or if each new crisis causes the cycle to start anew (Sichel,
1985). The latter would suggest that the levels of care reasoning
represent a process by which individuals resolve moral problems
rather than a developmental sequence in which each level repre-
sents an advance in moral maturity. Skoe and Marcia (1991)
suggested that advances in moral thought are based on questioning
previously held beliefs and formulating new, more inclusive po-
sitions, and they compared this to cognitive-developmental pro-
cesses of disequilibration and accommodation. However, such a
cognitive-developmental account of care reasoning has not been
assessed empirically.
Measures of Moral Orientation and Methods of Coding
A number of measures have been developed to assess moral
orientation. These generally fall into two categories: interview
measures and objective measures. The majority of these measures
were designed to assess Gilligan's conceptualization of the care
and justice orientations.
Interview Measures
Interview measures usually ask participants to describe a moral
dilemma from their own lives. Responses are coded for the pres-
ence of justice and care reasoning. A number of coding schemes
have been developed. For instance, Lyons's (1982) coding scheme calculates the number of care and justice considerations (each idea
the participant presents in discussing the dilemma) and classifies
moral orientation as care or justice on the basis of the modal
response. Moral orientation is recorded as "split" if the participant
voices as many care considerations as justice considerations. Other
researchers have modified this coding scheme to better differenti-
ate care and justice reasoning (Gilligan & Attanucci, 1988; Krebs,
Vermeulen, Denton, & Carpendale, 1994). Responses are classi-
fied as care only or justice only if all the considerations can be
categorized as care- or justice-based. If at least 75% of the con-
siderations can be categorized as care- or justice-based, the re-
sponse is classified as care focus or justice focus, respectively. If
less than 75% of the considerations can be categorized as care- or
justice-based, the response is categorized as mixed. Finally, Brown
and colleagues (1988) developed a coding system in which a
response narrative is read four separate times, each time from a
different standpoint. In the first reading, attention is focused on the
story being told. In the second reading, all references to the self are
underlined. In the third and fourth readings, attention is focused on
moral voice: fust the care perspective, then the justice perspective.
Finally, the reader completes a summary worksheet in which
his/her interpretation and summary of the text are substantiated by
quotes from the interviews. These summaries allow the reader to
code for the presence of care and justice reasoning and the pre-
dominance of one or the other moral orientation. Brown et al.
maintained that by reading the narrative from many different
standpoints, the reader remains open to the possibility that the
same statement can be interpreted in different ways.
The Fable Interview (Johnston, 1988) is another interview mea-
sure that taps Gilligan's moral orientation construct. However,
instead of asking participants to discuss dilemmas from their own
lives, participants respond to dilemmas embedded within two of
Aesop's fables. The first solution to each fable is considered the
spontaneous solution. Respondents are then probed to determine if
they can provide an alternative solution to the dilemma. If, for
instance, the spontaneous solution is a care solution, respondents
are encouraged to offer a justice solution. If they do so, they are
then asked to decide which is the better solution. A modified
version of Lyons's (1982) coding scheme is used to classify
respondents as care, justice, or split reasoners. The unit of analysis
in the fable coding is the entire solution offered by the respondent
(as opposed to the individual considerations in each response). One
advantage of the Fable Interview is that comparisons can be made
between individuals because all are responding to the same
dilemmas.
Eisenberg and colleagues (Eisenberg, Lennon, & Roth, 1983;
Eisenberg-Berg, 1979) and Kohlberg and his colleagues (Colby et
al., 1987) have developed coding schemes for individuals' re-
sponses to prosocial moral dilemmas and MJI dilemmas, respec-
tively. Eisenberg's scheme calculates the frequency with which an
individual uses the various types of prosocial reasoning in re-
sponding to a prosocial dilemma. In Kohlberg's scheme, orienta-
tion scores are generated on the basis of the content of an indi-
vidual's response to a dilemma. These interview measures are not
meant to tap Gilligan's conceptualization of care or justice
reasoning.
Self-Administered Objective Measures
Objective tests of moral reasoning and moral orientation have
also been developed. Some of these, including the Care/Response
Orientation Scale (CROS; Atunzu, 1986), the Measure of Moral
Orientation (MMO; Liddell, 1990), and the Moral Orientation
Scale (Yacker & Weinberg, 1990), were designed to assess the
706 JAFFEE AND HYDE
care and justice orientations as conceptualized by Gilligan. Others,
such as the Prosocial Moral Reasoning Objective Measure (Carlo,
Eisenberg, & Knight, 1992), were designed for other purposes,
although they have been used to assess care and/or justice reason-
ing. In these tests, respondents are asked to generate their own
moral dilemma or to respond to a hypothetical one. They are then
presented with multiple responses representing examples of care
and justice reasoning and asked to pick the one they would use or
prefer in resolving the dilemma. These tests produce continuously
distributed care and/or justice orientation scores.
Finally, as discussed above, the ECI (Skoe & Marcia, 1991)
assesses Gilligan's levels of care reasoning. The ECI consists of
four dilemmas administered in a structured interview format. One
of these dilemmas is generated by the respondent, and the other
three are standard interpersonal dilemmas. The ECI can be scored
according to level or according to the total score across the four
dilemmas.
Criticisms of the Moral Orientation Construct
Gilligan has been accused of oversimplifying Kohlbergian
moral reasoning in her description of the justice orientation (Puka,
1991; Walker, 1989; Walker, de Vries, & Trevethan, 1987). Critics
have argued that the justice orientation comes closest to the rigid
law-and-order reasoning of Kohlberg's Stage 4, and Gilligan has
been criticized for ignoring the contention that justice and rights
exist in the context of social responsibilities and obligations. Thus,
at the higher stages of moral reasoning, individuals follow rules
only if those rules benefit the common good (Kohlberg, Levine, &
Hewer, 1983).
Critics have also argued that too little work has been done to
characterize or validate the existence of Gilligan's care and justice
orientations other than to demonstrate that these modes of moral
reasoning are present in individuals' responses to moral dilemmas
(Walker, 1989; Walker et al., 1987). Consequently, it is not clear
whether moral orientations are used consistently across situations
and over time or how moral orientations relate to each other or to
moral behavior.
Intraindividual Consistency
Gilligan's assertion that most individuals prefer one mode of
reasoning over the other implies a high level of intraindividual
consistency in their use (Gilligan & Attanucci, 1988; Langdale,
1986). Indeed, Gilligan and Attanucci (1988) reported that two
thirds of their participants focused on only one orientation in their
discussion of a self-generated moral dilemma. Langdale (1986)
demonstrated that, within a single dilemma, approximately 87% of
participants used a single orientation and that the number of
individuals with a predominant justice or care orientation (as
opposed to mixed orientation) was significantly greater than
chance. The number of people who showed a predominant justice
or care orientation across different dilemmas was also significantly
greater than chance.
Other researchers, however, have been unable to replicate these
results (Krebs et al., 1994; Pratt, Golding, & Hunter, 1983; Pratt,
Golding, Hunter, & Sampson, 1988; Walker et al., 1987; Wark &
Krebs, 1996). For example, Wark and Krebs (1996) reported that
only 9% of their college-age participants obtained the same moral
orientation score across three types of dilemmas and only 29%
obtained the same or an adjacent score. Walker and colleagues
(1987) asserted that consistency would be demonstrated by indi-
viduals who used the same orientation 75% of the time or more.
Less than 20% of participants met this consistency criterion across
self-generated and hypothetical dilemmas. Within a single di-
lemma, only about half met the criterion. Similarly, Pratt and
colleagues (1988) reported that only 60% of their participants used
the same orientation across two self-generated dilemmas. In sum-
mary, there is little evidence that moral orientations are used
consistently within or between dilemmas.
There is mixed evidence regarding the extent to which moral
orientations are used consistently over time. Walker (1989) fol-
lowed children and adults over a 2-year interval and found that half
the respondents evidenced a different orientation at the follow-up
than at the initial interview. However, Skoe and colleagues (1996)
collected two waves of data, 4 years apart, from a sample of
middle-aged and elderly adults and reported that care reasoning
levels were moderately stable within mid- to late adulthood. It
should be noted that these longitudinal analyses addressed differ-
ent issues. Whereas the Walker analysis addressed the question of
whether individuals use different moral orientations over time, the
Skoe et al. analysis assessed changes in level of care reasoning
over time.
Validity
There is qualified evidence for the construct validity of Gilli-
gan's moral orientations. Pratt, Diessner, Hunsberger, Pancer, and
Savoy (1991) reported that the justice and care orientations were
associated with variations in the self-concept of the sort described
by Gilligan (1982) and Lyons (1983), such that those who had
more individuated self-concepts tended to discuss justice-oriented
personal dilemmas. However, their results did not clarify whether
males were more likely to report an individuated self and females
were more likely to report a connected self. Lyons, however,
reported that women were more likely than men to report a
connected self, men were more likely than women to report an
individuated self, and that men and women also differed in pre-
dicted ways in their use of the care and justice orientations.
Finally, Liddell and colleagues (LiddeU, 1998; Liddell, Halpin, &
Halpin, 1992) reported that scores on a measure of connected
self-concept were positively related to a standardized measure of
care reasoning and scores on a measure of a rights-oriented self-
concept were positively related to a standardized measure of
justice reasoning.
There is limited evidence for the convergent validity of the
moral orientation constructs. Liddell (1998) compared the MMO
(Liddell, 1990) with semistructured interviews coded according to
Lyons's (1982) protocol. Care and justice scores, as measured by
the MMO, showed positive but nonsignificant correlations with,
respectively, care and justice as measured by the semistructured
interview.
Other evidence does not support predictions derived from Gil-
ligan's work. Because Gilligan argued that justice reasoning is
favored at the highest stages of Kohlberg's framework, it is ex-
pected that the care orientation would be negatively associated and
the justice orientation would be positively associated with Kohl-
bergian moral stage. Although Krebs and colleagues (1994) re-
GENDER DIFFERENCES IN MORAL ORIENTATION 707
ported that moral stage scores were negatively correlated with the
care orientation for men, Pratt and colleagues (1988) reported a
positive relationship between care reasoning and moral stage for
women and no association between the two for men. Walker and
colleagues (1987) found no relationship between moral stage and
moral orientation when the latter was assessed with hypothetical
dilemmas but found a positive relationship between the care ori-
entation and moral stage when orientation was assessed with
self-generated dilemmas. Moreover, individuals at the highest
stages of moral reasoning were more likely to use both care and
justice reasoning in their response to dilemmas.
In summary, there is little evidence to support the notion that
individuals use a particular moral orientation consistently over
time and situations. Although there is some evidence that moral
orientations are associated with related constructs and measures,
these predicted relationships are not obtained consistently. Finally,
there is virtually no evidence regarding the extent to which moral
orientations develop over time. This pattern of evidence contrasts
sharply with Kohlberg's account of moral reasoning and develop-
ment, for which there is ample evidence of reliability, validity, and
stage sequence. Arguably then, these are not comparable phenom-
ena and, if so, accounts of one need not encompass the other, nor
can they be criticized for disregarding the other (Puka, 1991).
What Constitutes Moral Maturity?
As various critics have pointed out, it is not always clear what
Gilligan believes is the relationship between the justice and care
orientations or what constitutes moral maturity (Auerbach, Blum,
Smith, & Williams, 1985; Flanagan & Jackson, 1987; Mason,
1990; Puka, 1991; Sichel, 1985; Walker, 1995). Most commonly,
Gilligan has drawn on the metaphor of the ambiguous figure to
illustrate her discussion of moral reasoning. The care orientation
and the justice orientation are framed as two ways of seeing a
moral problem. Choosing to see the problem from one perspective
may lead one to neglect the ways in which the problem might be
solved from the other perspective (Gilligan & Wiggins, 1987). At
times, Gilligan has asserted that these perspectives are incompat-
ible alternatives to one another but are both adequate from a
normative point of view (Gilligan, 1986b, 1987). In other places,
she has argued that they complement one another and that each is
deficient without the other (Gilligan et al., 1990). Thus, it is not
clear whether moral maturity is characterized by the ability to
integrate and balance the justice and care perspectives, to maintain
them in a complementary tension, or whether there is a morally
mature care orientation in the absence of a morally mature justice
perspective. Notably, her more recent discussions favor the notion
that the justice and care perspectives are maintained in comple-
mentary tension (Gilligan et al., 1990).
Even granting that moral maturity is characterized by the ability
to maintain both the justice and the care perspective in some sort
of tension, this formulation does not resolve the question of how
individuals should solve moral problems—the question of whether
and when either moral voice should take precedence (Sichel,
1985). Although it is possible and, Gilligan might suggest, neces-
sary to perceive moral problems from multiple perspectives, moral
action requires choice, and Gilligan's framework offers few guide-
lines as to which moral voice is more adequate in which situations.
This, however, is an ethical issue and not one that empirical
research can address.
Evidence for and Against Gender Differences
in Moral Reasoning
Gender Differences in Kohlbergian Moral Stage
Gilligan's claim that the MJI is gender biased has been con-
vincingly debunked. Critical reviews of the moral development
literature have failed to find evidence that Kohlberg-based mea-
sures yield gender differences in moral reasoning scores. Instead,
these reviews have found that gender differences in moral reason-
ing are small to nonexistent (Rest, 1979; Thoma, 1986; Walker,
1984, 1991, 1995). Rest (1979) conducted a critical review of 17
studies with 20 independent comparisons of male and female
participants on the Defining Issues Test (BIT; Rest, 1979) and
found only two significant gender differences, both favoring
females.
Walker's (1984) critical review and meta-analysis of 79 studies
that measured moral reasoning with the MJI found small gender
differences favoring males only in adulthood. Sex accounted for
only one twentieth of one percent of the variability in moral
reasoning development. Walker noted that gender differences ap-
peared most frequently in studies that confounded gender with
education or occupational status and in studies that used the earlier,
less reliable versions of the Kohlberg scoring manual (see Baum-
rind, 1986, for a critique of Walker and Walker, 1986b, for a reply;
see also Walker, 1991, 1995). Walker (1991) updated this review
and obtained similar results.
Thoma (1986) conducted a meta-analysis of gender differences
in moral reasoning in 56 samples that were administered the DIT
(Rest, 1979) and found a small effect favoring females. Gender
differences accounted for less than 0.5% of the variance in DIT
scores, whereas age and education accounted for nearly 53% of the
variance in DIT scores. Thoma concluded that females are not at
a disadvantage when measured by the DIT.
Finally, Colby and Damon (1983) pointed out that Kohlberg's
model has now been validated on a sample of males and females.
They found that females passed through the same stages in the
same order as males (Colby, Kohlberg, Gibbs, & Lieberman,
1983). When occupation and education were controlled, gender
differences in moral reasoning level disappeared. Thus, they con-
cluded that Gilligan's allegations of gender bias in Kohlberg's
theory are unwarranted.
Gilligan's (1986a) response to critics has been to suggest that a
lack of gender differences on the MJI may simply demonstrate that
females learn to use the justice orientation as effectively as males.
Because the MJI codes only those data that fit within its stage
definitions, considerations of care may still be ignored. Although
this is a valid point, it begs the question of what sort of data would
disconfirm Gilligan's hypothesis.
It is important to point out that the studies cited above do not test
Gilligan's assertion that males are more likely than females to use
the justice orientation. The research cited above demonstrates that
there are minimal gender differences injustice reasoning stage, but
does not speak to the question of whether there are gender differ-
ences in the use of the justice orientation.
708 JAFFEE AND HYDE
Gender Differences in Moral Orientation
Although the extant literature fails to find gender differences in
stage scores on Kohlberg-based measures, the research on gender
differences in moral orientation is less conclusive. Most research-
ers have now acknowledged that more than one mode of moral
reasoning exists (Kohlberg et al., 1983), but there is considerable
controversy as to whether those moral orientations can be reliably
associated with gender. Whereas some researchers have found
evidence for Gilligan's claim that care reasoning is used predom-
inantly by females and justice reasoning predominantly by males
(Gilligan & Attanucci, 1988; Johnston, 1988; Yacker & Weinberg,
1990), other researchers have found gender differences in care
reasoning only (Galotti, Kozberg, & Farmer, 1991; Garmon, Bas-
inger, Gregg, & Gibbs, 1996; Gibbs, Arnold, & Burkhart, 1984;
Liddell, Halpin, & Halpin, 1993; Wark & Krebs, 1996), and still
others have failed to find gender differences in the use of either
care or justice reasoning (Beal, Garrod, Ruben, & Stewart, 1997;
Friedman, Robinson, & Friedman, 1987; Walker et al., 1987).
Moreover, many researchers have found evidence that gender
differences in moral orientation are moderated by other variables,
such as dilemma content (Walker et al., 1987; Wark & Krebs,
1996,1997) and social class (Beal et al., 1997; Puka, 1989; Tronto,
1987).
Given the widely disparate findings regarding gender differ-
ences in the use of justice and care reasoning, there is a need for
a review of this body of research. Meta-analysis provides an
appropriate tool for synthesizing the research on gender differ-
ences in moral orientation and for determining the direction and
the actual magnitude of the effect. Importantly, meta-analysis also
allows the researcher to examine how the size of the effect might
be moderated by other variables. We hypothesized that the effect
size for gender differences in moral orientation might be moder-
ated by the following variables, each of which is addressed in turn:
(a) age, (b) SES, (c) moral orientation construct, (d) type of
dilemma, (e) coding scheme, (f) scale, (g) gender of the protago-
nist, and (h) publication status. Although Walker (1995) provided
an excellent narrative review of some of these moderators, he did
not subject the studies to quantitative meta-analysis.
Moderator Variables
Age
Gilligan and colleagues (Gilligan, 1982; Gilligan & Wiggins,
1987) proposed that gender differences in moral orientation
emerge in early childhood and persist across the life course.
However, some cross-sectional research has suggested that gender
differences in moral orientation are moderated by age. Walker and
colleagues (Walker, 1989; Walker et al., 1987) reported gender differences in care reasoning favoring women among adults who
discussed self-generated dilemmas but failed to find such gender
differences among 1st-, 4th-, 7th-, and lOth-grade children. Sim-
ilarly, in a study of young, middle, and older adults, Pratt and
colleagues (1988) reported that gender differences in moral orien-
tation were significant only for the middle-age group. Pratt and
colleagues proposed that family organization during the period of
active parenting (i.e., middle age) leads to increased gender-role
polarization (Guttman, 1985) and thus heightens gender differ-
ences in moral orientation. Consistent with this hypothesis, they
found that mothers were significantly less justice-oriented than
fathers, but this gender difference failed to emerge among nonpar-
ents. However, other cross-sectional studies have failed to find
evidence that gender differences in moral orientation are moder-
ated by age (Craft, 1992; Galotti et al., 1991; Garrod, Beal, & Shin,
1990; Langdale, 1986; Pratt et al., 1991).
In a longitudinal study of prosocial moral reasoning that fol-
lowed a cohort of children from age 4.5 to age 20, Eisenberg and
colleagues (1987) reported that gender differences in sympathetic
and role-taking reasoning, both of which have been likened to
Gilligan's care orientation, emerged at approximately age 11 or 12.
However, Walker (1989) failed to find evidence among children of
gender differences in moral orientation at either of two assess-
ments in a short-term longitudinal study. Gender differences in
moral orientation emerged at both time points for adults who
described dilemmas from their own lives. Meta-analysis is needed
to sort out the inconsistencies in these cross-sectional and longi-
tudinal findings.
Socioeconomic Status
Some researchers have proposed that gender and subordinate or
minority status are confounded in studies of moral orientation
(Puka, 1989; Tronto, 1987). They have argued that subordinate
status and the condition of powerlessness promote an inherent
concern with others because those others, in part, determine one's
outcomes (Tronto, 1987). However, other researchers have argued
that low social status promotes a concern with fairness and rights
because these rectify the social inequalities experienced by minor-
ity or subordinate groups (Beal, et al., 1997). Garrod and Beal
(1993) found that children from a rural, working-class community
were somewhat more likely to emphasize rights considerations
than were children from a more affluent university suburb. Simi-
larly, Gilligan and Attanucci (1988) reported that minority students
were more likely than nonminority students to adopt a rights
perspective.
These hypotheses might be interpreted to mean that SES exerts
a main effect on moral reasoning, in which case gender differences
in both care and justice reasoning should be smallest for lower
class groups. Alternatively, they might be interpreted to mean that
gender and social class interact, such that gender differences in
care reasoning (but not justice reasoning) should be greatest
among lower class groups because low-SES females will be more
likely to use the care orientation as a function of both their gender
and their social class.
Moral Orientation Construct
Most measures of moral orientation base their definition of the care and justice orientations on Gilligan's description of these
constructs. However, some researchers have taken a different
approach to measuring moral orientation. For instance, Eisenberg
and her colleagues (Eisenberg et al., 1983; Eisenberg-Berg, 1979) have focused on prosocial moral reasoning, which is defined as
"reasoning about conflicts in which the individual must choose
between satisfying his or her own wants or needs and those of
others in contexts in which laws, punishments, authorities, formal
obligations, and other external criteria are irrelevant or deempha-
sized" (Eisenberg-Berg, 1979, p. 128). Eisenberg and her col-
GENDER DIFFERENCES IN MORAL ORIENTATION 709
leagues have characterized types of prosocial moral reasoning,
several of which (e.g., concern with other's physical, material, or
psychological needs; role-taking; sympathetic orientation) load on
a single factor that represents other-oriented reasoning and are
conceptually similar to Gilligan's description of the care orienta-
tion (Eisenberg, Fabes, & Shea, 1989).
Other researchers claim that moral orientation can be measured
at the aspect level or the orientation level in Kohlberg's scoring
scheme (de Vries & Walker, 1986; Garmon et al., 1996). Unlike
Kohlberg's stages, these moral orientations are more closely re-
lated to the content rather than the structure of moral reasoning. It
has been argued that the perfectionism orientation (emphasizing
dignity and autonomy, good conscience and motives, and harmony
with self and others) and the utilitarian orientation (emphasizing
welfare or happiness consequences for oneself and others) are
consistent with the care orientation, whereas the normative orien-
tation (emphasizing duties and rights) and the fairness orientation
(emphasizing justice) are consistent with the justice orientation
(Walker, 1986a, 1995). Other researchers have focused on the
aspects that characterize each stage of moral reasoning (Garmon et
al., 1996). Specifically, they have pointed to similarities between
empathic role-taking, which is a Stage 3 aspect that is character-
ized by empathic references to another's psychological or emo-
tional welfare, and the care orientation.
There is some disagreement as to whether Kohlberg's orienta-
tions are adequate measures of the care orientation. Smetana
(1984) pointed out that stage and aspect scores are confounded in
Kohlberg's scoring system. For instance, the perfectionist orien-
tation can be scored only at the higher levels of moral reasoning.
Nevertheless, because a number of studies have assessed gender
differences in care and justice reasoning using Kohlberg's moral
orientations, these studies were included in the meta-analysis.
Type of Dilemma
Some researchers have suggested that gender differences in the
use of care and justice reasoning might be accounted for by the
kinds of dilemmas used to elicit moral reasoning (Clopton &
Sorell, 1993; Ford & Lowery, 1986; Gilligan, 1982; Langdale,
1986; Mednick, 1989; Walker, 1989; Walker et al., 1987; Wark &
Krebs, 1996, 1997). One possibility is that when dilemmas are
self-generated, females discuss dilemmas that deal with personal
concerns (e.g., dilemmas dealing with relationships between the
participant and close others) and males discuss impersonal dilem-
mas (e.g., conflicts between others or involving generalized others,
such as clients or students). Because personal dilemmas are inher-
ently concerned with relationships, they may elicit care reasoning,
whereas impersonal dilemmas may elicit considerations of fairness
and reciprocity (Walker, 1989; Walker et al., 1987). Indeed, sev-
eral researchers have found that women generate more personal
real-life dilemmas than men do and men generate more impersonal
real-life dilemmas (Pratt et al., 1988, 1991; Skoe & Diessner,
1994; Skoe et al., 1996; Walker et al., 1987; Wark & Krebs, 1996,
1997). When type of dilemma is controlled, gender differences in
moral orientation are eliminated, suggesting the important role of
situational factors in moral orientations (Walker, 1989; Walker et
al., 1987; Wark & Krebs, 1996). Consistent with this possibility,
Wark and Krebs (1997) reported that prosocial dilemmas evoked
more care-based moral judgments than Kohlberg dilemmas or
dilemmas dealing with transgressions or rule violations and these,
in turn, evoked more justice-based moral judgments than prosocial
dilemmas did. Other researchers, however, have found no evidence
of gender differences in dilemma content (Ford & Lowery, 1986;
Peter & Gallop, 1994). Given these conflicting findings, meta-
analysis is needed to determine whether there are systematic trends
in research outcomes.
Coding Scheme
As discussed in the section on measures of moral orientation,
several coding schemes have been developed to code for justice
and care reasoning in individuals' responses to moral dilemmas.
These schemes differ on the basis of the unit of analysis (e.g.,
considerations vs. entire solution) and whether they are objective
or interpretive schemes. Given this heterogeneity, coding scheme
was included as a moderator variable.1
Scale
Measures of moral reasoning can also be distinguished by their
scale of measurement. In the coding schemes mentioned above,
moral orientation can be analyzed as a categorical variable (i.e.,
respondents can be classified as having a care or justice orienta-
tion) or as a continuous variable (e.g., a "percent care" score is
computed for each participant on the basis of the percentage of
care-based considerations in each response). Some continuous
measures rate respondents along a care continuum and a justice
continuum, yielding independent measures of each orientation.
Because the effect sizes that form the basis of a meta-analysis
are computed with means and standard deviations, moral orienta-
tion measures that yield continuously scaled scores may provide
more reliable effect size estimates than measures that describe the
proportion of respondents who fall into one or the other moral
orientation category. We wanted to be able to estimate the mag-
nitude of the effect size for gender differences in moral orientation
when only the most reliable estimates of the effect size were
included in the analysis, as well as when all effect sizes were
included (this issue is elaborated in the Results section). Conse-
quently, scale was included as a moderator variable.
Gender of the Dilemma's Protagonist
Most of the research on gender of the protagonist has dealt with
its role as a potential moderator of gender differences in moral
stage. These studies have addressed the possibility that women
may score lower on Kohlberg's MJI (Colby et al., 1987) because
they are unable to identify completely with the male protagonists
of the dilemmas (Holstein, 1976; however, it is important to note
1 It is important to note the distinction between coding scheme and moral
orientation measure. The same measure (e.g., the MJI) might be scored
according to any of several different coding schemes. The moderator
analyses do not explore how the magnitude of the gender difference in
moral orientation differs as a function of the measures themselves (e.g., the
Fable Interview or the MMO) because of heterogeneity in the coding
schemes used to score the same measure across studies and because there
were too few of certain measures (e.g., the CROS) to allow for meaningful
interpretation of the moderator analyses.
710 JAFFEE AND HYDE
that not all the MJI dilemmas feature male protagonists). Such
studies have provided inconsistent evidence that gender of the
protagonist has an effect on moral judgment level (Bussey &
Maughan, 1982; Freeman & Giebink, 1979; Garwood, Levine, &
Ewing, 1980; Krebs et al., 1994; Lonky, Roodin, & Rybash, 1988;
Orchowsky & Jenkins, 1979; Turiel, 1976).
Relatively few studies have examined the extent to which the
gender of the protagonist moderates gender differences in moral
orientation. Albrecht (1989) found that dilemmas with male pro-
tagonists elicited significantly more justice reasoning than dilem-
mas with female protagonists. However, Beal et al. (1997) reported
a preference for care reasoning regardless of the protagonist's
gender. Similarly, Krebs et al. (1994) failed to find differences in
moral orientation depending on whether participants responded to
Kohlberg's dilemmas from their own perspective or from a third-
person perspective. Because of these inconsistencies, gender of the
protagonist was included as a moderator variable.
Publication Status
Because of a bias toward pubk'shing significant findings, it is
possible that an overall effect size based only on published data
would overestimate the population effect size for gender differ-
ences in moral orientation (Rosenthal, 1979). To address this
possibility, unpublished studies were included in the meta-
analysis, and publication status was included as a moderator
variable.
Goals
The overall goal of this meta-analysis was to determine whether
the much-heralded claims of Gilligan are supported by empirical
evidence. To this end, we evaluated Gilligan's (1982) assertion
that there are gender differences in the use of care and justice
reasoning. Furthermore, we investigated whether potential gender
differences in moral orientation might be moderated by other
variables.
Method
Sample of Studies
The sample of studies came from two sources: (a) The PsycLIT, ERIC,
and Dissertation Abstracts computerized databases were searched simul-
taneously for the years 1966-1998, using the key words (care reason*) or
(care orient*) or (ethic* of care) or (prosocial moral*) or (moral orient*),
which yielded 741 citations, and (b) the reference sections of those studies
that were drawn from the databases and included in the meta-analysis were
searched for citations that did not appear in the database search. This
strategy yielded an additional 16 citations. It is important to note that the
computerized database search did not use the search term (gender and care
reason*) as this might have led to a selective sampling of studies that found
significant gender differences.
In the case of computerized literature searches, abstracts were inspected
and included if they met the following criteria: (a) The study was empirical,
(b) there were at least five males and five females in each sample, (c) the
study did not report data that had already been reported elsewhere, (d) the
measure assessed moral orientation, and (e) age and gender were not
confounded (e.g., gender differences in moral reasoning between mothers
and sons).
Of the 757 citations elicited by the database and reference section
searches, 180 met inclusion criteria. Although this may seem like a small
proportion of the total number of citations, it is important to underscore that
our search term resulted in a very large number of citations that were
nonempirical commentaries on Carol Gilligan's work or citations regarding
the ethic of care in nursing. Of the PsycLIT and ERIC studies, 25 were not
empirical, 2 included data that were reported elsewhere, 371 did not assess
moral orientation, 10 reported on same-sex samples, and 37 were dupli-
cates (i.e., the same study appeared in both the PsycLFT and ERIC
databases). Of the dissertations, 132 were excluded because they did not
assess moral orientation or because they were nonempirical.
In the second phase of the search, copies of the 114 papers and 66
dissertations were obtained to ensure that they met inclusion criteria and
included enough information to compute an effect size. In those cases in
which an author reported that moral orientation had been measured but
failed to report enough statistical information to compute an effect size, a
letter or e-mail was sent to the author at the address specified for reprints
or at a more recent address found in the American Psychological Associ-
ation 1997 Membership Directory or in the Society for Research in Child
Development 1996 Directory of Members.2 There were 29 cases in which
this was necessary, and we received 23 responses to these requests for
additional data, 11 of which included the requested data. In the 18 cases in
which no data were available (12 cases in which the author responded that
data were not available plus 6 cases in which the author did not respond at
all), effect sizes were estimated as zero for 10 of the studies on the basis
of the fact that the author reported a nonsignificant gender difference in
moral orientation.
Of the 180 studies and dissertations that were deemed eligible on the
basis of the abstracts, 113 yielded enough information to compute an effect
size or met inclusion criteria once a copy of the study itself had been
obtained and examined. This sample comprised 70 published and 43
unpublished studies. Of the 67 studies that were excluded, 7 could not be
located, 2 used same-sex samples, 9 did not assess moral orientation, 1 was
not empirical, 17 were unpublished studies that did not include enough
information to compute an effect size, 8 were published studies for which
an effect size could not be computed or estimated even after contacting the
author, 15 were studies in which the data were reported elsewhere, and in
one case, there was not enough information to compute an effect size and
the author's address could not be located. In addition, 7 dissertations were
unavailable from UMI.
It is possible to obtain several independent effect sizes from a single
article if, for example, data from several age groups are reported (e.g., in
a cross-sectional design). These groups can be regarded as separate sam-
ples (L. V. Hedges, personal communication, 1987). The result was 113
usable sources, yielding 160 independent effect sizes for gender differences
in care orientation and 95 independent effect sizes for gender differences in
justice orientation. In the case of the care orientation analysis, this repre-
sented the testing of 5,783 males and 6,654 females. In the case of the
justice orientation analysis, this represented the testing of 3,831 males
and 4,307 females. This compares favorably with other reviews of gender
differences in moral reasoning, namely, Walker's (1984) critical review of
gender differences in moral stage in which he analyzed gender differences
from 66 studies involving 6,780 participants.
Five studies that used the ECI and measured levels of care reasoning
were analyzed separately because the level at which one reasons about a
given dilemma is conceptually distinct from the amount of care or justice
reasoning used in solving the dilemma.
2 Letters were mailed only to the authors of published studies as it was
too difficult to locate the authors of dissertations.
GENDER DIFFERENCES IN MORAL ORIENTATION 711
Coding the Studies
For each study, the following information was recorded: (a) all statistics
on gender differences in moral orientation, including means and standard
deviations or t, F, or r and (b) the number of male and female participants.
The following potential moderator variables were also coded. It is impor-
tant to note that some of the categories within each moderator variable
were collapsed in the analyses because of insufficient numbers of effect
sizes per category.
Age. The age(s) of the participants were recorded. If the article re-
ported no age but reported that participants were undergraduates or stu-
dents in an introductory college course, age was set equal to 19. If a grade
level was reported, 5 years were added to that level to yield the age (e.g.,
third graders were recorded as 8-year-olds). If the sample comprised
nonadults and the age range exceeded 10 years, the sample age was
recorded as mixed. Otherwise, the midpoint of the age ranges was
recorded.
Socioeconomic status. The SES of the participants was recorded as
lower class, lower middle class, middle class, upper middle class, upper
class, or unreported or mixed, on the basis of how participants were
classified in the studies themselves. SES for university students and faculty
was estimated as middle class.
Moral orientation construct. We recorded whether researchers mea-
sured moral orientation as conceptualized by Gilligan (1982), by Kohlberg
(Colby et al., 1987), or by Eisenberg (Eisenberg, et al., 1983; Eisenberg-
Berg, 1979).3
Type of dilemma. Dilemmas were coded as standard hypothetical
dilemmas if they were hypothetical dilemmas from traditional measures of
moral reasoning (e.g., the Heinz dilemma) or if they were dilemmas that
were likely to be unfamiliar to the participants or not relevant to their
everyday lives. Hypothetical dilemmas that involved issues of direct rele-
vance to participants (e.g., dilemmas about cheating that were administered
to university students) were coded as real-life hypothetical dilemmas.
These were dilemmas that participants might expect to face at some point
hi their lives. Dilemmas that the participants generated from their own lives
were coded as self-generated dilemmas. Finally, measures in which dilem-
mas were not used or participants were asked to rate how much they used
care or justice reasoning to solve moral problems in the absence of any
specific dilemma were coded as no dilemma. Hypothetical dilemmas
(standard or real-life) that were designed to elicit care reasoning were
coded as hypothetical care dilemmas, and hypothetical dilemmas (standard
or real-life) that were designed to elicit justice reasoning were coded as
hypothetical justice dilemmas. For instance, Eisenberg's prosocial moral
reasoning dilemmas (Eisenberg et al., 1983; Eisenberg-Berg, 1979) were
coded as hypothetical care dilemmas because the issue of caring is salient
and is in conflict with self-interest or responsibility to the self. Finally,
self-generated dilemmas were coded as self-generated care or self-
generated justice if they were designed to elicit care or justice reasoning,
respectively.
Coding scheme. We recorded whether moral orientation was scored
according to (a) the coding scheme developed by Brown and colleagues
(Brown et al., 1988), (b) Lyons's coding scheme (Lyons, 1982), (c) a
modified version of Lyons's coding, (d) Kohlberg's coding scheme (Colby
et al., 1987), (e) Eisenberg's coding scheme (Eisenberg et al., 1983;
Eisenberg-Berg, 1979), (f) close-ended objective responses, or (g) any
other coding scheme.
Scale. It was also noted whether these coding schemes yielded con-
tinuous care and/or justice scores or whether participants were categorized
as using a care or justice orientation on the basis of their responses to the
dilemmas.
Gender of the protagonist. The protagonists of the various dilemmas
were male, female, or the same sex as the participant, or the dilemmas
included both male and female protagonists. It was also noted if the
participant was the protagonist of the dilemma (e.g., in self-generated
dilemmas), if there was no protagonist (e.g., cases where participants were
asked to rate the extent to which they used care and justice reasoning to
solve moral dilemmas), or if the gender of the protagonist was not indicated
in the dilemma.
Publication status. It was noted whether studies were published or
unpublished.
Interrater Agreement
All studies were coded by Sara Jaffee. A random sample of 30% of the
studies was rated by Janet Hyde to obtain estimates of interrater agreement.
Interrater agreement was computed for ratings of the eight moderator
variables (age, SES, moral orientation construct, type of dilemma, coding
scheme, scale, gender of protagonist, and publication status) and ranged
from 85% to 100%. Discrepancies were resolved by discussion and
consensus.
Computation of Effect Size
The effect size computed was d, defined as the mean for males minus the
mean for females, divided by the mean within-sex standard deviation
(Hedges & Becker, 1986). Thus, positive values of d indicate that males
used more justice or care reasoning than females, and negative values of d
indicate that females used more justice or care reasoning than males.
Depending on the statistics available for a given study, formulas pro-
vided by Hedges and Becker (1986) and Morris and DeShon (1997) were
used for the computation of d and the homogeneity statistics. The exception
was studies that reported the proportion of males and females who were
classified as care reasoners, justice reasoners, or some combination of the
two (e.g., studies that used Lyons coding or followed the coding scheme of
Brown et al., 1988). In these cases, chi-square statistics were used to test
for the association of gender and moral orientation. However, chi-square
statistics are not easily translated into effect sizes. So that we could include
these studies in the meta-analysis, the moral orientation classifications
were converted into percent care scores. Care reasoners were assigned a
percent care score of 75%, justice reasoners were assigned a percent care
score of 25%, and mixed justice and care reasoners were assigned a percent
care score of 50%. This strategy allowed us to derive an effect size from
the mean percent care scores. These percent care cutoffs roughly match
those used by researchers who have modified Lyons's (1982) coding
scheme (Gilligan & Attanucci, 1988; Krebs et al., 1994). This strategy has
also been used before by Krebs and colleagues (1994) to create a contin-
uous measure of care reasoning. Because the studies were coded according
to whether they yielded a continuous score or a categorical score that
needed to be converted by means of the strategy described above, we were
able to test explicitly whedier the magnitude of the effect size for gender
differences in care orientation differed as a function of our own coding
strategy.
In 39 of the independent samples that measured care reasoning and in 15
of the independent samples that measured justice reasoning, the author did
not include enough information to compute an effect size and did not or
could not respond to written solicitations for additional data. In all these
cases, the authors reported that the gender difference in moral orientation
was nonsignificant. In these cases, the effect size d was estimated as zero.
Consequently, analyses were conducted twice, first with the entire sample
of effect sizes and then with the subsample of effect sizes estimated as zero
excluded.
3 Prosocial moral reasoning is scored at a number of levels. Because
young children generally do not use higher levels of moral reasoning,
gender differences in needs-oriented reasoning were computed for the
prosocial moral reasoning studies of children (11 years or younger). Gen-
der differences in Level 4 reasoning (sympathetic, role-taking, positive or
negative affect regarding the consequences to others) were computed for
the prosocial moral reasoning studies of adolescents and young adults.
712 JAFFEE AND HYDE
A random sample of 30% of the effect sizes was computed indepen-
dently by Sara Jaffee and Janet Hyde. There were discrepancies in 5% of
the d values; these were resolved. All values of d were corrected for bias
in estimation of the population effect size, using the formula provided by
Hedges (1981). The complete listing of all studies, with effect sizes and
moderator variable codes, is provided in Table 1.
Results
The results are divided into three sections. The first reports the
results for a meta-analysis of gender differences in the care orien-
tation. The second reports the results for a meta-analysis of gender
differences in the justice orientation. The third reports the results
for a meta-analysis of gender differences in level of care reasoning
as measured by the ECI (Skoe & Marcia, 1991). Whereas the first
two meta-analyses assess how much males and females differ in
the extent to which they used or endorsed the care and justice
orientations respectively, the third analysis examines whether gen-
der differences exist in the level of care reasoning attained by male
and female participants.
Magnitude of Gender Differences in Care Orientation
For 73% of the 160 independent samples, the gender difference
in care reasoning as reported in the study was not statistically
significant. A power analysis revealed that a sample size of 31
would be required to detect a moderately sized effect of .50 for
gender differences in care reasoning with a = .05. Seventy-three
percent of the independent samples included samples sizes of that
magnitude. Thus, it is unlikely that the nonsignificant gender
differences in care reasoning were the result of low power to detect
differences.
The overall effect size d for gender differences in care orienta-
tion was -.28, indicating a small gender difference favoring fe-
males. A homogeneity analysis using procedures specified by
Hedges and Becker (1986) indicated that the effect sizes were
nonhomogeneous, #T = 438.43, p < .001. Therefore, analyses
were conducted to determine the extent to which the magnitude of
the effect size was moderated by other variables. The results of the
moderator analyses are presented in Table 2.
Age. For purposes of the moderator analysis, participants were
categorized into five age groups. Participants who were 11 years
old or less were classified as children, and participants who were
older than 11 years and less than or equal to 19 years (but were not
university students) were classified as adolescents. University stu-
dents were coded as such. Participants who were aged 20 to 49 (but
were not university students) were classified as younger adults,
and participants who were 50 years or older were classified as
older adults. In cases where the sample included nonadults and the
reported age range exceeded 10 years, participants were classified
as mixed age.
For all age groups, the effect size indicated that females used
more care reasoning than males. However, the magnitude of the
effect size differed among age groups, HK = 70.91, p < .001. For
children and university students, the overall effect sizes indicated
a small gender difference (d = -.08 and d = -.18, respectively), but
for adolescents, the effect was moderate in size (d = -.53). For
younger adults, the overall effect size d was -.33. The small
number of samples within the older adult and mixed-age groups
suggests that these effect sizes should be interpreted with caution.
Socioemnomic status. As there were relatively few effect
sizes at certain levels of the SES variable, SES was collapsed into
four categories: lower class (comprising lower class and lower
middle class), middle class (comprising middle class and upper
middle class), upper class, and mixed/unreported. The homogene-
ity analysis revealed that the magnitude of the effect sizes differed
significantly as a function of SES category, HB = 9.05, p < .05.
Although females consistently scored higher than males on care
reasoning across the SES groups, the magnitude of the effect size
increased from -.08 to -.42 as SES increased from the lower-class
group to the upper-class group.
Moral orientation construct. The homogeneity analysis re-
vealed that the magnitude of the effect sizes differed significantly
as a function of the moral orientation construct, HB = 17.29, p <
.001. The magnitude of the effect size d was small to moderate in
size for studies that defined the care orientation according to
Gilligan's description or defined it as the orientation or aspect
level in Kohlberg's scheme (-.32 and -.25, respectively). However,
for studies that conceptualized care reasoning as prosocial moral
reasoning, the magnitude of the effect size was close to zero.
Type af dilemma. As there were relatively few of certain types
of dilemmas, these types were collapsed into six categories: (a)
standard hypothetical dilemmas, (b) real-life hypothetical dilem-
mas, (c) self-generated dilemmas, (d) moral reasoning measures
that did not contain a dilemma, (e) dilemmas meant to elicit care
reasoning (both hypothetical and self-generated), and (f) dilemmas
meant to elicit justice reasoning (both hypothetical and self-
generated). The homogeneity analysis revealed that the magnitude
of the effect sizes differed significantly as a function of type of
dilemma, HB = 63.07, p < .001.
For both standard and real-life hypothetical dilemmas, the mag-
nitude of the effect sizes was small and indicated that females used
more care reasoning than males (-.19 and -.20, respectively).
Similarly, the effect sizes for dilemmas designed to elicit care
reasoning and dilemmas designed to elicit justice reasoning were
-.17 and -.18, respectively. The magnitude of the effect size for
self-generated dilemmas was moderate in size (d = -.37), and
measures that did not include a dilemma yielded the largest dif-
ference favoring females (d = -.57).
Coding scheme. The homogeneity analysis revealed that the
magnitude of the effect sizes differed significantly as a function of
coding scheme, HB = 61.49, p < .01. The effect size was close to
zero for studies that used Eisenberg's (Eisenberg et al., 1983;
Eisenberg-Berg, 1979) coding scheme, and the largest effect size
(d = -.61) was noted for studies that used a modified version of the
Lyons coding scheme (Lyons, 1982). Other measures produced
effect sizes that were small to moderate in magnitude and favored
females.
Scale. The homogeneity analysis revealed that the magnitude
of the effect sizes differed significantly as a function of scale,
HB = 7.46, p < .01. For the studies that used continuously scaled
measures, the magnitude of the effect size d was -.26. For studies
that used categorical measures, the magnitude of the effect d was
-.38, indicating a somewhat larger difference favoring females.
Gender of the protagonist. Because there were only two stud-
ies in which the protagonists were all female and one study in
which the gender of the protagonist was not specified, these were
not included in the moderator analysis. Homogeneity analyses
revealed that the magnitude of the effect sizes differed signifi-
GENDER DIFFERENCES IN MORAL ORIENTATION 713
cantly as a function of the protagonist's gender, HB = 84.88, p <
.001. When the protagonist was male or when the participant was
the protagonist, the effect size was small to moderate in magnitude
and favored females (d = -.36 and d = -.27, respectively). When
there were both male and female protagonists, however, the gender
difference in care reasoning was virtually nonexistent (d = -.03).
Similarly, when the protagonist was the same gender as the par-
ticipant, the magnitude of the effect size indicated a small differ-
ence (d = -.11). However, in cases where there was no protagonist,
the magnitude of the effect size d was -.57.
Publication status. The magnitude of the effect size for gender
differences in care orientation did not differ significantly as a
function of whether or not the study was published, HB = .13, m.
Regression Analysis for Care Orientation
Because the overall homogeneity of variance analysis indicated
that the effect sizes were nonhomogeneous, a multiple regression
analysis was conducted to determine how much of the variance in
effect sizes was accounted for by the moderator variables. More-
over, because many of the moderator variables are highly corre-
lated, it is important to control for the effect of other moderators
when determining the extent to which any given one accounts for
variance in the magnitude of the effect sizes. The corrected effect
size for gender differences in care reasoning was the criterion
variable (Hedges, 1981). The eight moderator variables (age, SES,
moral orientation construct, type of dilemma, coding scheme,
scale, gender of protagonist, and publication status) were entered
in stepwise fashion. Because of their categorical nature, all vari-
ables except age and SES were contrast coded, and the contrasts
are specified in Table 3.
Overall, the moderator variables accounted for a significant 16%
of the variance in the effect sizes for gender differences in care
reasoning, F = 7.25, p < .001. The effect of neither age nor SES
was significant. Two contrasts were specified to determine
whether moral orientation construct was a significant predictor of
the magnitude of the effect sizes for gender differences in care
reasoning. The first contrast compared studies that used Eisen-
berg's construct of prosocial moral reasoning with those that
characterized care reasoning according to Gilligan's or Kohlberg's
conceptualization of the care construct. Because prosocial dilem-
mas are designed to elicit care reasoning, it was expected that the
magnitude of the effect size for gender differences in care reason-
ing would be smaller for this group compared with the other two.
However, the regression analysis revealed that this contrast was
not a significant predictor of the magnitude of the effect size for
gender differences in care reasoning. A second contrast was con-
structed to compare studies that defined care reasoning according
to Gilligan with those that defined care reasoning at the element or
aspect level in Kohlberg's scheme. It was predicted that the mag-
nitude of the effect size for gender differences in care reasoning
would be greater for the former group because, according to
Gilligan (1982), Kohlberg's scoring scheme does not adequately
capture the care orientation and encourages use of the justice
orientation instead. The regression analysis revealed that this con-
trast was not a significant predictor of the magnitude of the effect size for gender differences hi care reasoning.
Two contrasts were specified to determine whether type of
dilemma was a significant predictor of the magnitude of the effect
sizes for gender differences in care reasoning. The first contrast
compared self-generated dilemmas with those that effectively stan-
dardized the dilemma content for participants (i.e., standard hypo-
thetical dilemmas, real-life hypothetical dilemmas, and dilemmas
designed specifically to elicit care or justice reasoning). This
contrast tested the hypothesis that the magnitude of the effect size
for gender differences in care reasoning would be greater when
participants were allowed to generate their own dilemmas. This
contrast was a significant predictor of the magnitude of the effect
sizes for gender differences in care reasoning, |3 = -.21, p < .05.
A second contrast compared studies that did not include a dilemma
with those that did to explore whether the presence of a dilemma
accounted for variation in the magnitude of the effect size. This
contrast was also significant, fi = -.29, p < .001.
Two contrasts were specified to determine whether coding
scheme was a significant predictor of the magnitude of the effect
sizes for gender differences in care reasoning. The first contrast
compared measures that are scored objectively with those that are
not and hypothesized that the magnitude of the effect size would
be smaller for objectively scored studies. A second contrast com-
pared coding schemes in which responses are matched to criterion
responses (see, e.g., Colby et al., 1987; Eisenberg et al., 1983;
Lyons, 1983) with those in which interpretive techniques are used
(see, e.g., Brown et al., 1988; modified Lyons, 1983). Neither
contrast was significant.
The contrast comparing studies that yielded a continuous moral
orientation score with studies that yielded a categorical moral
orientation score was significant, |3 = .18, p < .05.
Two contrasts were specified to test the effect of gender of the
protagonist on the magnitude of the effect sizes for gender differ-
ences in care reasoning. The first contrast compared studies in
which the participant served as the protagonist in the dilemma with
those in which there was a fictional protagonist. This contrast
tested whether a difference in the perspective from which the
participant responded to the dilemma influenced the magnitude of
the effect size. A second contrast was specified to compare studies
that used dilemmas with male protagonists with studies that used
dilemmas in which the protagonists were not exclusively male or
were the same gender as the respondent. Neither contrast was a
significant predictor of the magnitude of the effect size for gender
differences in care reasoning.
Finally, the regression analysis revealed that publication status
did not have a significant effect on the magnitude of the effect size
for gender differences in care reasoning.
Because the moderator variables were highly correlated, squared
semipartial regression coefficients were computed to determine
how much of the variance in the magnitude of the effect sizes was
uniquely accounted for by each of the moderator variables when
the other moderators were controlled. Table 3 indicates that the
contrast comparing self-generated dilemmas with standard hypo-
thetical dilemmas, real-life hypothetical dilemmas, or dilemmas
designed to elicit care or justice reasoning accounted for 4% of the
variance in the effect sizes for care reasoning. The contrast com-
paring studies in which there was no dilemma with those in which there was a dilemma uniquely accounted for 8% of the variance in
the effect sizes for care reasoning. Finally, the contrast comparing
studies that yielded categorical versus continuous outcomes
uniquely accounted for 3% of the variance in the effect sizes for care reasoning.
714
Table 1
Effect Size Estimates and Moderator Variable Codes
JAEFEE AND HYDE
Study
Abaris (1990)
Abide (1994)
Abide (1994)
Akman (1991)
Albrecht (1989)
Arvizu (1995)
Atunzu (1986)
Barnett, Quackenbush, & Sinisi (1995)
Bamett, Quackenbush, & Sinisi (1995)
Beal, Garrod. Ruben, & Stewart (1997)
Beal, Garrod, Ruben, & Stewart (1997)
Bollerud (1987)
Brown, Tappan, Gilligan, Miller, & Argyris
(1989)
Carlo, Eisenberg, & Knight (1992)
Carlo, Roller, & Eisenberg (1998)
Carlo, Roller, & Eisenberg (1998)
Carlo, Roller, Eisenberg, DaSilva, & Frolich
(19%)
Carney (1992)
Carney (1992)
Carney (1992)
Cassidy, Chu, & Dahlsgaard (1997)
Castor-Scheufler (1994)
Clopton & Sorell (1993)
Clopton & Sorell (1993)
Cole (1987)
Conley, Jadack, & Hyde (1997)
Craft (1992)
Craft (1992)
Crown & Heatheringlon (1989)
Curror (1994)"
Dekovic & Gerris (1994)
Dekovic & Gerris (1994)
Dekovic & Gerris (1994)
Derry (1989)
de Vries & Walker (1986)
Dezoll (1992)
Dezolt(1992)
Dezolt (1992)
Diamonti (1993)
Dickey, Rroll, & Jenkins (1987)
Diederichs (1993)
Dohrenwend (1995)
Donenberg & Hoffman (1988)
Dossetl (1989)
Dossett(1989)
Dossetl (1989)
Eisenberg, Boehnke, Schuler, & Silbereisen
(1985)
Eisenberg, Boehnke, Schuler, & Silbereisen
(1985)
Eisenberg, Boehnke, Schuler, & Silbereisen
(1985) Eisenberg. Boehnke, Schuler, & Silbereisen
(1985)
Eisenberg, Boehnke, Schuler, & Silbereisen
(1985)
Eisenberg, Boehnke, Schuler, & Silbereisen
(1985)
Eisenberg, Carlo, Murphy, & Van Court
(1995)
Eisenherg, Hertz-Lazarowitz, & Fuchs (1990)
Eisenberg, Hcrtz-Lazarowitz, & Fuchs (1990)
Eisenberg, Lennon, & Roth (1983)
Eisenberg, Miller. Shell, McNalley, & Shea
(1991)
Eisenberg, Pasternack, Cameron, & Tryon
(1984)
Eisenberg & Shell (1986)
Eisenberg et al. (1987) Eisenberg-Berg & Hand (1979)
Eisenberg-Berg & Mussen (1978)
Eisenberg-Berg & Neal (1981)
N
males
69 29 46
19 56 28 92
244
81 32 24
22 37
25 21 13
57
19 20 26 13
205 20 20 12 26
15 19 20 28 20 19 24
20 36 10
10 10 39 10 12 51 34
7 7 7
14
16
16
14
18
17
20
15 10 17 16
19
28 6
17 37 12
N females
112 59
100 19 68 67
110 273 73 32 24 17 43
38 17
27
79
18 20 27 18
270 20 20 12 33 20 29 20 34
20 19 23 20 36 10 10 10 96
10 12 28 35 7
7 7
16
13
17
16
12
22
34
13 14 16 16
25
28 7
16 35 10
4,
.14 -.02
-.40
.33
-.33
-.28
-.29
-.16
0"
0' -2.05
-.48
-.29
-.67
0 0
-1.62
-.56
-2.58
.59
0" .06
-.07
.34 -.07
-.47
,42
0" -.37
0" 0"
0" 0' .06"
0" 0" 0'
.13 -1.26
-.18
-.34
-.89"
-.73
-.94
-.11
0'
0"
0'
o-
0'
0"
-.28
.40 -.10
.63 -.37
0'
0" -.50
-.17
-.69
.12
<*i A8=
.04 3
-.12 3
-.30 3
-.67 3
.18 3
-.24 3
4 .03 3
.10 3
1 1
2.05 2
.48 2
2 2
2 2
.53 1
1.11 1
.63 2
1
0" 3
4 4
-.36 2
4 .46 1
-.42 1
3 2
1
1 1 4
-.13' 3
0" 1
0« 2
0" 2
-.13 2
.62 4
.18 1
-.04 2
2 .73 4
.94 4
.11 4
1
1
1
1
1
1
2
1
1 1 2
1
1
1 1 2 1
Type of
SES Construct dilemma
3 1 2
3 2
3 2
3 1
3 1
3 2
4 4
3 3
3 3
2 1
2 1
4.5 7
6 3
3 3 7
2 3 7
2 3 7
3 3 7
6
6 6 3.5
3 2
2 2 6 6 6 3 6 6 6
6 3
3
3 3 3
6 4 1 3
3 7 7 3
4 4 4
2 1
} S 3 2 2 2 2
2 & 3 7 7 7
6 4 1 1
1 3 3 1 4 7 3
3 3
2 3 7
2 3 7
2 3 7
6 3 7
6 3 7
6 3 7
3 3 7
3.5 3 7
6 3 7
3 3 7
3 3 7
3 3 7
3 3 7
3 3 7
3.5 3 7
4 3 7
3 3 7
Coding
6 6 6 1
2 6 6 6 6 3 3 2 1
6 6 6 6
3 3 3 7
6 1
1 2 2 6 6 2 7
5 5 5 2 4 2 2 2 2 7
3 6 2
1 1 1
5
5
5
5
5
5
5
5 5 5 5
5
5 5 5 5 5
Scale
1 1
1 0 1 1 1
1
1 1 1
0 0
1 1 1 1
1 1 1 1 1 1 1 0 0
1 1 0 1 1 1 1 0 0 0 0 0 0 1 1 1 1 1 1 1 1
1
1
1
1
1
1
1 1 1 1
1
1 1 1 1 1
Protagonist
gender
5
5 5 2
3 5 4 5
5 3
3 5 5
7 7 7 7
4 4
4 3 3 5
5 5 3
5 5 3 7 7 7
7
5 4 1
1 1 5
5 1
4 2
5 5 5
7
7
7
7
7
7
7
7 7 7 7
7
7 7 7 7 5
Published
0
0 0 0 0 1 0 1 1 1 1 0 1
1 1 1 1
0
0 0 1 0 1 1 0 1 0 0 1 0 1 1 1 1 1 0 0 0 0 0 0 0 1 0 0 n 1
1
1
1
1
1
1
1 1 1 1
1
1 1 1 1 1
GENDER DIFFERENCES IN MORAL ORIENTATION 715
Table 1 (continued)
Study
Eisenberg-Berg 4 Neal (1981)
Eisenberg-Berg & Neal (1981)
Eisenberg-Berg & Neal (1981)
Eisenberg-Berg & Roth (1980)
Estes (1994)
Ford & Lowery (1986)
Forsyth, Nye, & Kelley (1988)
Friedman, Robinson, & Friedman (1987)
Fuchs, Eisenberg, Hertz-Lazarowitz, &
Sharabany (1986) Fuchs, Eisenberg, Hertz-Lazarowitz, &
Sharabany (1986) Fuchs, Eisenberg, Hertz-Lazarowitz, &
Sharabany (1986)
Galotti, Kozbcrg, & Fanner (1991)
Galotti, Kozberg, & Farmer (1991)
Galotti, Kozberg, & Farmer (1991)
Garmon, Basinger, Gregg. & Gibbs (1996)
Garrod. Beal, & Shin (1990)
Garrod, Beal, & Shin (1990)
Garrod, Beal, & Shin (1990)
Gibbs, Arnold, & Burkhart (1984)
Gilligan & Attanucci (1988)
Graham (1995)
Hagar (1990)
Hopkins Baker (1995)
Hosick (1994)
Hosick (1994)
Jadack, Hyde, Moore, & Keller (1995)
Janoff(1991)
Janssens & Dekovic (1997)
Janssens & Dekovic (1997)
Janssens & Dekovic (1997)
Johnston (1988)
Kalkoskc (1992)
Korschgen (1988)
Krebs, Vermeulen, Denton, & Carpendale
(1994)
Krebs, Vermeulen, Denton, & Carpendaie
(1994)
Lamm (1992)
Langdale (1986)
Langdale (1986)
Langdale (1986)
Lantsberger (1993)
Liddell (1998)
Liddel & Davis (1996)
Liddell, Halpin, & Halpin (1993)
Logan (1990)
Lourenco (1991)
Lyons (1983)
Maqsud (1998)
Maqsud (1998)
Maqsud (1998)
Miller, Eisenberg, Fabes, & Shell (1996)
Moore (1988)
Norris (1996)
Orenstein (1991)
Perez (1994)
Peter & Gallop (1994)
Pratt, Diessner, Hunsberger. Pancer, & Savoy
(1991)
Pratt, Diessner, Hunsberger, Pancer, & Savoy
(1991)
Pratt, Golding, & Hunter (1983)
Pratt, Golding, & Hunter (1983)
Pratt, Golding. & Hunter (1983)
Pratt, Golding, Hunter, & Sampson (1988)
Pratt, Golding, Hunter, & Sampson (1988)
Pratt, Golding, Hunter, & Sampson (1988)
Pratt & Royer (1982)
Pratt & Royer (1982)
Pratt & Royer (1982)
Pratt & Royer (1982) Pratt & Royer (1982)
N
males
12 19 18 17 28
101 5S 47 12
21
33
27 34 27
166 10 i
13 22 46 27 20
204 20
15 38
100 20 19 24 28 93 19 10
10
12 16 24 30
95 12
133 185 30 16 14 40 40 40
34 35
126 60 97 26 13
16
10 10 10 11 12 11 24 20 12 12 12
N
females
11 22 23 16 28
101 158 54 17
15
22
34 39 25
162 8
10 8
53 34 26 20
312 15
15 36
100 20 19 23 30 79 19 10
10
12 16 22 29 51 25
248 174 60 16 16
40 40 40 40 52
151 60
187 25 15
20
10 10 10 8 8
12 24
20 12 12 12
4=
-.38
.39
.10
.31
.15 -.18
0" .17
-.29
-.56
.01
-.42
-.92
.04 -.39
.59 -1.17
.17 -.68
-1.04
-.17
-.26
0" -.62
-.42
.03
-1.16
-.30
-.50
-.23
-.56
-.57
.67
.44
-.15
-.69
-.83
-2.09
-1.24
-.38
.06 -.35
-.50
-.60
-.07
-1.40
-.41
.13 -.42
0" -.26
-.21
-.36
0" -.63
-.50
-.65
0" 0° 0" .15
-2.72
-.85
o- o- 0" 0" 0"
4
-.15
.07
.31 -.12
-.09
1.04
-.15
.26
.62
.42 -.03
1.19
.56
-.23
-.44
.15
.69
.83 2.09 1.24
-.28
O1
. .16
.60
.07 1.40
-.33
-.15
-.02
-.36
-.34
.05
.50
.65
0" 0" 0"
-.15 2.72
.85 0" 0' 0" 0" 0'
Age
1 1
1 1
3 3 3 3 1
1
1
2
2 3 6 1
1 1
6 6 2 4 3 4
4 3 4
1 1 1
2 2 4 3
3
1 I 2
4
3 3 3
3 6 1
6 2 2
2
1 3
4 4 4 4
4
5
3 4 5 3 4 5 3 4 2 3 4
SES
3 3 3
3 3 3
3 3
3.5
3.5
3.5
6 6 3 6 4 4 4
3 6 4 3.5 3 6
6 3 3 6 6 6 3
6 6 3
3
6
6 6 6 6 3 3 3 3
3 4
6 6 6 6 6 6 3.5
6 3 6
6
3 6 6 3 3
3 3 6 6 3 6
Construct
3 3 3
3 1 1
1
1 3
3
3
1
1 1
2 1 1
2 1 1
1 1 1
1 1
1 3
3 3
-1
1 1 1
1
1
1
1 1
1 1
1 2
1
1 1
1 3 1 1 1
1 1 1
1
2 2 2
Type of
dilemma
7 7
7 7 i 3
4 1 7
7
7
4 4
4 4 1 1
1 1 3
4 3 2 3
3 2 4
7 7 7 1
1 3 1
1
1
1 1
1 2 2 2 2
3 8 3 2 2 2 7 1 7 4
2 2 3
3
1 1
1 3 3 3 1 1
1 1 1
Coding
5 5 5 5
2 6
7 6 5
5
5
6 6 6 4 3
3 3 4 3
6 2 6 2
2 2 6 5
5 5 3
3 1 3
3
3
2 2 2 6 6 6 6
2 4 2 6 6 6 5
6 6 6 6 2 2
2
4 4 4 2 2 2 6 6 6 6 6
Scale
1 1
1 1
0 1
1 1 1
1
1
1 1
1 0 1 1 1 0 0 1 0 1 0 0 0 1 1 1 1 0 1 1 1
1
1
0 0 0 1 1 1 1 0 0 0 I 1 1 1 1 1 1 1 I 0
0
0 0 0 0 0 0
Protagonist
gender
7 5 7 7 1 J
4 3 1
7
7
4 4 4
4 1 1
1 3 5 4
5 5 5 5 3 4 7
7 7 1
1 5 1
5
1
1 1
1 5
5 5 i 5 7 5 5 5 5 7 3 5 4 5 5 5
5
1 1
1 5 5 5
3 3 3 3 3
Published
1 1
1 1
0 1
1 1
1
1
1 1
1 1
1 1
1 1 1
0 0
0 0 0 1 0
1 1 1 0 0 1
0
0 0 0 0 1 1
1
1 1 1
1
1 1 1 1 1
(table continues)
716 JAFFEE AND HYDE
Table 1 (continued)
Study
Roberscm (1988) Rogers (1987) Rothbart. Hanley, & Albert (1986) Saloner (1988) Schiller (1991) Schiller (1991) Schiller (1991) Schlesinger (1987) Self, Skeel, & Jccker (1993) Sitzer (1990) Skoe & Diessner (1994)" Skoe & Gooden (1993)b
Skoe, Pratt, Matthew, & Curror (1996)' Skoe, Pratt. Matthew, & Curror (1996)" Skoe, Pratt, Matthew, & Curror (1996)" Sochting, Skoe, & Marcia (1994)" Slander & Jensen (1993) Stiller & Forrest (1990) Stookey (1994) Tietjen (1986) Tietjen (1986) Tietjen (1986) Tietjen (1986) Vera & Levin (1989) Walker (1986a) Walker, de Vries, & Trevethan (1987) Walker, de Vrics, i Trevethan (1987) Walker & Moran (1991) Wark (1993) Wark & Krebs (1996) Wark & Krebs (1997) Webster (1996) Webster (1996) White (1994) White & Manolis (1997) Wilson (1995) Yacker & Weinberg (1990) Yacker & Weinberg (1990)
N males
20 1% 25 10 10
10 10 20 59 43 58 23
30 6 5
45 78 32 38
11 I I 10 11 40 24 54
66 26
51 55 40 19 13
124 138 52
20 29
N females
26 198
25 10 10 10 10 20 28
55 76 23 30 8 8
45
75 45 67 11 7 8
13 39 38 67 53 26 46 55
40 26 24
110 120 68 28
22
d.
-.41 -1.0
.06 -.81 -.63 -.10 -.46 -.22
-.36 -.24
-1.02 -.84
-1.19 -.62
.28
-.76" -1.18
.10
0" 0" 0" 0"
-.11 -.03 -.42 -.30 0"
-.30 -.07 -.11 -.35 -.94
-.43 -.40 -.38 -.42
-.16
Type of Protagonist dj Age SES Construct dilemma Coding Scale gender Published
.41 2 2
.49 2 3.5
.70 3 6 1 1
.81 2 4
.63 2 4
.10 2 4
.46 4 3
.22 4 6
.36 3 3 3 3 2 6 5 6 4 6 5 6 3 3 3 3
.67 3 3
4 3
3 2 4 3 3 2
7 5
5 0
4 0 5 1 7 0
1 1 0 1 0 7 1 0 1 0 8 1 0 1 0
3 2 0 5 0 3 7 1 2
2 & 3 7 2&3 7 2 & 3 7 2 £3 7 2 & 3 7 2 & 3 7
4 6
3 3 2 6
1 6 3 7 5
1 6 3 7 5 2 6 3 7 5 4 6 3 7 5
.11 3 3 1 2 2
) 5 1 1 0 7
7 •
7 7 7
7 4 5
5 7 7 7
7 6
. 0 3 4 6 2 1 4 0 1
.42 6 6
.30 6 6 0 ' 6 6 .30 3 3 .07 3 4 .11 3 3 .35 4 6 .94 5 6
.49 4 3
.51 4 3 4 3 4 3
5 2 0 5 6 2 0 5 1 4 0 1 6 3 0 5 6 3 0 5 1 3 3 1
1 ) 5
3 1 0 5
2 7 2 2 4 6 2 6
4 3 2 6
5 ) 5
4
5
}
3
) ) 3
5
Note. de — care reasoning effect size; d^ — justice reasoning effect size; Age: 1 = child; 2' = adolescent; 3 — university; 4 — adult; 5 — elderly; 6 =
mixed age; SES (socioeconotnic status): 1 = lower class; 2 = lower middle class; 3 = middle class; 3.5 = middle class to upper middle class; 4 = upper
middle class; 4.5 — upper middle class to upper class; 5 = upper class; 6 = mixed/unreported; 7 — male & female participants not matched on SES;
Construct: 1 = Gilligan care or justice; 2 = Kohlberg care or justice; 3 = Eisenberg prosocial moral reasoning; Type of dilemma: 1 = standard hypothetical;
2 = real-life hypothetical; 3 = self-generated; 4 = no dilemma; 5 = self-generated care; fi = self-generated justice; 7 = hypothetical care; 8 = hypothetical
justice; Coding: 1 = Brown et al. (1988); 2 = Lyons (1982); 3 = modified Lyons (1982); 4 = Colby et al. (1987); 5 = Eisenberg, Lennon, & Roth (1983);
6 = objective coding; 7 = other; Scale: 0 = categorical score; 1 = continuous score; Protagonist gender: 1 = male; 2 = female; 3 = both male & female
protagonists; 4 — no protagonist; 5 = participant is protagonist; 6 ~ gender not specified; 7 = protagonist is same gender as participant; Published: 0 =
unpublished; 1 = published. 3 Estimated effect size. b Ethic of Care Interview studies included in meta-analysis of gender differences in level of care reasoning.
Magnitude of Gender Differences in Justice Orientation
The gender difference in justice orientation as reported in the
studies was not statistically significant for 72% of the 95 indepen-
dent samples. Again, it is unlikely that the nonsignificant gender
differences in justice reasoning were due to low power to detect
differences. Seventy-six percent of the independent samples had
sufficiently large sample sizes to detect a moderate effect size for
gender differences in justice reasoning.
The overall effect size d for gender differences in justice orien-
tation was .19, indicating a small gender difference favoring males.
A homogeneity analysis using procedures specified by Hedges and
Becker (1986) indicated that the effect sizes were nonhomoge-
neous, HT = 307.44, p < .001. Therefore, analyses were conducted
to determine the extent to which the magnitude of the effect size
was moderated by other variables. The results of the moderator
analyses are presented in Table 2.
Age. Homogeneity analyses revealed that the magnitude of
the effect sizes differed significantly as a function of the re-
spondent's age, HB = 52.56, p < .001. As there were so few
independent samples in the child and older adult age groups, the
effect sizes for gender differences in justice reasoning within
these groups must be interpreted with caution. For university
students, the magnitude of the effect size d was close to zero.
For adolescents, the magnitude of the effect size was small and
favored males (d — .22), but for younger adults, the magnitude
of the effect size d was .40.
Socioeconotnic status. Homogeneity analyses revealed that
the magnitude of the effect sizes did not differ significantly as a
function of the respondent's SES, /?„ = 4.30, ns.
Moral orientation construct. A homogeneity analysis revealed
that the magnitude of these effect sizes did not differ significantly
as a function of whether measures based their definition of justice
GENDER DIFFERENCES IN MORAL ORIENTATION 717
Table 2
Magnitude of Gender Differences in Moral Orientation as a Function of Moderator Variables
Care reasoning
Moderator variables
Age group Children ( 11 or younger) Adolescents (12-19) University Younger adults (20-49) Older adults (50 or older) Mixed age
Socioeconomic status group Lower class Middle class Upper class Mixed/unreported
Moral orientation construct Gilligan Kohlberg Eisenberg
Type of dilemma Standard hypothetical Real-life hypothetical Self-generated No dilemma Designed to elicit care Designed to elicit justice
Coding scheme Brown et al. (1988) Lyons (1982) Lyons (1982), modified Colby et al. (1987) Eisenberg, Lennon, &
Roth (1983) Objective coding Other
Scale Categorical Continuous
Gender of protagonist Male Male and female No protagonist Participant is protagonist Same gender as participant
Publication status Unpublished Published
k
46 32 36 34 4
8
13 76 13 58
111 9
40
36 26 28 17 47 6
13 33 20 10 36
44 4
49 111
25 15 17 61 39
53 107
<4
-.08
-.53 -.18
-.33 -.63 -.48
-.08 -.27 -.42 -.32
-.32 -.25
-.09
-.19 -.20
-.37 -.57
-.17 -.18
-.23 -.40
-.61 -.25 -.08
-.23 -.23
-.38
-.26
-.36 -.03 -.57 -.27
-.11
-.29 -.28
95% confidence
interval
-.18 to .02
-.61 to -.44 -.24 to -.12
-.41 to -.25 -1.02 to -.25 -.61 to -.34
-.26 to .10 -.32 to -.22
-.58 to -.26 -.39 to -.25
-.36 to -.28 -.40 to -.10
-.19 to .01
-.28 to -.10 -.27 to -.14
-.46 to -.28 -.66 to -.48
-.26 to -.09 -.37 to .01
-.43 to -.04 -.50 to -.31
-.72 to -.51 -.39 to -.10
-.19 to .03
-.28 to -.18 -.41 to -.04
-.45 to -.30
-.30 to -.22
-.49 to -.23 -.14 to .09 -.66 to -.48 -.32 to -.22
-.21 to .00
-.35 to -.24 -.33 to -.23
HW
51.93
134.36" 72.37" 90.54' 2.82
15.50"
5.82 227.62" 38.99"
156.95"
385.81" 8.66
26.73
80.07" 50.99" 72.54"
108.07" 62.16
1.53
20.23" 101.39" 98.20" 8.68
22.90
112.31" 13.23"
121.78" 309.19"
58.43" 11.03
108.07' 151.84' 24.18
232.27" 206.03"
k
9 23 32 22 4 5
3 45
8 39
83 12 —
30 16 26 12 4 7
10 30 12
8 —
33 2
42 53
21 9
12 49 —
47 48
Justice reasoning
4
.35
.22
.04
.40
.63
.42
.15
.18
.43
.18
.19
.19 —
.10
.09
.42
.40
.05
.10
.37
.42
.32
.03 —
.04
.50
.41
.11
.34 -.01
.40
.17
—
.17
.21
95% confidence
interval
.1210.59
.1210.31 -.03 to .10
.30 to .50
.25 to 1.02
.22 to .62
-.26to.56 .12 to .23 .20 to .66 .10 to .26
.1410.23 -.01 to .39
—
.00 to. 19
.00 to. 17
.31 to .53
.28 to .51 -.14 to .23 -.02 to .22
.14 to .60
.32 to .51
.20 to .44 -.18 to .24
—
-.02 to. 10 .25 to .75
.32 to .50
.06 to. 16
.19 to .48 -.1410.12
.28to.51
.12 to .23
—
.10 to .23
.14to.27
#w
15.19
89.70" 35.03
103.44" 2.82
8.70'
2.18 134.83* 30.61"
135.52"
283.80" 23.64" —
75.02' 38.86" 52.49" 56.12* 40.25"
2.74
14.84
95.57" 33.37"
1.16 —
102.35" .05
119.15" 153.37'
49.95" 3.59
56.12" 157.48"
—
168.23" 138.53"
Note, k represents the number of effect sizes; dc is the effect size for care reasoning; d-s is the effect size for justice reasoning; Hw is the within-group homogeneity statistic (Hedges & Becker, 1986). * Significant nonhomogeneiry at p < .05, according to the chi-square test.
reasoning on Gilligan's description of the justice orientation or on
Kohlberg's fairness and/or normative orientations, HB = 0, ns.
Type of dilemma. A homogeneity analysis revealed that the
magnitude of the effect size differed significantly as a function of
the type of dilemma, ffB = 41.96, p < .001. For hypothetical
dilemmas (standard or real-rife) and dilemmas designed to elicit
care or justice, the magnitude of the effect size for gender differ-
ences in justice reasoning was close to zero. The effect sizes for
self-generated dilemmas and measures that did not use a dilemma
were moderate in size and favored males (d — .42 and d = .40,
respectively).
Coding, scheme. A homogeneity analysis revealed that the
magnitude of the effect size differed significantly as a function
of the coding scheme, HB = 60.10, p < .001. The effect size
was close to zero for those studies that used Kohlberg's scoring
scheme or objective scoring schemes (d = .03 and d = .04,
respectively) and d = .42 for studies that used Lyons's (1982)
coding scheme.
718 JAFFEE AND HYDE
Table 3
Regression Coefficients From Analysis Predicting Care Effect
Size From Moderator Variables
Moderator variable ft p sr sr2
Age group -.04 as -.03 ' Socioeeonomic status .04 ns .04 * Moral orientation construct
Eisenberg vs. Kohlberg & Gilligan -.01 ns -.01 " Kohlberg vs. Gilligan -.12 ns -.13 .02
Type of dilemma Self-generated vs. standard -.21 .05 -.21 .04
hypothetical, real-life hypothetical, designed to elicit care or justice
No dilemma vs. dilemma -.29 .001 -.29 .08 Coding scheme
Objective vs. others .12 ns .11 .01 Brown et al. & Lyons modified vs. .06 ns .06 a
Lyons, Eisenberg, & Kohlberg Scale (continuous vs. categorical) .18 .05 .18 .03 Gender of Protagonist
Participant vs. fictional -.07 ns -.06 • Male vs. nonmale —.07 ns —.07 a
Publication status -.05 ns -.05 '
Note. /3 - standardized beta; sr = semipartial correlation; sr2 — squared semipartial correlation. "ir2 < .01.
Scale. A homogeneity analysis revealed that the magnitude of
the effect size differed significantly as a function of the scale of
measurement, //B = 34.92, p < .005. Although the effect size was
rf = .11 for measures that yielded continuous outcomes, it was
larger in size for measures that yielded categorical outcomes
(d = .41).
Gender of protagonist. A homogeneity analysis revealed that
the magnitude of the effect size differed significantly as a function
of the gender of the protagonist, #„ = 40.30, p < .001. The
magnitude of the effect was small when the participant was the
protagonist or when there were both male and female protagonists.
The magnitude of the effect was moderate in size when the
protagonist was male or when the dilemma did not contain a
protagonist.
Publication status. The magnitude of the effect size did not
differ significantly as a function of whether the study was pub-
lished, HB = .68, ns.
Regression Analysis for Justice Orientation
Because the overall test for the homogeneity of variance re-
vealed that the effect sizes were nonhomogeneous, a multiple
regression analysis was conducted to determine how much the
moderator variables accounted for the variance in the effect sizes
(Table 4). The corrected effect size for justice orientation was the
criterion variable (Hedges, 1981), and the moderator variables
were entered in stepwise fashion. The moderator variables ac-
counted for a significant 17% of the variance in the effect sizes for
gender differences in justice reasoning, F — 9.38,p < .001. Again,
squared semipartial regression coefficients were computed to de-
termine how much of the variance in the effect sizes was uniquely
accounted for by each of the moderator variables when the other
moderators were controlled. Table 4 indicates that the contrast
comparing self-generated dilemmas with standard hypothetical
dilemmas, real-life hypothetical dilemmas, and dilemmas designed
to elicit care or justice reasoning was a significant predictor of the
magnitude of the effect size for gender differences in justice
reasoning, [} = .28, p < .01. and uniquely accounted for 8% of that
variance. The contrast comparing studies yielding categorical out-
comes versus those with continuous outcomes was also significant,
J3 = -.25, p < .01, and accounted for 7% of the variance in the
effect sizes for justice reasoning. None of the other moderator
variables had significant effects.
Estimated Effect Sizes Excluded
When the effect sizes estimated as zero were excluded, the
magnitude of the overall gender difference in care reasoning was
d = -.35, and the magnitude of the gender difference in justice
reasoning was d = .22. Because the effect sizes for justice and care
reasoning were similar in magnitude and sign regardless of
whether the estimated effect sizes were excluded, the moderator
analyses were not rerun.
Levels of Care Reasoning
As discussed in the introduction, the ECI (Skoe & Marcia, 1991)
measures the levels of care reasoning originally proposed by
Gilligan (1982). A separate meta-analysis was conducted to esti-
mate the magnitude of the effect size for gender differences in
level of care reasoning. The overall effect size d for gender
differences in level of care reasoning was -.34, indicating a small
to moderate difference favoring females. As there were only seven
independent samples included in this meta-analysis, it was not
possible to conduct moderator analyses. It is worth noting, how-
ever, that with one exception, females demonstrated higher levels
of care reasoning than males across age groups and that the
Table 4
Regression Coefficients From Analysis Predicting Justice Effect
Size From Moderator Variables
Moderator variable 6 p sr sr1
Age group Socioeeonomic status Moral orientation construct
Kohlberg vs. Gilligan Type of dilemma
Self-generated vs. standard hypothetical, real-life hypothetical, designed to elicit care or justice
No dilemma vs. dilemma Coding scheme
Objective vs. others Brown et al., & Lyons modified vs.
Lyons & Kohlberg Scale (continuous vs. categorical) Gender of protagonist
Participant vs. fictional Male vs. nonmale
Publication status
.04 ns .04
-.01 ns -.01
-.04 ns -.04
.28 .01 .28 .08
.14 ns
-.12 ns -.02 ns
.15 .02
-.09 -.01
-.25 .01 -.26 .07
-.03 ns .10 ns
-.03 .10 .01 .05 "
Nole. (3 — standardized beta; sr semipartial correlation. "sr2 < .01.
= semipartial correlation; sr2 = squared
GENDER DIFFERENCES IN MORAL ORIENTATION 719
magnitude of the effect size was greater among middle-aged and
older adults compared with adolescents and young adults.
Discussion
This review used modern meta-analytic techniques to analyze
113 studies of moral reasoning (160 independent samples measur-
ing care and 95 measuring justice). This meta-analysis allowed us
to evaluate empirically Gilligan's assertion that the justice and care
orientations are strongly gender differentiated. The magnitude of
the effect size for gender differences in care reasoning was -.28,
indicating a small difference favoring females. The magnitude of
the effect size for gender differences in justice reasoning was. 19,
indicating a small difference favoring males. The small magnitude
of these effects, combined with the finding that 73% of the studies
that measured care reasoning and 72% of the studies that measured
justice reasoning failed to find significant gender differences, leads
us to conclude that, although distinct moral orientations may exist,
these orientations are not strongly associated with gender.
This analysis also indicates that the magnitude of the effect size
for gender differences in care and justice reasoning differs as a
function of specific variables that have often been confounded
with the measurement of moral orientation. An important contri-
bution of this meta-analysis is the finding that specific moderator
variables uniquely account for variance in the effect sizes for care
and justice reasoning when other moderators are controlled.
What Moderates Gender Differences in
Moral Orientation?
All variables except publication status significantly moderated
the magnitude of the effect size for gender differences in care and
justice reasoning, and the implication of these findings is discussed
below. However, the results must be interpreted with caution,
bearing in mind that the regression analyses revealed relatively
few unique effects of these variables on the magnitude of the effect
size for justice or care reasoning.
Age. According to Gilligan and colleagues (Gilligan & Wig-
gins, 1987), gender differences in the use of care and justice
reasoning should increase as children get older and their experi-
ences of attachment and inequality are differentially reinforced.
The findings from the moderator analyses do not support this
claim. Although the magnitude of the effect size for gender dif-
ferences in care reasoning did increase from childhood to adoles-
cence, it then decreased sharply amongst university students. One
possibility is that as individuals get older, they become more adept
at using more than one mode of moral reasoning. Gilligan (1986a)
has suggested that to be successful in a male-dominated society,
females are expected to be conversant in both modes of moral
reasoning. Indeed, the more recent work of Gilligan and her
colleagues (Brown & Gilligan, 1993) has explored the idea that, in
adolescence, girls become aware that their own moral voice is
neither recognized nor valued by society. Consequently, they come
to question their own understanding of their experience and con-
form to interpretations imposed by others. In so doing, they learn
to use the justice perspective instead of or along with the care
perspective. Consistent with this possibility is the finding that
gender differences hi justice reasoning were also smallest among
university students. However, neither of these findings make clear
whether gender differences in care or justice reasoning decreased
among university students because young women changed their
mode of moral reasoning (e.g., used less care reasoning or more
justice reasoning), because young men changed their mode of
moral reasoning (e.g., used more care reasoning or less justice
reasoning), or because both groups changed.
Gender differences in both care and justice reasoning increased
again in the younger adult group. To the extent that younger adults
are more likely to be involved in family formation, this finding is
consistent with the possibility that family organization during the
period of active parenting leads to increased gender-role polariza-
tion and, consequently, to gender-stereotyped moral reasoning
(Pratt et al., 1988).
Socioeconomic status. Although gender differences in care
reasoning increased from lower-class to upper-class groups, the
magnitude of the effect size for justice reasoning did not differ as
a function of social class. Again, these findings do not speak to the
question of whether gender differences in care reasoning are
smallest among lower-class groups because low social status pro-
motes a concern with others who, in part, determine one's out-
comes (Tronto, 1987) or because low social status promotes a
concern with fairness and rights because these rectify the inequal-
ities faced by minority or subordinate groups (Beal et al., 1997).
The findings do, however, suggest that social class and gender do
not interact, such that lower-class women are more likely than
other groups to endorse the care orientation or the justice
orientation.
Moral orientation construct. Given Gilligan's (1982) claims
that Kohlberg's coding scheme is insensitive to the care orienta-
tion, it is notable that the magnitudes of the effect size for gender
differences in care reasoning as conceptualized by Gilligan and by
Kohlberg were of approximately equal size. In the case of justice
reasoning, the magnitude of the effect size was identical for studies
that defined the justice orientation according to Gilligan's defini-
tion versus those studies that defined the justice orientation ac-
cording to Kohlberg. It is also notable that the smallest effect size
for gender differences in care reasoning was from studies that
conceptualized care as prosocial moral reasoning. This might be
explained by the fact that the prosocial dilemmas "pull for" care
reasoning from both male and female respondents.
Type of dilemma. Moderately large gender differences were
found for both care and justice reasoning among measures that did
not include a dilemma. These included the Washington University
Sentence Completion Test (WUSCT; Rogers, 1987) , the CROS
(Atunzu, 1986), the Sociomoral Reflection Measure-Short Form
(SRM-SF; Gibbs, Basinger, & Fuller, 1992), and the Ford and
Lowery questionnaire (Ford & Lowery, 1986). The CROS and the
Ford and Lowery questionnaire present participants with descrip-
tions of care or justice reasoning and ask them how much these
characterize their own thinking about moral dilemmas or how
important such considerations are in resolving moral problems. In
the case of these measures, it is the participant, and not the
researcher, who determines whether or not the participant is a
justice or care reasoner. Participants' perceptions of their own
moral reasoning may be influenced by their own self-concept and
may differ from the perceptions of a researcher who is ostensibly
coding a response from a more objective perspective.
In the WUSCT, respondents complete sentence stems that are
then coded for the presence of justice and care reasoning. The
720 JAFFEE AND HYDE
SRM-SF is designed to measure developmental stages of moral
judgment. It elicits responses related to the evaluation and justifi-
cation of the importance of sociomoral values such as contract and
truth, affiliation, property and law, and legal justice. These can
then be coded for various stage aspects, such as empathic role-
taking, that are related to moral reasoning content. It is less clear
why these measures should lead to larger gender differences in
care reasoning. It is worth noting, however, that the no dilemma
group included only one study that used the SRM-SF and only
three studies that used the WUSCT.
Importantly, the moderator analyses also revealed that the mag-
nitudes of the effect size for both care and justice reasoning were
greater for self-generated dilemmas compared with dilemmas in
which the content was standardized. Indeed, this comparison ac-
counted for 4% of the variance in the magnitude of the effect sizes
for care and 8%-of the variance in the magnitude of the effect sizes
for justice reasoning. This finding provides strong support for
studies that have found that'gender differences in moral orientation
are a function of the situation to which individuals are responding
(Walker, 1989; Walker et al., 1987; Wark & Krebs, 1997). On the
basis of such findings, Walker (1986a) has argued that males and
females do not differ in basic moral orientation but rather in the
kinds of problems they encounter and report. This argument sug-
gests that researchers who study gender differences in moral
orientation must make sure that moral orientation is not inadver-
tently confounded with dilemma content.
Coding scheme. Although the magnitude of the effect size for
gender differences hi both care and justice reasoning differed
significantly as a function of the coding scheme, this variable
accounted for little of the variance hi effect size estimates when the
other moderators were controlled. This suggests that the coding
schemes themselves are less important than other characteristics
associated with them, such as the type of outcome (categorical vs.
continuous) they yield.
Scale. The effect size for gender differences in care and justice
reasoning was larger when moral orientation was a categorical
instead of a continuous outcome. There are at least two explana-
tions for this finding. One is that categorical measures cause males
and females to look more different than they are. For instance, in
Lyons's coding scheme, classification as a justice or care reasoner
is based on modal moral orientation. Consequently, if 51% of one
person's considerations are care considerations, she is considered
a care reasoner and if 51% of another person's considerations are
justice considerations, he is considered a justice reasoner. Despite
the fact that these two individuals offer virtually the same propor-
tion of care and justice considerations, they are categorized as
displaying distinct moral orientations.
The second possibility is that because categorical outcomes
were converted into continuous scores for the purpose of comput-
ing an effect size, the effect size for these transformed scores was
less precise than the effect size for continuous measures. In some
cases, these transformed scores may have inflated the magnitude of
the effect size. If this was the case, the overall effect size for
continuous measures better represents the true gender difference in
care orientation.
This finding implies that it might be useful for future studies to
use measures that can be scored both continuously and categori-
cally. Such measures need not assume that justice and care repre-
sent opposite ends of a single spectrum (although such a concep-
tualization is implicit in, e.g., a percent care score). Justice and
care reasoning can be measured as a two-dimensional construct as
in, for example, the MMO (Liddell, 1990).
Gender of protagonist. Although the magnitude of the effect
sizes for gender differences in both care and justice reasoning
differed significantly as a function of the gender of the protagonist
of the dilemma, it is important to note that this variable accounted
for 1% or less of the variance in the effect size estimates when
controlling for other moderators. Although the perspective from
which an individual responds to a dilemma may influence moral
orientation, it does not appear to do so differently for males and
females.
In summary, the results of the care and justice analyses suggest
that researchers who study gender differences in moral orientation
should be aware of the extent to which moral orientation is
influenced by the content of specific moral dilemmas and by the
context in which these dilemmas arise. Importantly, however, the
moderator analyses demonstrate that even though the magnitude of
the effect size does differ as a function of variables like age and
dilemma type, among no group is the effect size for gender
differences in care or justice reasoning ever more than moderate in
size, and in most cases, it is considerably smaller. Thus, research-
ers may benefit from turning their attention away from the study of
gender differences in moral orientation and toward a more sophis-
ticated characterization of moral orientation or to questions of how
moral orientations develop over time.
Implications
An important limitation of this meta-analysis is that we were
able to determine only relative gender differences in the use of care
and justice reasoning and not whether females predominantly use
care reasoning and males predominantly use justice reasoning. In
its strongest form, Gilligan's assertion is that females consistently
use the care orientation and infrequently use the justice orientation
and that males consistently use the justice orientation and infre-
quently use the care orientation. This meta-analysis was able to
evaluate only whether females used the care orientation more than
males and whether males used the justice orientation more than
females. However, it is entirely possible that all participants used
a mix of justice and care reasoning, with females using relatively
more care reasoning than males and males using relatively more
justice reasoning than females. Many of Ihe findings regarding
gender differences in moral orientation are consistent with this
possibility (Walker, 1991; Wark & Krebs, 1996).
It is possible that we might have observed a gender-related
pattern in justice and care reasoning if we had looked at differ-
ences in moral orientation as a function of gender-role identity
(i.e., masculine, feminine, or androgynous). This, however, was
not feasible because it would have required us to know the gender-
role identity classification for each study member. It is worth
noting, however, that the evidence for an association between
gender-role identity and moral orientation is weak (Pratt & Royer,
1982), with most studies failing to find significant differences in
moral reasoning as a function of gender-role identity (Abaris,
1990; Hagar, 1990; Stookey, 1994; Wark & Krebs, 1996). It is
unlikely, then, that gender-role identity would have proved to be
an important moderator of gender differences in moral orientation.
GENDER DIFFERENCES IN MORAL ORIENTATION 721
If, as the results of this meta-analysis imply, it is the case that
most males and females use a mix of justice and care reasoning,
the field should move beyond the debate about gender differences
in moral reasoning and focus instead on how individuals integrate
justice and care reasoning and under what conditions they deter-
mine which is a more adequate basis for moral action. For in-
stance, Johnston (1988) found that boys and girls were equally
capable of switching moral orientations when prompted to offer a
different solution to a moral dilemma, and the solution that par-
ticipants considered best was not always the one they offered
originally. In follow-up interviews with her participants, Johnston
found that boys, even if they thought the care solution was the best
one, would not use it unless they believed the relationship between
the characters in the dilemma could be salvaged. These findings
suggest that the practical constraints of the situation may dictate
one's choice of which moral orientation to implement.
What is most ironic is that the notion of moral voice that lies at
the heart of the current debate over gender differences in moral
reasoning has little to do with how Gilligan, Brown, and their
colleagues on the Harvard Project on Women's Psychology and
Girls' Development currently conceptualize moral voice (Brown,
1994; Brown & Gilligan, 1993; Gilligan, 1994). It is telling that
not one of the authors represented in this meta-analysis is included
in the body of work that, according to Gilligan (1994), best
supports her theory of women's psychological development. Much
of this discrepancy can be traced to the fact that Gilligan and her
colleagues (Brown, 1994; Gilligan, 1994) argue from a different
epistemological framework. They argue that quantitative tests of
gender differences in moral reasoning miss the point because they
construe justice and care as opposite ends of a single continuum
instead of recognizing that these represent distinct moral perspec-
tives. Quantitative researchers, on the other hand, argue that the
conclusions reached by the Harvard Project are not falsifiable
(Walker, 1995), that the care and justice constructs are insuffi-
ciently well defined and impossible to operationalize (Breakwell,
1994), and that measures such as the Reading Guide (Brown et al.,
1988) are not well validated. This is not so much a debate about a
specific theory or finding as it is a debate about epistemology.
Unfortunately, the intersection between qualitative and quantita-
tive methodologies is not well defined, and there are few common
benchmarks for evaluating different epistemological approaches
(Jaffee, Kling, Plant, Sloan, & Hyde, 1999). For instance, Brown,
Tappan, Gilligan, Miller, and Argyris (1989) pointed out that
traditional psychometric conceptions of reliability and validity are
based on assumptions that "render them inappropriate for inter-
pretive approaches" (p. 156). What this implies is that researchers
who develop psychometrically valid and reliable measures of
moral orientation will satisfy Gilligan's critics on the one hand but
will fail to capture the construct of moral orientation as she and her
colleagues have developed and refined it over the years.
Do We Need the Care Orientation?
Gilligan's critics suggest a number of reasons why the care
orientation is unnecessary and/or insufficient as a mode of moral
reasoning. For instance, Kohlberg and colleagues (Higgins, 1989;
Kohlberg et al., 1983) have claimed that his cognitive-
developmental framework encompasses both care and justice, sug-
gesting that the ability to take another's perspective, or moral
role-taking, fosters a sense of responsibility to care. In addition,
Colby et al. (1983) showed that considerations of care, relation-
ship, and interpersonal trust are represented as norms and elements
at each stage in the MJI scoring scheme, and various researchers
have demonstrated that care reasoning can be elicited by the MJI
dilemmas (Walker et al., 1987; Wark & Krebs, 1996, 1997). From
this standpoint, the care orientation does not require a separate
theoretical and psychometric structure.
On the other hand, these findings underscore that the care
orientation does exist and can be represented in individuals' re-
sponses to moral dilemmas. Moreover, the suggestion that the care
and justice orientations can be integrated at the highest levels of
Kohlbergian reasoning does not adequately capture Gilligan's no-
tion that these orientations represent distinct moral perspectives
that exist in tension. However, even if one agrees that Kohlbergian
moral reasoning does not encompass Gilligan's (1982) vision of
moral maturity, it is not clear whether Gilligan's conceptualization
forms an adequate basis for moral action in that it does not offer
guidelines as to when one moral perspective should precede the
other. Unfortunately, Gilligan and her colleagues (Brown & Gil-
ligan, 1993) have moved away from work on moral development
to a broader focus on girls' and women's psychological develop-
ment, and it is unlikely that the necessary clarification will be
offered.
Conclusion
The results of this meta-analysis do not indicate that the care and
justice orientations are strongly gender differentiated. Moreover,
the results of the moderator analyses support previous findings that
the type Of moral reasoning an individual uses is highly sensitive
to the context and content of the dilemma. Although these results
weaken Gilligan's strongest claims about the relation between
gender and moral reasoning, they do not diminish her larger point.
In a 1994 interview (Kitzinger & Gilligan, 1994), Gilligan high-
lighted the extent to which In a Different Voice was motivated by
the observation that there was a "difference between the way
psychologists talked about the self and about morality and the way
people actually experienced it" (p. 409). This, perhaps, is the most
important message to take from Gilligan's work. Debates over the
association between gender and moral orientation and the moral
adequacy of the care orientation, although important and neces-
sary, miss the larger point that if psychological theories of human
development intend to represent lived experience, then they must
be constructed with the diversity of such experience in mind.
References
References marked with an asterisk indicate studies included in the
meta-analysis.
*Abaris, N. L. (1990). Gender and sex-role differences in moral develop-
ment among undergraduates: Ethics of care versus ethics of justice
(Doctoral dissertation, Rutgers University, 1990). Dissertation Abstracts
International, 51, 3588B.
*Abide, M. M. (1994). Decisions about substance use: Moral reasoning
and moral orientation (Doctoral dissertation, University of Virginia,
1994). Dissertation Abstracts International, 55, MSA.
*Akman, D. (1991). Gender differences in moral orientation (Masters
thesis. University of Calgary, 1991). Masters Abstracts Interna-
tional, 30, 965.
722 JAFFEE AND HYDE
*Albrecht, E. F. (1989), The relationship between protagonist gender in
hypothetical moral dilemmas and the use of justice versus care moral
orientation (Doctoral dissertation, Adelphi University, 1989). Disserta-
tion Abstracts International, 49, 1734A.
*Arvizu, D. R. (1995). The care voice and American Indian college
students: An alternative perspective for student development profession-
als. Journal of American Indian Education, 34, 1-17.
*Atunzu, K. O. (1986). The relationship among ethics of care, ethics of
justice, psychosocial intimacy and gender (Doctoral dissertation, Clar-
emont Graduate University, 1986). Dissertation Abstracts Interna-
tional, 47, 2505A.
Auerbach, J,, Blum, L., Smith, V., & Williams, C. (1985). On Gilligan's In
a Different Voice. Feminist Studies, 11, 149-161.
*Barnett, M. A., Quackenbush, S. W., & Sinisi, C. S. (1995), The role of
critical experiences in moral development: Implications for justice and
care orientations. Basic and Applied Social Psychology, 17, 137-152.
Baumriml, D. (1986). Sex differences in moral reasoning: Response to
Walter's (1984) conclusion that there are none. Child Development, 57,
511-52).
*Beal, C. R., Garrod, A., Ruben, K., & Stewart, T. L. (1997). Children's
moral orientation: Does the gender of the dilemma character make a
difference? Journal of Moral Education, 26, 45-58.
Benenson, J. F., Morash, D., & Petrakos, H. (1998). Gender differences in
emotional closeness between preschool children and their mothers. Sex
Roles, 38, 975-985.
*Bollerud, K. H. (1987). Adolescent sexual decision-making: The influ-
ence of gender and pubertal development on moral reasoning (Doctoral
dissertation, Harvard University, 1987). Dissertation Abstracts Interna-
tional, 48, 1704A,
Breakwell, G. M. (1994). Review of In a Different Voice by Carol Gilligan.
Feminism and Psychology, 4, 404-406.
Brown, L. M. (1994). Standing in the crossfire: A response to Tavris,
Gremmen, Lykes, Davis and Contralto. Feminism and Psychology, 4,
382-398.
Brown, L. M., Argyris, D., Attanucci, J., Bardige, B., Gilligan, C.,
Johnston, D. K., Miller, B., Osborne, R., Tappan, M., Ward, J., Wiggins,
G., & Wilcox, D. (1988). A guide to reading narratives of conflict and
choice for self and relational voice. Cambridge, MA: Project on the
Psychology of Women and the Development of Girls, Harvard Graduate
School of Education.
Brown, L. M., & Gilligan. C. (1993). Meeting at the crossroads: Women's
psychology and girls' development. Feminism and Psychology, 3, 11-
35.
*Brown, L. M., Tappan, M. B.. Gilligan, C., Miller, B. A., & Argyris, D. E.
(1989). Reading for serf and moral voice: A method for interpreting
narratives of real-life moral conflict and choice. In M. J. Packer & R. B.
Addison (Eds.), Entering the circle: Hermeneutic investigation in psy-
chology (pp. 141-164). Albany: State University of New York Press.
Bussey, K., & Maughan. B. (1982). Gender differences in moral reasoning.
Journal of Personality and Social Psychology, 42, 701-706.
*Carlo, G,, Eisenberg, N., & Knight, G. P. (1992). An objective measure
of adolescents' prosocial moral reasoning. Journal of Research on
Adolescence, 2, 331-349.
*Carlo, G., Koller, S. H., & Eisenberg, N. (1998). Prosocial moral reason-
ing in institutionalized delinquent, orphaned, and noninstitutionalized
Brazilian adolescents. Journal of Adolescent Research, 13, 363-376.
*Carlo, G., Koller, S. H., Eisenberg, N., DaSilva, M. S., & Frolich, C. B.
(1996). A cross-national study on the relations among prosocial moral
reasoning, gender role orientations, and prosocial behaviors. Develop-
mental Psychology, 32, 231-240.
*Carney, C. C. (1992). Children's concepts of good and bad: Age and sex
differences in moral reasoning (Doctoral dissertation, University of
California, Berkeley, 1992). Dissertation Abstracts International, 53,
5458B.
*Cassidy. K. W., Chu, J. Y., & Dahlsgaard, K. K. (1997). Preschoolers'
ability to adopt justice and care orientations to moral dilemmas. Early
Education and Development, 8, 419-434.
*Castor-Scheufler, M, G. (1994). Looking for Gilligan's "other voice":
The relation of gender identity, empathy, and moral development (Doc-
toral dissertation, University of Houston, 1994). Dissertation Abstracts
International, 55, 4619B.
Chodorow, N. (1978). The reproduction of mothering. Berkeley: Univer-
sity of California Press.
*Clopton, N. A., & Sorell, G. T. (1993). Gender differences in moral
reasoning: Stable or situational? Psychology of Women Quarterly, 17,
85-101.
Colby, A., & Damon, W. (1983). Listening to a different voice: A review
of Gilligan's In a Different Voice. Merrill-Palmer Quarterly, 29, 473-
481.
Colby, A., Kohlberg, L., Gibbs, J., & Lieberman, M. (1983). A longitudinal
study of moral development. Monographs of the Society for Research in
Child Development, 48(\-2, Serial No. 200).
Colby, A., Kohlberg, L., Speicher, B., Hewer A., Candee, D., Gibbs, J., &
Power, C. (1987). The measurement of moral judgment: Standard issue
scoring manual (Vol. 2). Cambridge, England: Cambridge University
Press.
*Cole, E. H. S. (1987). In their own voices: Hawaiian-American adoles-
cents speak of moral choice and personal changes (Doctoral dissertation,
Harvard University, 1987). Dissertation Abstracts International, 48,
2576A.
*Conley, T. D., Jadack, R. A., & Hyde, J. S. (1997). Moral dilemmas,
moral reasoning, and genital herpes. Journal of Sex Research, 34,
256-266.
*Craft, D. A. (1992). Children's moral orientation: An instrument valida-
tion study to assess school children's moral orientation (Doctoral dis-
sertation, University of San Francisco, 1992). Dissertation Abstracts
International, 54, 170A.
*Crown, J., & Heatherington, L, (1989). The costs of winning? The role of
gender in moral reasoning and judgments about competitive athletic
encounters. Journal of Sport and Exercise Psychology, II, 281-289.
*Curror, S. (1994). The relationship between empathic listening styles and
moral reasoning development in adolescents (Masters thesis, Wilfrid
Laurier University, 1994). Masters Abstracts International, 33, 664.
*Dekovic, M., & Gerris, J. R. M. (1994). Developmental analysis of social
cognitive and behavioral differences between popular and rejected chil-
dren. Journal of Applied Developmental Psychology, 15, 367-386.
*Derry, R. (1989). An empirical study of moral reasoning among manag-
ers. Journal of Business Ethics, 8, 855-862.
*de Vries, B., & Walker, L. J. (1986). Moral reasoning and attitudes toward
capital punishment. Developmental Psychology, 22, 509-513.
*Dezolt, D. M. (1992). Themes, age, and gender differences in children's
stories about caring (Doctoral dissertation, Kent State University, 1992).
Dissertation Abstracts International, 53, 2741A.
*Diamonti, N. L. (1993). An investigation of gender differences in ado-
lescent moral reasoning about rule breaking behavior (Doctoral disser-
tation, Boston College, 1993). Dissertation Abstracts International, 54,
2892A.
*Dickey, B., Kroll, B. S., & Jenkins, L. (1987, November). Gilligan
revisited: Methodological issues in the study of gender and moral
development. Paper presented at the National Convention of the Speech
Communication Association, Boston, MA.
*Diederichs, M. (1993). Young children and moral understanding (Masters
thesis, California State University, Fullerton, 1993). Masters Abstracts
International, 32, 112.
*Dohrenwend, A. M. (1995), Moral orientation in law students: A longi-
tudinal study from a relational perspective (Doctoral dissertation, Uni-
versity of North Dakota—Grand Forks, 1995). Dissertation Abstracts
International, 57, 1030A.
*Donenberg, G. R., & Hoffman, L. W. (1988). Gender differences in moral
development. Sex Roles, IS, 701-717.
*Dossett, M. E. (1989). Moral orientation of males and females in parental
GENDER DIFFERENCES IN MORAL ORIENTATION 723
caregiving, parental non-caregiving, and non-parental roles (Doctoral
dissertation. University of Alabama, 1989). Dissertation Abstracts In-
ternational, 51, 792A.
*Eisenberg, N., Boehnke, K., Schuler, P., & Silbereisen, R. K. (1985). The
development of prosocial behavior and cognitions in German children.
Journal of Cross-Cultural Psychology, 16, 69-82.
*Eisenberg, N., Carlo, G., Murphy, B., & Van Court, P. (1995). Prosocial
development in late adolescence. Child Development, 66, 1179-1197.
Eisenberg, N., Fabes, R., & Shea, C. (1989). Gender differences in empa-
thy and prosocial moral reasoning: Empirical investigations. In M.
Brabeck (Ed.) Who cares? Theory, research, and implications of the
ethic of care (pp. 127-143). New York: Praeger.
*Eisenberg, N., Hertz-Lazarowitz, R., & Fuchs, I. (1990). Prosocial moral
judgment in Israeli kibbutz and city children: A longitudinal study.
Merrill-Palmer Quarterly, 36, 273-285.
*Eisenberg, N., Lennon, R., & Roth, K. (1983). Prosocial development: A
longitudinal study. Developmental Psychology, 19, 846-855.
'Eisenberg, N.. Miller, P. A., Shell, R., McNalley, S., & Shea, C. (1991).
Prosocial development in adolescence: A longitudinal study. Develop-
mental Psychology, 27, 849-857.
*Eisenberg, N., Pastemack, J. F., Cameron, E., & Tryon, K. (1984). The
relation of quantity and mode of prosocial behavior to moral cognitions
and social style. Child Development, 55, 1479-1485.
*Eisenberg, N., & Shell, R. (1986). Prosocial moral judgment and behavior
in children: The mediating role of cost. Personality and Social Psychol-
ogy Bulletin, 12, 426-433.
*Eisenberg, N., Shell, R., Pastemack, J., Lennon, R., Seller, R., & Mathy,
R. M. (1987). Prosocial development in middle childhood: A longitudi-
nal study. Developmental Psychology, 23, 712-718.
Eisenberg-Berg, N. (1979). Development of children's prosocial moral
judgment. Developmental Psychology, 15, 128-137.
*Eisenberg-Berg, N., & Hand, M. (1979). The relationship of preschoolers'
reasoning about prosocial moral conflicts to prosocial behavior. Child
Development, 50, 356-363.
*Eisenberg-Berg, N., & Mussen, P. (1978). Empathy and moral develop-
ment in adolescence. Developmental Psychology, 14, 185-186.
*Eisenberg-Berg, N., & Neal, C. (1981). Children's moral reasoning about
self and others: Effects of identity of the story character and cost of
helping. Personality and Social Psychology Bulletin, 7, 17-23.
Eisenberg-Berg, N., & Roth, K. (1980). Development of young children's
prosocial and moral judgment: A longitudinal follow-up. Developmental
Psychology, 16, 375-376.
*Estes, J. M. (1994). Gender differences in moral reasoning comparing
pre-professional university students in nursing, education, and engineer-
ing: A content analysis (Doctoral dissertation, University of Alabama,
Tuscaloosa, 1994). Dissertation Abstracts International, 55, 1866A.
Fishkin, J., Keniston, K., & MacKinnon, C. (1973). Moral reasoning and
political ideology. Journal of Personality and Social Psychology, 27,
109-119.
Flanagan, O., & Jackson, K. (1987). Justice, care, and gender: The
Kohlberg-Gilligan debate revisited. Ethics, 97, 622-637.
•Ford, M. R., & Lowery, C. R. (1986). Gender differences in moral
reasoning: A comparison of the use of justice and care orientations.
Journal of Personality and Social Psychology, 50, 777-783.
*Forsyth, D. R., Nye, J. L., & Kelley, K. (1988). Idealism, relativism, and
the ethic of caring. Journal of Psychology, 122, 243-248.
Freeman, S. J. M., & Giebink, J. W. (1979). Moral judgment as a function
of age, sex, and stimulus. Journal of Psychology, 102, 43-47.
•Friedman, W. J., Robinson, A. B., & Friedman, B. L. (1987). Sex
differences in moral judgments? A test of Gilligan's theory. Psychology of Women Quarterly, 11, 37-46.
*Fuchs, L, Eisenberg, N., Hertz-Lazarowitz, R., & Sharabany, R. (1986).
Kibbutz, Israeli city, and American children's moral reasoning about
prosocial moral conflicts. Merrill-Palmer Quarterly, 32, 37-50.
•Galotti, K. M., Kozberg, S. F., & Farmer, M. C. (1991). Gender and
developmental differences in adolescents' conceptions of moral reason-
ing. Journal of Youth and Adolescence, 20, 13-30.
*Garmon, L. C., Basinger, K. S., Gregg, V. R., & Gibbs, J. C. (1996).
Gender differences in stage and expression of moral judgment. Merrill-
Palmer Quarterly, 42, 418-437.
Garrod, A., & Beal, C. (1993). Voices of care and justice in children's
responses to fable dilemmas. In A. Garrod (Ed.), Approaches to moral
development: New research and emerging themes (pp. 59-71). New
York: Teachers College Press.
*Garrod, A., Beal, C., & Shin, P. (1990). The development of moral
orientation in elementary school children. Sex Roles, 22, 13-27.
Garwood, S. G., Levine, D. W., & Ewing, L. (1980). Effect of protagonist's
sex on assessing gender differences in moral reasoning. Developmental
Psychology, 16, 677-678.
•Gibbs, J. C., Arnold, K. D., & Burkhart, J. E. (1984). Sex differences in
the expression of moral judgment. Child Development, 55, 1040-1043.
Gibbs, J. C., Basinger, K. S., & Fuller, D. (1992). Moral maturity:
Measuring the development of sociomoral reflection. Hillsdale, NJ:
Erlbaum.
Gilligan, C. (1977). In a different voice: Women's conception of the self
and morality. Harvard Education Review, 47, 481-517.
Gilligau, C. (1982). In a different voice: Psychological theory and women's
development, Cambridge, MA: Harvard University Press.
Gilligan, C. (1986a). On In a different voice: An interdisciplinary forum:
Reply. Signs, 11, 324-333.
Gilligan, C. (1986b). Remapping the moral domain: New images of the self
in relationship. In T. C. Heller, M. Sosna, & D. E. Wellbery (Eds.),
Reconstructing individualism: Autonomy, individuality, and the self in
western thought (pp. 237-252). Stanford, CA: Stanford University Press.
Gilligan, C. (1987). Moral orientation and moral development In E. F.
Kittay & D. T. Meyers (Eds.), Women and moral theory (pp. 19-33).
Totowa, NJ: Rowman & Littlefield.
Gilligan, C. (1994). Afterword: The power to name. Feminism and Psy-
chology, 4, 420-424.
•Gilligan, C., & Attanucci, J. (1988). Two moral orientations: Gender
differences and similarities. Merrill-Palmer Quarterly, 34, 223-237.
Gilligan, C., Brown, L. M., & Rogers, A. G. (1990). Psyche embedded: A
place for body, relationships, and culture in personality theory. In A. I.
Rabin, R. A. Zucker, R. A. Emmons, & S. Frank (Eds.), Studying
persons and lives (pp. 86-147). New York: Springer.
Gilligan, C., & Wiggins, G. (1987). The origins of morality in early
childhood relationships. In J, Kagan & S. Lamb (Eds.), The emergence
of morality in young children (pp. 277-305). Chicago: University of
Chicago Press.
•Graham, S. (1995). The effectiveness and psychological impact of a high
school community service course (Doctoral dissertation, University of
California, Los Angeles, 1995). Dissertation Abstracts International, 56,
868A.
Guttman, D. (1985). The parental imperative revisited. In J. Meacham
(Ed.), The family and individual development (pp. 31-60). Basel, Swit-
zerland: Karger.
Haan, N. (1978). Two moralities in action contexts: Relationships to
thought, ego regulation, and development. Journal of Personality and
Social Psychology, 36, 286-305.
Haan, N., Smith, B., & Block, J. (1968). Moral reasoning of young adults.
Journal of Personality and Social Psychology, W, 183-201.
•Hagar, H. O. (1990). Exploring correlates of moral orientation: The roles
of dilemma content, psychological gender, and attachment history (Doc-
toral dissertation, University of Wisconsin—Madison, 1990). Disserta-
tion Abstracts International, 51, 6104B.
Hedges, L. V. (1981). Distribution theory for Glass's estimator of effect
size and related estimators. Journal of Educational Statistics, 7, 119-
137.
Hedges, L. V., & Becker, B. J. (1986). Statistical methods in the meta-
analysis of research on gender differences. In J. S. Hyde & M. C. Linn
724 JAFFEE AND HYDE
(Eds.), The psychology of gender: Advances through meta-analysis (pp.
14-50). Baltimore: Johns Hopkins University Press.
Higgins, A. (1989). The Just Community Educational Program: The de-
velopment of moral role-taking as the expression of justice and care. In
M. Brabeck (Ed.), Who cares? Theory, research, and educational im-
plications of the ethic of care (pp. 197-215). New York: Praeger.
Holstein, C. B. (1969). The relation of children's moral judgment to that of
their parents and to communication patterns in the family. Unpublished
doctoral dissertation, University of California, Berkeley.
Holstein, C. B. (1976). Irreversible, stepwise sequence in the development
of moral judgment: A longitudinal study of males and females. Child
Development, 47, 51-61.
*Hopkins-Baker, C. (1995). An examination of coping strategies and other
factors associated with Gilligan's moral orientations of care and justice
(Doctoral dissertation, University of Mississippi, 1995). Dissertation
Abstracts International, 56, 3476B,
*Hosick, J. (1994). Moral development: The effects of gender and the
transition to parenthood (Doctoral dissertation, Virginia Commonwealth
University, 1994). Dissertation Abstracts International, 55, 3035B.
*Jadack, R. A., Hyde, J. S., Moore, C. F., & Keller, M. L. (1995). Moral
reasoning about sexually transmitted diseases. Child Development, 66,
167-177.
Jaffee, S.. Kling, K. C., Plant, E. A., Sloan, M., & Hyde, J. S. (1999). The
view from down here: Feminist graduate students consider innovative
methodologies. Psychology of Women Quarterly, 23, 425-432.
*Janoff, S. (1991). The impact of a legal education on moral reasoning
(Doctoral dissertation, Temple University, 1991). Dissertation Abstracts
International, 52, 3891 B.
*Janssens, J, M., & Dekovic, M. (1997). Child rearing, prosocial moral
reasoning, and prosocial behaviour. International Journal of Behavioral
Development, 20, 509-527.
* Johnston, K. (1988). Adolescents' solutions to dilemmas in fables: Two
moral orientations. In C. Gilligan, J. V. Ward, J. M. Taylor, & B.
Bardige (Eds.), Mapping the moral domain: A contribution to psycho-
logical theory and education (pp. 49-69). Cambridge, MA: Harvard
University Press.
*Kalkoske, M. S. (1992). The internal representation of attachment and its
relationship to moral orientation in middle childhood (Doctoral disser-
tation, University of Minnesota, 1992). Dissertation Abstracts Interna-
tional, 53, 4393B.
Kitzinger, C., & Gilligan, C. (1994). Listening to a different voice. Fem-
inism and Psychology, 4, 408-419.
Kohlberg, L. (1969). Stage and sequence: The cognitive-developmental
approach to socialization. In D. A. Goslin (Ed.), Handbook of social-
ization theory and research (pp. 348-480). Chicago: Rand McNally.
Kohlberg, L. (1976). Moral stages and moralization: The cognitive-
developmental approach. In T. Lickona (Ed.), Moral development and
behavior: Theory, research, and social issues (pp. 31-53). New York:
Holt, Rinehart & Winston.
Kohlberg, L. (1984). Moral stages and moralization: The cognitive-
developmental approach. In Essays on moral development: Vol. 2. The
psychology of moral development: The nature and validity of moral
stages (pp. 170-205). San Francisco: Harper & Row.
Kohlberg. L., & Kramer, R. (1969). Continuities and discontinuities in
childhood and adult moral development. Human Development, 12, 93-
120.
Kohlberg, L., Leyine, C., & Hewer, A. (1983). Moral stages: A current
formulation and a response to critics. New York: Karger.
*Korschgen, A. J. (1988). Concepts of morality in administration: The role
of gender, rank, and personality type (Doctoral dissertation, University
of Wisconsin—Madison, 1988). Dissertation Abstracts Interna-
tional, 49, 2474A,
*Krebs, D. L., Vermeulen, S. C., Denton, K. L., & Carpendale, J. I. (1994).
Gender and perspective differences in moral judgment and moral orien-
tation. Journal of Moral Education, 23, 17-26.
*Lamm, C. L. (1992). Children as problem solvers: Voices of care and
justice in young children (Doctoral dissertation, Claremont Graduate
University, 1992). Dissertation Abstracts International, 53, 2675A.
*Langdale, C. J. (1986). A re-vision of structural-developmental theory. In
G. L. Sapp (Ed.), Handbook of moral development: Models, processes,
techniques, and research (pp. 15-54). Birmingham, AL: Religious Ed-
ucation Press.
*Lantsberger, T. J. (1993). Factors in the decision to use a justice or a care
orientation in moral reasoning (Doctoral dissertation, University of
Missouri, 1993). Dissertation Abstracts International, 52, 3076A.
Liddell, D. L. (1990). Measure of Moral Orientation: Construction of an
objective instrument measuring care and justice with an investigation of
gender differences (Doctoral dissertation, Auburn University, 1990).
Dissertation Abstracts International, 51, 5626B.
*Liddell, D. L. (1998). Comparison of semistructured interviews with a
quantitative measure of moral orientation. Journal of College Student
Development, 39, 169-178.
*Liddell, D. L., & Davis, T. L. (1996). The Measure of Moral Orientation:
Reliability and validity evidence. Journal of College Student Develop-
ment, 37, 485-493.
Liddell, D. L., Halpin, G., & Halpin, W. G. (1992). The Measure of Moral
Orientation: Measuring the ethics of care and justice. Journal of College
Student Development, 33, 325-330.
*Liddell, D. L., Halpin, G., & Halpin, W. G. (1993). Men, women, and
moral orientation: Accounting for our differences. NASPA Journal, 30,
138-144.
*Logan, C. E. (1990), Message design logic and moral reasoning: A social
cognitive approach to understanding messages (Doctoral dissertation,
University of Washington. 1990). Dissertation Abstracts Interna-
tional, 51, 3559A.
Lollis, S., Ross, H., & Leroux, L. (1996). An observational study of
parents' socialization of moral orientation during sibling conflicts.
Merrill-Palmer Quarterly, 42, 475-494.
Lonky, E., Roodin, P. A., & Rybash, J. M. (1988). Moral judgment and sex
role orientation as a function of self and other presentation mode.
Journal of Youth and Adolescence, 17, 189-195.
*Lourenco, O. M. (1991). Is the care orientation distinct from the justice
orientation? Some empirical data in ten- and eleven-year-old children.
Archives de Psychologic, 59, 17-30.
Lyons, N. P. (1982). Conceptions of self and morality and modes of moral
choice: Identifying justice and care judgments of actual moral dilemmas.
Unpublished doctoral dissertation, Harvard University.
*Lyons, N. P. (1983). Two perspectives: On self, relationships, and
morality. Harvard Education Review, 53, 125-145.
*Maqsud, M. (1998). Moral orientation of Batswana high school pupils in
South Africa. Journal of Social Psychology, 138, 255-257.
Mason, A. (1990). Gilligan's conception of moral maturity. Journal for the
Theory of Social Behavior, 20, 167-179.
Mednick, M. T. (1989). On the politics of psychological constructs: Stop
the bandwagon, I want to get off. American Psychologist, 44, 1118-
1123.
*MiIler, P. A., Eisenberg, N., Fabes, R. A., & Shell, R. (1996). Relations
of moral reasoning and vicarious emotion to young children's prosocial
behavior toward peers and adults. Developmental Psychology, 32, 210-
219.
*Moore, M. L. (1988). Sex roles and moral reasoning: The relationship
between moral orientation and the social domain (Doctoral dissertation,
University of North Carolina at Greensboro, 1988). Dissertation Ab-
stracts International, 50, 551 A.
Morris, S. B., & DeShon, R. P. (1997). Correcting effect sizes computed
from factorial analysis of variance for use in meta-analysis. Psycholog-
ical Methods. 2, 192-199.
*Norris, D. A. (1996). Factors related to the resolution of ethical dilemmas
by school psychologists (Doctoral dissertation, Arizona State University,
1996). Dissertation Abstracts International, 57, 1019A.
Orchowsky, S. J., & Jenkins, L. R. (1979). Sex biases in the measurement
of moral judgment. Psychological Reports, 44, 1040.
GENDER DIFFERENCES IN MORAL ORIENTATION 725
*Orenstein, S. H. (1991). Parental preference in moral dilemma resolution
with girls and boys (Doctoral dissertation, New York University, 1991),
Dissertation Abstracts International, 52, 2079A.
*Perez, M. (1994). The impact of moral development, moral orientation,
gender, and religiosity on reasoning about abortion (Doctoral disserta-
tion, Loyola College, 1994). Dissertation Abstracts International, 56,
1721B.
*Peter, E., & Gallop, R. (1994). The ethic of care: A comparison of nursing
and medical students. IMAGE: Journal of Nursing Scholarship, 26,
47-51.
Poppen, P. (1974). Sex differences in moral judgment. Personality and
Social Psychology Bulletin, 1, 313-315.
*Pratt, M. W., Diessner, R., Hunsberger, B., Pancer, S. M., & Savoy, K.
(1991). Four pathways in the analysis of adult development and aging:
Comparing analyses of reasoning about personal-life dilemmas. Psy-
chology and Aging, 6, 666-675.
*Pratt, M. W., Golding, G., & Hunter, W. (1983). Aging as ripening:
Character and consistency of moral judgment in young, mature, and
older adults. Human Development, 26, 277-288.
*Pratt, M. W., Golding, G., Hunter, W., & Sampson, R. (1988). Sex
differences in adult moral orientations. Journal of Personality, 56,
373-391.
*Pratt, M. W., & Royer, J. M. (1982). When rights and responsibilities
don't mix: Sex and sex-role patterns in moral judgment orientation.
Canadian Journal of Behavioral Science, 14, 190-204.
Puka, B. (1989). The liberation of caring: A different voice for Gilligan's
"different voice." In M. Brabeck (Ed.), Who cares? Theory, research,
and educational implications of the ethic of care (pp. 19-44). New
York: Praeger.
Puka, B. (1991). Interpretive experiments: Probing the care—justice debate
in moral development. Human Development, 34, 61-80.
Rest, J. R. (1979). Development in judging moral issues, Minneapolis:
University of Minnesota Press.
*Roberson, M. K. (1988). The adolescent's decision to engage in sexual
behavior (Doctoral dissertation, Michigan State University, 1988). Dis-
sertation Abstracts International, 49, 4577B,
*Rogers, A. G. (1987). Gender differences in moral reasoning: A validity
study of two moral orientations (Doctoral dissertation, Washington
University, 1987). Dissertation Abstracts International, 49, 1410B.
Rosenthal, R. (1979). The file drawer problem and tolerance for null
results. Psychological Bulletin, 86, 638-641.
*Rothbart, M. K., Hanley, D., & Albert, M. (1986). Gender differences in
moral reasoning. Sex Roles, 15, 645-653.
*Saloner, M. E. (1988). The contribution of a prosocial component in a
social skills training program for preschoolers (Doctoral dissertation,
University of Wisconsin—Madison, 1988). Dissertation Abstracts In-
ternational, 49, 2532A.
*Schiller, D. C. (1991). The effect of additional context on the usage of
moral orientation in the measurement of moral reasoning: An examina-
tion of sex differences (Doctoral dissertation, California School of
Professional Psychology, Los Angeles, 1991). Dissertation Abstracts
International, 52, 2784B.
*Schlesinger, M. (1987). Gender differences in moral stage, moral orien-
tation, and sex role identity of academic administrators: A comparison of
Kohlberg's and Gilligan's theories (Doctoral dissertation, Temple Uni-
versity, 1987). Dissertation Abstracts International, 50, 3192A.
*Se1f, D. J., Skeel, J. D., & Jecker, N. S. (1993). A comparison of the moral
reasoning of physicians and clinical medical ethicists. Academic Medi-
cine, 68, 852-855.
Sichel, B. A. (1985). Women's moral development in search of philosoph-
ical assumptions. Journal of Moral Education, 14, 149-161.
*Sitzer, D. L. (1990). A theoretical and empirical linkage between moral
and personality development: A study correlating narcissism, affective
empathy, and the justice and care moral orientations (Doctoral disserta-
tion, California School of Professional Psychology, Los Angeles, 1990).
Dissertation Abstracts International, 51, 5042B.
*Skoe, E. E., & Diessner, R. (1994). Ethic of care, justice, identity, and
gender: An extension and replication. Merrill-Palmer Quarterly, 40,
272-289.
*Skoe, E. E., & Gooden, A. (1993). Ethic of care and real-life moral
dilemma content in male and female early adolescents. Journal of Early
Adolescence, 13, 154-167.
Skoe, E. E., & Marcia, J. E. (1991). A measure of care-based morality and
its relation to ego identity. Merrill-Palmer Quarterly, 37, 289-304.
*Skoe, E. E., Pratt, M. W., Matthew, M., & Curror, S. E. (1996). The ethic
of care: Stability over time, gender differences, and correlates in mid- to
late adulthood. Psychology and Aging, 11, 280-292.
Smetana, J. G. (1984). Morality and gender: A commentary on Pratt,
Golding, and Hunter. Merrill-Palmer Quarterly, 30, 341-348.
*Sochting, I., Skoe, E. E., & Marcia, J. E. (1994), Care-oriented moral
reasoning and prosocial behavior: A question of gender or sex role
orientation. Sex Roles, 31, 131-147.
*Stander, V., & Jensen, L. (1993). The relationship of value orientation to
moral cognition: Gender and cultural differences in the United States
and China explored. Journal of Cross-Cultural Psychology, 24, 42-52.
*Stiller, N. J., & Forrest, L. (1990). An extension of Gilligan's and Lyon's
investigation of morality: Gender differences in college students. Jour-
nal of College Student Development, 31, 54-63.
*Stookey, K. (1994). Gender, gender-roles, and moral judgments: An
investigation of the theories of Kohlberg and Gilligan (Doctoral disser-
tation, University of Alabama, 1994). Dissertation Abstracts Interna-
tional, 56, 2178A.
Thoma, S. J. (1986). Estimating gender differences in the comprehension
and preference of moral issues. Developmental Review, 6, 165-180.
Tietjen, A. M. (1986). Prosocial reasoning among children and adults in
a Papua New Guinea society. Developmental Psychology, 22, 861-868.
Time 25. (1996, June 17). Time, 147, 52-79.
Tronto, J. C. (1987). Beyond gender difference to a theory of care. Signs:
Journal of Women in Culture and Society, 12, 644-663.
Turiel, E. (1976). A comparative analysis of moral knowledge and moral
judgment in males and females. Journal of Personality, 44, 195-208.
Van Gelder, L. (1984, January). Carol Gilligan—Leader for a different
kind of future. Ms., 12, 37-40.
*Vera, E. M., & Levin, I. P. (1989, May). Sex differences in moral decision
making. Paper presented at the annual meeting of the Midwestern
Psychological Association, Chicago, IL.
Walker, L. J. (1984). Sex differences in the development of moral reason-
ing: A critical review. Child Development, 55, 677-691.
*Walker, L. J. (1986a). Experimental and cognitive sources of moral
development in adulthood. Human Development, 29, 113-124.
Walker, L. J. (1986b). Sex differences in the development of moral
reasoning: A rejoinder to Baumrind. Child Development, 57, 522-526.
Walker, L. J. (1989). A longitudinal study of moral reasoning. Child
Development, 60, 157-166.
Walker, L. J. (1991). Sex differences in moral reasoning. In W. M.
Kurtines & J. L. Gewirtz (Eds.), Handbook of moral behavior and
development: Vol. 2. Research (pp. 333-364). Hillsdale, NJ: Erlbaum.
Walker, L. J. (1995). Sexism in Kohlberg's moral psychology? In W. M,
Kurtines & J. L. Gewirtz (Eds.), Moral development; An introduction
(pp. 83-107). Boston: Allyn & Bacon.
Walker, L. J. (1997). Is morality gendered in early parent-child relation-
ships? A commentary on the Lollis, Ross, and Leroux study. Merrill-
Palmer Quarterly, 43, 148-159.
*Walker, L. J., de Vries, B., & Trevethan, S. D. (1987). Moral stages and
moral orientations in real-life and hypothetical dilemmas. Child Devel-
opment, 58, 842-858,
*Walker, L. J., & Moran, T. J. (1991). Moral reasoning in Communist
Chinese society. Journal of Moral Education, 20, 139-155.
*Wark, G. R. (1993). The effects of sex, sex-role, and type of moral
dilemma on moral maturity and moral orientation (Masters thesis, Simon
Fraser University, 1993). Masters Abstracts International, 33, 652.
726 JAFFEE AND HYDE
*Wark, O. R., & Krebs, D. L. (1596). Gender and dilemma differences in
real-life moral judgment. Developmental Psychology, 32, 220-230.
»Wark, G. R., & Krebs, D. L. (1997). Sources of variation in real-life
moral judgment: Toward a new model of morality. Manuscript submitted
for publication, Simon Fraser University, Burnaby, British Columbia, Canada.
*Webster, P. A. (1996). Moral orientation, ego maturity, and religious commitment: An exploratory study from a developmental perspective
(Doctoral dissertation, Andrews University, 1996). Dissertation Ab-
stracts International, 57, 1982A.
•White, 1. (1994). Individual characteristics and social knowledge in
ethical reasoning. Psychological Reports, 75, 627-649.
•White, !., & Manolis, C. (1997). Individual differences in ethical
reasoning among law students. Social Behavior and Personality, 25, 19-48.
•Wilson, F. L. (1995). The effects of age, gender, and ethnic/cultural
background on moral reasoning. Journal of Social Behavior and Per- sonality, 10, 67-78.
*Yacker, N., & Weinberg, S. L. (1990). Care and justice moral orientation:
A scale for its assessment. Journal of Personality Assessment, 55, 18-27.
Received August 25, 1997
Revision received March 5, 2000
Accepted March 5, 2000
Call for Nominations
The Publications and Communications Board has opened nominations for the
editorships of Journal of Applied Psychology, Journal of Consulting and Clinical
Psychology, Journal of Educational Psychology, Psychological Bulletin, and Journal
of Personality and Social Psychology: Interpersonal Relations and Group Processes
for the years 2003-2008. Kevin R. Murphy, PhD, Philip C. Kendall, PhD, Michael
Pressley, PhD, Nancy Eisenberg, PhD, and Chester A. Insko, PhD, respectively, are the
incumbent editors.
Candidates should be members of APA and should be available to start receiving
manuscripts in early 2002 to prepare for issues published in 2003. Please note that the
P&C Board encourages participation by members of underrepresented groups in the
publication process and would particularly welcome such nominees. Self-nominations
are also encouraged.
To nominate candidates, prepare a statement of one page or less in support of
each candidate and send to
• Margaret B. Spencer, PhD, for the Journal of Applied Psychology
• Donna M. Gelfand, PhD, and Lucia Albino Gilbert, PhD, for the Journal of Con-
sulting and Clinical Psychology
• Lauren B. Resnick, PhD, for the Journal of Educational Psychology
• Janet Shibley Hyde, PhD, and Randi C. Martin, PhD, for Psychological Bulletin
• Sara B. Kiesler, PhD, for JPSP: Interpersonal Relations and Group Processes
Address all nominations to the appropriate search committee at the following
address:
[Name of journal] Search Committee
c/o Karen Sellman, P&C Board Search Liaison
Room 2004
American Psychological Association
750 First Street, NE
Washington, DC 20002-4242
The first review of nominations will begin December 11, 2000.