Extra credit
Studies on metamemory (see, e.g., Sampaio & Brewer, 2009), metacomprehension (e.g., Maki, Jonas, & Kallod, 1994; Mengelkamp & Bannert, 2009), and test taking (e.g., Kleitman & Stankov, 2007) use confidence judgments and calculate the accuracy of these judgments. Confidence judgments are made after an item in a test has been an- swered, in contrast with judgments about the ease of learn- ing or judgments of knowing that are made before the items are solved (Nelson & Narens, 1990). Accuracy of judg- ments is interpreted as an indicator for metacognitive mon- itoring, which, besides metacognitive control, is a crucial process in research on metamemory (Nelson & Narens, 1990, 1992). In the present study, we examined whether the accuracy of confidence judgments that is operationalized by different measures is stable during learning and whether it generalizes over two different tests. Moreover, the predic- tive validity of different accuracy measures for learning outcome was investigated. Although there are studies avail- able that have examined stability or generality using post- test confidence judgments (e.g., Jonsson & Allwood, 2003; Thompson & Mason, 1996), they mostly report only one or two measures of accuracy and do not compare the results of different measures. Since the accuracy of confidence judg- ments is widely used in current research on metamemory,
metacomprehension, and test taking, the stability, gener- ality, and validity of different measures of accuracy are important factors when findings are compared, and con- sidering these factors may encourage further research. We will first describe the measures of accuracy briefly. Then, theory and studies concerning the stability, generality, and validity of such measures will be summarized.
Measures of Accuracy One fundamental distinction of the measures is drawn
between relative and absolute accuracy (Maki, 1998; Nel- son, 1996; Nelson & Dunlosky, 1991). Measures of ab- solute accuracy are based on the difference between the judgment about an item and the actual performance for the same item. The most common measure of the abso- lute accuracy of judgments is bias. Bias is calculated as the signed difference between the average of judgments for a test and the average performance on this test (Yates, 1990), and therefore it allows the distinction between persons who are overconfident versus persons who are underconfident. A further measure of absolute accuracy is calculated by ignoring the direction of the difference between judgment and performance—for example, by av- eraging the squares of the differences (Schraw, 2009). In
441 © 2010 The Psychonomic Society, Inc.
Accuracy of confidence judgments: Stability and generality in the learning process
and predictive validity for learning outcome
CHRISTOPH MENGELKAMP University of Koblenz-Landau, Landau, Germany
AND
MARIA BANNERT Chemnitz University of Technology, Chemnitz, Germany
One aspect of metacognition is the monitoring of memory or comprehension measured with retrospective confidence judgments after test taking. The research questions of the present study were whether different mea- sures for the accuracy of such confidence judgments are stable over learning time, whether they generalize over two different tests, and whether they predict the learning outcome. In order to answer these questions, a study was conducted in which university students (N 113) learned about the basic concepts of operant condition- ing for 30 min. Knowledge tests with confidence judgments presented after each item were obtained before learning, after 10 min, and at the end of the learning session. Bias and absolute bias were calculated as absolute measures of accuracy, and gamma, Pearson’s r, and da were calculated as relative measures of accuracy. The results showed that the absolute measures were stable, but that the relative measures were not. Furthermore, absolute bias, gamma, and Pearson’s r obtained 10 min after the beginning of the learning process predicted the learning outcome. The results are discussed with regard to research on different measures of accuracy for confidence judgments.
Memory & Cognition 2010, 38 (4), 441-451 doi:10.3758/MC.38.4.441
C. Mengelkamp, [email protected]
442 MENGELKAMP AND BANNERT
basis of their theories or beliefs—for example, beliefs about self-efficacy or about their own ability (Ehrlinger & Dunning, 2003) or expertise (Glenberg & Epstein, 1987). Self-efficacy is at least moderately stable over a period of 1 week (Lane & Lane, 2001), and the ability of reading comprehension is at least moderately stable over 5 months, as well (Byrne, 1986). Moreover, self-efficacy predicts academic learning outcome (Lane & Lane, 2001; Lane, Lane, & Kyprianou, 2004). Therefore, it can be ex- pected that some of the stable between-person variance of self-efficacy will be found in the judgments, and some of the stability of ability in reading comprehension will be found in the learning outcome. Furthermore, self-efficacy may influence all judgments to almost the same extent, and ability may influence performance on test items to al- most the same extent. Absolute measures of accuracy are not independent of the magnitude of the underlying judg- ments and performances (Nelson, 1984); thus, the stability of judgments and performances is inherited by the mea- sures of absolute accuracy. In contrast, relative accuracies should not be affected by factors that influence the judg- ments and/or performances evenly (cf. Nelson, 1984).
Besides theory-based cues, people use experience-based cues to generate judgments. These cues arise from the in- formation that they process when learning or answering test items (Koriat, 2007; Koriat et al., 2008). In the case of confidence judgments, such cues are, for example, the time needed to retrieve information (Kelley & Lindsay, 1993; Nelson & Narens, 1990), the completeness of the recall (Brewer, Sampaio, & Barlow, 2005), and the kind of questions asked (Maki, 1995, Experiment 1). There- fore, confidence judgments vary depending on factors that are not stable characteristics of a person but that are characteristics of the learning and testing situation. Thus, judgments vary within a person across tests and over time. Furthermore, if experience-based cues are not used in the same manner by all persons, this results in the instability of the accuracy of judgments in a sample of persons that is reflected in both absolute and relative accuracy. To sum up these deliberations, we may expect at least some stability in judgments and some stability in the absolute accuracy of judgments caused by theory-based cues. And, indeed, some evidence for such stability has been found and will be reported next.
Leonesio and Nelson (1990) found only small gamma correlations between judgments of learning, judgments of knowing, and feeling-of-knowing judgments. They sug- gested that there were different bases—experience-based cues, in the terminology of Koriat (2007)—for the judg- ments made at different points in time during the learning process. Another study (Kelemen, Frost, & Weaver, 2000) used English word pairs, English–Swahili word pairs, gen- eral knowledge questions, and narrative tests as materials. In addition to different domains, they used different types of judgments: ease of learning, judgments of learning, feeling of knowing, and text comprehension. They found distinctly higher Spearman correlations between the same types of judgments in comparison with the low stabilities found by Leonesio and Nelson between different kinds of
the following, the term absolute bias is used for measures that ignore the direction of the difference.
Measures of relative accuracy describe how accurately a learner discriminates between the test performance on correct and incorrect items. Relative measures of accuracy correlate the judgments with the performances within each person in the sample. If one accepts that judgments are on a metric level of measurement, Pearson’s r can be calculated as a correlation coefficient. Nelson (1984, 1996) advocated the use of the nonparametric correlation gamma instead of Pearson’s r because gamma does not require a metric but requires an ordinal scale level. The calculation of gamma is based on dyads of items; that is, to reach high accuracy, the judgments about two items have to be in the same order as the performances on these two items (see Gonzalez & Nelson, 1996). Furthermore, gamma is unaffected by the overall performance on the items and the magnitude of the metacognitive judgments (Nelson, 1984). A recent Monte Carlo study (Benjamin & Diaz, 2008) showed that da from signal detection theory is an equivalent or even a better measure of relative accuracy than is gamma.
The decision for one measure of accuracy should be guided by the nature of the data, especially of the data level. Measures of absolute accuracy require the calcula- tion of differences that are meaningful only in the case of a metric data level. The same is true for the intra- individual correlation Pearson’s r as a measure of rela- tive accuracy. If the data are on an ordinal level, gamma is an appropriate measure of relative accuracy (Nelson, 1984, 1996). Of course, gamma can be used for met- ric data as well, since it is a more conservative measure than r. Moreover, one could use Kim’s dxy when ties on the judgments are not forced by the procedure of data collection (Gonzalez & Nelson, 1996). Nevertheless, all of the measures have been used in past research with con- fidence judgments—for example, bias (see, e.g., Maki, 1998) and absolute bias (e.g., Nietfeld & Schraw, 2002) as measures of absolute accuracy, and Pearson’s r (e.g., Glenberg, Sanocki, Epstein, & Morris, 1987), gamma (e.g., Dunlosky, Rawson, & Middleton, 2005), and d (e.g., Tobias & Everson, 2000) as measures of relative accuracy. Measures of absolute accuracy in particular are favored by researchers working in classroom settings (see, e.g., Hacker, Bol, & Bahbahani, 2008; Nietfeld, Cao, & Osborne, 2005). Our aim was not to limit our results to one measure within each class of measures, but to show that the results hold true, irrespective of the measure used within each class. Therefore, we used a finely graded percentage scale to meet the assumption of metric data of the judgments.
Stability and Generality What determines the stability or instability of the ac-
curacy of confidence judgments? In regard to this ques- tion, Koriat’s (Koriat, 2007; Koriat, Nussinson, Bless, & Shaked, 2008) work is essential. He proposed that judg- ments are generated on the basis of cues, and he distin- guished theory- based cues from experience-based cues. Using theory-based cues, people judge items on the
ACCURACY OF CONFIDENCE JUDGMENTS: STABILITY, GENERALITY, AND VALIDITY 443
cause the control of the learning process will fail if the ac- curacy is too low (Dunlosky, Hertzog, Kennedy, & Thiede, 2005). According to the transfer-appropriate monitoring hypothesis, the accuracy of judgments is a predictor of the learning outcome if the items used for the judgments are the same as those used to measure the learning outcome (see Dunlosky, Rawson, & McDonald, 2002, p. 86ff.). Ad- ditionally, there is evidence that the accuracy of pretest judgments even predicts the learning outcome for new items if the items are constructed to measure the same content (Thiede, Anderson, & Therriault, 2003).
Besides the cited study by Thiede et al. (2003), there is also some evidence for the validity of the accuracy of posttest confidence judgments. Accuracy measured by gamma and performance on a comprehension test cor- related significantly (see, e.g., Maki, 1998, p. 240; Maki, et al., 1994). However, using bias as the absolute measure of accuracy, no correlation was found between bias and performance on the test (Maki, 1998). Nevertheless, the results were correlational in nature; thus, other variables— for example, general intelligence, reading ability, or prior knowledge—may explain the correlation. General intel- ligence was controlled in a classroom study (Nietfeld et al. 2005) that used absolute bias as the measure of ac- curacy. Thus, absolute bias was predicted by both general intelligence and the performance on the test. In another classroom study (Nietfeld, Cao, & Osborne, 2006), the students’ prior knowledge was controlled. Path analyses showed that the effect of prior knowledge on the final test score was partially mediated by absolute bias. To sum up, there is evidence for the validity of gamma and absolute bias, and there is no evidence for bias being a valid predic- tor for test performance.
Research Questions and Hypotheses Our first research question was whether the accuracy of
confidence judgments is stable over time and whether it generalizes over two different tests within the same learn- ing domain. We argued that the beliefs and theories of a person may cause some stability in judgments and the ab- solute accuracy of those judgments. On the basis of these thoughts and the evidence from the outlined studies, we hypothesized (1) that measures of absolute accuracy are stable over time, but measures of relative accuracy are not; and (2) with regard to generality, we anticipated no gen- erality of relative accuracy over two different tests within the same knowledge domain. Since it is assumed that the accuracy of monitoring is important for effective learning (Dunlosky, Hertzog, et al., 2005), valid measures of the accuracy of monitoring during learning should predict the learning outcome; (3) hence, we expected the accuracy of confidence judgments during the learning process to be prognostic for the learning outcome.
METHOD
Participants A total of 113 students from a German university participated in
the study; 86 (76%) were female. The mean age of the participants
judgments. Kelemen et al. reported not only correlations between the judgments, but also correlations between the accuracies. Using bias, they found some stability and gen- erality, but hardly any stability or generality was found for gamma. In the two studies cited so far (Kelemen et al., 2000; Leonesio & Nelson, 1990), judgments were made before the recognition test was taken, and stability was investigated over a period of weeks. In addition, Kelemen et al. used different domains and materials to investigate the generality of judgments. In contrast, our present re- search examined confidence judgments taken after four tests, and the judgments were made within the same do- main over a short period of approximately 1 h.
Results from studies on test taking (Jonsson & Allwood, 2003; Schraw, Dunkle, Bendixen, & DeBacker Roedel, 1995; Schraw & Nietfeld, 1998) revealed that bias gener- alized over different tests to a considerable degree, since almost all correlations between bias calculated from dif- ferent tests were significant and reached at least medium effect size (cf. Cohen, 1988). Moreover, bias showed at least some stability over a period of 2 weeks (Jonsson & Allwood, 2003). For absolute bias, the generality is lower than that for bias, but most correlations between different tests are still significant (Schraw & Nietfeld, 1998). Using relative accuracy measured by gamma, no stability was found for three different tasks administered at two times within a period of 2 weeks (Thompson & Mason, 1996, Experiments 1 and 2). Moreover, the split-half reliabilities of the tests were not significant either, indicating a lack of generality for gamma, even within the same test. This result is in accordance with another study in which no evi- dence was found for generality using gamma computed from a comprehension test, an analogy test, and a test of opposites (Pressley & Ghatala, 1988).
To sum up, there is some evidence for the stability and generality of accuracy of confidence judgments if absolute measures of accuracy are used, but there is no evidence for relative accuracy measured by gamma. This conclu- sion is somewhat limited, due to the fact that no study has compared more than two measures of accuracy, and no study compared an absolute with a relative measure. Fur- thermore, the stability of confidence judgments has only been investigated with regard to periods of weeks between the tests. Thus, the question remains whether there is any stability in shorter periods of time in the range of hours or minutes. Relative accuracy might be more stable over such short periods of time with less intervening study.
Validity Why is the accuracy of monitoring important for learn-
ing processes and learning outcome? The judgments themselves are crucial for the control of learning, espe- cially for the allocation of study time (Metcalfe, 2002; Son & Metcalfe, 2000; Thiede & Dunlosky, 1999). That is, study time is allocated to items that are judged as “least understood” (Dunlosky & Hertzog, 1998) or to items that are in a region of proximal learning (Metcalfe, 2002). But this allocation of study time is effective with regard to learning outcome only if the judgment is accurate, be-
444 MENGELKAMP AND BANNERT
alternative multiple choice test, ranging up to 100% as the highest possible percentage. In the case of the transfer test with its open- answer format, the scale ranged from 0% up to 100%.
Design and Procedure First, the participants practiced navigating in the hypertext using
another chapter about educational psychology. After that, the PT was filled in, and the participants began to study the chapter about operant conditioning. The participants learned for 10 min; then, they took the IT before learning again for another 20 min. After that, the two final tests (FCT and FTT) took place, and the participants were informed about the aims of the study and were offered feedback about their performance.
Remarks on Statistical Analyses All of the statistics were calculated using R. Two measures of
absolute accuracy and three measures of relative accuracy were cal- culated. Bias and absolute bias were calculated for each person using the two following formulas:
bias 1
1n c pi i
i
n
and
abs.bias 1 n
ci pi i
n 2
1 ,
where n number of items; ci confidence judgment about the item i; and pi performance on the item i.
In the case of the FTT with an open-answer format, no measures of absolute accuracy were calculated. Although the FTT yielded an objective and reliable score, those data were obtained on an interval level of measurement instead of the ratio level of measurement ob- tained by the multiple-choice tests. Therefore, the value of 0 is not fixed on the performance scale, and calculating differences between judgment and performance makes no sense.
As a first measure of relative accuracy, the nonparametric cor- relation gamma between confidence judgments and performance in the items was calculated within each person (see Gonzalez & Nelson, 1996). Second, the within-person correlation Pearson’s r was calculated as a relative measure. Third, da was calculated as proposed by Benjamin and Diaz (2008). Using their procedure, the confidence judgments and items were recoded into four bins. Each bin contained one quarter of the items, ordered from the items judged lowest to the items judged highest. The first row contained the number of correct answers, and the second row contained the number of incorrect items. In cases in which a person had values of 0 in one cell of the 2 4 data array—for example, no incor- rect answer in the first bin—no value for da could be calculated. Furthermore, if a person had equal values for all judgments or had solved all items correctly, no measures of relative accuracy could be calculated. Thus, these participants were excluded from some analyses.
RESULTS
Instruments The statistics for the scales of the instruments used are
listed in Table 1. The alphas for the knowledge tests were far from optimal but were sufficiently reliable for research purposes. All of the alphas for the confidence judgments were good or very good (D. H. Rost, 2005, p. 132). The FTT, with its open-answer format, yielded intraclass coef- ficients adjusted for systematic variance between raters (ICCa) between .86 and .97, with a mean ICCa of .93 for all 11 items. Thus, the rater reliability was rather good, and the internal consistency of .79 was also good.
was 23.6 years old (SD 5.2). A total of 65 (58%) participants were studying psychology; 39 (35%) were studying education, and 9 (8%) were studying other social sciences. With a mean of 2.9 semesters— 1.5 years of study time—the participants were rather at the begin- ning of their studies. The students received either €15 for participat- ing in the study or a certificate needed for their studies.
Learning Material By means of a hypermedia environment that was written in Ger-
man, the students learned the principles and the application of oper- ant conditioning. The hypermedia environment has been used with comparable samples and learning settings in previous studies (Ban- nert, 2006), and its intermediate difficulty has been proven. The hy- pertext consisted of 44 nodes with about 12,500 words, 19 pictures/ diagrams, and 240 links in total. The part that was relevant for the learning task involved 9 nodes including 2,300 words, 3 pictures, and 60 links. Navigation was possible by using the hierarchical navi- gation menu, the forward and backward buttons on each node, and the hotwords placed directly in the text.
Instruments Knowledge tests. Three types of learning outcomes are distin-
guished using Bloom’s (Bloom, Engelhart, Furst, Hill, & Krathwohl, 1973) taxonomy of learning objectives as a framework to generate items about operant conditioning:
1. Knowledge about facts: To answer these items, it was neces- sary to remember the text base (see Graesser, Millis, & Zwaan, 1997, for levels of comprehension). The participants were usually able to answer the questions without knowing the exact wording that had been used in the learning material, and it was not neces- sary to infer anything that was not written in the learning material. The students had to select one out of four responses to solve the items.
2. Comprehension: To answer these questions, it was necessary to know at least two facts and to draw an inference from them, because the right answers to these questions could not be found directly in the text. According to Graesser et al. (1997), these questions mostly refer to situational models; that is, a situational model was needed or had to be constructed to solve the tasks. Again, an answer format with three distractors was chosen.
3. Transfer: Questions concerning transfer required the appli- cation of knowledge to a new situation that was not mentioned in the learning material. Short texts were presented that described a situation in which operant conditioning occurred. Here, the answer format was open: The participants had to write one or two words to indicate who was conditioned by whom, what the stimulus and what the reaction were, and which principle of operant conditioning was used. Answers were rated by two raters, independently.
No items were constructed for the three highest objectives in Bloom et al.’s (1973) taxonomy because the learning goals did not include analysis, synthesis, or evaluation objectives. Three tests that were used at different times during the study were composed from the items on knowledge about facts and comprehension. The pre- test (PT) consisted of 10 items. The intermediate test (IT) consisted of 20 new items. The final comprehension test (FCT) included the 10 items from the PT and 10 new items that had been used nei- ther in the PT nor in the IT. Each item of the FCT corresponded to an item of the IT; that is, they required the same information that was given at the same node of the hypermedia. Additionally, a final transfer test (FTT) with 11 transfer items followed. Examples for the items from the comprehension tests and the FTT can be found in the Appendix.
Retrospective confidence judgments. For each item on the tests, the participants were asked retrospectively to judge their con- fidence that their answers were correct. Following studies on test taking (e.g., Kleitman & Stankov, 2001; Pallier et al., 2002; Stankov, 1998), percentage scales were used to assess the confidence judg- ments. That is, the lowest possible percentage was 25% for the four-
ACCURACY OF CONFIDENCE JUDGMENTS: STABILITY, GENERALITY, AND VALIDITY 445
dence for the upper judgments were found. Over- or un- derconfidence was tested using two-tailed paired-sample t tests to compare the actual performance with the judged performance. Neither over- nor underconf idence was found in the PT [bias .02, t(112) 1.14, n.s.], or in the IT [bias .02, t(112) 1.59, n.s.]. But, there was a small, statistically significant amount of over- confidence in the FCT [bias .03, t(112) 2.58, p .05, 2 .06]. Thus, the calibration curves and the t tests show that the participants were able to judge their perfor- mance quite accurately.
Furthermore, the shape of the curves remained the same over all three tests, but the distribution of judgments changed over time, as can be seen from the frequencies of the judgment categories.1 That is, before learning, more low judgments were made, and after learning, more high judgments were given. According to a Monte Carlo study by Weaver and Kelemen (1997), a shift to the end of the distribution may alter the measure of relative accuracy, even when there is no change in the genuine metacogni- tive accuracy of the judgments. With this potential artifact in mind, we analyzed the measures of relative accuracy. All values for the three relative measures (gamma, Pear- son’s r, and da) in all tests reached values significantly dif- ferent from 0, as can be seen in Table 3. Thus, for all tests, the judgments were considerably accurate, indicating that the participants were able to judge their knowledge at least better than chance. However, the gain in accuracy from the PT to the FCT and FTT may be at least partly due to the artifact described by Weaver and Kelemen. Since the increase in accuracy is of no interest for the calcula- tion of correlations, we will not consider this problem any further.
As was described earlier, the items in the IT and the FCT were not the same, but one item from each test refers to the same knowledge. We correlated all items between all persons to each other. According to a two-tailed Welch test for independent samples, the corresponding items yielded a greater mean gamma correlation [M( ) .44, SD( ) .25] than the noncorresponding items [M( ) .36, SD( ) .23] [t(28) 2.77, p .01]. Thus, even though the items in the two tests were not the same, we have evidence that the items corresponded to each other.
Stability The investigation of strict stability is not appropriate
in learning processes because it is inherent to learning
Descriptive statistics for the measures of accuracy can be found in Table 2. For bias and absolute bias, a value of 0 indicates perfect accuracy. For bias, the value of one indicates a maximum of overconfidence, and a value of
1 indicates a maximum of underconfidence. For the relative- measures gamma and Pearson’s r, perfect accu- racy is indicated by a value of 1, and 0 indicates no accu- racy, whereas negative values indicate that a person judges easier items as difficult, and vice versa. The value of da is bounded at 0 and , with zero indicating no accuracy.
Shapiro–Wilk normality tests indicated a significant deviation from the normal distribution for the values of some scales and measures of accuracy. But an inspection of the histograms showed that the distributions violate the assumption of normality only slightly. Nevertheless, all further analyses were calculated using nonparametric correlations.
Initial Analyses Before we discuss the testing of our hypotheses, some
initial analyses concerning the performances, the judg- ments, and the resulting accuracy measures are reported. To inspect the data, we plotted calibration curves that can be seen in Figure 1 (see Stankov, 1998, for calibra- tion curves). The deviance of the calibration curve from the dotted line indicates over- and underconfidence. In all three tests, a slight amount of overconfidence for the lower judgments and a marginal amount of underconfi-
Table 1 Descriptive Statistics of the Instruments (N 113)
Number of Items
Ma
SD
Knowledge Pretest (PT) 10 .58 .21 .55 Intermediate test (IT) 20 .65 .15 .62 Final comprehension test (FCT) 20 .79 .16 .73 Final comprehension test (repeated)b 10 .80 .18 .56 Final transfer test (FTT)c 11 .75 .09 .79
Confidence Judgmentsd
PT 10 .56 .19 .89 IT 20 .62 .16 .90 FCT 10 .82 .13 .89 FTTe 10 .61 .22 .89 aProportion of items solved correctly or, for confidence judgments, esti- mated by participants. bContains the same 10 items as the PT. cPro- portion of points achieved (a maximum of 10 points per item was as- signed). dScale from .25 up to 1.00. eScale from 0 up to 1.00.
Table 2 Descriptive Statistics of the Measures of Accuracy
Measure of
Pretest
Intermediate
Test
Final Comprehension
Test
Final
Transfer Test
Accuracy N M SD N M SD N M SD N M SD
Bias 113 .02 .18 113 .02 .15 113 .03 .14 – – – Absolute bias 113 .47 .11 113 .45 .08 113 .37 .12 – – – Gamma 101 .37 .51 112 .57 .27 105 .62 .38 110 .30 .33 Pearson’s r 101 .25 .34 112 .37 .20 105 .39 .23 110 .30 .31 da 72 .25 .55 62 .54 .38 52 .57 .38 – – –
446 MENGELKAMP AND BANNERT
sumption of strict stability is true for the accuracy of the judgments as well as, since all measures of accuracy were based on different tests with different difficulties (see Table 1), and since test difficulty affects the value of relative accuracy (Pressley & Ghatala, 1988). Further- more, the length of the test also influences the relative accuracy (Weaver, 1990, Experiment 2). Therefore, no strict stability can be expected in the accuracy measures, but correlations between measures of accuracy at differ-
that knowledge increases and is not on a stable level. This is also the case in the present study. The partici- pants’ knowledge increased during the learning process, as was demonstrated by a comparison of the PT (M .58, SD .21) with the repeated version of the FCT (M .80, SD .18) (Wilcoxon z 8.01, p .001). But, the investigation of monotonic stability is reason- able; that is, the rank order of learners according to their knowledge may be stable. The argument against the as-
(201)
(268)
(60)
(338)
(1,393)
(648)
(458)
(84)
(372)
(698)
40 60 80 100
40
60
80
100
Pretest
Judgments (%)
P e
rc e
n ta
g e
C o
rr e
ct
40 60 80 100
40
60
80
100
Final Comprehension Test
Judgments (%)
P e
rc e
n ta
g e
C o
rr e
ct
40 60 80 100
40
60
80
100
Intermediate Test
Judgments (%) P
e rc
e n
ta g
e C
o rr
e ct
(413)
(258) (25)
(189)
(245)
Figure 1. Calibration curves with the dotted line indicating perfectly accurate judgments. Frequencies are printed in parentheses.
ACCURACY OF CONFIDENCE JUDGMENTS: STABILITY, GENERALITY, AND VALIDITY 447
vs. M .39) (W 865.5, p .001). Accordingly, the reduced sample differed from the remaining participants in all four measures, and the reduced sample is not rep- resentative of the whole sample. Furthermore, none of the correlations between the values for da reached sig- nificance. This may also be due to the nonrepresentative- ness of the sample. Additionally, the power of the tests is reduced, since the Ns are clearly smaller than these for gamma and Pearson’s r. To support the explanation that the different results for da were caused by the reduced sample, the three measures of relative accuracy were correlated to each other for the PT. The PT was chosen because the reduction of the sample was less than that in the IT and the FT; that is, a da value could be calculated for 72 participants. Results revealed Spearman corre- lations of .95 for all combinations of the three relative measures. Therefore, one may expect very similar results concerning stability and generality for all three relative measures of accuracy if da can be calculated for the com- plete sample.
Generality Before the hypothesis concerning the generality of ac-
curacies is addressed, the generality of the underlying two final tests and the confidence judgments shall be consid- ered. As can be seen in the lower part of Table 4, the corre- lation between the FCT and the FTT was significant. This indicates that the two tests at least partly measured the same knowledge. Descriptively speaking, the correlation between the judgments was even higher than the correla- tion between the tests themselves.
In order to test our second hypothesis concerning gen- erality, the relative accuracies in the two final tests were
ent points in time can be calculated to test whether there is any monotonic stability.
Before the stability of the accuracies is reported, the stability of the underlying judgments and performances will be described. In Table 4, the correlations between the performances across all tests are listed in the second col- umn. All correlations between tests at different points in time were significant, indicating that despite the absolute gain in knowledge, the participants’ rank order of knowl- edge was at least partly stable over time. Furthermore, Table 4 contains the correlations between judgments across all tests in the third column. Descriptively speak- ing, the correlations were consistently higher than those for the performances, indicating an even higher stability in the confidence judgments about knowledge than in the knowledge itself.
To test our first hypothesis concerning the stability of accuracies, gamma correlations were calculated between the measures of accuracy derived before, during, and after learning. Correlations between biases and absolute biases are listed in Table 4, in columns four and five. All cor- relations between biases were significant, and all correla- tions between absolute biases were significant as well, indicating some stability over time. For relative accuracy, the gamma correlations are presented in the upper part of Table 5. Most of these correlations were not signifi- cantly greater than 0, indicating no stability at all. Two correlations revealed significantly negative values that do not support any stability, either. There was only one sig- nificant positive correlation between the IT and the FCT for gamma, indicating some stability over time. To sum up, relative accuracy was not stable over time at all, but absolute accuracy was.
Comparing the correlations for gamma and Pearson’s r reveals few differences between the two measures of relative accuracy. But for da as a measure of relative ac- curacy, the picture changes somewhat. Descriptively, the correlation between da in the PT and the IT was not 0 as it was for gamma and r, but was negative. This result may be due to the reduced sample. A comparison between the 48 participants for which da could be calculated and the remaining 65 participants showed that the 48 partici- pants had less knowledge (M .22 vs. M .36) in the PT (Wilcoxon W 1,100.5, p .001). Furthermore, they had less knowledge (M .24 vs. M .40) in the IT (W 828.0, p .001); their judgments in the PT were lower (M .20 vs. M .36) (W 898.0, p .001), and their judgments in the IT were also lower (M .23
Table 3 Independent One-Sample t Tests for Gamma, Pearson’s r, and da
Gamma Pearson’s r da t df r t df da t df
PT .37 7.25*** 100 .25 7.34*** 100 .25 3.79*** 71 IT .57 22.64*** 111 .37 19.43*** 111 .54 11.25*** 61 FCT .62 16.91*** 104 .39 17.35*** 104 .57 11.35*** 54 FTT .30 9.56*** 109 .30 10.09*** 109 − − −
Note—PT, pretest; IT, intermediate test; FCT, final comprehension test; FTT, final transfer test. ***p .001.
Table 4 Gamma Correlations Across Tests for Performance, Judgments,
and Absolute Measures of Accuracy (N 113)
Tests Performance Judgment Bias Absolute Bias
Stability
PT–IT .47*** .64*** .21** .23***
PT–FCT .53*** .53*** .20** .23***
PT–FTT .34*** .36*** – – IT–FCT .50*** .61*** .26*** .34***
IT–FTT .41*** .47*** – –
Generality
FCT–FTT .47*** .54*** – –
Note—PT, pretest; IT, intermediate test; FCT, final comprehension test; FTT, final transfer test. **p .01. ***p .001.
448 MENGELKAMP AND BANNERT
stable over learning time, whether they generalize over different tests within the same domain, and whether they predict the learning outcome. Concerning stability over time, our results confirm our first hypothesis assuming that absolute accuracies were considerably stable, since all correlations were significant. In contrast, all three relative accuracies were not stable over time, with the exception of gamma correlating significantly between the IT and the FCT. This finding is not due to the use of a specific mea- sure within these two classes of accuracy measures.
Moreover, bias derived from predictive judgments has some stability as well (Kelemen et al., 2000). One reason for the stability of absolute accuracies could be that they are not independent of the two underlying measures of knowledge and confidence (see Nelson, 1984). That is, variance in the performance test and variance in the con- fidence judgments are part of the measures of absolute accuracy, due to their calculation. Therefore, the stability in absolute accuracy may purely reflect the stability of the performance and the judgments. In accordance with results from studies on test taking (see, e.g., Jonsson & Allwood, 2003; Schraw & Nietfeld, 1998), the rank order of judgments in this sample was even more stable than the rank order of the performance itself. So far, there is no method, to the authors’ knowledge, that solves the de- scribed problem of the results potentially being a statisti- cal artifact.
In contrast with absolute measures of accuracy, rela- tive measures of accuracy do not suffer from the problem of inherited variance, as was shown for gamma (Nelson, 1984). Thus, the findings for relative accuracy are not artifacts but can be interpreted unrestrictedly. Almost no evidence for the stability of relative accuracy was found, either with retrospective judgments in the present study or with predictive judgments and gamma in the study by Kelemen et al. (2000). Instead, the accuracy of monitoring seems to depend strongly on the situation. This is in ac- cordance with our assumption that relative accuracy is de- pendent on experience-based cues rather than on theory- based cues, and it is true despite the considerable stability of the judgments themselves. Low stability enables the manipulation of relative accuracy, as was done in the study by Thiede et al. (2003). Further interventions that increase the accuracy of judgments have been reviewed recently (Dunlosky & Lipko, 2007).
The second hypothesis stated that relative accuracy does not generalize over different tests. Results showed that there was a significant correlation between the gam- mas and Pearson’s r for the FCT and the FTT. At f irst glance, this result contradicts the findings with gamma and retrospective judgments (Pressley & Ghatala, 1988), as well as findings with predictive judgments (Glenberg & Epstein, 1987; Kelemen et al., 2000). However, all of these studies used tests of different knowledge domains from which to calculate the gammas. Our interpretation of the findings is that relative accuracy is domain spe- cific; thus, relative accuracy generalizes over different tests within a domain, but not between domains. This in- terpretation is supported by findings about theory-based monitoring (see Koriat, 2007), indicating that, among
correlated. The correlations can be found in the lower part of Table 5. Correlations between relative accuracy in the FCT and the FTT were significant for gamma as well as for Pearson’s r. These correlations indicate some generality of relative accuracy between the two tests. Note that these tests measured comprehension and transfer on the basis of the same learning material, but with differ- ent items and different answer formats. Thus, relative ac- curacy generalized at least partly between different tests administered at the same time.
Predictive Validity To avoid the methodological problem of part–whole
correlations mentioned by Hasselhorn and Hager (1989), no performance was predicted with absolute measures of accuracy that were derived from the same test. Thus, the aim of the analyses was to predict the performance in the FCT and FTT from measures of accuracy in the IT—that is, from measures of accuracy during the learn- ing process.
For absolute accuracy, the bivariate correlation between bias in the IT and the FCT was rs(111) .18, n.s. For absolute bias, the correlation was rs(111) .42, p .001. Taking the FTT as a criterion changed the correla- tions for bias [rs(111) .11, n.s.] and for absolute bias [rs(111) .43, p .001]. Thus, bias was not a valid predictor for learning outcome, but absolute bias was. For the relative measures of accuracy, the bivariate correlation between gamma in the IT and the FCT was rs(110) .20, p .05; for Pearson’s r the correlation was rs(110) .12, n.s.; and for da, the correlation was rs(60) .05, n.s. For the FTT, the predictive validity for gamma was rs(110) .26, p .01; for Pearson’s r, the validity was rs(110) .21, p .05; and for da, the validity was rs(60) .11, n.s. Thus, relative accuracy was a valid predictor for the learning outcome for gamma and partly for Pearson’s r, but da failed to show its predictive validity. As was argued above, this may be due to the restricted sample that was not representative of the whole sample.
DISCUSSION
In the present study, we analyzed whether different measures for the accuracy of confidence judgments are
Table 5 Gamma Correlations Across Tests for Relative Measures of Accuracy
Gamma Pearson’s r da Tests N N N
Stability
PT–IT 101 .06 101 .02 48 .17 PT–FCT 96 .03 96 .04 40 .08 PT–FTT 99 .16* 99 .14* − − IT–FCT 105 .15* 105 .09 37 .17 IT–FTT 110 .11 110 .05 − −
Generality
FCT–FTT 103 .15* 103 .13* − −
Note—PT, pretest; IT, intermediate test; FCT, final comprehension test; FTT, final transfer test. *p .05.
ACCURACY OF CONFIDENCE JUDGMENTS: STABILITY, GENERALITY, AND VALIDITY 449
bility, and this has not been investigated systematically so far. That is, the absolute accuracy of confidence judg- ments was stable, whereas the relative accuracy showed no stability. In our view, it seems promising to investi- gate the idea of theory-based versus experience-based cues (Koriat, 2007) more deeply; for example, measures of self-efficacy should be used to test our rationale that theory-based cues cause stability in absolute accuracies.
AUTHOR NOTE
The present article was mainly written during a research stay of M.B. at the CoCo Research Centre, University of Sydney, Australia, which was supported by funds from the German Science Foundation (DFG: BA 2044/5-1). We thank Susanne Narciss and the anonymous reviewers for their valuable advice and comments on the manuscript. Address correspon- dence to C. Mengelkamp, University of Koblenz-Landau, Fortstrasse 7, 76829 Landau, Germany (e-mail: [email protected]).
REFERENCES
Bannert, M. (2006). Effects of reflection prompts when learning with hypermedia. Journal of Educational Computing Research, 35, 359- 375. doi:10.2190/94V6-R58H-3367-G388
Benjamin, A. S., & Diaz, M. (2008). Measurement of relative meta- mnemonic accuracy. In J. Dunlosky & R. A. Bjork (Eds.), Handbook of metamemory and memory (pp. 73-94). New York: Psychology Press.
Bloom, B. S., Engelhart, M. D., Furst, E. J., Hill, W. H., & Krath- wohl, D. R. (1973). Taxonomie von Lernzielen im kognitiven Be- reich [Taxonomy of educational objectives] (E. Füner & R. Horn, Trans., 3rd ed.). Weinheim, Germany: Beltz.
Brewer, W. F., Sampaio, C., & Barlow, M. R. (2005). Confidence and accuracy in the recall of deceptive and nondeceptive sen- tences. Journal of Memory & Language, 52, 618-627. doi:10.1016/j .jml.2005.01.017
Byrne, B. M. (1986). Self-concept/academic achievement relations: An investigation of dimensionality, stability, and causality. Canadian Journal of Behavioural Science, 18, 173-186.
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
Dunlosky, J., & Hertzog, C. (1998). Training programs to improve learning in later adulthood: Helping older adults educate themselves. In D. J. Hacker, J. Dunlosky, & A. C. Graesser (Eds.), Metacogni- tion in educational theory and practice (pp. 249-275). Mahwah, NJ: Erlbaum.
Dunlosky, J., Hertzog, C., Kennedy, M. R. T., & Thiede, K. W. (2005). The self-monitoring approach for effective learning. Cognitive Technology, 10, 4-11.
Dunlosky, J., & Lipko, A. R. (2007). Metacomprehension: A brief history and how to improve its accuracy. Current Directions in Psychological Science, 16, 228-232. doi:10.1111/j.1467-8721.2007 .00509.x
Dunlosky, J., Rawson, K. A., & McDonald, S. L. (2002). Influence of practice tests on the accuracy of predicting memory performance for paired associates, sentences, and text material. In T. J. Perfect & B. L. Schwartz (Eds.), Applied metacognition (pp. 68-92). Cambridge: Cambridge University Press.
Dunlosky, J., Rawson, K. A., & Middleton, E. L. (2005). What constrains the accuracy of metacomprehension judgments? Test- ing the transfer-appropriate-monitoring and accessibility hypoth- eses. Journal of Memory & Language, 52, 551-565. doi:10.1016/j .jml.2005.01.011
Ehrlinger, J., & Dunning, D. (2003). How chronic self-views influ- ence (and potentially mislead) estimates of performance. Journal of Personality & Social Psychology, 84, 5-17. doi:10.1037/0022 -3514.84.1.5
Glenberg, A. M., & Epstein, W. (1987). Inexpert calibration of com- prehension. Memory & Cognition, 15, 84-93.
Glenberg, A. M., Sanocki, T., Epstein, W., & Morris, C. (1987). En-
other sources, domain knowledge can be used to gener- ate judgments. Our interpretation is supported further by a metacomprehension study (Moore, Lin-Agler, & Zabrucky, 2005) in which the retrospective judgments of comprehension over three trials were influenced much more by prior judgments than by the performance within each trial. Moore et al. (2005, p. 261) concluded that the judgments are based on “a general perception of oneself as being a good or poor reader.”
The third hypothesis of the present study stated that the accuracy measures should be predictive for learning outcome. For bias, no significant bivariate correlations between the accuracy in the IT and in the final tests were found, but the correlations were significant for absolute bias. The former result was also found in a laboratory study (Maki, 1998). The latter result parallels findings from classroom studies that showed predictive validity for absolute bias (Nietfeld et al., 2005, 2006). One expla- nation for the nonsignificance of bias is that bias reflects over- and underconf idence. We based our assumption of predictive validity on the model of effective learning (Dunlosky, Hertzog, et al., 2005). The model does not differentiate between over- and underconfidence; that is, in both cases, monitoring is inaccurate, and the learning process is not controlled effectively. Therefore, the model does not predict the validity of bias, but the validity of absolute bias; hence, the results in the present study are in accordance with the hypotheses drawn from the model.
For relative accuracy, significant correlations between gamma in the IT and in both final tests were found. For Pearson’s r, the correlation with the FCT was not signifi- cant, but the correlation with the FTT was. The finding that relative accuracy is a valid predictor for learning out- come corresponds with results from metacomprehension research, which also have shown significant correlations for gamma (see, e.g., Maki, 1998; Maki et al., 1994), and is in accordance with the model by Dunlosky, Hertzog, et al. (2005). For da, no significant correlation was found, but this may be due to the smaller sample size.
A methodological problem concerning the stability, generality, and validity of the measures of accuracy arises from the confounding of these coefficients with the reli- ability of the measures. That is, a correlation cannot ex- ceed the minimum of the square roots of the reliabilities (J. Rost, 2004, p. 389). Thus, for future research on sta- bility, generality, and validity of accuracy measures, we suggest using structural equation models to estimate the accuracy at a latent level, thereby eliminating errors of measurement (see Schilling, 2005). Doing this requires at least four measures in time for each person and larger samples than in the present study. Bias can be estimated as a latent variable using models of true change (see, e.g., Little, Bovaird, & Slegers, 2006). To gain an estimation of latent relative accuracy, we suggest obtaining two relative accuracies at one point in time using two parallel tests.
Despite this statistical discussion, our present study highlighted the role of different classes of measures of accuracy. Apparently, absolute and relative measures of accuracy produced different results concerning their sta-
450 MENGELKAMP AND BANNERT
of metacomprehension inaccuracy. Reading Psychology, 26, 251-265. doi:10.1080/02702710590962578
Nelson, T. O. (1984). A comparison of current measures of the accuracy of feeling-of-knowing predictions. Psychological Bulletin, 95, 109- 133. doi:10.1037/0033-2909.95.1.109
Nelson, T. O. (1996). Gamma is a measure of the accuracy of predicting performance on one item relative to another item, not of the absolute performance on an individual item. Applied Cognitive Psychology, 10, 257-260.
Nelson, T. O., & Dunlosky, J. (1991). When people’s judgments of learning (JOLs) are extremely accurate at predicting subsequent re- call: The “delayed-JOL effect.” Psychological Science, 2, 267-270. doi:10.1111/j.1467-9280.1991.tb00147.x
Nelson, T. O., & Narens, L. (1990). Metamemory: A theoretical framework and new findings. In G. H. Bower (Ed.), The psychology of learning and motivation (Vol. 26, pp. 125-173). New York: Aca- demic Press.
Nelson, T. O., & Narens, L. (1992). Metamemory: A theoretical frame- work and new findings. In T. O. Nelson (Ed.), Metacognition: Core readings (pp. 117-130). Needham Heights, MA: Allyn and Bacon.
Nietfeld, J. L., Cao, L., & Osborne, J. W. (2005). Metacognitive monitoring accuracy and student performance in the postsecondary classroom. Journal of Experimental Education, 74, 7-28.
Nietfeld, J. L., Cao, L., & Osborne, J. W. (2006). The effect of dis- tributed monitoring exercises and feedback on performance, moni- toring accuracy, and self-eff icacy. Metacognition & Learning, 1, 159-179.
Nietfeld, J. L., & Schraw, G. (2002). The effect of knowledge and strategy training on monitoring accuracy. Journal of Educational Re- search, 95, 131-142.
Pallier, G., Wilkinson, R., Danthiir, V., Kleitman, S., Knezevic, G., Stankov, L., & Roberts, R. D. (2002). The role of individual differ- ences in the accuracy of confidence judgments. Journal of General Psychology, 129, 257-299.
Pressley, M., & Ghatala, E. S. (1988). Delusions about performance on multiple-choice comprehension tests. Reading Research Quar- terly, 23, 454-464. doi:10.2307/747643
Rost, D. H. (2005). Interpretation und Bewertung pädagogisch- psychologischer Studien. Eine Einführung [Interpretation and evalu- ation of studies in educational psychology. An introduction]. Wein- heim, Germany: Beltz.
Rost, J. (2004). Lehrbuch Testtheorie—Testkonstruktion [Test theory and test construction textbook]. Bern, Switzerland: Huber.
Sampaio, C., & Brewer, W. F. (2009). The role of unconscious memory errors in judgments of confidence for sentence recognition. Memory & Cognition, 37, 158-163. doi:10.3758/MC.37.2.158
Schilling, O. (2005). Strukturgleichungsmodelle zur Analyse von Simplex- Strukturen [Structural equation models for the analysis of simplex structures]. In J. Werner (Ed.), Zeitreihenanalysen mit Beispielen aus der Psychologie (pp. 37-72). Berlin: Logos.
Schraw, G. (2009). A conceptual analysis of five measures of metacog- nitive monitoring. Metacognition & Learning, 4, 33-45. doi:10.1007/ s11409-008-9031-3
Schraw, G., Dunkle, M. E., Bendixen, L. D., & DeBacker Roedel, T. (1995). Does a general monitoring skill exist? Jour- nal of Educational Psychology, 87, 433-444. doi:10.1037/0022- 0663.87.3.433
Schraw, G., & Nietfeld, J. [L.] (1998). A further test of the general monitoring skill hypothesis. Journal of Educational Psychology, 90, 236-248. doi:10.1037/0022-0663.90.2.236
Son, L. K., & Metcalfe, J. (2000). Metacognitive and control strate- gies in study-time allocation. Journal of Experimental Psychology: Learning, Memory, & Cognition, 26, 204-221. doi:10.1037/0278 -7393.26.1.204
Stankov, L. (1998). Calibration curves, scatterplots and the distinction between general knowledge and perceptual tasks. Learning & Indi- vidual Differences, 10, 29-50. doi:10.1016/S1041-6080(99)80141-1
Thiede, K. W., Anderson, M. C. M., & Therriault, D. (2003). Accu- racy of metacognitive monitoring affects learning of texts. Journal of Educational Psychology, 95, 66-73. doi:10.1037/0022-0663.95.1.66
Thiede, K. W., & Dunlosky, J. (1999). Toward a general model
hancing calibration of comprehension. Journal of Experimental Psy- chology: General, 116, 119-136. doi:10.1037/0096-3445.116.2.119
Gonzalez, R., & Nelson, T. O. (1996). Measuring ordinal association in situations that contain tied scores. Psychological Bulletin, 119, 159- 165. doi:10.1037/0033-2909.119.1.159
Graesser, A. C., Millis, K. K., & Zwaan, R. A. (1997). Discourse com- prehension. Annual Review of Psychology, 48, 163-189. doi:10.1146/ annurev.psych.48.1.163
Hacker, D. J., Bol, L., & Bahbahani, K. (2008). Explaining calibra- tion accuracy in classroom contexts: The effects of incentives, reflec- tion, and explanatory style. Metacognition & Learning, 3, 101-121. doi:10.1007/s11409-008-9021-5
Hasselhorn, M., & Hager, W. (1989). Prediction accuracy and mem- ory performance: Correlational and experimental tests of a metamem- ory hypothesis. Psychological Research, 51, 147-152. doi:10.1007/ BF00309310
Jonsson, A.-C., & Allwood, C. M. (2003). Stability and variability in the realism of confidence judgments over time, content domain, and gender. Personality & Individual Differences, 34, 559-574. doi:10.1016/S0191-8869(02)00028-4
Kelemen, W. L., Frost, P. J., & Weaver, C. A., III (2000). Individual differences in metacognition: Evidence against a general metacogni- tive ability. Memory & Cognition, 28, 92-107.
Kelley, C. M., & Lindsay, D. S. (1993). Remembering mistaken for knowing: Ease of retrieval as a basis for confidence in answers to general knowledge questions. Journal of Memory & Language, 32, 1-24. doi:10.1006/jmla.1993.1001
Kleitman, S., & Stankov, L. (2001). Ecological and person-oriented aspects of metacognitive processes in test-taking. Applied Cognitive Psychology, 15, 321-341. doi:10.1002/acp.705
Kleitman, S., & Stankov, L. (2007). Self-confidence and metacog- nitive processes. Learning & Individual Differences, 17, 161-173. doi:10.1016/j.lindif.2007.03.004
Koriat, A. (2007). Metacognition and consciousness. In P. D. Zelazo, M. Moscovitch, & E. Thompson (Eds.), The Cambridge handbook of consciousness (pp. 289-325). Cambridge: Cambridge University Press.
Koriat, A., Nussinson, R., Bless, H., & Shaked, N. (2008). Information- based and experience-based metacognitive judgments: Evidence from subjective confidence. In J. Dunlosky & R. A. Bjork (Eds.), Handbook of metamemory and memory (pp. 117-135). New York: Psychology Press.
Lane, J., & Lane, A. [M.] (2001). Self-efficacy and academic perfor- mance. Social Behavior & Personality, 29, 687-693. doi:10.2224/ sbp.2001.29.7.687
Lane, J., Lane, A. M., & Kyprianou, A. (2004). Self-efficacy, self- esteem and their impact on academic performance. Social Behavior & Personality, 32, 247-256. doi:10.2224/sbp.2004.32.3.247
Leonesio, R. J., & Nelson, T. O. (1990). Do different metamemory judgments tap the same underlying aspects of memory? Journal of Experimental Psychology: Learning, Memory, & Cognition, 16, 464- 470. doi:10.1037/0278-7393.16.3.464
Little, T. D., Bovaird, J. A., & Slegers, D. W. (2006). Methods for the analysis of change. In D. K. Mroczek & T. D. Little (Eds.), Handbook of personality development (pp. 181-211). Mahwah, NJ: Erlbaum.
Maki, R. H. (1995). Accuracy of metacomprehension judgments for questions of varying importance levels. American Journal of Psychol- ogy, 108, 327-344. doi:10.2307/1422893
Maki, R. H. (1998). Metacomprehension of text: Influence of absolute confidence level on bias and accuracy. In D. L. Medin (Ed.), The psy- chology of learning and motivation (Vol. 38, pp. 223-248). San Diego: Academic Press.
Maki, R. H., Jonas, D., & Kallod, M. (1994). The relationship be- tween comprehension and metacomprehension ability. Psychonomic Bulletin & Review, 1, 126-129.
Mengelkamp, C., & Bannert, M. (2009). Judgements about knowl- edge: Searching for factors that influence their validity. Electronic Journal of Research in Educational Psychology, 7, 163-190.
Metcalfe, J. (2002). Is study time allocated selectively to a region of proximal learning? Journal of Experimental Psychology: General, 131, 349-363. doi:10.1037/0096-3445.131.3.349
Moore, D., Lin-Agler, L. M., & Zabrucky, K. M. (2005). A source
ACCURACY OF CONFIDENCE JUDGMENTS: STABILITY, GENERALITY, AND VALIDITY 451
Weaver, C. A., III, & Kelemen, W. L. (1997). Judgments of learning at delays: Shifts in response patterns or increased metamemory accuracy? Psychological Science, 8, 318-321. doi:10.1111/j.1467-9280.1997 .tb00445.x
Yates, J. F. (1990). Judgment and decision making. Englewood Cliffs, NJ: Prentice Hall.
NOTE
1. The low frequencies for the middle category of judgments are due to the construction of the categories. Participants tended to use judgments such as 25%, 50%, or 75%. As the middle category reaches from 55% up to 69%, neither 50% nor 75% is included in the interval; thus, the frequencies are rather low.
of self-regulated study: An analysis of selection of items for study and self-paced study time. Journal of Experimental Psychology: Learning, Memory, & Cognition, 25, 1024-1037. doi:10.1037/0278 -7393.25.4.1024
Thompson, W. B., & Mason, S. E. (1996). Instability of individual dif- ferences in the association between confidence judgments and mem- ory performance. Memory & Cognition, 24, 226-234.
Tobias, S., & Everson, H. (2000). Assessing metacognitive knowledge monitoring. In G. Schraw & J. C. Impara (Eds.), Issues in the measure- ment of metacognition (pp. 147-222). Lincoln, NE: Buros Institute of Mental Measurements.
Weaver, C. A., III (1990). Constraining factors in calibration of compre- hension. Journal of Experimental Psychology: Learning, Memory, & Cognition, 16, 214-222. doi:10.1037/0278-7393.16.2.214
APPENDIX
Example of an Item From the Comprehension Test Positive reinforcement can be obtained with… o a positive consequence, but not with the withdrawal of an aversive stimulus. o the withdrawal of an aversive stimulus, but not with a positive consequence. o a positive consequence as well as with the withdrawal of an aversive stimulus.* o a positive consequence as well as with an aversive stimulus. *Correct answer
Example of an Item From the Transfer Test In many families, getting the children to bed is not easy. They get out of bed, want something to eat or drink,
want their parents to sing a song for them, etc. This scenario is repeated several times in one evening, and the parents become annoyed. If the parents decide not to respond to the wishes of their children but do not do this consistently, the children will behave in an even more bothersome manner.
Acting persons: parents influence children Influenced behavior: getting out of bed, wanting something to drink, etc. Stimulus: eating, drinking, hearing a song Principle: variable interval periodic reinforcement
(The underlined passages had to be filled in by the participants.)
(Manuscript received May 28, 2009; revision accepted for publication October 29, 2009.)
<< /ASCII85EncodePages false /AllowTransparency false /AutoPositionEPSFiles true /AutoRotatePages /None /Binding /Left /CalGrayProfile (Gray Gamma 2.2) /CalRGBProfile (sRGB IEC61966-2.1) /CalCMYKProfile (ISO Coated v2 300% \050ECI\051) /sRGBProfile (sRGB IEC61966-2.1) /CannotEmbedFontPolicy /Error /CompatibilityLevel 1.3 /CompressObjects /Off /CompressPages true /ConvertImagesToIndexed true /PassThroughJPEGImages true /CreateJobTicket false /DefaultRenderingIntent /Perceptual /DetectBlends true /DetectCurves 0.1000 /ColorConversionStrategy /sRGB /DoThumbnails true /EmbedAllFonts true /EmbedOpenType false /ParseICCProfilesInComments true /EmbedJobOptions true /DSCReportingLevel 0 /EmitDSCWarnings false /EndPage -1 /ImageMemory 1048576 /LockDistillerParams true /MaxSubsetPct 100 /Optimize true /OPM 1 /ParseDSCComments true /ParseDSCCommentsForDocInfo true /PreserveCopyPage true /PreserveDICMYKValues true /PreserveEPSInfo true /PreserveFlatness true /PreserveHalftoneInfo false /PreserveOPIComments false /PreserveOverprintSettings true /StartPage 1 /SubsetFonts false /TransferFunctionInfo /Apply /UCRandBGInfo /Preserve /UsePrologue false /ColorSettingsFile () /AlwaysEmbed [ true ] /NeverEmbed [ true ] /AntiAliasColorImages false /CropColorImages true /ColorImageMinResolution 149 /ColorImageMinResolutionPolicy /Warning /DownsampleColorImages true /ColorImageDownsampleType /Bicubic /ColorImageResolution 150 /ColorImageDepth -1 /ColorImageMinDownsampleDepth 1 /ColorImageDownsampleThreshold 1.50000 /EncodeColorImages true /ColorImageFilter /DCTEncode /AutoFilterColorImages true /ColorImageAutoFilterStrategy /JPEG /ColorACSImageDict << /QFactor 0.40 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /ColorImageDict << /QFactor 0.15 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /JPEG2000ColorACSImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /JPEG2000ColorImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /AntiAliasGrayImages false /CropGrayImages true /GrayImageMinResolution 149 /GrayImageMinResolutionPolicy /Warning /DownsampleGrayImages true /GrayImageDownsampleType /Bicubic /GrayImageResolution 150 /GrayImageDepth -1 /GrayImageMinDownsampleDepth 2 /GrayImageDownsampleThreshold 1.50000 /EncodeGrayImages true /GrayImageFilter /DCTEncode /AutoFilterGrayImages true /GrayImageAutoFilterStrategy /JPEG /GrayACSImageDict << /QFactor 0.40 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /GrayImageDict << /QFactor 0.15 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /JPEG2000GrayACSImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /JPEG2000GrayImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /AntiAliasMonoImages false /CropMonoImages true /MonoImageMinResolution 599 /MonoImageMinResolutionPolicy /Warning /DownsampleMonoImages true /MonoImageDownsampleType /Bicubic /MonoImageResolution 600 /MonoImageDepth -1 /MonoImageDownsampleThreshold 1.50000 /EncodeMonoImages true /MonoImageFilter /CCITTFaxEncode /MonoImageDict << /K -1 >> /AllowPSXObjects false /CheckCompliance [ /None ] /PDFX1aCheck false /PDFX3Check false /PDFXCompliantPDFOnly false /PDFXNoTrimBoxError true /PDFXTrimBoxToMediaBoxOffset [ 0.00000 0.00000 0.00000 0.00000 ] /PDFXSetBleedBoxToMediaBox true /PDFXBleedBoxToTrimBoxOffset [ 0.00000 0.00000 0.00000 0.00000 ] /PDFXOutputIntentProfile (None) /PDFXOutputConditionIdentifier () /PDFXOutputCondition () /PDFXRegistryName () /PDFXTrapped /False /CreateJDFFile false /Description << /ARA <FEFF06270633062A062E062F0645002006470630064700200627064406250639062F0627062F0627062A002006440625064606340627062100200648062B062706260642002000410064006F00620065002000500044004600200645062A064806270641064206290020064406440637062806270639062900200641064A00200627064406450637062706280639002006300627062A0020062F0631062C0627062A002006270644062C0648062F0629002006270644063906270644064A0629061B0020064A06450643064600200641062A062D00200648062B0627062606420020005000440046002006270644064506460634062306290020062806270633062A062E062F062706450020004100630072006F0062006100740020064800410064006F006200650020005200650061006400650072002006250635062F0627063100200035002E0030002006480627064406250635062F062706310627062A0020062706440623062D062F062B002E0635062F0627063100200035002E0030002006480627064406250635062F062706310627062A0020062706440623062D062F062B002E> /BGR <FEFF04180437043f043e043b043704320430043904420435002004420435043704380020043d0430044104420440043e0439043a0438002c00200437043000200434043000200441044a0437043404300432043004420435002000410064006f00620065002000500044004600200434043e043a0443043c0435043d04420438002c0020043c0430043a04410438043c0430043b043d043e0020043f044004380433043e04340435043d04380020043704300020043204380441043e043a043e043a0430044704350441044204320435043d0020043f04350447043004420020043704300020043f044004350434043f0435044704300442043d04300020043f043e04340433043e0442043e0432043a0430002e002000200421044a04370434043004340435043d043804420435002000500044004600200434043e043a0443043c0435043d044204380020043c043e0433043004420020043404300020044104350020043e0442043204300440044f0442002004410020004100630072006f00620061007400200438002000410064006f00620065002000520065006100640065007200200035002e00300020043800200441043b0435043404320430044904380020043204350440044104380438002e> /CHS <FEFF4f7f75288fd94e9b8bbe5b9a521b5efa7684002000410064006f006200650020005000440046002065876863900275284e8e9ad88d2891cf76845370524d53705237300260a853ef4ee54f7f75280020004100630072006f0062006100740020548c002000410064006f00620065002000520065006100640065007200200035002e003000204ee553ca66f49ad87248672c676562535f00521b5efa768400200050004400460020658768633002> /CHT <FEFF4f7f752890194e9b8a2d7f6e5efa7acb7684002000410064006f006200650020005000440046002065874ef69069752865bc9ad854c18cea76845370524d5370523786557406300260a853ef4ee54f7f75280020004100630072006f0062006100740020548c002000410064006f00620065002000520065006100640065007200200035002e003000204ee553ca66f49ad87248672c4f86958b555f5df25efa7acb76840020005000440046002065874ef63002> /CZE <FEFF005400610074006f0020006e006100730074006100760065006e00ed00200070006f0075017e0069006a007400650020006b0020007600790074007600e101590065006e00ed00200064006f006b0075006d0065006e0074016f002000410064006f006200650020005000440046002c0020006b00740065007200e90020007300650020006e0065006a006c00e90070006500200068006f006400ed002000700072006f0020006b00760061006c00690074006e00ed0020007400690073006b00200061002000700072006500700072006500730073002e002000200056007900740076006f01590065006e00e900200064006f006b0075006d0065006e007400790020005000440046002000620075006400650020006d006f017e006e00e90020006f007400650076015900ed007400200076002000700072006f006700720061006d0065006300680020004100630072006f00620061007400200061002000410064006f00620065002000520065006100640065007200200035002e0030002000610020006e006f0076011b006a016100ed00630068002e> /DAN <FEFF004200720075006700200069006e0064007300740069006c006c0069006e006700650072006e0065002000740069006c0020006100740020006f007000720065007400740065002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e007400650072002c0020006400650072002000620065006400730074002000650067006e006500720020007300690067002000740069006c002000700072006500700072006500730073002d007500640073006b007200690076006e0069006e00670020006100660020006800f8006a0020006b00760061006c0069007400650074002e0020004400650020006f007000720065007400740065006400650020005000440046002d0064006f006b0075006d0065006e0074006500720020006b0061006e002000e50062006e00650073002000690020004100630072006f00620061007400200065006c006c006500720020004100630072006f006200610074002000520065006100640065007200200035002e00300020006f00670020006e0079006500720065002e> /ESP <FEFF005500740069006c0069006300650020006500730074006100200063006f006e0066006900670075007200610063006900f3006e0020007000610072006100200063007200650061007200200064006f00630075006d0065006e0074006f00730020005000440046002000640065002000410064006f0062006500200061006400650063007500610064006f00730020007000610072006100200069006d0070007200650073006900f3006e0020007000720065002d0065006400690074006f007200690061006c00200064006500200061006c00740061002000630061006c0069006400610064002e002000530065002000700075006500640065006e00200061006200720069007200200064006f00630075006d0065006e0074006f00730020005000440046002000630072006500610064006f007300200063006f006e0020004100630072006f006200610074002c002000410064006f00620065002000520065006100640065007200200035002e003000200079002000760065007200730069006f006e0065007300200070006f00730074006500720069006f007200650073002e> /ETI <FEFF004b00610073007500740061006700650020006e0065006900640020007300e4007400740065006900640020006b00760061006c006900740065006500740073006500200074007200fc006b006900650065006c007300650020007000720069006e00740069006d0069007300650020006a0061006f006b007300200073006f00620069006c0069006b0065002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e00740069006400650020006c006f006f006d006900730065006b0073002e00200020004c006f006f0064007500640020005000440046002d0064006f006b0075006d0065006e00740065002000730061006100740065002000610076006100640061002000700072006f006700720061006d006d006900640065006700610020004100630072006f0062006100740020006e0069006e0067002000410064006f00620065002000520065006100640065007200200035002e00300020006a00610020007500750065006d006100740065002000760065007200730069006f006f006e00690064006500670061002e000d000a> /FRA <FEFF005500740069006c006900730065007a00200063006500730020006f007000740069006f006e00730020006100660069006e00200064006500200063007200e900650072002000640065007300200064006f00630075006d0065006e00740073002000410064006f00620065002000500044004600200070006f0075007200200075006e00650020007100750061006c0069007400e90020006400270069006d007000720065007300730069006f006e00200070007200e9007000720065007300730065002e0020004c0065007300200064006f00630075006d0065006e00740073002000500044004600200063007200e900e90073002000700065007500760065006e0074002000ea0074007200650020006f007500760065007200740073002000640061006e00730020004100630072006f006200610074002c002000610069006e00730069002000710075002700410064006f00620065002000520065006100640065007200200035002e0030002000650074002000760065007200730069006f006e007300200075006c007400e90072006900650075007200650073002e> /GRE <FEFF03a703c103b703c303b903bc03bf03c003bf03b903ae03c303c403b5002003b103c503c403ad03c2002003c403b903c2002003c103c503b803bc03af03c303b503b903c2002003b303b903b1002003bd03b1002003b403b703bc03b903bf03c503c103b303ae03c303b503c403b5002003ad03b303b303c103b103c603b1002000410064006f006200650020005000440046002003c003bf03c5002003b503af03bd03b103b9002003ba03b103c42019002003b503be03bf03c703ae03bd002003ba03b103c403ac03bb03bb03b703bb03b1002003b303b903b1002003c003c103bf002d03b503ba03c403c503c003c903c403b903ba03ad03c2002003b503c103b303b103c303af03b503c2002003c503c803b703bb03ae03c2002003c003bf03b903cc03c403b703c403b103c2002e0020002003a403b10020005000440046002003ad03b303b303c103b103c603b1002003c003bf03c5002003ad03c703b503c403b5002003b403b703bc03b903bf03c503c103b303ae03c303b503b9002003bc03c003bf03c103bf03cd03bd002003bd03b1002003b103bd03bf03b903c703c403bf03cd03bd002003bc03b5002003c403bf0020004100630072006f006200610074002c002003c403bf002000410064006f00620065002000520065006100640065007200200035002e0030002003ba03b103b9002003bc03b503c403b103b303b503bd03ad03c303c403b503c103b503c2002003b503ba03b403cc03c303b503b903c2002e> /HEB <FEFF05D405E905EA05DE05E905D5002005D105D405D205D305E805D505EA002005D005DC05D4002005DB05D305D9002005DC05D905E605D505E8002005DE05E105DE05DB05D9002000410064006F006200650020005000440046002005D405DE05D505EA05D005DE05D905DD002005DC05D405D305E405E105EA002005E705D305DD002D05D305E405D505E1002005D005D905DB05D505EA05D905EA002E002005DE05E105DE05DB05D90020005000440046002005E905E005D505E605E805D5002005E005D905EA05E005D905DD002005DC05E405EA05D905D705D4002005D105D005DE05E605E205D505EA0020004100630072006F006200610074002005D5002D00410064006F00620065002000520065006100640065007200200035002E0030002005D505D205E805E105D005D505EA002005DE05EA05E705D305DE05D505EA002005D905D505EA05E8002E05D005DE05D905DD002005DC002D005000440046002F0058002D0033002C002005E205D905D905E005D5002005D105DE05D305E805D905DA002005DC05DE05E905EA05DE05E9002005E905DC0020004100630072006F006200610074002E002005DE05E105DE05DB05D90020005000440046002005E905E005D505E605E805D5002005E005D905EA05E005D905DD002005DC05E405EA05D905D705D4002005D105D005DE05E605E205D505EA0020004100630072006F006200610074002005D5002D00410064006F00620065002000520065006100640065007200200035002E0030002005D505D205E805E105D005D505EA002005DE05EA05E705D305DE05D505EA002005D905D505EA05E8002E> /HRV (Za stvaranje Adobe PDF dokumenata najpogodnijih za visokokvalitetni ispis prije tiskanja koristite ove postavke. Stvoreni PDF dokumenti mogu se otvoriti Acrobat i Adobe Reader 5.0 i kasnijim verzijama.) /HUN <FEFF004b0069007600e1006c00f30020006d0069006e0151007300e9006701710020006e0079006f006d00640061006900200065006c0151006b00e90073007a00ed007401510020006e0079006f006d00740061007400e100730068006f007a0020006c006500670069006e006b00e1006200620020006d0065006700660065006c0065006c0151002000410064006f00620065002000500044004600200064006f006b0075006d0065006e00740075006d006f006b0061007400200065007a0065006b006b0065006c0020006100200062006500e1006c006c00ed007400e10073006f006b006b0061006c0020006b00e90073007a00ed0074006800650074002e0020002000410020006c00e90074007200650068006f007a006f00740074002000500044004600200064006f006b0075006d0065006e00740075006d006f006b00200061007a0020004100630072006f006200610074002000e9007300200061007a002000410064006f00620065002000520065006100640065007200200035002e0030002c0020007600610067007900200061007a002000610074007400f3006c0020006b00e9007301510062006200690020007600650072007a006900f3006b006b0061006c0020006e00790069007400680061007400f3006b0020006d00650067002e> /ITA <FEFF005500740069006c0069007a007a006100720065002000710075006500730074006500200069006d0070006f007300740061007a0069006f006e00690020007000650072002000630072006500610072006500200064006f00630075006d0065006e00740069002000410064006f00620065002000500044004600200070006900f900200061006400610074007400690020006100200075006e00610020007000720065007300740061006d0070006100200064006900200061006c007400610020007100750061006c0069007400e0002e0020004900200064006f00630075006d0065006e007400690020005000440046002000630072006500610074006900200070006f00730073006f006e006f0020006500730073006500720065002000610070006500720074006900200063006f006e0020004100630072006f00620061007400200065002000410064006f00620065002000520065006100640065007200200035002e003000200065002000760065007200730069006f006e006900200073007500630063006500730073006900760065002e> /JPN <FEFF9ad854c18cea306a30d730ea30d730ec30b951fa529b7528002000410064006f0062006500200050004400460020658766f8306e4f5c6210306b4f7f75283057307e305930023053306e8a2d5b9a30674f5c62103055308c305f0020005000440046002030d530a130a430eb306f3001004100630072006f0062006100740020304a30883073002000410064006f00620065002000520065006100640065007200200035002e003000204ee5964d3067958b304f30533068304c3067304d307e305930023053306e8a2d5b9a306b306f30d530a930f330c8306e57cb30818fbc307f304c5fc59808306730593002> /KOR <FEFFc7740020c124c815c7440020c0acc6a9d558c5ec0020ace0d488c9c80020c2dcd5d80020c778c1c4c5d00020ac00c7a50020c801d569d55c002000410064006f0062006500200050004400460020bb38c11cb97c0020c791c131d569b2c8b2e4002e0020c774b807ac8c0020c791c131b41c00200050004400460020bb38c11cb2940020004100630072006f0062006100740020bc0f002000410064006f00620065002000520065006100640065007200200035002e00300020c774c0c1c5d0c11c0020c5f40020c2180020c788c2b5b2c8b2e4002e> /LTH <FEFF004e006100750064006f006b0069007400650020016100690075006f007300200070006100720061006d006500740072007500730020006e006f0072011700640061006d00690020006b0075007200740069002000410064006f00620065002000500044004600200064006f006b0075006d0065006e007400750073002c0020006b00750072006900650020006c0061006200690061007500730069006100690020007000720069007400610069006b007900740069002000610075006b01610074006f00730020006b006f006b007900620117007300200070006100720065006e006700740069006e00690061006d00200073007000610075007300640069006e0069006d00750069002e0020002000530075006b0075007200740069002000500044004600200064006f006b0075006d0065006e007400610069002000670061006c006900200062016b007400690020006100740069006400610072006f006d00690020004100630072006f006200610074002000690072002000410064006f00620065002000520065006100640065007200200035002e0030002000610072002000760117006c00650073006e0117006d00690073002000760065007200730069006a006f006d00690073002e> /LVI <FEFF0049007a006d0061006e0074006f006a00690065007400200161006f00730020006900650073007400610074012b006a0075006d00750073002c0020006c0061006900200076006500690064006f00740075002000410064006f00620065002000500044004600200064006f006b0075006d0065006e007400750073002c0020006b006100730020006900720020012b00700061016100690020007000690065006d01130072006f00740069002000610075006700730074006100730020006b00760061006c0069007401010074006500730020007000690072006d007300690065007300700069006501610061006e006100730020006400720075006b00610069002e00200049007a0076006500690064006f006a006900650074002000500044004600200064006f006b0075006d0065006e007400750073002c0020006b006f002000760061007200200061007400760113007200740020006100720020004100630072006f00620061007400200075006e002000410064006f00620065002000520065006100640065007200200035002e0030002c0020006b0101002000610072012b00200074006f0020006a00610075006e0101006b0101006d002000760065007200730069006a0101006d002e> /NLD (Gebruik deze instellingen om Adobe PDF-documenten te maken die zijn geoptimaliseerd voor prepress-afdrukken van hoge kwaliteit. De gemaakte PDF-documenten kunnen worden geopend met Acrobat en Adobe Reader 5.0 en hoger.) /NOR <FEFF004200720075006b00200064006900730073006500200069006e006e007300740069006c006c0069006e00670065006e0065002000740069006c002000e50020006f0070007000720065007400740065002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e00740065007200200073006f006d00200065007200200062006500730074002000650067006e0065007400200066006f00720020006600f80072007400720079006b006b0073007500740073006b00720069006600740020006100760020006800f800790020006b00760061006c0069007400650074002e0020005000440046002d0064006f006b0075006d0065006e00740065006e00650020006b0061006e002000e50070006e00650073002000690020004100630072006f00620061007400200065006c006c00650072002000410064006f00620065002000520065006100640065007200200035002e003000200065006c006c00650072002000730065006e006500720065002e> /POL <FEFF0055007300740061007700690065006e0069006100200064006f002000740077006f0072007a0065006e0069006100200064006f006b0075006d0065006e007400f300770020005000440046002000700072007a0065007a006e00610063007a006f006e00790063006800200064006f002000770079006400720075006b00f30077002000770020007700790073006f006b00690065006a0020006a0061006b006f015b00630069002e002000200044006f006b0075006d0065006e0074007900200050004400460020006d006f017c006e00610020006f007400770069006500720061010700200077002000700072006f006700720061006d006900650020004100630072006f00620061007400200069002000410064006f00620065002000520065006100640065007200200035002e0030002000690020006e006f00770073007a0079006d002e> /PTB <FEFF005500740069006c0069007a006500200065007300730061007300200063006f006e00660069006700750072006100e700f50065007300200064006500200066006f0072006d00610020006100200063007200690061007200200064006f00630075006d0065006e0074006f0073002000410064006f0062006500200050004400460020006d00610069007300200061006400650071007500610064006f00730020007000610072006100200070007200e9002d0069006d0070007200650073007300f50065007300200064006500200061006c007400610020007100750061006c00690064006100640065002e0020004f007300200064006f00630075006d0065006e0074006f00730020005000440046002000630072006900610064006f007300200070006f00640065006d0020007300650072002000610062006500720074006f007300200063006f006d0020006f0020004100630072006f006200610074002000650020006f002000410064006f00620065002000520065006100640065007200200035002e0030002000650020007600650072007300f50065007300200070006f00730074006500720069006f007200650073002e> /RUM <FEFF005500740069006c0069007a00610163006900200061006300650073007400650020007300650074010300720069002000700065006e007400720075002000610020006300720065006100200064006f00630075006d0065006e00740065002000410064006f006200650020005000440046002000610064006500630076006100740065002000700065006e0074007200750020007400690070010300720069007200650061002000700072006500700072006500730073002000640065002000630061006c006900740061007400650020007300750070006500720069006f006100720103002e002000200044006f00630075006d0065006e00740065006c00650020005000440046002000630072006500610074006500200070006f00740020006600690020006400650073006300680069007300650020006300750020004100630072006f006200610074002c002000410064006f00620065002000520065006100640065007200200035002e00300020015f00690020007600650072007300690075006e0069006c006500200075006c0074006500720069006f006100720065002e> /RUS <FEFF04180441043f043e043b044c04370443043904420435002004340430043d043d044b04350020043d0430044104420440043e0439043a043800200434043b044f00200441043e043704340430043d0438044f00200434043e043a0443043c0435043d0442043e0432002000410064006f006200650020005000440046002c0020043c0430043a04410438043c0430043b044c043d043e0020043f043e04340445043e0434044f04490438044500200434043b044f00200432044b0441043e043a043e043a0430044704350441044204320435043d043d043e0433043e00200434043e043f0435044704300442043d043e0433043e00200432044b0432043e04340430002e002000200421043e043704340430043d043d044b04350020005000440046002d0434043e043a0443043c0435043d0442044b0020043c043e0436043d043e0020043e0442043a0440044b043204300442044c002004410020043f043e043c043e0449044c044e0020004100630072006f00620061007400200438002000410064006f00620065002000520065006100640065007200200035002e00300020043800200431043e043b043504350020043f043e04370434043d043804450020043204350440044104380439002e> /SKY <FEFF0054006900650074006f0020006e006100730074006100760065006e0069006100200070006f0075017e0069007400650020006e00610020007600790074007600e100720061006e0069006500200064006f006b0075006d0065006e0074006f0076002000410064006f006200650020005000440046002c0020006b0074006f007200e90020007300610020006e0061006a006c0065007001610069006500200068006f0064006900610020006e00610020006b00760061006c00690074006e00fa00200074006c0061010d00200061002000700072006500700072006500730073002e00200056007900740076006f00720065006e00e900200064006f006b0075006d0065006e007400790020005000440046002000620075006400650020006d006f017e006e00e90020006f00740076006f00720069016500200076002000700072006f006700720061006d006f006300680020004100630072006f00620061007400200061002000410064006f00620065002000520065006100640065007200200035002e0030002000610020006e006f0076016100ed00630068002e> /SLV <FEFF005400650020006e006100730074006100760069007400760065002000750070006f0072006100620069007400650020007a00610020007500730074007600610072006a0061006e006a006500200064006f006b0075006d0065006e0074006f0076002000410064006f006200650020005000440046002c0020006b006900200073006f0020006e0061006a007000720069006d00650072006e0065006a016100690020007a00610020006b0061006b006f0076006f00730074006e006f0020007400690073006b0061006e006a00650020007300200070007200690070007200610076006f0020006e00610020007400690073006b002e00200020005500730074007600610072006a0065006e006500200064006f006b0075006d0065006e0074006500200050004400460020006a00650020006d006f0067006f010d00650020006f0064007000720065007400690020007a0020004100630072006f00620061007400200069006e002000410064006f00620065002000520065006100640065007200200035002e003000200069006e0020006e006f00760065006a01610069006d002e> /SUO <FEFF004b00e40079007400e40020006e00e40069007400e4002000610073006500740075006b007300690061002c0020006b0075006e0020006c0075006f00740020006c00e400680069006e006e00e4002000760061006100740069007600610061006e0020007000610069006e006100740075006b00730065006e002000760061006c006d0069007300740065006c00750074007900f6006800f6006e00200073006f00700069007600690061002000410064006f0062006500200050004400460020002d0064006f006b0075006d0065006e007400740065006a0061002e0020004c0075006f0064007500740020005000440046002d0064006f006b0075006d0065006e00740069007400200076006f0069006400610061006e0020006100760061007400610020004100630072006f0062006100740069006c006c00610020006a0061002000410064006f00620065002000520065006100640065007200200035002e0030003a006c006c00610020006a006100200075007500640065006d006d0069006c006c0061002e> /SVE <FEFF0041006e007600e4006e00640020006400650020006800e4007200200069006e0073007400e4006c006c006e0069006e006700610072006e00610020006f006d002000640075002000760069006c006c00200073006b006100700061002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e007400200073006f006d002000e400720020006c00e4006d0070006c0069006700610020006600f60072002000700072006500700072006500730073002d007500740073006b00720069006600740020006d006500640020006800f600670020006b00760061006c0069007400650074002e002000200053006b006100700061006400650020005000440046002d0064006f006b0075006d0065006e00740020006b0061006e002000f600700070006e00610073002000690020004100630072006f0062006100740020006f00630068002000410064006f00620065002000520065006100640065007200200035002e00300020006f00630068002000730065006e006100720065002e> /TUR <FEFF005900fc006b00730065006b0020006b0061006c006900740065006c0069002000f6006e002000790061007a006401310072006d00610020006200610073006b013100730131006e006100200065006e0020006900790069002000750079006100620069006c006500630065006b002000410064006f006200650020005000440046002000620065006c00670065006c0065007200690020006f006c0075015f007400750072006d0061006b0020006900e70069006e00200062007500200061007900610072006c0061007201310020006b0075006c006c0061006e0131006e002e00200020004f006c0075015f0074007500720075006c0061006e0020005000440046002000620065006c00670065006c0065007200690020004100630072006f006200610074002000760065002000410064006f00620065002000520065006100640065007200200035002e003000200076006500200073006f006e0072006100730131006e00640061006b00690020007300fc007200fc006d006c00650072006c00650020006100e70131006c006100620069006c00690072002e> /UKR <FEFF04120438043a043e0440043804410442043e043204430439044204350020044604560020043f043004400430043c043504420440043800200434043b044f0020044104420432043e04400435043d043d044f00200434043e043a0443043c0435043d044204560432002000410064006f006200650020005000440046002c0020044f043a04560020043d04300439043a04400430044904350020043f045604340445043e0434044f0442044c00200434043b044f0020043204380441043e043a043e044f043a04560441043d043e0433043e0020043f0435044004350434043404400443043a043e0432043e0433043e0020043404400443043a0443002e00200020042104420432043e04400435043d045600200434043e043a0443043c0435043d0442043800200050004400460020043c043e0436043d04300020043204560434043a0440043804420438002004430020004100630072006f006200610074002004420430002000410064006f00620065002000520065006100640065007200200035002e0030002004300431043e0020043f04560437043d04560448043e04570020043204350440044104560457002e> /ENU (Use these settings to create Adobe PDF documents best suited for high-quality prepress printing. Created PDF documents can be opened with Acrobat and Adobe Reader 5.0 and later.) /DEU <FEFF004a006f0062006f007000740069006f006e007300200066006f00720020004100630072006f006200610074002000440069007300740069006c006c0065007200200038002000280038002e0032002e00310029000d00500072006f006400750063006500730020005000440046002000660069006c0065007300200077006800690063006800200061007200650020007500730065006400200066006f00720020006f006e006c0069006e0065002e000d0028006300290020003200300031003000200053007000720069006e006700650072002d005600650072006c0061006700200047006d006200480020000d000d0054006800650020006c00610074006500730074002000760065007200730069006f006e002000630061006e00200062006500200064006f0077006e006c006f0061006400650064002000610074002000680074007400700073003a002f002f0070006f007200740061006c002d0064006f0072006400720065006300680074002e0073007000720069006e006700650072002d00730062006d002e0063006f006d002f00500072006f00640075006300740069006f006e002f0046006c006f0077002f00740065006300680064006f0063002f00640065006600610075006c0074002e0061007300700078000d0054006800650072006500200079006f0075002000630061006e00200061006c0073006f002000660069006e0064002000610020007300750069007400610062006c006500200045006e0066006f0063007500730020005000440046002000500072006f00660069006c006500200066006f0072002000500069007400530074006f0070002000500072006f00660065007300730069006f006e0061006c00200030003800200061006e0064002000500069007400530074006f0070002000530065007200760065007200200030003800200066006f007200200070007200650066006c00690067006800740069006e006700200079006f007500720020005000440046002000660069006c006500730020006200650066006f007200650020006a006f00620020007300750062006d0069007300730069006f006e002e000d> >> /Namespace [ (Adobe) (Common) (1.0) ] /OtherNamespaces [ << /AsReaderSpreads false /CropImagesToFrames true /ErrorControl /WarnAndContinue /FlattenerIgnoreSpreadOverrides false /IncludeGuidesGrids false /IncludeNonPrinting false /IncludeSlug false /Namespace [ (Adobe) (InDesign) (4.0) ] /OmitPlacedBitmaps false /OmitPlacedEPS false /OmitPlacedPDF false /SimulateOverprint /Legacy >> << /AddBleedMarks false /AddColorBars false /AddCropMarks false /AddPageInfo false /AddRegMarks false /ConvertColors /ConvertToCMYK /DestinationProfileName () /DestinationProfileSelector /DocumentCMYK /Downsample16BitImages true /FlattenerPreset << /PresetSelector /MediumResolution >> /FormElements false /GenerateStructure false /IncludeBookmarks false /IncludeHyperlinks false /IncludeInteractive false /IncludeLayers false /IncludeProfiles false /MultimediaHandling /UseObjectSettings /Namespace [ (Adobe) (CreativeSuite) (2.0) ] /PDFXOutputIntentProfileSelector /DocumentCMYK /PreserveEditing true /UntaggedCMYKHandling /LeaveUntagged /UntaggedRGBHandling /UseDocumentProfile /UseDocumentBleed false >> ] >> setdistillerparams << /HWResolution [2400 2400] /PageSize [595.276 841.890] >> setpagedevice