Research Proposal

profileybilmz1314
Meng2013-WCF.pdf

Retrievable at www.tc.columbia.edu/tesolalwebjournal

69

Written Corrective Feedback:

A Review of Studies since Truscott (1996)

Yuan-Yuan Meng 1

INTRODUCTION

Linguistic errors are pervasive in second language (L2) students’ writing. Depending on their

gravity, the errors may cause a minor degree of irritation to the reader or even lead to total

communication breakdown. As such, errors have always been a major concern to both students

and teachers, and error correction has also assumed a central position in language teaching.

Students generally expect that their errors will be pointed out and dealt with by their teachers.

For instance, in a study on students’ attitudes toward corrective feedback (CF) in college-level

English writing classes, Leki (1991) surveyed 100 English as a Second Language (ESL) students,

asking them such questions as how concerned they were with their written errors, what they

thought were the most important features in their writing that the teacher should attend to, and

what they looked at when receiving a graded paper from the teacher. The results of the survey

indicated that the students believed that good writing should be error-free, and the majority

wanted all their written errors to be corrected. For L2 teachers, providing written CF on student

writing has long been an essential practice. In fact, “grammar correction is something of an

institution” (Truscott, 1996, p. 327) in L2 writing courses. Despite the fact that correcting

students’ written errors is a time-consuming ordeal, and the endeavor is “fraught with uncertainty

about its long-term effectiveness” (Ferris, 1999, p. 1), most L2 teachers have continued to slave

over students’ errors in one form or another. As confirmed by a recent study on practitioners’

perspectives, the majority of teachers believe that students need CF and that written CF is overall

an effective pedagogical practice (Evans et al., 2010).

In the field of Second Language Acquisition (SLA), the positive effect of oral CF in

facilitating L2 acquisition is generally supported by research (Li, 2010; Lyster & Saito, 2010;

Russell & Spada, 2006), but research on the role of written CF in SLA has produced mixed

results, and a consensus on the positive role of written CF has not yet been obtained (Ferris, 2010;

Guenette, 2007; Hyland and Hyland, 2006; Van Beuningen, 2010). Hyland and Hyland (2006)

commented that due to the varied populations, treatments, and research designs of many of the

earlier studies on written CF, it was difficult to draw valid conclusions and generalizations from

them. Likewise, Guenett (2007) attributed the conflicting research findings on written CF to

methodological inconsistencies between the earlier studies and the presence of confounding

external variables.

1 Yuan-Yuan Meng is currently an EdM student in Applied Linguistics at Teachers College, Columbia University,

where she obtained an MA in TESOL and an MA in Applied Linguistics. She has been teaching Chinese as a

foreign language at Columbia University since 1993. Her research interests include formulaic expressions, literacy

education, and the stylistic features of Chinese journalistic language. Correspondence should be sent to:

[email protected].

Retrievable at www.tc.columbia.edu/tesolalwebjournal

70

Can written CF help L2 learners improve their grammatical accuracy in writing? Is it

worthwhile for L2 teachers to devote countless hours to written error correction? These questions

not only concern L2 teachers; they have also become a source of controversy among SLA

researchers, who, for over a decade now, have vigorously debated the effectiveness of written CF

in facilitating L2 development (Ferris, 1999, 2004; Truscott, 1996, 1999, 2004, 2007). The

debate was famously triggered by Truscott’s (1996) review article, “The Case against Grammar

Correction in L2 Writing Classes,” in which he made the strong claim that grammar correction is

ineffective and potentially harmful and should therefore be abandoned. Responding to Truscott,

Ferris (1999) argued that when correction is “selective, prioritized, and clear” (p. 4), it can be

beneficial for some student writers. She also argued that research evidence up to the late 1990s

had been too limited to support Truscott’s strong position. Furthermore, many of those earlier

studies suffered such design flaws as the lack of a control group or generalizability, so she

contended that the results obtained from them could not serve as a basis for abolishing the long-

valued tradition of error correction. In 2004, Ferris argued again in a review article that the

existing research findings were still inconclusive in determining the efficacy of written CF and

called for more robust investigation into the effect of error correction, which ideally should

address the design flaws found in many previous studies. Specifically, she called for replicable

longitudinal studies, carefully designed with a control group that does not receive feedback.

Following this call, there has emerged a good number of experimental and quasi-experimental

studies (e.g., Bitchener, 2008; Ellis et al., 2008; Sheen, 2007; Truscott & Hsu, 2008), which

made an effort to address the shortcomings of prior research by including a control group and

examining the effect of written CF on new pieces of writing. Apart from these tightly controlled

studies, descriptive research has also been carried out (e.g., Evans et al., 2010; Vyatkina, 2011).

In this paper I review 15 empirical studies on written CF conducted since Truscott’s

(1996) call for the abandonment of grammar correction. In the sections below, I first summarize

the arguments presented in Truscott’s (1996) review and then provide the definitions for several

key terms related to the studies on written CF. Following that, I review the empirical studies and

discuss their findings and implications. I conclude this paper by making suggestions for

pedagogy and future studies.

SUMMARY OF TRUSCOTT (1996)

In “The Case against Grammar Correction in L2 Writing Classes,” Truscott (1996)

advanced the position that grammar correction is ineffective, counterproductive, and should have

no place in writing courses. By grammar correction, he meant “correction of grammatical errors

for the purpose of improving a student’s ability to write accurately” (p. 329). Citing research

evidence from first language (L1) writing studies (e.g., Knoblauch & Brannon, 1981) and L2

writing studies (e.g., Kepner, 1991; Semke, 1984; Sheppard, 1992), and arguing on both

theoretical and practical grounds, he repeatedly emphasized that grammar correction does not

work. His major arguments are summarized as follows:

1. Second language development involves complex learning processes rather than the

simple process of passing information from teacher to student. Error correction is a form of

information transfer. To harbor favorable intuitions about error correction is thus to subscribe to

the simplistic “information-transfer” view of learning, which expects, unrealistically, that

students will take in everything the teacher imparts to them.

Retrievable at www.tc.columbia.edu/tesolalwebjournal

71

2. Related to the notion of complex learning processes, research has shown that in

acquiring certain grammatical features, L2 learners follow a natural pre-determined acquisition

order, which is unaffected by instruction. It follows then that error correction, which functions as

a form of instructional intervention and which often fails to match students’ current stage of

interlanguage development, is not likely to have much value.

3. Not all learning is equal. Learning which does not affect learners’ underlying,

developing system and which only results in a superficial, transient form of knowledge, is

“pseudo-learning.” Grammar correction only produces pseudo-learning because it is “typically

done in terms of isolated points and without reference either to the processes by which the

linguistic system develops or to the learner’s current developmental stage” (p. 347).

4. From a practical perspective, grammar correction is unlikely to be successful as it

requires that many other conditions be met. For one, the teacher must have the ability to

recognize errors and be able to explain the problems well. But grammar is often too complex to

be adequately explained by any single linguistic theory or teacher, and even if the teacher is

capable enough to recognize and explain all errors, he may still lack the time and patience

required for high-quality correction. On the students’ part, they may fail to understand the

teacher’s corrections or only achieve a surface understanding of the teacher’s comments without

grasping the general principles involved. In the latter case, they are likely to forget the new

knowledge quickly and repeat the errors later. Students may also lack the motivation to process

the teacher’s feedback thoughtfully.

5. The inherent unpleasantness of correction triggers much anxiety in students and

discourages them from experimenting with the language. In order to avoid errors and corrections,

students tend to shorten and simplify their writing, which hurts the development of their

linguistic complexity.

6. The time spent on providing and processing low-level grammar feedback diverts the

attention of both teacher and student from other high-level aspects of writing such as

organization and content. When effort directed at grammar correction diverts class resources

from other important activities like additional writing practice, the correction harms students’

interlanguage development.

DEFINITION OF TERMS

Direct vs. Indirect Written CF

Direct written CF involves the provision of the correct form on students’ work, including

such treatments as crossing out an unnecessary element, inserting a missing element, and writing

the correct form above or near the error (Ellis, 2009). Indirect written CF only requires singling

out the errors without actually providing the corrections. It can be done by underlining or circling

errors, inserting a caret (‸) sign to indicate a missing word, or placing a question mark alongside a confusing phrase (Hendrickson, 1980). It can also include the use of error codes.

Metalinguistic CF

Metalinguistic CF involves providing explicit explanations of the nature of the student’s

errors (Ellis, 2009). The explanations can be directly written down on the student’s paper or

given through face-to-face conferencing.

Retrievable at www.tc.columbia.edu/tesolalwebjournal

72

Focused vs. Unfocused Written CF

The teacher can choose to treat students’ errors selectively (focused approach) or

comprehensively (unfocused approach). The focused approach targets only one or a limited

range of error types (e.g., the use of English articles), whereas the unfocused approach involves

correcting all errors in the student’s text (Van Beuningen, 2010).

REVIEW OF STUDIES

At the start of the 2000s, two studies investigating the effect of written CF on revised

texts have provided support for the short-term effectiveness of CF (Ashwell, 2000; Ferris &

Roberts, 2001). Ashwell (2000) compared the effect of different feedback patterns on the writing

by 50 Japanese college students in the context of a process-oriented writing instruction. He

wanted to find out if the content-then-form feedback sequence as suggested by Zamel (1985) was

superior to other sequences in improving students’ writing. The students were assigned to one of

four different groups: the content-then-form feedback group, the form-then-content feedback

group, the form-and-content feedback group, and the non-feedback group. Form feedback was

provided by underlining or circling linguistic and mechanical errors, and content feedback

addressed discourse-level issues such as organization, paragraphing, and cohesion. The students

produced three drafts for a writing assignment. The first group received content feedback on

their first draft, then form feedback on the second draft; the second group received feedback in

the reverse order; the third group received mixed feedback on both drafts. The results showed

that the recommended feedback sequence was not superior to the other two feedback patterns,

and the three groups that received feedback outperformed the non-feedback group. When

revising their texts, the students acted on three quarters of the form feedback, which helped them

to improve the linguistic accuracy of their writing.

Ferris and Roberts’s (2001) study, also involving the provision of indirect CF (codes and

underlining) and text revision, likewise showed that ESL college students who received CF

produced significantly better revised texts than those who did not receive feedback. This study

consisted of a four-stage cycle (pre-test, writing, feedback, editing) and targeted five error

categories. Prior to the writing, a “Grammar Knowledge Questionnaire” and a “Grammar

Knowledge Pretest” were administered to ensure that the students’ knowledge of the five error

categories was comparable. Students were assigned to one of three groups: Group A had their

errors underlined and coded by error type; Group B only had their errors underlined; and Group

C did not receive any feedback. Two weeks after the original writing, the students self-edited

their texts in class based on the papers — corrected or uncorrected — that they received. Apart

from providing support for the value of CF for text revision, this study also showed that there

were no significant differences in revision performance between the two feedback groups.

Notwithstanding the short-term effectiveness of written CF demonstrated by the above

revision studies and an earlier study by Fathman and Whalley (1990), Truscott (2007) contended

that because these studies did not produce evidence of learning, which in his term refers to

improved accuracy on new pieces of writing, the investigation was of limited value.

In the same year as Ferris and Roberts’s study, Fazio (2001) conducted an experiment

investigating the effect of different feedback on the writing accuracy of 112 fifth-graders. These

fifth-graders were L1 and L2 learners of French at a French-language school in Montreal,

Retrievable at www.tc.columbia.edu/tesolalwebjournal

73

randomly assigned to one of three feedback conditions — form-focused, content-based, and the

combination of form and content feedback. For this study, the students produced one piece of

journal writing each week over the course of four months, with complete freedom to topic

selection and length of entry. For each journal entry, the form-focused group received direct CF

on two targeted grammatical features; the content group received a content-related comment; and

those in the combination group received both types of feedback. Accuracy of writing was

measured as the total number of targeted errors committed divided by the total number of uses of

the targeted structures. The findings showed that neither the form-focused nor the content-based

feedback exerted much positive influence on the L1 and L2 students’ grammatical accuracy in

the targeted features.

Chandler (2003) reported positive findings in two studies on ESL music students, who

wrote five autobiographical essays over one semester for a writing course. In the first study,

students were assigned to an experimental group (n = 15) and a control group (n = 16), with the

former required to correct the errors marked by the teacher and the latter not. The experimental

group’s papers were treated in two steps: First, the teacher underlined all the errors in the

students’ first draft. Then, after students rewrote the paper based on the initial feedback, the

teacher provided direct correction on their revised text. The control group, who were not required

to rewrite the first draft, only received the initial CF from the teacher. Students’ errors were

classified into 23 different categories, including but not limited to word form, word choice, verb

tense, and word order. Error rate was calculated as number of errors per 100 words, and writing

accuracy was measured by calculating error rate on the first and fifth papers. The results revealed

that the experimental group showed significant improvement in grammatical and lexical

accuracy on the fifth paper, but no significant difference was found between the control group’s

first and fifth papers. Between the experimental and the control groups, there was also significant

difference in their improvement in accuracy over the 10 weeks. Furthermore, both groups

showed marked increase in writing fluency, measured by the time spent on writing each piece.

Chandler’s (2003) second study investigated the differential efficacy of four types of

feedback methods on students’ writing accuracy. It also examined change in students’ writing

fluency over one semester and looked into students’ attitudes to the different feedback methods

through surveys. Participants were 20 students enrolled in the same ESL writing course in a

different year, who were also required to produce five autobiographical essays. For each of the

first four essays they wrote, the students had to revise based on the teacher’s feedback, which

was provided in four ways: direct correction, underlining with description, description of error

type only, and underlining. All students received these four types of feedback but in varying

order. Error rate was calculated in the same way as for the first study, and improvement in

writing accuracy was assessed by looking at the change in error rate in the revised and

subsequent new papers. The overall writing quality of the first draft of the first paper and that of

the fifth paper was rated holistically, on a scale of 1-6, by two trained raters. In addition, writing

fluency was measured by using students’ self-reports of the time spent on writing each paper.

The study yielded five main findings: (1) that students’ writing accuracy and fluency both

improved significantly over the semester; (2) that concerning the overall writing quality, there

was only a small difference between the ratings for the first paper and the last; (3) that with

respect to accuracy in revision, students produced fewest errors with the direct correction method,

second fewest errors with underlining plus description, and most errors with description alone; (4)

that in terms of accuracy in subsequent writing, the direct correction and underlining methods led

Retrievable at www.tc.columbia.edu/tesolalwebjournal

74

to more accurate writing on the next assignment; and (5) that most students considered direct

correction to be the easiest and the most preferred feedback method.

Although Chandler’s (2003) two studies suggest that the provision of CF, when

combined with revision requirement, helps learners improve their writing accuracy over time,

some researchers (Guenette, 2007; Truscott, 2004, 2007) argued that due to the absence of a non-

feedback control group in both studies, their findings failed to provide solid evidence about the

superiority of CF. To answer what Truscott (2004) called the “Big Questions” — whether

written CF facilitates the development of grammatical accuracy and whether teachers should

provide written CF — Ferris (2004) stressed that studies with more robust design were needed.

As mentioned previously, Ferris’s (2004) call for more robust investigation into the role

of written CF has given rise to a growing body of new studies, the most noteworthy strand of

which investigated the effect of focused written CF on learners’ development of grammatical

accuracy (Bitchener, 2008; Bitchener & Knoch, 2008; Bitchener & Knoch, 2009; Bitchener &

Knoch, 2010; Bitchener et al., 2005; Ellis et al., 2008; Sheen, 2007; Sheen et al., 2009). This

body of research shares several methodological features. First, all employed a quasi-

experimental design with a non-feedback control group for comparison. Second, most targeted

only the use of two functions of the English article system: the indefinite article a for the first

mention and the definite article the for subsequent mention. These studies also went beyond

examining short-term improvement in revisions and sought to measure accuracy gain on new

pieces of writing. The research questions that they explored generally fall into two categories:

whether written CF improves accuracy in the use of the targeted features on new pieces of

writing, and whether there is a difference in the effect of the different feedback strategies. Below

I describe each of them in more detail.

Bitchener et al.’s (2005) 12-week study examined the effects of written CF and face-to-

face conference on the development of three English grammatical features: prepositions, simple

past tense, and articles. Participants were 53 ESL immigrant learners, who completed four

writing tasks, each in the form of an informal letter, and were divided into three treatment groups.

Group 1 received direct CF and a 5-minute student-researcher conference; Group 2 received

direct CF only; and Group 3 received no CF. Accuracy was measured as the percentage of

correct suppliance of each targeted linguistic feature. The study found that Group 1 outperformed

Group 2 and the control group. The combination of direct CF and individual conferences helped

learners improve their use of the simple past tense and the definite article, but such effect was not

found for the use of prepositions. The researchers attributed the positive results with simple past

tense and definite article to the fact that they are more rule-based, and the negative results with

prepositions to their idiosyncratic nature. Also, the meta-linguistic explanations students received

during the one-on-one meeting helped them notice the gap between their incorrect usage and the

correct form, and thereby facilitated their understanding and acquisition of the rule-based

features.

Following Bitchener et al. (2005), Bitchener (2008) and Bitchener and Knoch (2008,

2009, 2010) conducted a series of narrowly focused studies, all targeting the use of two functions

of the English article system as described above. In Bitchener (2008), 75 intermediate-level ESL

students participated in an experiment involving a pre-test, an immediate post-test, and a delayed

post-test. The delayed post-test was administered two months after the immediate post-test. For

each of these tests, the students completed a 30-minute in-class writing task describing what was

happening in a given picture, and they were assigned to four different conditions: direct CF plus

written and oral meta-linguistic explanation; direct CF and written meta-linguistic explanation;

Retrievable at www.tc.columbia.edu/tesolalwebjournal

75

direct CF only; and no feedback. Oral meta-linguistic feedback was delivered to Group 1 via a

30-minute mini-lesson. Obligatory contexts for the targeted article uses were identified, and

accuracy was calculated as percentage of correct uses in obligatory contexts. The results

demonstrated that all the treatment groups receiving CF outperformed the non-feedback control

group on both the immediate and delayed post-tests, suggesting that written CF is not only

effective in the short-term, but the level of effectiveness can be sustained for a longer period. The

study also showed that of the three feedback methods, the combination of direct CF plus written

and oral metalinguistic explanation produced the best results in helping students improve the

accuracy of their use of articles. Interestingly, though, Group 2 who received direct CF plus

written metalinguistic explanation performed worse than Group 3 who only received direct CF.

Extending Bitchener’s (2008) study with the same design, Bitchener and Knoch (2008)

carried out a two-month experiment with 114 low-intermediate ESL learners and obtained

slightly different results. While the study also showed that students who received written CF

performed better than those who did not, and that the level of accuracy was sustained for seven

weeks, nonetheless no significant difference was found in the efficacy of the different feedback

methods.

In Bitchener and Knoch’s (2009) ten-month investigation on the effect of written CF, 52

low-intermediate ESL students were assigned to the same three treatment groups and a non-

feedback control group. The major difference between this study and the previous two lies in its

duration. After the immediate post-test, the students were evaluated on three delayed post-tests at

three different points of time. The study found (1) that all feedback groups outperformed the

control group; (2) that there was no difference in effectiveness among the three feedback options;

and (3) that the accuracy-gain was durable, retained over a 10-month period.

Still another study by Bitchener and Knoch (2010) applied similar methodology to

compare the effects of three feedback strategies on 63 advanced ESL learners’ use of the two

articles. The three treatment conditions were: written meta-linguistic explanation, indirect

circling of errors, and written meta-linguistic explanation plus oral form-focused instruction. The

study showed that all experimental groups outperformed the control group in the immediate

post-test, and only the two meta-linguistic groups retained the accuracy-gains across the 10-week

period.

In a similar vein, Sheen (2007) examined the differential effect of two types of written

CF — direct CF and direct CF plus meta-linguistic comments — on the acquisition of English

articles by 111 intermediate ESL students. Through a correlational analysis, she also investigated

how language aptitude affected the results. The students took a language analytic ability test two

weeks prior to the experiment, which was then followed by the pre-test, two CF treatment

sessions, the immediate post-test, and the delayed post-test. During each of the two treatment

sessions, the experimental groups wrote a narrative based on a story they read. Then the narrative

was corrected by the researcher, who focused mainly on article errors, and returned to the

students 2-4 days later. The control group received no CF treatment. To measure acquisition,

three kinds of tests (a speeded dictation test, a narrative writing test, and an error correction test)

were given to all three groups during the pre-test, immediate post-test, and delayed post-test.

This two-month study reached three conclusions. First, the CF groups outperformed the control

group on both post-tests. Second, there was no difference between the two feedback methods in

the immediate post-test. Third, in the delayed post-test, there is an advantage for the combination

of direct CF and written metalinguistic explanation over direct CF alone. This third conclusion

supports Bitchener et al.’s (2005) findings that when meta-linguistic explanation accompanies

Retrievable at www.tc.columbia.edu/tesolalwebjournal

76

direct CF, learners are more likely to achieve enduring improvement. The study also showed that

learners with stronger language aptitude benefitted more from the CF.

Building on Sheen (2007), Sheen et al. (2009) examined the differential effects of three

treatments – direct focused CF, direct unfocused CF, and writing practice alone – on the use of

the English articles by 80 ESL learners. Students in the focused CF group only had their article

errors corrected, whereas the unfocused CF group had their errors in five categories (articles,

copula be, regular past tense, irregular past tense, and preposition) corrected. Like those in the

Sheen (2007) study, these two CF groups followed the same procedure to complete two written

narrative tasks and receive CF during the treatment sessions. The writing-practice group only did

the narrative tasks without receiving CF; the control group received no CF. All four groups

completed the pre-test, immediate post-test, and delayed post-test, and acquisition of the articles

was measured by three versions of a narrative writing test, which asked the students to write a

story based on a series of pictures. The results showed that all three treatment groups

outperformed the control group. For the acquisition of the articles, the focused group showed an

advantage over the unfocused group in both the immediate and the delayed post-tests, suggesting

that focused CF is more effective than unfocused CF. What is surprising, though, is that in terms

of the overall accuracy in the five targeted features, the focused group, who did not receive

corrections on features beyond the articles, also outperformed the unfocused and the control

groups. The unfocused group, who in fact received CF for the targeted features, did not

demonstrate significantly greater accuracy than the control group. Sheen attributed this to the

possibility that when CF addresses a range of errors, learners might be less able to process the

feedback effectively. Another reason might have to do with the manner in which the CF was

provided — the feedback which the focused group received was systematic, but that which the

unfocused group received was much less so. Although the writing-practice group also

outperformed the control group in both the article uses and in the other grammatical features,

suggesting that accuracy can improve through the kind of writing practice used in this

experiment, the effectiveness of focused CF was still greater than that achieved by writing

practice alone.

Adopting several methodological features from Sheen (2007), Ellis et al. (2008) also

compared the effects of focused and unfocused written CF on Japanese EFL learners’ use of the

two English articles. On three written narratives, students in the focused group received

corrections of article errors, the unfocused group received corrections of all errors, and the

control group received no CF. Accuracy-gain on the article uses was measured by narrative

writing tests and error correction tests. The study found that both CF groups performed

significantly better than the control group on the delayed post-test. The researchers contended

that contrary to Trucscott’s claim (1996, 1999), written CF can indeed facilitate acquisition. But

different from the findings revealed in Sheen et al. (2009), the results indicated that there was no

statistically significant difference between the effects of focused and unfocused CF options.

The next three studies represent a refreshing break from the above “article craze,” in that

they all employed a true experimental design and treated errors comprehensively.

Truscott and Hsu (2008) tested the effectiveness of unfocused CF on students’ revision

and new writing. Participants were 47 university students in Taiwan, who were randomly

assigned to either the experimental group that received CF from the teacher, or the control group

that did not receive any CF. Both groups were given 30 minutes to write a narrative essay in

class during the 12 th

, 13 th

, and 14 th

weeks of the semester. In week 12, all students wrote a

narrative story based on a sequence of eight pictures. One week later, they received the first draft

Retrievable at www.tc.columbia.edu/tesolalwebjournal

77

back from the teacher and revised it. The experimental group rewrote the story based on the

teacher’s feedback, delivered via underlining of errors without codes. The control group simply

rewrote the draft without the benefit of CF. In week 14, the two groups wrote another narrative

story based on a new sequence of eight pictures. In this study, the treatment of errors focused on

grammatical and spelling errors only. Statistical analysis revealed that for the rewrites, there was

a significant difference between the two groups in error reduction. However, for the new writing

task, the error rate between the two groups was virtually identical, suggesting that CF did not

exert a strong influence on students’ writing development. On the one hand, these findings

corroborated previous research that CF does have a positive effect on reducing errors in revision.

On the other hand, they failed to show that success with the revision task can translate into

improved accuracy on a new writing task. Truscott and Hsu thus concluded that successful error

reduction on the revision cannot be taken as evidence of learning.

Van Beuningen et al.’s (2008) study, however, provided evidence to support the

beneficial role of unfocused CF. The study was carried out on 62 secondary-school L1 and L2

learners of Dutch, who were randomly assigned to one of four groups: direct unfocused CF,

indirect unfocused CF, practicing writing (practice), and self-correction. All four groups

participated in a three-session experiment and completed two writing tasks for Session 1 and

another two tasks for Session 3. The two writing tasks for each session were of different topics,

both requiring that the students write an email explaining the topics to a classmate. Each topic

was accompanied by a series of pictures, and students were allowed 20 minutes to perform each

task. The purpose of using two task topics was to control for topic influences and increase

generalizability. Students in the CF groups received either direct corrections or coded CF on

their writing. The Practice group and the self-correction group received no CF. During Session 2,

the CF groups revised their texts based on the corrections they received; the Practice group did

not revise their uncorrected texts but instead, completed two additional writing tasks; and the

Self-correction group was required to revise their texts without the benefit of CF. The results

revealed that the two feedback groups significantly outperformed the self-correction group on the

revised texts. With regard to performance on the new tasks given at Session 3, only direct CF

resulted in improved accuracy. Of the four groups, the practice group performed worst,

displaying no improvement from Session 1 to Session 3. This is different from the results

obtained by Sheen et al. (2009), which showed that writing practice alone was more helpful than

receiving no feedback. It also challenged Truscott’s (1996) suggestion that having students do

additional writing practice might be more worthwhile than giving them CF.

Following up on their 2008 study, Van Beuningen et al. (2012) investigated the effect of

direct and indirect unfocused CF on 268 L1 and L2 Dutch learners. This study is unmatched by

previous experiments in that it not only dealt with a language other than English, but also

involved a large number of secondary-school participants representing two educational levels. It

also explored eight research questions, some of which had not been addressed or only touched

upon briefly before. For instance, the study sought to test Truscott’s (2007) hypotheses that CF

might only have value for non-grammatical errors and that CF compels students to simplify their

writing. Like in the previous study (Van Beuningen et al., 2008), the students were randomly

assigned to one of four conditions: direct CF, indirect CF (via error codes), Self-correction, and

Practice. The experiment consisted of four sessions: a pre-test session (Session1), a

treatment/control session (Session 2), a post-test session (Session 3), and a delayed post-test

session (Session 4). At Session 1, all four groups completed the first writing task. One week later

at Session 2, the two CF groups revised their first written texts based on the CF they received;

Retrievable at www.tc.columbia.edu/tesolalwebjournal

78

the Control group self-corrected their original texts without the help of CF; and the Practice

group completed a new writing task. During Session 3 (week 3) and Session 4 (week 6), all four

groups produced a new text based on a new topic. All writing topics were related to biology (e.g.,

honeybees, ladybugs), and each writing task required that the students wrote an email to a

classmate explaining the topic.

The results showed that unfocused CF led to improved accuracy in both the revised texts

and the new texts. The positive effect of unfocused CF observed at the revision stage was

retained four weeks later. In terms of the differential efficacy of direct and indirect CF, the study

found that only direct CF facilitated “durable grammatical accuracy improvements of a medium

size” (p. 32), and that indirect CF had a greater effect on non-grammatical errors. The findings

also rebutted Truscott’s (1996) hypotheses that CF harms the complexity of students’ writing and

that additional writing practice may be more beneficial than the provision of CF. Lastly, the

study found no significant interaction between feedback effectiveness and the participants’

educational level. In short, this study demonstrated that comprehensive treatment of errors can

help students improve their grammatical accuracy over time, and the findings do not support

Truscott’s claim that grammar correction may bring about harmful effects.

DISCUSSION

Effects of Written CF on Text Revision

The findings of two revision studies (Ashwell, 2000; Ferris & Roberts, 2001), together

with Truscott and Hsu (2008) and the two studies by Van Beuningen et al. (2008, 2012), have

provided clear evidence that written CF can serve as an effective editing tool — students who

received both direct and indirect CF outperformed those who received no CF on the revision task.

By now, a consensus regarding the short-term effectiveness of CF is apparently established.

Effects of Written CF on New Texts

But what is of greater concern is whether the short-term effect can be transferred to new

tasks and help students acquire grammar over time. Most of the studies reviewed above have

striven to address this issue. Even though a unanimous answer to the question has yet been

reached, there is growing evidence that written CF can help learners improve their linguistic

accuracy for the longer term. Of the 11 recent quasi-experimental and experimental studies, ten

have shown that both selective and comprehensive correction of errors can result in learners’

improved accuracy in new texts, and the effect could be as durable as up to ten months. Most

notably, with its robust research design (i.e. true experimental design using a large sample), Van

Beuningen et al.’s (2012) study provided compelling evidence of the effectiveness of written CF.

Nevertheless, problems still exist in some of these studies, which I will turn to shortly.

Focused Written CF vs. Unfocused Written CF

If written CF can be useful for the acquisition of certain linguistic features for the long

term, it is worth asking then whether focused written CF (selective correction) is more effective

than unfocused feedback (comprehensive correction) and whether direct written CF is more

Retrievable at www.tc.columbia.edu/tesolalwebjournal

79

effective than indirect feedback. For the focused-unfocused dichotomy, the limited findings so

far do not provide a clear picture. Ellis et al. (2008) and Sheen et al. (2009) both compared the

relative effectiveness of focused and unfocused CF, but the former found the two approaches to

be equally effective, while the latter showed that the focused approach yielded better results. To

complicate the picture a little, Van Beuningen et al.’s (2008, 2012) studies showed that

unfocused CF helped their Dutch learners improve written accuracy both for the short- and the

longer-term.

Direct Written CF vs. Indirect Written CF

With respect to the relative effectiveness of direct and indirect CF, four studies that

compare the two approaches (Bitchener & Knoch, 2010; Chandler, 2003; Van Beuningen et al.,

2008, 2012) demonstrated that direct CF led to greater accuracy gain than indirect CF. Bitchener

and Knoch’s (2010) and Van Beuningen et al.’s (2008, 2012) particularly showed that while both

direct and indirect CF were effective for the short-term, only direct CF yielded a more significant

long-term effect. From a theoretical point of view, some researchers (e.g., Ferris, 2002; Lalande,

1982) had suggested that indirect CF has the greatest potential to facilitate learning because it

engages learners in deeper cognitive processing and “promotes the type of reflection on existing

knowledge or partially internalized knowledge that is more likely to foster long-term acquisition

and written accuracy” (Bitchener & Ferris, 2011, p. 65). Chandler (2003), however, argued that

the benefit created by indirect CF may be cancelled by delayed access to the correct form. Direct

CF, on the other hand, allows learners prompt access to the target form, enabling them to

confirm or abandon their hypotheses about the language soon after they write, and thereby helps

them to internalize the corrections better. Other benefits of direct CF may include reducing

confusion resulted from ambiguous indirect CF and providing learners with information to solve

complex errors involving syntactic structure or idiomatic expressions (Bitchener & Ferris, 2011).

Amenability to Written CF

Not all errors are equally treatable — different types may respond to correction

differently (Ferris, 1999; Hyland & Hyland, 2006). According to Ferris (1999), “treatable errors”

are those that occur in “a patterned, rule-governed way” (p. 6) such as verb-form errors and the

aforementioned article use, whereas untreatable errors (e.g., lexical errors, sentence construction)

are more idiosyncratic in nature. Truscott (2001) also suggested that some error types might be

more correctable than others. To him, the more appropriate targets for selective correction are

discrete items involving simple principles (e.g., lexical items, spelling, punctuation), and the

uncorrectable errors are those “stemming from problems in a complex system, particularly the

syntactic system” (p. 93). In his view, grammar errors are generally poor candidates for CF, but

non-grammatical errors can fare well with correction. On this point, Van Beuningen et al.’s

(2012) study demonstrated that both complex grammatical problems (e.g., word order) and non-

grammatical errors (e.g., spelling) are susceptible to CF. Regarding the correctability of lexical

errors, it is interesting to see how Truscott’s view contradicts Ferris’s. In my opinion, lexical

items may just be as complex as articles. Some lexical items are indeed simple and easily

correctable, but others that involve multiple meanings or complicated selectional restrictions may

not respond to CF well. Future research could investigate whether lexical errors would be good

targets for correction since the extant research has not shed much light on that.

Retrievable at www.tc.columbia.edu/tesolalwebjournal

80

Limitations of the Current Research

Methodological flaws are apparent in some of the studies. In Bitchener et al.’s (2005)

experiment, the participants, who were divided into three feedback groups, received different

amount of instruction from their respective classes — 20 hours a week for Group 1, ten hours a

week for Group 2, and four hours a week for Group 3. Though the authors claimed that all three

groups received the same amount of grammar instruction and attention to writing, such dramatic

differences in instructional exposure would most likely have confounded the results. It is

possible that the positive effects that Group 1 obtained did not actually result from the written CF

but rather were attributable to the longer hours of instruction they had received. Given this flaw,

Truscott (2007) argued that the study did not qualify as a controlled experiment. A closer

examination of the three groups’ performance on the writing tasks also led him to question the

validity of the study’s findings.

As noted earlier, the majority of the quasi-experimental studies post 2005 only targeted

one single aspect of the English article system. With improved research design, this body of

work has consistently shown that written CF can facilitate the acquisition of the targeted feature.

Despite the optimistic findings, however, these studies have come under criticisms chiefly

because of their extremely narrow focus. Bruton (2009b) and Storch (2010) both criticized these

studies for lacking ecological validity, having little pedagogical relevance to real classrooms.

Bruton (2009b) commented that with the single-feature focus, these studies give the impression

that the writing tasks more resembles “the third stage in a Present-Practice-Produce (PPP)

sequence” (p. 608) than the different stages of a communicative writing process. Similarly,

Storch (2010) questioned the generalizability of these studies’ findings, arguing that the

provision of CF on only one grammatical feature in controlled environments will not be of much

value to language teachers. Sympathetic to these legitimate concerns, Ferris (2010) also noted

that

L2 writing researchers and practitioners might wonder if, in the interest of empirical

rigor, some of the SLA research efforts on written CF have been so narrowly focused

that it would be difficult to transfer their approach and findings to a real writing

classroom or to a diverse group of students. (p. 186)

Another problem with this body of research, which relates more to grammar instruction

than methodology, has to do with how the English article system was presented to the students.

As is well known, the article system is highly complex. Certainly there is more to the articles

than what was presented in these studies. By limiting the article usage to the simple contrast of

“first mention” and “anaphoric second mention,” students might be misled into thinking that that

was the whole picture of the article system, and might even feel confused when they encounter

other types of article uses that are beyond the simple scope of “first mention vs. second mention”.

As an avid critic of empirical research on written CF, Truscott himself has only

conducted one study on the subject with Hsu in 2008, and this study did not escape scrutiny,

either. Truscott and Hsu concluded that learning did not occur as a result of the provision of

indirect written CF. On the basis of the surface evidence, this conclusion seemed justified. It is

true that compared to the control group, the experimental group showed significant improvement

on their revision texts after receiving correction, but on the subsequent new writing task, they

Retrievable at www.tc.columbia.edu/tesolalwebjournal

81

produced virtually the same amount of errors as the control group. However, upon careful

examination of the data provided by Truscott and Hsu, Bruton (2009a) pointed out two problems

that called into question the author’s claims. First, the experimental group’s lack of improvement

from the pre-test to the post-test might have been due to a “ceiling effect,” which refers to the

fact that the experimental students did not make that many errors on their first writing piece to

begin with. These students were thus left with little room to make significant improvement on

the subsequent text. But more crucially, the way accuracy gain was measured has seriously

undermined the validity of the study’s findings.

In this experiment, each piece of the students’ writing was assigned an error rate,

calculated as the total number of errors divided by the total number of words produced, and the

mean error rates for both the experimental and the control groups were also calculated. The

authors then compared the global error rates of the two groups without taking into account what

specific errors were made on the initial text and the new text. This approach is problematic

because to truly capture the changes in the student’s writing, one would need to examine the

specific details in each of the texts. To accurately gauge the effect of written CF on error

reduction, one would also need to consider if the student’s original errors in the initial text, for

which he receives CF, actually reappear or disappear in the new one. If what shows up in the

new writing are mostly new errors which have never before been treated, the high error rate on

the new text cannot be taken as evidence of non-learning or ineffectiveness of correction.

Such is precisely the case with a particular participant, whose three sample texts (initial

text, revision, second text) were included in the study, and on which Bruton (2009a) conducted a

detailed qualitative analysis. It was found that the student’s most frequent error in the initial text

was the past tense of to be, which was actually successfully reduced in the subsequent new text

by 75%. As for other error types such as lexical items and verb prepositions, none of those

occurred in the new text bore any relation to those in the original one. These valuable details and

insights were lost in the global error rate comparison that Truscott and Hsu (2008) had opted for.

If not for Bruton’s (2009a) meticulous analysis of the “sequential details that reveals the

underlying inconsistency in the design and the argument” (p. 139), the reader would have been

left with only one interpretation of the results as presented by the authors and perhaps would

have even gone on to accept Truscott’s view that error correction is indeed unhelpful and should

thus be avoided.

CONCLUSION Since Truscott (1996) called for the abandonment of grammar correction in L2 writing,

the past 16 years has seen a proliferation of studies on written CF. This paper reviews the

empirical evidence provided by 15 recent studies, with the aim to gain more insights into the

state of affairs regarding the written CF debate. Overall, these studies have shown that error

correction is effective for short-term revision, and increasing evidence also suggests that both

focused and unfocused CF can facilitate the acquisition of either a single or a wider range of

grammatical features. The findings of some studies also challenged Truscott’s hypothesis that

grammar correction may bring about detrimental effects such as reducing the complexity of

students’ writing or that mere writing practice would be more valuable than providing CF.

Despite the promising results, there are still lingering concerns. For one, most of the

studies dealt with English learners in an L2 context, and many of them only focused on the

simple usage of the English article system. It is therefore difficult to generalize the results to

other contexts, learners, or linguistic features. Now is time that the research community puts the

Retrievable at www.tc.columbia.edu/tesolalwebjournal

82

narrow article investigation to rest and turns its attention to other syntactic and lexical features.

Given the scarcity of research on unfocused CF and given the positive findings revealed by Van

Beuningen et al. (2012), future studies should continue to explore the potential of comprehensive

error correction. In addition, to better understand how learners process CF and how the depth of

their processing affects their interlanguage development, more qualitative research examining

learners’ uptake of CF is also needed.

Drawing on the available research evidence and discussions of written CF, I would like

to make the following pedagogical suggestions. First of all, written CF can help learners develop

greater control over the use of grammar, so the teacher should not hesitate about providing it. But

s/he should also know that, however great its potential might be, CF will not cure all of the

learners’ language problems. Its success will be determined by a host of other factors such as

error types, feedback characteristics (e.g., explicitness, clarity, consistency), the learner (e.g.,

language aptitude, proficiency level, motivation, developmental readiness), the teacher (e.g.,

personality, teaching methods), and the learning context. It is also vital that the teacher make

sure that learners attend to the CF by taking such measures as requiring them to revise the

original text. Having learners process feedback is essential because as Schmidt’s Noticing

Hypothesis (1990) suggests, linguistic elements in the input cannot be learned unless they have

been noticed by the learner. Last but not least, it is important for the teacher to remember that

“second language acquisition is slow, gradual, and often arduous, and that corrective feedback is

only one of the many factors that contribute to that process” (Guenette, 2007, p. 52). Sometimes

it may take an extended period of time before one begins to see CF bear fruit. Patience and

persistence are key.

REFERENCES

Ashwell, T. (2000). Patterns of teacher response to student writing in a multi-draft composition

classroom: Is content feedback followed by form feedback the best method? Journal of

Second Language Writing, 9, 227–257.

Bitchener, J. (2008). Evidence in support of written corrective feedback. Journal of Second

language Writing, 17, 102-118.

Bitchener, J. & Ferris, D. (2011). Written Corrective Feedback in Second Language Acquisition

and Writing. New York, NY: Routledge.

Bitchener, J. & Knoch, U. (2008). The value of written corrective feedback for migrant and

international students. Language Teaching Research, 12, 409–431.

Bitchener, J. & Knoch, U. (2009). The contribution of written corrective feedback to language

development: A ten month investigation. Applied Linguistics, 31, 193–214.

Bitchener, J. & Knoch, U. (2010). Raising the linguistic accuracy level of advanced L2 writers

with written corrective feedback. Journal of Second Language Writing, 19, 207–217.

Bitchener, J., Young, S., & Cameron, D. (2005). The effect of different types of corrective

feedback on ESL student writing. Journal of Second Language Writing, 14, 191–205.

Bruton, A. (2009a). Designing research into the effects of grammar correction in L2 writing: Not

so straightforward. Journal of Second Language Writing, 18, 136-140.

Bruton, A. (2009b). Improving accuracy is not the only reason for writing, and even if it were...

System, 37, 600-613.

Retrievable at www.tc.columbia.edu/tesolalwebjournal

83

Chandler, J. (2003). The efficacy of various kinds of error feedback for improvement in the

accuracy and fluency of L2 student writing. Journal of Second Language Writing, 12, 267–

296.

Ellis, R. (2009). A typology of written corrective feedback types. ELT Journal, 63(2), 97-107.

Ellis, R., Sheen, Y., Murakami, M., & Takashima, H. (2008). The effects of focused and

unfocused written corrective feedback in an English as a foreign language context. System,

36, 353-371.

Evans, N., Hartshorn, K., & Tuioti, E. (2010). Written corrective feedback: Practitioners’

Perspectives. International Journal of English Studies, 10, 47-77.

Fathman, A., & Whalley, E. (1990). Teacher response to student writing: Focus on form versus

content. In B. Kroll (Ed.), Second language writing: Research insights for the classroom (pp.

178-190). Cambridge: Cambridge University Press.

Fazio, L. (2001). The effect of corrections and commentaries on the journal writing accuracy of

minority- and majority-language students. Journal of Second Language Writing, 10, 235-249.

Ferris, D.R. (1999). The case for grammar correction in L2 writing classes: A response to

Truscott (1996). Journal of Second Language Writing, 8, 1–10.

Ferris, D. R. (2002). Treatment of error in second language student writing. Ann Arbor, MI:

University of Michigan Press.

Ferris, D. R. (2004). The “grammar correction” debate in L2 writing: Where are we, and where

do we go from here? (and what do we do in the meantime…?) . Journal of Second Language

Writing, 13, 49 – 62.

Ferris, D.R. (2010). Second language writing research and written corrective feedback in SLA.

Studies in Second Language Acquisition, 32, 181–201.

Ferris, D. R., & Roberts, B. (2001). Error feedback in L2 writing classes: How explicit does it

need to be?. Journal of Second Language Writing, 10, 161–184.

Guenette, D. (2007). Is feedback pedagogically correct? Research design issues in studies of

feedback on writing. Journal of Second Language Writing, 16, 40-53.

Hendrickson, J. M. (1980). The treatment of error in written work. Modern Language Journal,

64, 216-221.

Hyland, K., & Hyland, F. (2006). Feedback on second language students’ writing. Language

Teaching, 39, 83-101.

Kepner, C. G. (1991). An experiment in the relationship of types of written feedback to the

development of second-language writing skills. Modern Language Journal, 75, 305-313.

Knoblauch, C. H., & Brannon, L. (1981). Teacher commentary on student writing: The state of

the art. Freshman English News, 10(2), 1-4.

Lalande, J. (1982). Reducing composition error: An experiment. Modern Language Journal, 66,

140-149.

Leki, I. (1991). The preferences of ESL students for error correction in college-level writing

classes. Foreign Language Annals, 24, 203–218.

Li, S. (2010). The effectiveness of corrective feedback in SLA: A meta-analysis. Language

Learning, 60, 309-365.

Lyster, R., & Saito, K. (2010). Oral feedback in classroom SLA: A meta-analysis. Studies in

Second Language Acquisition, 32, 265-302.

Russell, J., & Spada, N. (2006). The effectiveness of corrective feedback for the acquisition of

L2 grammar: A meta-analysis of the research. In J. M. Norris & L. Ortega (Eds.),

Retrievable at www.tc.columbia.edu/tesolalwebjournal

84

Synthesizing research on language learning and teaching (pp. 133-164). Amsterdam:

Benjamins.

Schmmidt, R. W. (1990). The role of consciousness in second language learning. Applied

Linguistics, 11, 128-158.

Semke, H. (1984). Effects of the red pen. Foreign Language Annals, 17, 195-202.

Sheen, Y. (2007). The effect of focused written corrective feedback and language aptitude on

ESL learners’ acquisition of articles. TESOL Quarterly, 41, 255–283.

Sheen, Y., Wright, D., & Moldawa, A. (2009). Differential effects of focused and unfocused

written correction on the accurate use of grammatical forms by adult ESL learners. System,

37, 556–569.

Sheppard, K. (1992). Two feedback types: Do they make a difference? RELC Journal, 23, 103-

110.

Storch, N. (2010). Critical feedback on written corrective feedback research. International

Journal of English Studies, 10, 29-46.

Truscott, J. (1996). The case against grammar correction in L2 writing classes. Language

Learning, 46, 327–369.

Truscott, J. (1999). The case for “The case against grammar correction in L2 writing classes”: A

response to Ferris. Journal of Second Language Writing, 8, 111–122.

Truscott, J. (2001). Selecting errors for selective error correction. Concentric: Studies in English

Literature and Linguistics, 27, 93-108.

Truscott, J. (2004). Evidence and conjecture on the effects of correction: A response to Chandler.

Journal of Second Language Writing, 13, 337-343.

Truscott, J. (2007). The effect of error correction on learners’ ability to write accurately. Journal

of Second Language Writing, 16, 255-272.

Truscott, J., & Hsu, A. (2008). Error correction, revision, and learning. Journal of Second

Language Writing, 17, 292-305.

Van Beuningen, C. (2010). Corrective feedback in L2 writing: Theoretical perspectives,

empirical insights, and future directions. International Journal of English Studies, 10, 1-27.

Van Beuningen, C. G., De Jong, N. H., & Kuiken, F. (2008). The effect of direct and indirect

corrective feedback on L2 learners’ written accuracy. International Journal of Applied

Linguistics, 156, 279-296.

Van Beuningen, C. G., De Jong, N. H., & Kuiken, F. (2012). Evidence on the effectiveness of

comprehensive error correction in second language writing. Language Learning, 62, 1-41.

Vyatkina, N. (2011). Writing instruction and policies for written corrective feedback in the basic

language sequence. L2 Journal, 3, 63-92.

Zamel, V. (1985). Responding to student writing. TESOL Quarterly, 19, 79-101.