Journal#5

profilemarietorres
2011-Shohamy-AssessingMultilingualCompetencies.pdf

Assessing Multilingual Competencies: Adopting Construct Valid Assessment Policies Author(s): ELANA SHOHAMY Source: The Modern Language Journal, Vol. 95, No. 3, Special Issue: Toward a Multilingual Approach in the Study of Multilingualism in School Contexts (Fall 2011), pp. 418-429 Published by: Wiley on behalf of the National Federation of Modern Language Teachers Associations Stable URL: https://www.jstor.org/stable/41262376 Accessed: 26-07-2018 22:03 UTC

REFERENCES Linked references are available on JSTOR for this article: https://www.jstor.org/stable/41262376?seq=1&cid=pdf-reference#references_tab_contents You may need to log in to JSTOR to access the linked references.

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide

range of content in a trusted digital archive. We use information technology and tools to increase productivity and

facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected].

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

https://about.jstor.org/terms

National Federation of Modern Language Teachers Associations, Wiley are collaborating with JSTOR to digitize, preserve and extend access to The Modern Language Journal

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Highly situated in but also critiquing the GoI; interesting argument about paradigm shift that is killed in conclusion regarding ‘legitimate to assess in lang; also older (2011) newer research has come out since then.

Assessing Multilingual Competencies: Adopting Construct Valid Assessment Policies ELANA SHOHAMY

Tel Aviv University

School of Education Tel Aviv, Israel 69978

Email: [email protected]

All assessment policies and practices are based on monolingual constructs whereby test-takers are expected to demonstrate their language proficiency in one language at a time. Thus, the construct underlying these assessment approaches and/or scales (e.g., the CEFR) is of language as a closed and finite system that does not enable other languages to "smuggle in." This view is in stark contrast to the current understanding of multilingual competencies for which various languages and aspects "bleed" into one another in creative ways as manifested by a growing number of users, especially immigrants, who are born into one language and acquire additional language (s), resulting in multilingual competencies. This is manifested in codeswitching and in the simultaneous use of different language functions (e.g., reading in one and speaking in another in the process of academic functioning) . Yet, this multilingual functioning receives no attention in language testing practices. Further, multilingual users who rarely reach language proficiency in each of the languages that is identical to that of their monolingual counterparts are always being compared to them and thus receive lower scores. Consequently, they are penalized for their multilingual competencies, sending a message that multilingual knowledge is a liability. Given the current policies of cultivating multilingualism in schools and societies as expressed in the articles in this special issue, I critique the current monolingual assessment approaches within a political and social context. I argue that these approaches are rooted in nation-state ideologies that are still attempting to promote national collective agendas of "wishful thinking" and ignore the reality of how languages are being used. This is followed by empirical data pointing to the cost of the continued use of monolingual tests for individual students, especially those who are bilingual, as is the case with immigrants. All of these will lead to initial proposals and examples for the adoption of different types of multilingual testing and assessment policies and practices in various contexts. These approaches, I argue, are more construct valid, as they enable the manifestation of fuller knowledge in integrated ways, thus highlighting the advantages, rather than the problems, that multilingual users possess.

BETWEEN 1999 AND 2003, T. LEVIN, B. Spol- sky, and I worked on a research project for the Israeli Ministry of Education, the aim of which was to compare the academic achievements of immigrant students from the former USSR and Ethiopia with Israeli Hebrew native speakers. The

The Modern Language Journal, 95, iii, (2011) DOI: 10.1111/j.l540-4781.2011.01210.x 0026-7902/11/418-429 $1.50/0 ©201 1 The Modern Language Journal

main aim of the research was to answer the ques- tion of "how long," that is, how many years it took immigrant students to close the gap between them and the Israeli-born students. The tests used for

the comparison were all in Hebrew, the dominant official language of Israel and the language of in- struction in all Israeli Jewish schools. As expected, the results of the study pointed to large and signif- icant gaps between the immigrant students and the native Israelis. It takes about 9-11 years for students from the former USSR to close the gap, whereas the Ethiopian students never close it, not

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Elana Shohamy 419

even in the second generation. Eight years later, I rethink the ethical dimensions of the compara- tive study and my own compliance with research, the aim of which was to compare groups of in- comparable conditions. Carrying out these tests in a language that the immigrants have not yet acquired is likely to yield lower scores, which will lead to wrong conclusions regarding their true levels of academic achievements. Although it is clear that the immigrant students did acquire vast amounts of academic knowledge prior to migra- tion, it is not possible to access that knowledge via Hebrew tests. The lack of Hebrew proficiency masks that picture. For these students who acquired academic knowledge via Russian or Amharic, these types of comparisons are of low validity and fairness, as the students do not have the appropriate channel to demonstrate their aca- demic knowledge. Such results are also likely to have effects on their identity, confidence, and self-concept. Using such deficient/ subtractive re- search designs denies immigrants the fair oppor- tunity to demonstrate their true academic knowl- edge. These types of studies are common world- wide and it is easy for researchers to fall into sub- tractive research designs, believing that pointing out that the obvious gaps would lead to improved policies toward the groups - they rarely do. The expectation to demonstrate academic knowledge via national languages is the dominant model ev- erywhere. The use of a dominant national lan- guage as the means of demonstrating the aca- demic achievements on tests is an example of buying into research designs that fit national lan- guage ideologies of one nation, one language, and hence mask the real trait that is the target of the measurement. Researchers and language testers face dilemmas as to whether they should refuse to participate in such studies or whether new methods of assessment should be invented to

highlight the true academic knowledge in which multilingual students have more than one way to see beyond just temporary accommodations. This experience is the impetus for the current article. It is about ways to address multiple ways of seeing.

The articles included in this special issue of The Modern Language Journal all argue for the expan- sion of the language learning construct beyond monolingual views of language, toward different forms of multilingualism. There are two main ap- proaches to multilingualism. One approach calls for the legitimacy and encouragement of teach- ing and learning of multiple languages within the same space (i.e., classrooms, schools, regions). This approach views each language as a closed and homogenous construct and encourages the

teaching of multiple independent languages in the same context. The data used to support this argument build mostly on the positive transfer among multiple languages. The second approach deconstructs the notion of a language as a finite construct, viewing it as an abstract notion that is used as a means for negotiating and creating meanings. Language, accordingly, is made up of hybrids and fusions and nonnative varieties that continuously cross over in creative ways with un- defined boundaries and open forms of negotia- tions (Canagarajah, 2006). Translanguaging, as it is termed by Garcia, Sylvan, and Witt (this issue), is one such example of moving freely within, be- tween, and among languages. These views also fol- low notions advanced by Makoni and Pennycook (2006) of "disinventing languages" as well as what Li Wei and Martin (2009) refer to as "languages that bleed into one another." Creese and Black-

ledge (2010), for example, examined such mul- tilingual varieties of immigrant students in com- plementary schools in the United Kingdom (see also Li Wei, this issue). There are clearly multi- ple ways of moving within a multilingual contin- uum. The unique aspect of these articles is that, together, they recognize and legitimize the mul- tilingual phenomena as part of language educa- tion.

Although dynamic, diverse, and constructive discussions of multilingual teaching and learn- ing are currently taking place within the language education field, the phenomenon is completely overlooked in the assessment field that continues

to view language as a monolingual, homogenous, and often still native-like construct. There seems

to be a lack of coordination between the two dis-

ciplines of teaching and testing. This is in spite of the fact that language tests need to build on an updated language construct of what it means to know a language as the fundamental step in the creation of tests of high construct validity. Yet, although there is ample and convincing research about multilingual constructs, it is intriguing to examine the reasons that language tests are still not addressing these issues, especially given their powerful effects on learning and their ability to dictate and perpetuate language realities in edu- cational institutions. Thus, overlooking the con- struct of multilingualism is likely to result in lan- guage tests of limited evidence of validity.

Traditionally, the field of language testing is un- derstood to consist of two major components: one focusing on the "what," which refers to the con- structs that need to be assessed (also known as "the trait") and the other pertaining to the "how" (also known as "the method"), which addresses the

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

420 The Modern Language Journal 95 (2011)

specific procedures and strategies used for assess- ing the "what." The trait is known to be defined by the language learning field, which provides defi- nitions of the essential elements of language that language testers can use to design and develop valid language tests. The "how," in contrast, is de- rived mostly from the field of testing and measure- ment that has, over the years, developed a broad body of theories, research, techniques, and prac- tices about testing and assessment. An examina- tion of the developments in the language testing and assessment discipline since the 1960s reveals, in fact, that its theories and practices have always been closely related to definitions of language proficiency. Thus, discrete-point testing viewed language as consisting of lexical and structural items so that the language test of that era pre- sented isolated items in objective testing proce- dures. In the communicative era, tests aimed to

replicate interactions among language users uti- lizing authentic oral and written texts, and in the performance testing era, language users were ex- pected to perform tasks taken from "real life lan- guage activities" (Shohamy, 2009a).

Accordingly, the introduction and wide use of a multilingual construct in much of the literature of language learning in the past decade should have had a more direct carryover to the field of lan- guage testing. Multilingualism should have pro- vided testers with new and broader definitions

of the language that needs to be addressed by testers in the creation of language tests. Such tests are needed, given the multilingual practices, es- pecially within the context of migration and glob- alization. It follows that there is a need to ad-

dress the multiple versions and perspectives of the "what" of languages, along with the approaches developed for the "how." Thus, the special focus on multilingualism today and the diverse con- texts in which language testing and assessment are currently anchored require the incorporation of these theories into the field of language test- ing. Testers need to be socially responsible and accountable to ensure ethicality and profession- alism. In other words, as language testers seek to develop and design methods and procedures for assessment (the "how") and its multiple facets and dimensions, they also need to become mindful of the emerging insights regarding the trait (the "what"). This is related to the power of tests, the responsibility that language testers hold, and their central functions in education, politics, and soci- ety. It is being realized that language testing is not occurring in homogenous, uniform, and isolated contexts but rather in diverse, multilingual, and multicultural societies, a reality that poses new

challenges and questions to testers with regard to what it means to know language (s) in education and society. Thus, the claim made here is that for tests to be construct-valid, they need to be based on a construct that follows current understand-

ings and theories of language. Given the multi- lingual approaches to teaching, learning, and un- derstanding of language, language testing poli- cies, procedures, and tasks need to reflect these approaches. I will now attempt to explain what I see as the roots of this continued monolingual view of language tests within a political and social context. This will be followed by some empirical data pointing to the cost of the continued use of monolingual tests for individual students, espe- cially those who are bilingual, as is the case with immigrants. All of these will lead to an initial pro- posal and recommendation for the adoption of different types of testing policies in various con- texts.

THE SOCIAL AND POLITICAL DIMENSIONS OF TESTS

In the past two decades, a sociopolitical per- spective of tests has been introduced (McNa- mara 8c Roever, 2006; Messick, 1994; Shohamy, 2001, 2006, among others). Accordingly, tests are no longer viewed only in scientific and tech- nical terms. Rather, there is strong attention to the uses of tests and their impact on peo- ple, education, and society. As a result, lan- guage testers are asked to deal with broader issues of the tests they develop and to exam- ine their impact and consequences as well as to understand the motivations for introducing tests. Tests, then, are not only viewed anymore as naïve measurement tools, but also as powerful devices contextualized within broad social, polit- ical, and economic realities. Tests and language go along with the arguments about the politi- cal dimensions of language itself, used mostly by central authorities as an ideological tool for the creation of national and collective identities

(Ricento, 2006; Shohamy, 2006). The use of monolingual tests needs to be contextualized within this political and social reality in which they operate.

Take for example the introduction of language tests for citizenship in a growing number of coun- ties in the past decade. It has been argued that these tests are used as devices to gatekeep immi- grants and asylum seekers and thus to exclude people whom the state is not interested (Ex- tra, Spotti, & Van Avermaat, 2009; Hogan-Brun, Mar-Molinero, & Stevenson, 2009; Shohamy,

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Disagree

Elana Shohamy 421

2009b). Other examples include the introduction of tests in educational systems often motivated by an agenda of perpetuating certain political ideolo- gies. These tests are given in monolingual national languages. It is understood today that language tests are used as disciplinary tools to create and impose language ideologies and policies accord- ing to the agendas and authority of the nation- state. It is the power of tests that enables those in authority to use them as a major tool to perpet- uate such agendas, given that those who are af- fected by these tests comply with the test demands, given the tests' consequences on their personal lives (Shohamy, 2001). Tests, then, serve as me- dia through which messages regarding national- ism are being communicated to test-takers, teach- ers, and educational systems regarding language priorities. Thus, by conducting language tests in a given language, messages are being transmit- ted regarding the priority of dominant languages while marginalizing others, in line with national ideologies and agendas. It is within this widespread practice of the use of tests as political tools by government and cen- tral agencies that the use of monolingual tests needs to be interpreted. The national ideologies of most nation-states worldwide are still driven by the promotion and perpetuation of "one nation, one language" for the sake of a national collective identity, a phenomenon that is even more domi- nant, given the large number of immigrants that move to many European countries and elsewhere. National languages still serve as symbols and de- vices to promote collective national identities. It is within this context that monolingual tests in national languages serve as institutional tools to perpetuate and impose such ideologies. Although there is some promotion of additional languages (i.e., foreign languages, mostly English), they are viewed as added languages and rarely have an equal status in relation to national languages. Al- though there are some content-based programs that will promote English, such as Content Lan- guage Integrated Learning (CLIL), given the role of English as an international language, this is rarely the case with other languages. In other words, the connection of national ideologies with national dominant languages and the continued role of language as a symbol of identity are mani- fested in and channeled to monolingual tests. Thus, although multilingual teaching and learning are currently promoted, encouraged, and practiced, especially in the European context, there are no voices that argue in favor of multi- lingual tests. Yet, as argued earlier, tests need to reflect such language definitions so as to ensure

construct validity. Even in the cases of immigrant students in schools, who are in the midst of the

long process of acquiring the dominant language of the new place they immigrated to, these lan- guages are still being viewed as liabilities and are being overlooked for the sake of learning the pre- ferred powerful and prestigious language of the nation. Rarely do these tests incorporate the lan- guages of "the others" and/or receive any recog- nition. This is evident not only in the fast-growing policies, as mentioned earlier, where immigrants are forced to pass language tests in national lan- guages in condition of residence and citizenship, but also in the international global tests such as the Programme for International Student Assess- ment (PISA) . This test requires all students to per- form exclusively via the national languages of the countries. Finally, it is important to examine the widespread instruments, such as the Common Eu- ropean Framework of Reference for Languages (CEFR) and the American Council on the Teach-

ing of Foreign Languages (ACTFL) guidelines, as the main criteria and yardsticks for judging the quality of the languages produced on tests. These are based exclusively on monolingual homoge- nous constructs that forbid any other languages to "smuggle in" or "bloom."

The conclusion of the aforementioned discus-

sion is that given their power, monolingual tests in national languages serve as tools that perpetu- ate the monolingual de facto policies of the na- tion. These policies, in return, have detrimental effects on learning. Thus, not only are the existing tests not congruent with the various multilingual approaches, they in fact work against these ap- proaches (Cheng, 2009; Davies, 2009; McNamara & Roever, 2006; Menken, 2009; Shohamy, 2001; Spolsky, 2009).

SOME DATA REGARDING THE COST OF MONOLINGUAL TESTS

As is the practice worldwide, immigrant stu- dents are being compared with native speakers on their academic achievement via tests in the

monolingual dominant language. Such is the case with the No Child Left Behind Act (2001) in the United States, in which all students are expected to be tested on their academic proficiency. This includes immigrant students who are in the pro- cess of acquiring the dominant languages of their new American residence and at the same time are

tested in English in the short time after their ar- rival in the United States. These second-language learners, it is argued, are bound to be marginal- ized in comparison to native speakers, given the

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Spain; Cenos; Basque country

422 The Modern Language Journal 95 (201 1)

length of time it takes to acquire a new language. The main point made here is that as long as such national tests will require nonnatives (i.e., immigrant students) to be like natives and to be measured on monolingual tests in the dominant language, these students will be not be able to achieve valid scores. As a result, hierarchies of im-

migrants and natives are being created and per- petuated, and messages regarding marginality are being delivered and reinforced. In other words, these monolingual tests bear costs for multilin- gual test-takers, as will be demonstrated in the empirical evidence presented next. First, monolingual tests overlook various re-

search findings that demonstrate that immigrants continue to employ their first languages (Lis) in various academic literacy situations, a long time after immigration, often even for a lifetime (Thomas 8c Collier, 2002) . The languages that im- migrants possess upon arrival in the countries to which they immigrate do play important roles in providing cognitive and educational advantages. Immigrants bring with them knowledge of the world, varied content, experiences, and interpre- tations. For example, in the domain of speaking, immigrants use mixed codes of LI accents, gram- mar, lexicon, syntax, discourse, and pragmatics. Similarly, in writing, language users continue to use LI syntax and vocabulary so much so that they almost always need to employ editors to improve and standardize their writing so it will be on par or close to that ornative speakers. Clearly, the use of a mixed code in writing and in other areas is still considered a liability. In the domains of lis- tening, contextualized information and schema provided in the LI are often needed in order to process oral information. Finally, in reading, in- formation about history, culture, and politics is needed for comprehension of most texts. Addi- tional sources and resources that rely on multi- modal sources are constantly in use in the form of images, sounds, and other clues. Even when decoding is practiced and words are read well, there is a need for cultural and contextual fa-

miliarity. A study by Haim (2010) demonstrated how significant transfers occur within three lan- guages, from the LI to the second language (L2) to the third language (L3) within Russian, Hebrew, and English (see also the article by Cenoz & Gorter, "Focus on Multilingualism," this issue). Strong predictions were found within the three languages in reading and writing according to years of residence. Clearly, the LI plays a role, as well as the L2 and the L3. All these language performances need to be incorporated into mul- tilingual tests.

In our own research (Levin, Shohamy, & Spol- sky, 2003; Levin 8c Shohamy, 2008), as mentioned in the preface to this article, immigrant students, irrespective of age, time of migration, and length of residence, are being compared with students born in Israel and are measured on the same

yardstick of monolingual Hebrew test. Yet, mul- tilingual users rarely reach language proficiency identical to those born into one language, as they are not on par with the native speakers. The very question as to whether immigrants can ever be compared to native speakers, when the test is conducted in a language that will take them a long time to acquire, requires special attention. There seems to be a need to address the channel

through which these students can best demon- strate the knowledge of content areas they pos- sess. By ignoring it, the very academic compe- tence gained by these students in their previous years of study in their countries of origin, where they obtained most of their academic learning, is overlooked. Yet, this knowledge is certainly part of their academic knowledge, regardless of the channel through which they can demonstrate it. Figures 1 and 2 display results that point to the gaps between immigrants and native speakers. It is shown that it takes immigrants from the former USSR 9-11 years to achieve the same academic level of a native speaker in mathematics and in Hebrew (Levin & Shohamy, 2008) . These findings were identical for all the three grades tested (5th, 9th, and 1 1 th) , whereas immigrants from Ethiopia could never achieve such high levels as the native speakers, not even in the second generation.

Yet, the main point raised here is whether this is even a legitimate question to ask, as it is clear that immigrants cannot perform as native speakers in a language they do not know. Moreover, immigrant students continue to interpret the knowledge via two languages for a long period of time after im- migration. Figure 3 thus portrays the results of a study in which students from the former USSR were tested in a bilingual Hebrew-Russian test. In this test, questions and distracters were presented in two languages versus a control group, which was tested in Hebrew only. The results showed that the students who received the tests in the two lan-

guages significantly outperformed those students who were subject to the monolingual "Hebrew- only" test, an advantage that lasted 12 years after migration.

Immigrant students therefore continue to rely on the language they acquired before immigra- tion as a valuable resource for processing the new academic knowledge. Yet, on the tests that are administered in most educational systems, this

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Abedi; Linquianti; Menken; Gottlieb ALL OF THEM

Discussion of CUP

Time to acquire; points to Israel same as Cummins work

Elana Shohamy 423

FIGURE 1

Grade 9 Hebrew Standard Scores, According to Years of Residence

FIGURE 2

Grade 1 1 Standard Scores, According to Years of Residence

knowledge is denied them. Wright and Xiaoshi (2008) demonstrated, for example, how students who performed well in math in their home coun- tries performed poorly in math in the new coun- try due to the new language. The knowledge was there, but the channel of expressing it was missing. Figure 4 demonstrates the notion that immigrants view academic knowledge via two perspectives - their own home language and the new language (s). However, upon immigration, they are expected to see with one eye only. More- over, that eye happens to be the weak one.

In Figures 5-7, we can see further evidence as to the degree of the obstacle that the new language poses to academic achievement. In the following

experiments, performance of students on math tests was compared in three conditions. One in- cluded math tests that included symbols versus words. Specifically, it was shown that the use of symbols such as graphs and images on a test, in- stead of words, increased the performance levels of students in math. The results are presented in Figure 5. It can be seen that when students were presented with visual and symbolic signs in rela- tion to just verbal information, they performed significantly better.

Lack of familiarity with the topics of the texts used on academic achievement tests provides yet another source that penalizes immigrant students on academic tests. In Figure 6, we can see that

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

424 The Modern Language Journal 95 (201 1)

FIGURE 3

Math Scores, According to Years of Residence: In the Two Conditions

FIGURE 4

A Portrayal of Pushing Immigrants to Use One Language Instead of Two

100 -г :

()0 _j

so i

♦ Hebrew ' Russian 70 | jT^^^S ! « Hebrew <>() | ^"^""*^^. / ^% ¡

Г)П j

■io i "--«' I ^( , i

l util l'util .4- 1 5-7 S .ind Г|) Year 1 Year 2

Years

Israel Former и Ì slop л I ;и:П;;<: ! <,,■>;* ■ (JS) USSR «{ fff >

(FSU)

*lstlMnguage 62 55 W Moí/í/e/ío/íer

* Difficulties in 69 56 3 ' Immigration

Less familial 58 ->7 'is

Topic s

Grade 5 (,iade9 (»rad, ! I

FSU ETH IS FSU F.TH is I Si I III

47 30 ''Л 42 15 57 1 I Tl

56 39 51 51 21 I* 4<! 25

65 45 61 6 H :И 67 50 :12

FIGURE 5

Visuals and Symbolic Versus Verbal

FIGURE 6

Familiar Content

Note. IS = Israel; FSU = Former Soviet Union;

ETH = Ethiopia.

when students were tested on familiar topics, such as immigration and language learning, they per- formed significantly better, in comparison to the identical tests which contained questions on un- familiar topics.

Another source of difficulty for immigrant stu- dents is related to cognitive processing. In Fig- ure 7, we can see that immigrant students, in all grades, performed significantly better when they were provided with cognitive guidance for pro- cessing math texts when it was presented in the dominant language being acquired.

Through statistical procedures such as Differ- ential Item Functioning (DIF), it is possible to identify the specific items that discriminate (i.e., differentiate) against students of different back- grounds, such as different language backgrounds. For example, in a study by Levi-Keren (2008), it was shown that by applying the DIF statistical methods and analyses, the very test items that differentiated negatively (and positively) between students from the USSR and Ethiopia and native speakers were tracked and identified. This was

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Elana Shohamy 425

FIGURE 7

Cognitive Guidance

Note. IS = Israel; FSU = Former Soviet Union;

ETH = Ethiopia.

accompanied by a follow-up study that employed "think-aloud" protocols to identify the specific cul- tural and cognitive elements that caused these students to perform better or worse. The find- ings identified factors such as lack of background knowledge, unfamiliar topic, lexicon, and cogni- tive processing that are responsible for low per- formance. It is expected that such results will feed into models that will be instrumental in develop- ing tests that are less biased against these students. Further work currently being conducted attempts to follow the process by which immigrants pro- cess monolingual texts, often showing that they do that using both the LI and the L2 (Logan-Terry & Wright, 2010). Such studies are relevant to var- ious types of multilingual students, immigrants, as well as various types of minority students and those who are born into one language, function in a number of other languages, and possess mul- tilingual competencies throughout their lives.

The aforementioned data and arguments demonstrate that the widely used tests that are based on a monolingual construct result in biased and often discriminatory scores, as many factors are not incorporated. Often, these monolingual tests are also based on the native variety of the language, whereas it is clear here that very few L2 learners can ever reach such a standard. This phe- nomenon is even more relevant with regard to En- glish, with its many native and nonnative varieties, as it is used as a lingua franca worldwide. Even in these cases, the national Lis of immigrants are generally being viewed as intrusions to the acqui- sition of Standard English, as is the requirement in most English language tests.

One context in which multilingual tests are rec- ognized is in test accommodations. This context allows for immigrant students to obtain assistance in performing on academic tests in their LI, such as with the aid of a dictionary, translation of vo- cabulary items, and translations in the body of the text. The rationale behind the accommodation

policy is that they are needed on a temporary ba- sis until the ultimate goal of perfect monolingual- ism is reached (Abedi, 2004, 2009). Yet, this very policy implies that immigrant students can per- form bilingually until they start ignoring their LI or ignore bilingualism and adopt monolingualism again in a different language. However, if, as the data suggest here, students in fact continue to use their LI for a long time, what is the real role of ac- commodation? One also wonders if the use of ac-

commodations does, in fact, contribute to mono-

lingualism. Current research on school learning examines various types of accommodation; there is a trend to view accommodations as reflecting the complex construct of processing test items for L2 learners. Thus, when a student processes test items using the LI and L2, it is viewed as a more valid construct and not as the "route to mono-

lingualism." Therefore, accommodations need to be viewed as integrated components of academic performance of bilingual students as L2 students continue to process language in two languages, as discussed earlier. This indicates that the view that

reliance on LI is only a temporary phase, used for a limited time only, should be rethought. There is an urgent need to think of a new term that will address the reality of how L2 learners process academic language on a permanent basis rather than as a temporary one. Accommodations need to reflect the wealth of factors that play a role in academic processing, not as temporary help but as an integral component of the newly defined language construct.

Given the aforementioned findings and ar- guments, a number of ideas for multilingual tests are proposed. These address multilin- gual competencies that many language speakers possess. Whether in the context of migration, mi- nority groups using multiple languages, or any other multilingual situations of students acquiring additional languages in school learning, these competencies need to be incorporated in new ways of using tests. These tests will more accurately reflect the reality of the multilingual construct. Al- though the principle of the multilingual construct is important, it clearly is different in diverse lan- guage learning contexts. Yet, in all cases, these types of tests are likely to enable manifestation of true knowledge and will enhance bilingualism and multilingualism as advantages rather than as liabilities.

As was noted earlier, a number of reasons can

account for this dominant phenomenon. One is that large-scale testing is ideological and politi- cal, so that tests are used as tools that define and

perpetuate the varieties that the nation and its educational system echo and voice and are

Grade 5 Grade 9 Grade 1 1

IS FSU ÍET IS к SI OH IS IS I III ¡H | Г

Procès 1 65 58 ! 40 .46 | 41 18 4-1 X' 20 HK . - ' ! Product | ! Product 57 I 30 22 j 23 04 32 28 Í 1

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Problematic

No paradigm shift

This is the new addition to this arg.

426 The Modern Language Journal 95 (2011)

FIGURE 8

Bilingual Construction of a Recipe

believed to serve the system. Then the testers themselves, mostly contracted and employed by national institutions, provide the devices to pur- sue and impose these agendas. Thus, testers are viewed mostly as technicians who follow the agen- das of the nations and provide the technical di- mensions of tests (i.e., items, analysis, reliability) while not arguing with the construct. In most cases, test institutions are branches of central gov- ernments or are contracted by them and thus help maintain these ideologies. In other words, testers do not design tests that are based on the real- ity of how languages are really used and learned by L2 learners, minorities, and even foreign lan-

guage learners, but rather take the side of the ideological institutions and therefore serve as ser- vants of the ideology that is in charge of one of the most powerful devices in modern societies: tests. Testers choose to overlook the diversity and complexity of the language construct, what lan- guages are, and continue to produce monolingual tests that reflect ideologies of those who decide how languages should be used.

Yet, language testers cannot just stand aside and ignore the role that tests play in creating monolingual policies. They should as- sume responsibility for their tests, especially be- cause tests determine the prestige and status of

My Hecipe for:

chode i r,Y->í)^:'b

^ fcigredieii& Kitchen Toóte: J 1-Skach- ^c i.f.cK4b;f - /VP

"' 2^ í>U)t - ?& 2.cKlťrCf piety ,V - *W*>

ok -t> Ь.чьЬ^к /„ "Take/ J(,4 .76Ь^ crcrpt

i !.'/

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Elana Shohamy 427

languages and are capable of suppressing lan- guage diversity. This finding was shown in a num- ber of studies such as that of Menken (2008a, 2008b) and others who pointed to the phe- nomenon whereby the introduction of monolin- gual tests, such as through NCLB, brought about a growing number of monolingual programs giving increased attention to the dominant languages and suppressing other languages that had been studied and used until the tests were introduced.

Thus, the use of native-like tests tends to standard-

ize languages and perpetuate correctness and pu- rity. The main argument here is that as long as language tests continue to be based on a limited and narrow view of language, these powerful tests will continue to contribute to a limited and nar-

row view of language.

SOME PROPOSALS FOR MULTILINGUAL ASSESSMENT

How can language tests best reflect the wealth of factors that play a role in academic processing and thus be more congruent with the various types of the multilingual constructs?

In principle, when tests adopt a broad and expanded view of assessment, such as when assessment is used for diagnostic, feedback, and learning purposes beyond the narrow view of test- ing as it is used in large-scale tests, then any mul- tilingual pedagogical strategy can be used as an assessment procedure. Thus, teaching and assess- ment are totally integrated and inseparable. One is embedded in the other and there are no clear

and distinct divides. This means that testing leads to teaching that brings about further assessment based on diagnosis, as is argued by the theories of dynamic assessment (Poehner, 2008). In these cases, the multilingual approaches are integrated in multilingual assessment tasks in natural and or- ganic ways.

Yet, as to large-scale assessment when testing is totally separable from teaching, which is the main critical focus of this article, one needs to follow the

multilingual approaches as outlined in this issue and as outlined earlier here. It was claimed that

there are two types of multilingual approaches: one where a number of homogenous languages are taught in one space, whereas the other focuses on the integration and mixing of a number of languages. These two approaches will have direct manifestations as to the strategies of testing and can actually be put on a continuum.

On one side of the continuum, language x is used for certain purposes, such as reading, whereas language y is used for writing, and lan-

guage z is used for discussions. In many classes in Arab schools in Israel, a text in history is read in English, students are asked to summarize the arti- cle in Hebrew, and the oral discussion takes place in Arabic, either Modern Standard or a spoken dialect. The extent to which these three (or four, including the Arabic varieties) languages, used in the same space, are actually kept homogenous and separate is probably too idealistic and unreal. On the other extreme side of the continuum, tests

are based on the approach in which a mixture of languages and open borders among them is a rec- ognized, accepted, and encouraged variety. This approach can be translated into multilingual tasks for which it is understood by the test-takers that mixing languages is a legitimate act that does not result in penalties but rather is an effective means of expressing and communicating ideas that cannot be transmitted in one language (see also Canagarajah, this issue). Figure 8 displays such a writing task - in this case, it is writing a recipe. The test-taker who authored the text wrote a very accurate recipe utilizing Hebrew and English. In cases like this, a monolingual rating scale would have penalized the student, as it violates the de- scriptions of "interference from first language." What is viewed in monolingual tests and criteria as interference is viewed here as a more effective

way to transmit language. The other example, which falls somewhere be-

tween the two approaches, is the one mentioned previously about the math tests for which the test questions are presented in Hebrew and Russian (a bilingual test). Both the math text and the test questions were presented in two versions: Hebrew and Russian. Although on the surface, this version of a test seems to follow the approach of multilingual tests with two homogenous lan- guages, when students were asked about the pro- cess they followed in responding to the text, they admitted to using a bilingual pattern of thinking while responding. Specifically, they claimed that they took some words from the Russian version, understood the syntax from the Hebrew version, and combined both in the process of meaning- making. This approach also shows that there are various strategies. They may still have used bilingual mixed processing strategies as well as monolingual ones while reading academic texts. This, as we mentioned previously, also happens in monolingual texts (Logan-Terry & Wright, 2010). There are those (Canagarajah, 2006) who claim that a focus on one language only is unrealistic, as the main goal should be ne- gotiations and communication. To reach that goal, many routes can be followed: monolingual,

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Shapers and tools of discourse

WHY ALWAYS MATH?!

428 The Modern Language Journal 95 (201 1)

bilingual, and multilingual, as well as other com- municative and negotiation strategies, including images and symbols, and other semiotic devices, as was demonstrated in the examples described in this article (see also Canagarajah, this issue). It is clear that the processing and production of

academic texts provide a useful tool for high-level academic performance. It is a challenge to the language testing profession to develop and invent tests and rubrics that will be based on a broader

multilingual construct of language.

CONCLUSION

One central question in the adoption of mul- tilingual assessment procedures is whether it is even legitimate to compare groups with different language backgrounds or perhaps regardless of language backgrounds, we should shift to content- based tests in which students may react in any means they find appropriate, but language should not be the main focus (i.e., challenging the no- tion of language tests in general, which are de- tached from content) . Can there be any test that is "language or languages"? Should we adopt sep- arate tests for bilingual and monolingual students to avoid unjust comparisons, as all monolinguals cannot perform on bilingual tests?

Rather than ask "how long?" there is a need to carry out research that looks deeper into mean- ing construction of bilingual/multilingual tests- takers in different settings. Academic knowledge is a complex construct and additional factors and dimensions provide resources for meaning con- struction: LI, cognitive skills, familiarity, previous experiences, and so forth. The data shown here point to the exclusion of the complex dimensions of academic language and thus provide invalid test scores. Language testing as a discipline has reached a point where it must examine its un- derlying constructs in line with developments in the field of language learning, teaching, and use. It is time for the language profession to think of a construct that will better match current re-

search and knowledge and not serve ideologies of nation-states that are interested mostly in ho- mogenous and monolingual citizens. These con- structs need to incorporate the specific and unique competencies that demonstrate the advan- tages that bilingual students have in a number of areas, in spite of not being proficient in the dom- inant language. Language testers should take the role of resistance 'from below' to imposed ide- ologies and invent creative ways of testing, which reflect the diverse populations in this day and

age and provide equal participation not limited to dominant languages and dominant people.

REFERENCES

Abedi, J. (2004). The No Child Left Behind Act and En- glish language learners: Assessment and account- ability issues. Educational Researcher, 33(1), 4-14.

Abedi, J. (2009). Utilizing accommodations in assess- ment. In E. Shohamy & N. H. Hornberger (Eds.), Encyclopedia of language and education: Vol. 7. Lan- guage testing and assessment (2nd ed., pp. 331-347) . Berlin: Springer.

Canagarajah, S. (2006). Changing communicative needs: Revised assessment objectives, testing English as an international language. Language Assessment Quarterly, 3, 229-242.

Cheng, L. (2009). Washback, impact and consequences. In E. Shohamy & N. H. Hornberger (Eds.), En- cyclopedia of language and education: Vol. 7. Lan- guage testing and assessment (2nd ed., pp. 349-364) . Berlin: Springer.

Creese, A, 8c Blackledge, A (2010). Translanguaging in the bilingual classroom: A pedagogy for learning and teaching? Modern Language Journal, 94, 103- 115.

Davies, A. (2009). Ethics, professionalism, rights and codes. In E. Shohamy & N. H. Hornberger (Eds.), Encyclopedia of language and education: Vol. 7. Lan- guage testing and assessment (2nd ed., pp. 429-444) . Berlin: Springer.

Extra, G., Spotti, M, 8c Van Avermaat, P. (2009). Language testing, migration and citizenship: Cross- national perspectives. London: Continuum.

Haim, O. (2010). The relationship between Academic Proficiency (AP) in first language and AP in second and third languages. Unpublished doctoral disser- tation, Tel Aviv University.

Hogan-Brun, G., Mar-Molinero, C, 8c Stevenson, R (Eds.). (2009). Discourses on language and integra- tion. Amsterdam: Benjamins.

Levi-Keren, M. (2008). Factors explaining biases in Math- ematic tests among immigrant students in Israel. Un- published doctoral dissertation, Tel Aviv Univer- sity, [in Hebrew]

Levin, T., & Shohamy, E. (2008). Achievement of im- migrant students in mathematics and academic Hebrew in Israeli school: A large scale evalua- tion study. Studies in Educational Evaluation, 34, 1-14.

Levin, T, Shohamy, E., 8c Spolsky, B. (2003). Academic achievements of immigrants in schools. Report sub- mitted to the Ministry of Education. Tel Aviv Uni- versity, [in Hebrew]

Li, Wei, 8c Martin, R (2009). Conflicts and tensions in classroom codeswitching [Special issue]. Bilingual Education and Bilingualism, 13.

Logan-Terry, A, 8c Wright, L. (2010). Making thinking visible: An analysis of English language learners'

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms

Elana Shohamy 429

interactions with access-based science assessment

items. AccELLerate!, 2(4), 11-14. Makoni, b., ¿к rennycook, A. (zUOb). Disinventing and

reconstituting languages. Clevedon, England: Mul- tilingual Matters.

McNamara, T., 8c Roever, C. (2006). Language testing: The social dimension. Maiden, MA: Blackwell.

Menken, K. (2008a). Editorial introduction 7.3. The- matic issue entitled No Child Left Behind and U.S.

Language Education Policy. Language Policy, 7(3), 191-199.

Menken, K. (2008b). English learners left behind: Stan- dardized testing as language policy. Clevedon, England: Multilingual Matters.

Menken, K. (2009). High-stakes tests as de facto lan- guage education policies. In E. Shohamy & N. H. Hornberger (Eds.), Encyclopedia of language and education: Vol. 7. Language testing and assessment (2nd ed., pp. 401-414). Berlin: Springer.

Messick, S. (1994). The interplay of evidence and con- sequences in the validation of performance assess- ment. Educational Researcher, 23, 13-23.

No Child Left Behind Act (2001). Public Law 107- 110 (January 8, 2002). United States Congress. http://www2.ed.gov/policy/elsec/leg/esea02/ 107-1 10.pdf

Poehner, M. E. (2008). Dynamic assessment. A Vygotskian approach to understanding and promoting L2 devel- opment. Berlin: Springer.

Ricento, T. (2006). An introduction to language policy: Theory and methods. Oxford: Blackwell.

Shohamy, E. (2001). The power of tests: A critical perspec- tive on the uses of language tests. Harlow, England: Pearson Educational.

Shohamy, E. (2006). Language policy: Hidden agendas and new approaches. London: Routledge.

Shohamy, E. (2009a). Introduction to Volume 7: Lan- guage testing and assessment. In E. Shohamy & N. H. Hornberger (Eds.), Encyclopedia of lan- guage and education: Vol. 7. Language testing and assessment (2nd ed., pp. xiii-xxii). Berlin: Springer.

Shohamy, E. (2009b). Language tests for immigrants: Why language? Why tests? Why citizenship? In G. Hogan-Brun, С Mar-Molinero, & P. Stevenson (Eds.), Discourses on language and integration (pp. 45-59). Amsterdam: Benjamins.

Spolsky, B. (2009). Language assessment in histori- cal and future perspective. In E. Shohamy 8c N. H. Hornberger (Eds.), Encyclopedia of lan- guage and education: Vol. 7. Language testing and assessment (2nd ed., pp. 445-454). Berlin: Springer.

Thomas, W., & Collier, V. (2002). A national study of school effectiveness for language minority students' long-term academic achievement: Final report. Project 1.1. Santa Cruz, CA: Center for Research on Edu-

cation, Diversity and Excellence (CREDE). Wright, W., & Xiaoshi, L. (2008). High-stakes math

tests: How No Child Left Behind leaves newcomer

English language learners behind. Language Policy, 7, 237-266.

This content downloaded from 165.123.34.86 on Thu, 26 Jul 2018 22:03:35 UTC All use subject to https://about.jstor.org/terms