Language Policy Research Unit (LPRU) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Resources Journal of Language, Identity, and Education Policy Briefs Demographic Data Bibliography Virtual Library Book Notes Legal Resources Media Resources Scholarly Journals Internet Resources |
EPSL-0204-101-LPRU
"Accountability" Versus Science in
the
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Grade |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
11 |
Rate* |
|
|
1997-98 |
Oceanside |
12 |
9 |
8 |
6 |
9 |
4 |
9 |
5 |
2 |
3 |
5.4% |
|
Statewide |
19 |
14 |
15 |
14 |
16 |
12 |
15 |
10 |
8 |
10 |
7.0% |
|
|
1998-99 |
Oceanside |
26 |
15 |
16 |
16 |
16 |
12 |
15 |
9 |
6 |
6 |
6.6% |
|
Statewide |
23 |
18 |
17 |
16 |
18 |
14 |
17 |
11 |
9 |
11 |
7.6% |
|
|
1999-00 |
Oceanside |
32 |
22 |
23 |
19 |
20 |
13 |
18 |
11 |
6 |
8 |
4.1% |
|
Statewide |
28 |
21 |
20 |
17 |
19 |
15 |
18 |
12 |
9 |
11 |
7.8% |
|
|
2000-01 |
Oceanside |
32 |
22 |
19 |
16 |
16 |
12 |
15 |
8 |
7 |
7 |
17.8% |
|
Statewide |
31 |
23 |
21 |
18 |
21 |
16 |
19 |
12 |
9 |
11 |
9.0% |
(Source: California
Department of Education; Proposition 227 took effect in
1998-99.)
*Redesignation rates
represent the percentage of English language learners who are
redesignated as "fully English proficient" each
year.
One educational researcher, Kenji Hakuta of Stanford University, was quoted briefly by the Times, cautioning that no scientific conclusions about Proposition 227 could be drawn from these data. But it was Unz's interpretation that received the lion's share of attention. "The test scores these last two years have risen, and risen dramatically," he said. "Something has gone tremendously right for immigrants being educated in California" (Steinberg, 2000).
As it happened, Hakuta and some colleagues had already conducted an extensive analysis of the California test results, which went unmentioned in the Times article. Their conclusion: "Scores rose for all students, and in no clear pattern that could be attributable to Proposition 227" (Orr et al., 2000). White and minority, rich and poor, language-minority and native English-speakingvirtually all groups of students had improved their performance on the Stanford 9. At parents' request, about 12 percent of California's English learners remained in bilingual classrooms under Proposition 227. So the researchers sampled a cross-section of school districts, including those that had eliminated all bilingual programs in 1998, those that had continued some bilingual programs, and those that had never offered any bilingual programs. They found that average gains among English language learners were about the same in each.
Among numerous possible explanations for the pattern of rising scores, Hakuta (2000) cited a California initiative to reduce class size in the early grades, a movement toward higher standards and accountability, and more effective preparation for the Stanford 9 as teachers become more familiar with the test. This last factor may be especially important, since 1998the year before Proposition 227 took effectwas the first year of California's statewide testing program. As Krashen (2001) explains, "Typical test score inflation is about 1.5 to 2 points per year, which accounts for a great deal of the gains seen in grades 2-6 in California," both for English learners and for English-proficient students.
But what about Oceanside? English learners' scores did rise substantially there in the two years after the initiative passed. Could this mean that English immersion is working "miracles" in that district? Based on the Stanford 9 results, it is impossible to say. Numerous other explanations are equally plausible, however, and they have nothing to do with the dismantling of bilingual education. First, the school district started from a dismal position in 1998with reading scores sliding from the 12th percentile (grade 2) to the 6th percentile (grade 5) to the 2nd percentile (grade 10)well below statewide averages for English learners (see Table I). With intensive test preparation, such results can be improved significantly. Oceanside superintendent Ken Noonan (2000) has reported that, before Proposition 227, these students were taught entirely in Spanish for the first four years, sometimes longer. With no substantial exposure to English in the classroom, it's no wonder they did so poorly on English-language tests. Finally, there's the statistical phenomenon of "regression to the mean." As Hakuta (2001) notes, "Oceanside finally managed to drag its test scores from rock bottom up to the statewide average for EL students. This is not a story about excellence, hardly a miracle."
Subsequent events have cast further doubts on the role of Proposition 227 in raising the district's scores. A month after the laudatory New York Times story appeared, the California Department of Education (2000) cited Oceanside for violating the civil rights of LEP students. Among numerous infractions, investigators found that the district had no coherent English immersion program; that it was failing to train teachers in immersion methodologies; and that, after one year of the immersion treatment, most English learners had been arbitrarily reassigned to mainstream classrooms, where they received little if any help in overcoming language barriers. Soon after, the federal Office for Civil Rights reached similar conclusions.
In 2001, the reading scores for Oceanside's English learners leveled off in grades 2 and 3 and declined in grades 4 through 9 and 11. The only "good news" was posted in grade 10an increase from the 6th to the 7th percentile. What does this all mean? Probably not very much, just another regression to the mean, as Hakuta (2001) argues. But it does highlight another reason to dismiss the Stanford 9 scores for English learners as mostly meaningless. No confidence can be placed in these year-to-year comparisons because students may differ in ways that cannot be statistically controlled. For example, because of a recent wave of immigrants, this year's second graders may come from poorer, less educated backgrounds and start out with less knowledge of English than last year's second graders. Such factors are certain to affect performance on the Stanford 9, but with the data now available there is no way to adjust for them. Therefore, it is impossible to draw valid conclusions about year-to-year fluctuations in second grade scores.
Moreover, the English learner category itself is constantly changing. Students enter and exit at varying rates, depending on how much English they arrive with and how long it takes them to be "redesignated" as fully fluent in the language. Naturally those who have acquired more English will do better on English-language achievement tests than those who have acquired less English. This means that when students are redesignated as fluent in English, their scores are no longer counted in LEP group, usually lowering the overall average. In effect, a district's successes in teaching English count against it when Stanford 9 scores are calculated, even if students are doing well. Because there is no statewide gauge of English proficiencycriteria and procedures vary considerably among California districtstest results for English learners can be easily manipulated. For example, to boost average scores on the Stanford 9, all a clever administrator would need to do is slow down the official resdesignation of students as fluent in English, which would automatically retain many high-scorers in the LEP category. That stratagem could well account for Oceanside's "striking" performance reported in 2000. That year only 4 percent of these students were deemed ready for the mainstreamabout half the statewide average. Then, in 2001, test scores fell off as the district's redesignation rate jumped to nearly 18 percent.
It is also important to note that none of the year-to-year comparisons are based on the scores of individual students receiving distinct educational treatments. They are based on aggregate scores for school districts using greater or lesser amounts of students' native language for instruction. This is a crude way to analyze program alternatives, considering that there are no "pure" bilingual education districts in California, only districts where a percentage of English learnersusually a minorityhave been granted "waivers" of the English-only rule. (For other statistical anomalies, see Thompson et al., 2002.)
To sum up, there are excellent reasons to suspect that rising scores for English learners in Oceanside and other California districts have more to do with extraneous factors than with what is happening in the classroom. But who knows? Available data are insufficient to prove, or disprove, any hypothesis about the impact of Proposition 227 on English learners' achievement. What is needed is a truly controlled educational experiment that tracks the academic progress of individual children over several years. The California legislature has authorized a study along these lines, with interim results due to be reported soon. Unfortunately, in drawing conclusions about student outcomes, the researchers are relying on a single assessment tool: the Stanford 9.
None of this has inspired the New York Times or other media to reassess their verdict. According to the conventional wisdom, Proposition 227 remains a great success. Meanwhile, Ron Unz is urging voters and legislators elsewhere to follow California's leadin fact, to go even furtherin mandating English-only instruction. Bilingual programs could be restricted or banned outright in numerous states, largely on the basis of claims about the Stanford 9. A high-stakes test indeed.
How should defenders of bilingual education respond? There are two basic choices. Either we criticize this use of raw test scores as scientifically unsupportable. Or we use raw test scores to make contrary claims. Many of us have taken the latter course, and it's not hard to see why. Journalists tend to ignore the scientific arguments of Hakuta, Krashen, and others, which can be tedious to explain and rarely produce flashy headlines. Conversely, they give lots of attention and credence to the bold assertions of Ron Unzalways good copy. Advocates for bilingual education are tempted to respond in kind.
Recently I participated in a press conference sponsored by opponents of Unz's initiative in Massachusetts. Our side argued that, according to the latest Stanford 9 scores, English learners were losing ground in California. We pointed out that since 1998, in virtually every grade, the "achievement gap" has widened between these children and their English-speaking peers. In other words, scores of students overall are rising faster than those of LEP studentsthus demonstrating the failure of Proposition 227 (Note 2) (Vaishnav, 2002). This is a plausible claim. Based on everything I have heard anecdotally and read in ethnographic studies of the initiative's impact (e.g., Gándara et al., 2000), I expect that a scientific study will one day render such a verdict. Yet, alas, it cannot be proved with existing Stanford 9 data, for the reasons explained above. In particular, English language learners make up an inconsistent and unstable category. Raising or lowering the bar for English proficiency, or altering procedures for redesignating students, could have a significant impact on the achievement gap. Since these decisions are made separately by each of the nearly 1,100 school districts in the state, there is no way to control for them. Ergo, no valid conclusions can be drawn.
This illustrates the dilemma for bilingual education supporters today. Credibility is a valuable commodity for advocates and researchers alike, especially when political assets are in short supply. Using persuasive but scientifically disreputable arguments could easily squander what little advantage we have in the public debate. On the other hand, when voters want answers, merely dismissing test scores as irrelevant is likely to make us irrelevant. What's the solution?
Often overlooked in the debates over Proposition 227 are the numerous research studies, from California and other states, showing the effectiveness of well designed bilingual education programs. These far outnumber the studies showing the benefits of all-English immersion programs (see, e.g., August & Hakuta, 1997). There is no study, in this country or abroad, that reports any promise whatsoever for the one-year immersion model prescribed by Proposition 227. Under the circumstances, it's hardly surprising that Unz derides all research in the field as unproven "theory" and elevates Stanford 9 scores as the ultimate "reality." He knows that most journalists like a simple story-line, with few subplots or caveats, and he has constructed a clever one.
We who oppose him have been far less adept in making our case, despite the ample evidence at our disposal. No doubt this reflects our inexperience in media manipulation and political chicanery. Mainly, however, it demonstrates the low priority that bilingual educators and researchers have placed on making scientific findings accessible to the public. As a direct result, policies on how to teach English language learners are increasingly based on what is politically, not pedagogically, effective. All this must change, or we ourselves should be held accountable.
1. It turned out that because of a computation error, Vista's highest-scoring English learners were wrongly counted as fully English-proficient. Thus its average scores for LEP students were considerably higher than those the Times reported. Nevertheless, Unz has continued to use the erroneous results, insisting they had been "officially" reported (Zehr, 2000; Unz et al., 2001).
2. There is no question that the initiative has failed to deliver on its promise of teaching children English in one school year. Since 1998, the statewide redesignation rate has budged only slightly, from 7 percent to 9 percent, continuing an upward trend that began in the early 1990s. By the (absurd) standard Unz that used for judging bilingual education during the campaign, Proposition 227 had a 91 percent "failure rate" in teaching English last year.
References
August, D. & Hakuta, K. (Eds). 1997.Improving schooling for language-minority students: A research agenda. Washington, DC: National Academy Press.
Boston Globe. 2002. Threatening language. April 8.
California Department of Education. 2000. Report on the investigation of complaints against the Oceanside Unified School District. Sacramento, September 29.
Gándara, P., Maxwell-Jolly, J., Garcia, E., Asato, J., Gutierrez, K., Stritikus, T. & Curry, J. 2000. The initial impact of Proposition 227 on the instruction of English learners. University of California, Linguistic Minority Research Institute, April. [Online] Available: http://lmri.ucsb.edu/RESDISS/prop227effects.pdf
Hakuta, K. 2000. Points on SAT-9 performance and Proposition 227. Stanford University, August 22. [Online] Available:
Hakuta, K. 2001. Silence from Oceanside and the future of bilingual education. Stanford University, August 18. [Online] Available: http://www.stanford.edu/~hakuta/SAT9/Silence%20from%20Oceanside.htm
Krashen, S. 2001. Why did test scores go up in California? A response to Unz/Reinhard.NYSABE Newsletter, 1, (3), 21-23.[Online] Available:
Krashen, S. 2002. Letter to the Boston Globe [unpublished]. April 8.
Noonan, K. 2000. I believed that bilingual education was best ... until the kids proved me wrong. Washington Post, September 3.
Orr, J.E., Butler, Y.G., Bousquet, M. & Hakuta, K. 2000. What can we learn about the impact of Proposition 227 from SAT-9 scores? An analysis of results from 2000. Stanford University, August 15. [Online] Available:
Steinberg, J. 2000. Increase in test scores counters dire forecasts for bilingual ban. New York Times, August 20.
Thompson, M.S., DiCerbo,
K.E., Mahoney, K. & MacSwan, J. 2002. ¿Exito en
California?
A validity critique of language program evaluations and analysis
of English learner test scores. Education Policy Analysis
Archives 10 (7). [Online] Available:
http://epaa.asu.edu/epaa/v10n7/
Unz, R., Snow, C. & Randolph, T. 2001. Bilingual education: A necessary help or a failed hindrance. [Videotape]. Harvard Graduate School of Education, October 15.
Vaishnav, A. 2002. Backers step up bilingual ed fight. Boston Globe, March 13.
Zehr, M.A. Cause of higher Calif. scores sore point in bilingual ed. debate. Education Week, September 6.
|
Copyright © 2002 by James Crawford. All rights reserved. No permission is required for personal use of this article, or to quote it in research papers. But republication of this material in any form and for any purpose- including course use and Internet postings - is prohibited, except by permission of the author at jwcrawford@compuserve.com. Before writing, please read his permissions FAQ, http://ourworld.compuserve.com/homepages/JWCRAWFORD/copy.htm. |
|
| |