|
|
An Exploratory Analysis of School-Based Student Assessment SystemsFebruary 2000By Introduction | Method | Findings | Summary and Conclusions | Recommendations | References IntroductionAchievement testing has long been a major fixture of the American educational landscape. Tests are used to monitor the quality of school systems, evaluate education policies and programs, make important instructional decisions about students, and leverage major educational change (Baker & Linn, 1997; Linn, 1986). In recent years, student testingparticularly from the point of view of the teacherhas become increasingly complex as classrooms have become inundated by growing numbers of mandated tests, new forms of student assessment, and a host of educational reform initiatives that demand accountability and are linked to testing (Cizek, 1995). Educators, reformers, and researchers have begun to recognize some of the challenges schools face using multiple student assessments. For example, coordinating assessment activities and aligning them with a common "vision" for student achievement can be very demanding work for schools, particularly when, as research has consistently shown, few teachers and administrators have received formal training in assessment (Resnick & Resnick, 1989; Schafer & Lissitz, 1987; Sokoloff, 1987; Stiggins, 1995; Stiggins & Conklin, 1992; Wise, Lukin, & Roos, 1991). The national organizations, the research community, and state and local education agencies have attempted to provide guidance to schools regarding the use of student assessments. Nevertheless, it not well understood howor how wellschools are using these supports. This report addresses some of these issues by summarizing findings from an exploratory study of the school-based assessment practices in a sample of elementary, middle, and secondary schools. The purposes of the study were twofold: (1) to add to the growing base of knowledge about how schools use student assessment data obtained from multiple sources to inform important decisions about programs, instruction, and individual students; and (2) to identify and describe the factors and conditions that make schools' use of the student assessment data more probable and valuable. Specifically, four variables were examined in this study:
These variables can be used in the future as the basis for a more comprehensive evaluation of school-based student assessment practices in districts and states. In the near term, the findings summarized in this report are intended to help state, regional, and local education agencies "take stock" of their current capacity to assist schools in developing, implementing, and overseeing student assessment systems; and their ability to help decision makers maximally use the information garnered from these systems. First, we will summarize recent trends in state testing and the accountability uses of assessment results to provide a broader context for understanding the environment in which school-based student assessment systems are often situated. Background/ContextPolicymakers, educators, and the general public are looking once again to student assessment as a catalyst for education reform because of its reputed ability to leverage instructional improvement and to hold school systems accountable for their results. Recent mandates for improved student outcomes have shifted focus away from assessments that test students' "minimal" skills to those that emphasize high standards in skill areas that are considered necessary for success in today's technologically advanced society. Specifically, today's educational reforms are often linked to state frameworks that broadly specify the content and knowledge expectations for all students. In addition, state frameworks also serve as guidance for local curriculum development, textbook selection, teacher professional development, and evaluation of schools (Baker & Linn, 1997). The standards movement, which has origins in national education initiatives, such as America 2000 and its predecessor, Goals 20001, can be attributed to research on how children learn, the effects of minimum competency testing on instruction, and on the public's demand to hold schools accountable for student achievement (Darling-Hammond & Wise, 1985; Glaser & Silver, 1994; Herman, Dreyfus, & Golan, 1990; Herman & Golan, 1991; Madaus, 1988; Shepard, 1989). In response, states have, one by one, made student assessment the centerpiece of their school reform and improvement efforts. Today, 49 states have implemented a statewide exam46 of these measure student achievement relative to specified content and performance standards in core subjects such as reading, math, and science (Council of Chief State School Officers, 1998). Increasingly, state assessments are being linked to policies that hold school systems, teachers and administrators, and the students themselves accountable for students' performance. Students' scores on state assessments are, for example, reported in the newspaper and other media as a matter of public accounting; are used as a basis for determining rewards and sanctions for schools and their staffs; and are used in making decisions about whether to graduate students or promote them to the next grade. The increased use of state assessments as a basis for school improvement and accountability has placed new demands on schools that already use their own assessments to measure what their students know and can do. These school exams include (1) classroom assessments such as teacher-made quizzes and tests and informal observations of students' learning; (2) school assessments such as student writing portfolios or other projects that are used to unify the curriculum and assess student proficiencies in multiple subject areas; and (3) district assessments such as commercially purchased standardized achievement tests that can be administered under uniform conditions in many schools simultaneously. Thus, teachers and administrators can easily be inundated with information about the performance of their students. To prevent this, schools must implement assessment systems that are not only aligned and integrated with local curricula, instructional practices, and professional development strategies, but also contribute to the goal of increasing student achievement based on rigorous content standards. This is complex, demanding work that can take several years. Similarly, managing, synthesizing, interpreting, and using student assessment data obtained from a multifaceted assessment system can be a daunting task for educators, particularly when (a) the assessment system lacks coherence; (b) school staff do not have training or experience in student assessment; and (c) the time, attention, and energies of teachers and administrators are stretched to personal limits. MethodParticipantsA total of nine public schoolsfive elementary, three middle, and one high schoolwere selected for the study based on nominations made by Michigan State Board of Education staff. Specifically, they were asked to identify schools they thought represented a range of experiences relating to:
Additional measures were taken to maximize the representation of schools in this study. Selected schools were located in different regions of the state and were diverse in terms of the size of their student enrollments and key student characteristics (e.g., SES, ethnicity). Table 1 summarizes some of the key characteristics of schools that were visited by NCREL researchers. A total of 46 interviews were conducted in the nine sites. The interviewees included regular, bilingual, and Title I teachers; building principals; district curriculum and assessment coordinators; assistant superintendents; and, in one case, a district superintendent. Teachers were selected by their district's curriculum director or principal based on their prior experience with using student assessments and data. For example, some teachers were selected for interview because they had been involved in developing assessments for their school or district. Others were members of their school's improvement team, participated in local efforts to align the curriculum with standards, or served on district or state committees for content standards and benchmarks. In addition, at each school, some teachers were purposely chosen because they had little or no experience in any of these areas. NCREL is aware that it is not possible to "see the whole picture" without interviewing more educators in more schools. Moreover, we acknowledge the fact that schools and individuals were not selected randomly for the study. Quite the contrary, they were, in addition to the criteria shown above, identified by SEA staff based on their reputations, personal contacts with the agency, and/or general willingness to participate in studies like ours-criteria that may reflect personal biases. Nonetheless, we do maintain that the sample is adequate to explore issues relating to student assessment and is consistent with the formative and exploratory nature of this study. ProceduresTwo NCREL researchers visited each of the nine schools. Semistructured interview protocols (see Appendices A and B) were used to guide discussions with teachers, principals, and district-level staff. These interviews typically took one day to complete. In addition, school and district staff were asked to provide any documents they thought would help researchers better understand the student assessments they develop and/or use locally, as well as their efforts to set standards, align curriculum, or both. All study participants were assured that their identities and those of their schools would be kept confidential. Therefore, all direct quotations and documents used in this report are not traceable to their source by name. Table 1. Characteristics of Participating Schools
a E = Elementary, MS = Middle School, HS = High School; Rur = Rural, Urb = Urban, Sub = Suburban Data AnalysisResearchers' notes, direct quotations obtained from informants, and relevant documents collected from schools were analyzed for content and theme. One-page, school-level summaries of these notes are included in Appendix C. FindingsThe findings presented in this report are the result of cross-site analyses of interview and document review data obtained from nine schools in Michigan. The results of these analyses are presented according to key characteristics that, together, distinguish schools that have made significant progress in implementing school-based student assessment systems from those that are, to date, not as far along. For the sake of parsimony, we culled schools into two groups based on these characteristics and identified them as having "well-developed" (N=4) or "less-developed" (N=5) assessment systems. Specifically, schools with well-developed assessment systems distinguish themselves from schools with less-developed assessment systems because:
The majority of the findings section will focus on the schools with well-developed assessment systems in an attempt to describe not only what they are doing to develop these systems, but also how they are managing this challenging work. The schools with less-developed assessment systems were either not as advanced as the well-developed schools or were grappling with the challenges associated with implementing their own school-based assessment systems. The variability in the quality of the assessment systems at these schools and in the issues they faced was substantial. Detailing all of these differences is beyond the scope of this paper. Instead, a brief description within the section for each major finding will contrast how the characteristic was manifested in these less-developed schools and, where applicable, what important barriers related to the issue. Examples of school activities and quotations from informants will be used to illustrate findings throughout this report. Finding 1: Schools with well-developed assessment systems align local curriculum, standards, and assessments to state content standards.Each of the schools visited was aware of increased accountability to educators, policymakers, the community, and the media to meet state student achievement standards. States use large-scale standardized assessments to indicate the extent to which students are achieving the standards, which in turn reflects the impact of a school's educational programming on individuals and distinct demographic groups. One distinction between schools with well-developed and less-developed assessment systems was their response to rigorous state standards. Schools with well-developed assessment systems responded in an organized, methodical fashion to rising standards by improving alignment within their local curriculum while ensuring that local standards were consistent with those of the state. This foundation allowed for the identification of local assessments that could indicate student mastery of standards as they were taught. As a result, these schools had balanced assessment systems of which large-scale assessments were one, but not the primary, indicator of success.2 Aligned local assessment tools were designed to provide feedback to students, teachers, and parents regarding mastery of particular standards relatively rapidly and inexpensively when compared to statewide assessments. Examples of aligned local assessment tools and their impact on the teaching process will be described in greater detail later in this report (see Finding 4). The process of achieving alignment within these schools was lengthy, arduous, and collaborative. Schools reported spending several years creating time in already-busy schedules, increasing the responsibilities of teachers, and seeking specific professional development to accomplish the task. Typically, this work was viewed as a component of the schools' overall improvement efforts. Schools collaborated with their local school districts or their IEAs throughout the alignment process, and some schools also involved members of the community. In each well-developed system, alignment began with a detailed analysis of the local curriculum. This analysis required reflecting on several other sources of data, including the state content standards and results from state and local assessments for each subject area and grade, K-12. Therefore, schools with good assessment systems integrated their own history of assessment performance into the alignment and development process. The specific approach to this work varied, but the following four common activities (described in more detail below) tended to drive the process:
Curriculum analysis. This analysis involved a comparison between the local curriculum and state standards. Schools outlined where state content standards and benchmarks were being taught within subject areas and throughout the local K-12 continuum. Curriculum maps were used as tools by several schools to help organize the curriculum analysis process. The maps illustrated "gaps or evidence that certain state standards and benchmarks were not being addressed sufficiently in the local curriculum. They also documented overlaps covered excess from subject area to grade grade. Curriculum maps also provided data regarding the within-school alignment of curriculum goals as students moved through courses and/or grades, as well as the between-school/district and state/nation alignment of the local curriculum to state and national standards. Figure 1 is an adaptation from a curriculum map that existed at one of the schools studied. It illustrates in which courses specific arithmetic curriculum strands, content standards, and skills are taught.3 A series of generic curriculum strands, content standards, and skills are organized in the left-hand portion of the map; the right-hand portion illustrates the sequence of math courses in that particular school. In this example, the definitions of each strand, standard, and skill are not indicated and are not important for our purposes. However, schools that go through the curriculum mapping process would need their curriculum at-hand to make the map meaningful. Educators can view each academic skill on the map, which is embedded within a content standard and a curriculum strand, and can move across the courses to see in what course that skill is being taught. For example, Skill 1 within Content Standard 2 and Curriculum Strand II is taught in trigonometry and pre-calculus. Looking at the map in Figure 1, it is clear there are five gaps of specific skills in the math curriculum. Numerous skill overlaps are shown as well, indicating that some topics are not covered at all, while others are covered again and again. Realignment of the local curriculum. Schools made decisions to adjust the curriculum according to their hypotheses. They advised other educational stakeholders, such as parents and students, of new curricular standards through presentations and/or written documents. Alignment of the local assessment system. Schools ensured that local assessment tools measured state standards and benchmarks as they were represented in the revised curriculum. In some districts, schools worked together to standardize assessment tools for certain benchmarks at the classroom level. Again, specific examples of local assessment tools will be offered later in this report (see Finding 4). Figure I. Representation of a Curriculum Map gray = curriculum overlap
Reflection upon two data sources:
Schools compared characteristics of the curriculum to trends in student performance. This comparison often led to hypotheses regarding the curriculum's effect on assessment results. In looking at the data in Figure 1, it would be important to understand if gaps in the curriculum were related to low performance on assessment items related to those skills. This finding could lead to the hypothesis that lack of teaching those skills had a negative impact on student performance. However, if students performed well on assessment items related to those skills, it could be hypothesized that no instruction was merited for those skills. Furthermore, there may be a rational, curriculum-related reason that certain topics are covered in several courses; schools should consider assessment results and local goals and needs when reviewing curriculum gaps and overlaps. In each of the schools with well-developed assessment systems, the process of aligning the curriculum and adjusting the assessment system was done over an extended period of time, and both continued to be reflected upon and modified as local needs were discovered or as state criteria changed. Schools with less-developed assessment systems were either in the beginning stages of the alignment process or had yet to begin this work. Each of these schools was engaged in ongoing initiatives to improve student achievement and some were involved in initiatives aimed at improving student assessment. Yet because the schools were unable to articulate the connection of their assessment tools to state standards, to their local curriculum, and/or to each other, their assessment efforts did not operate in unison and hence did not compose an assessment "system." For example, one of the schools was in the beginning stages of aligning its curriculum and improving its collection and use of student achievement data. A school-improvement team had been charged with developing a vision for improvement and detailing changes to the assessment system. The present assessment practices were reportedly based on individual teacher interpretations of the standards and curriculum, and a variety of data collection tools (e.g., teacher-made tests, chapter tests from texts, teacher observations). Teachers remarked that they were all "doing their own thing"; that assessment at their school was not systematic; and that decisions about their curriculum and assessment tools were based more on philosophy than on data. Another school had gone through the alignment process with the aid of many of its teachers and had identified weaknesses in its curriculum (e.g., lack of consistency with state standards and benchmarks or over-/under-representation of topics). They were using large-scale assessment tools to identify high-risk students most in need of interventions, and fine-grained (i.e., individual) tools to intervene and track progress of individuals in accordance with several school initiatives. This school was moving in the right direction, yet seemed stuck at this point. Specifically, they had difficulty conceptualizing how to organize and use their various data sources to help them set goals and improve practices. Moreover, one teacher performing individual interventions with the highest-risk students did not know if teachers were using data generated by her sessions to guide their own work. Finding 2: Schools with well-developed assessment systems analyze assessment data to monitor student progress.Schools with well-developed assessment systems used multiple sources of data to evaluate their own longitudinal performance, as well as their performance relative to local and statewide trends. Quantitative analyses of state assessment data went above and beyond reviewing standard reports. Instead, schools found ways to individualize the information and/or format of the reports to meet their specific needs. In some cases this meant disaggregating state assessment data to reflect specific student populations or producing reports on individual student trends in a particular topic strand. Other assessment data, such as locally administered standardized achievement tests, computerized reading and math screenings, or developmental checklists were also included in quantitative analyses. To make this level of specificity in data analysis possible, the schools used at least one of the following resources:
As a result of continuously analyzing and reflecting upon data in a collaborative fashion, schools had many information feedback loops related to school and individual student performance. Each time they reflected upon data, they had an opportunity to make revisions and improvements. Therefore, none of the four schools with well-developed assessment systems reported relying primarily on large-scale, relatively infrequent assessments to inform their work. Rather, they integrated data from multiple sources to form a cohesive picture of school and student progress. Schools with less-developed assessment systems were not as advanced in their use of assessment data to monitor student progress. Stakeholders at these schools often pointed to their own lack of technical expertise as a barrier to using assessment data. Schools with less-developed assessment systems either lacked the resources needed to build this capacity in the staff (i.e., no data guru and/or number cruncher) or there was insufficient sharing and communication between those with these skills and those needing the information. Some administrators at these sites admitted to having difficulty poring over and making sense out of standard reports from state assessments, instead relying heavily on individual teacher quizzes, tests, and observations for guidance. Others talked about efforts to have teachers discuss student progress in various subject areas according to standardized rubrics, but had not begun to analyze the quantifiable, local data. Unable to take full advantage of student assessment data, these educators seemed to struggle more with increasing accountability and community attention to student achievement. Finding 3: Schools with well-developed assessment systems use state assessment data to check the validity of local assessment systems.Schools with well-developed assessment systems also used state standardized assessment results to validate local assessment tools. To determine the validity of student performance across various tools, outcomes from state assessments were compared with outcomes from locally administered standardized tests and other assessment tools (e.g., computerized reading and math screenings, reading "running records"). This comparison informed educators if student performance at the individual, classroom, and school level was consistent as measured by these different tools. State assessments also provided data that helped schools validate the alignment of their local curricula to state standards as well as the efficacy of their teaching practices. Schools found ways to aggregate data across several assessment instruments, which helped identify trends across levels of analyses (e.g., student, classroom, school). Following are four of the strategies used by schools with well-developed assessment systems to do this work, including three levels of data aggregation and standardized test item analyses:
Figure 2 shows an example of an item classification chart, a tool some schools use to analyze their performance on items appearing on their states' assessment. In this example, a school not only analyzed the performance of its students on individual items compared to the performance of students from other schools, it also explored student performance on "easy" (i.e., problems that most students answer correctly) and "hard" items (i.e., problems that most students do not answer correctly). The item classification chart plots items in four areas according to (a) a school's performance on an item relative to a comparison group (e.g., all schools in the state or district) and (b) the difficulty of the item (expressed as a percentage of all students in the state who answered the item correctly). Items that fall into each of the four areas on the chart have implications for instruction, though those that are located in Priority Areas One and Two tend to generate the most discussion, hypothesizing, and problem solving on the part of participating school staff. For instance, items that are located in Priority Areas One and Two may be further analyzed to examine students' responses to incorrect items. These analyses are typically referred to as "error analyses." For example, when students consistently select or write the same wrong answer, they may have learned the wrong information in class or have similar misconceptions about the nature of the problem, its answer, or both. On the other hand, when students make or select a variety of incorrect responses to a test item, they may have not been sufficiently exposed to the material tested by the item and could be guessing at its answer. Figure 2. Item Classification Chart
Priority Area One: Items that were easy for the comparison group but were very difficult for your school. Implications: These items test skills that were probably not taught in your school. Your school should consider focusing on teaching these skills. Priority Area Two: Items that range from easy to very difficult for the comparison group but were very difficult for your school. Implications: Items near the top of Area Two involve skills that were probably taught, but not effectively and therefore should be considered for further review. Items near the bottom of Area Two were difficult for the comparison school and your school, indicating perhaps that they are faulty. Priority Area Three: Items that range from easy to very difficult for the comparison group but were relatively easy for your school. Implications: Items that appear in Area Three are a lower priority for your school and should be addressed only after those appearing in Areas One and Two have been addressed. Priority Area Four: Items that range from easy to very difficult for the comparison group but were very easy for your school. Implications: Items in this area suggest that teaching and learning has been effective. If your students tend to score very high on the test, most items will appear in this area. If this is the case, you should prioritize by addressing the most difficult items first. Educators at schools with less-developed assessment systems discussed having difficulty relating to student performance data from standardized state assessments, which, they felt, did not accurately reflect the complex issues of their student populations. For example, some schools had large and diverse groups of students for which English was not their first language. The emerging ability to read and write English, they felt, introduced a confound for many students that was not sufficiently addressed by large-scale assessments and was not clearly reflected in the results as they were reported. One informant in this situation remarked that large-scale state assessments did not reflect "what we know about our students." That is, the students reportedly performed well on local, school-based assessments but did not perform well on the state, large-scale assessment. The validity of these perceptions, however, was unknown at these schools, as systematic comparisons of student performance across various assessment tools was not being done. Without these analyses, these schools were unable to rely on objective data to demonstrate their assertions or to suggest if specific changes in local assessment tools needed to take place. Finding 4. Schools with well-developed assessment systems use assessment data to evaluate the efficacy of local curriculum and instructional practices.Changes to instructional practices occur at the large-scale (i.e., school, classroom) and finer-grain levels. Schools with well-developed assessment systems used universal assessment tools or practices that supported instructional decisions at each of these levels. Like most schools, they used assessment tools at the large-scale level (such as standardized tests) that were familiar to all teachers, yet were administered infrequently. More uniquely, these schools also used or developed assessments at the finer-grained, individual level that were common throughout the school and were administered and analyzed frequently to gauge student progress. For example, one school created a continuum to document each child's progress through the developmental stages of reading from kindergarten through second grade (See Appendix D). This tool, adopted by the school based on its philosophy of continuous progress, is ideally geared to make fine-grained decisions about individual instructional needs. As children consistently demonstrate reading behaviors, teachers reflect this mastery on the continuum, which follows the child's progress along several dimensions (i.e., comprehension, word identification, attitude) from year to year. Each week, teachers meet to discuss individual student progress along the continuum and to ensure that they are in agreement in its use and interpretation. When the continuum reveals gaps in a student's reading skills, teachers hold conferences with the students and, if necessary, their parents, to problem solve and set instructional goals. Because reading skills vary and progress along the continuum occurs at different rates among students, teachers at this school also use the continuum for larger-scale decisions about classroom organization during reading instruction. Reading instruction occurs at a consistent time between the lower elementary grades, allowing for flexible groupings of children based on their current reading skills rather than their grade level. Hence, students at the "emergent" level will work with one teacher; those at the "beginning" level will work with another, and so on. As students raise their skill level, they "flex" (i.e., move) to the more challenging group. Reportedly, this assessment tool has not only allowed teachers to better tailor their reading instruction, but helped identify instructional needs in teachers (i.e., those whose teaching practices did not support the concept of continuous progress). The success of this tool motivated the third- through fifth-grade teachers to extend the continuum to these grades, improving the continuity and standardization of reading assessment throughout this elementary school. Teachers in another district worked together to identify two to three standard summative assessments for each district standard in each grade and subject area. After a student has shown mastery of a skill based on the teacher's formative assessment tools (such as observations, classroom tests, and so on), the teacher must use at least one of the districtwide summative assessment tools to "certify" mastery of the standard. Before teachers begin a lesson, they identify which standards are being addressed, what corrective actions they will take with students who do not show mastery of the standards, and what enrichment activities they will provide for students who show mastery more rapidly than others (see the Teaching for Mastery tool in Appendix D). Again, these tools help teachers individualize their instruction according to the different needs of their students and also facilitate flexible groupings within classes based on student's skills. They also ensure some consistency in how children are evaluated within classrooms throughout the district. Similarly, another school anticipated variable performance among students on formative assessments and designed the following instructional model to manage this: At appropriate times (e.g., after a lesson, after the completion of an instructional unit), students are assessed on the related standard(s). If the students demonstrate mastery of the standard(s), this is documented with an "M," and the students are moved on to an "enrichment" experience. Students who have not mastered the standard are given an "I" (i.e., incomplete) and are sent through a "reteaching loop," or supplemental instruction. This instruction includes extra work in class, after school, or at Saturday School. Data regarding each student's mastery on various standards are accumulated over the years and form a portion of the student's performance record related to promotional decisions. Each of these examples suggests a flexible approach to teaching based on individual student needs and an attempt to frequently assess and calibrate instruction to constantly changing student skills. Yet not enough is known about the content of the "corrective" and "enrichment" activities. Often, corrective experiences modify the duration and intensity of instruction. That is, they often reflect traditional educational decisions, such as to spend more time teaching a skill or to practice a skill more. Evidence of innovative instructional practices that work to resolve specific deficits in composite student skills is less common. Further investigation aimed at identifying specific teaching practices within these corrective and enrichment activities at these schools is merited. Because schools with less-developed assessment systems had not completed the alignment process and did not have staff with the capacity to perform in-depth analyses of their assessment data, they were less able to hypothesize causal relationships between their curriculum, their instruction, their local assessment results, and their state assessment results. As a result, individuals at these schools seldom mentioned the relationship between assessment results and modifications to instructional tools and practices. Finding 5. Schools with well-developed assessment systems limit the quantity of student assessments to those that are purposeful and aligned.Assessment costs schools time, effort, and money. Schools with well-developed systems identified which tools were of value to them and discontinued using those that were not. They struck a balance between the depth of the system, the richness of the data, and the cost and utility of the assessment system. The general philosophy driving educators at the school, coupled with findings from extensive analyses of assessment data, such as those previously described, informed their decision-making process regarding the use of individual tools. At the schools with well-developed assessment systems, educators continuously evaluated the efficacy of their overall system and the contributions of individual tools. When tools were deemed too costly, relative to the utility of the data they offered, they were dropped. For example, several schools reported discontinuing the local administration of certain commercially available standardized assessments. Some were investigating alternative large-scale assessments that were better aligned with local curriculum standards. Others discontinued the practice altogether. Schools with less-developed assessment systems made the effort to collect data on student achievement, often from multiple sources. Some of these schools were balancing several initiatives sponsored by different agencies, each initiative having its own set of assessment tools. Yet because the schools' approaches to assessment were not systematic and they were unable to effectively analyze assessment data, the payoff for these efforts was unclear. That is, at some of these schools, teachers and administrators reported putting great efforts into assessing student progress but doing little work to analyze, coordinate, and conceptualize the results. Paradoxically, some of these schools may have been "overassessing" their students. Finding 6. Schools with well-developed assessment systems allocate time for collaboration, reflection, and decision making based on student assessment data.Teachers frequently report having to juggle multiple responsibilities and to balance competing demands on their time. Nonetheless, teachers at schools with well-developed assessment systems performed much of the hands-on, time-consuming work during the alignment process. They also shared, compared, and discussed assessment data while forming hypotheses and solutions about their own curricular or instructional needs. This typically occurred through a combination of one-on-one meetings or regular small-group sessions throughout the school year. They also shared student assessment data with parents, usually through conferences that they led or by facilitating those that were student led. How, then, did they perform and have time for these formidable tasks? Teachers suggested that they could not have improved their assessment systems without additional support, due to their lack of experience with these tasks. Much of the support came from local school districts or IEAs and took the form of professional development with teachers. District leaders with expertise in this area provided conceptual frameworks, materials, and guidance for committees of teachers as they performed curriculum gap analyses, defined their local standards, aligned their local curriculum and assessment systems to state standards, and worked to understand student needs. Developing and maintaining these improvements were time-consuming processes. This work required creative scheduling, allowing time for teachers within schools and throughout districts to meet regularly. The strategies adopted by schools were simple, yet required flexible thinking, a willing staff, and, at times, union negotiation for more accommodating schedules and job descriptions. Table 2 outlines how the various schools managed to create time for teacher reflection and the concrete activities that occurred during that time. The commitment of districts and/or IEAs to building the capacity of school staff in their use of assessment data was consistent in each of the schools with well-developed student assessment systems. These "capacity-building models" varied slightly from site to site, and the name was not coined as such or visually represented. Yet because of their integral role in the use of assessment data, we have chosen to illustrate an approximation of the models as they were observed in these schools (see Figure 3). You will note that teachers are central in the models. Within the context of receiving training and support from external sources and their instructional leader, teachers do the hands-on work to design their assessment systems and are the primary decision makers regarding curricular and instructional interventions. As a result, participating teachers made great leaps in their own skills, were able to train others, and reported taking pride in their expanded roles. Other positive consequences of this work were reportedly increased communication throughout the system and mutual ownership of the curriculum and assessment system. Among schools with less-developed assessment practices, some struggled with the logistics of professional collaboration and lacked the leadership needed to provide professional development targeted toward improving their assessment systems. Several of the schools were unaware of where to turn for this additional support and had not identified ways to make time available to teachers for these activities. Perhaps most important, some of these schools had not yet embarked on the alignment process and therefore had not identified the need for collaboration. Other schools did embrace teacher collaboration and successfully made time for this activity. However, at some of these schools the activities included only a small group of teachers, not the entire staff. At others, teachers met regularly to discuss student progress and to problem solve regarding student needs. These schools were clearly progressing in the right direction, but lacked the assessment system and related data analyses to inform their decision making. Table 2. Scheduling Approaches for Teacher Collaboration
Figure 3. Building the Capacity of Data Consumers Throughout the System Summary and ConclusionsThis study describes the general natures and uses of school-based student assessment systems in a small sample of elementary, middle, and high schools. We have relied solely on descriptive analyses and presentation of data obtained during our visits to these schools. In order to make these findings clear to readers of this report, we separated the schools into two groups: those that had well-developed student assessment systems and those whose assessment systems were less-developed. We readily acknowledge that this dichotomy of schools risks oversimplification and some insensitivity to situational hardships at some schools that made it difficult for them to optimally use student assessment data. For instance, some teachers and administrative staff were implementing other school initiatives or reforms and, in so doing, may not have reflected on how student assessments could have assisted them in reaching their goals. Other schoolsparticularly those in poor and/or multilingual districtsmay not have had the staff or resources to simultaneously contend with these demographic challenges while developing and implementing an assessment system. Nevertheless, the strategy of characterizing the schools in this manner has been useful for bringing some of the broader findings to light. For instance, we found that schools in the well-developed group:
In contrast, schools in the less-developed group had made less progress in aligning the various elements of an assessment system (i.e., standards, curriculum, assessment tools) and had not articulated a clear purpose for the assessments they used. This lack of progress was compounded by the difficulty these schools had in analyzing and using their current assessment data to conceptualize strengths and needs at the student, classroom, and school level on an ongoing basis. Some of the schools in the less-developed group recognized the benefits of teacher collaboration and were facilitating this process. However, the teachers were not able to inform their work with assessment data to the same degree as teachers in the well-developed schools and did not benefit from the same level of capacity building from experienced, skilled leaders. Without the availability or understanding of their own assessment data, these schools had difficulty prioritizing which assessment tools to use and which to discontinue. As a result, some were practicing a "more is better" approach to assessment, using too many assessments with no clear connection. In short, these schools lacked what Cizek and Rachor (1994) have called a planned assessment system, "one in which each assessment activity that occurs is conducted for a well-articulated purpose; the benefits and beneficiaries of the assessment are clearly defined; the uses of the assessment information are real, tangible, and valued by the users; and the assessments are conducted in an efficient manner, ensuring that assessment time is not wasted gathering redundant information and reducing the time available for instruction" (Cizek, 1995, p. 247). Our intent for this report was simple: to find out more about how schools use the data they receive from multiple assessments and to identify and describe the factors, conditions, and supports that optimize schools' use of these data. These findings could be used to develop a framework for studying school-based student assessment practices more comprehensively (e.g., in states). This framework may include, for instance, the following five dimensions as a basis for determining (a) the extent to which schools vary along each and (b) howand how mucheach contributes to a coherent, school-based student assessment system:
In the near term, this report can be used by education agencies at different levels of the system to reflect on their own capacities and resources to assist schools in the development of coherent, effective student assessment systems. We conclude this report with some recommendations for how state, regional, and local educational agencies may proceed. These recommendations are organized and presented in four parts: (1) improving networks, (2) involving and supporting teachers, (3) formal professional development for teachers and school administrators, and (4) other resources and assistance to schools. RecommendationsImproving Networks
Involving and Supporting Teachers
Formal Professional Development
Other Resources and Assistance to Schools
ReferencesBaker, E. L., & Linn, R. L. (1997). Emerging educational standards of performance in the United States (CSE Technical Report 437). Los Angeles, CA: National Center for Research on Evaluation, Standards, and Student Testing. Cizek, G. J. (1995). The big picture in assessment and who ought to have it. Phi Delta Kappan, 77(3), 246-249. Cizek, G. J., & Rachor, R. E. (1994). The real testing bias: the role of values in educational assessment. NASSP Bulletin, 78(560), 83-93. Council of Chief State School Officers. (1998). Annual survey of state student assessment programs. Washington, DC: Author. Darling-Hammond, L., & Wise, A. E. (1985). Beyond standardization: State standards and school improvement. The Elementary School Journal, 85, 315-336. Glaser, R., & Silver, E. (1994). Assessment, testing, and instruction: Retrospect and prospect. In L. Hammond (Ed.), Review of research in education, Volume 20. Washington, DC: American Educational Research Association. Herman, J. L., Dreyfus, J., & Golan, S. (1990). The effects of testing on teaching and learning (CSE Technical Report No. 327). Los Angeles, CA: National Center for Research on Evaluation, Standards, and Student Testing. Herman, J. L., & Golan, S. (1991). Effects of standardized testing on teachers and learningAnother look (CSE Technical Report No. 334). Los Angeles, CA: National Center for Research on Evaluation, Standards, and Student Testing. Linn, R. L. (1986). Educational testing and assessment. American Psychologist, 41(10), 1153-1160. Madaus, G. F. (1988). The influence of testing on the curriculum. In L. N. Tanner (Ed.), Critical issues in curriculum, Eighty-seventh yearbook of the National Society for the Study of Education (pp. 83-121). Chicago, IL: The University of Chicago Press. Michigan Department of Education. (1996). Michigan curriculum framework. Lansing, MI: Author. Michigan State Board of Education. (1999). Handbook, Michigan Educational Assessment Program. Lansing, MI: Author. National Council on Education Standards and Testing. (1992). Raising standards for American education. Washington, DC: U.S. Government Printing Office. National Education Goals Panel. (1991). Gauging high performance. Washington, DC: U.S. Government Printing Office. Resnick, L., & Resnick, D. (1989). Tests as standards of achievement in schools. In The uses of standardized tests in American education: Proceedings of the 1989 ETS invitational conference. Princeton, NJ: Educational Testing Service. Schafer, D., & Lissitz, R.W. (1987). Measurement training for school personnel: Recommendations and reality. Journal of Teacher Education, 38(3), 57-63. Shepard, L. A. (1989). Why we need better assessments. Educational Leadership, 46(7), 4-9. Sokoloff, H. J. (1987). Decision-Oriented information systems: Leadership and implementation issues. Paper presented at the annual meeting of the American Educational Research Association, Washington, DC. Stiggins, R. J. (1995). Assessment literacy for the 21st century. Phi Delta Kappan, 77(3), 238-245. Stiggins, R. J., & Conklin, N. F. (1992). In teachers' hands: Investigating the practices of classroom assessment. State University of New York Press: Albany, NY. U.S. Department of Education. (1994). Goals 2000: Supporting community efforts to improve schools. Washington, DC: U.S. Government Printing Office. U.S. Department of Education. (1991). America 2000: An education strategy. Washington, DC: Author. Wise, S. L., Lukin, L. E., & Roos, L. L. (1991). Teacher beliefs about training in testing and measurement. Journal of Teacher Education, 37-42. Endnotes[1] (See also National Council on Education Standards and Testing, 1992; National Education Goals Panel, 1991; and U.S. Department of Education, 1991, 1994) [2] The large-scale state assessment tool that is referenced throughout is the Michigan Educational Assessment Program (MEAP), begun in 1969. The MEAP has evolved over the years in response to current research in learning, new areas of emphasis in curriculum and assessment, and increased interest in the performance and accountability of Michigan public schools (Michigan State Board of Education, 1999). The latest generation of the MEAP maintains its close relationship to the Michigan Curriculum Framework (MCF) (Michigan Department of Education, 1996), but has shifted its focus away from testing students' minimum, or basic, skills to an emphasis on problem solving and processes-skills Michigan considers essential for students' educational development. In addition to using data from the MEAP, Michigan encourages educators to use and review extensive local assessment data to measure student achievement of core content standards and benchmarks at classroom, school, and district levels (Michigan Department of Education, 1996). [3] Curriculum strands are clusters of related content standards representing disciplines or common themes. Content standards are descriptions of knowledge students should acquire in core academic subjects. Skills are descriptions of abilities students should acquire in core academic subjects. [4] On June 4, 1997, President Clinton signed into law the Individuals with Disabilities Education Act Amendments of 1997 (P.L. 105-17). The law introduced new requirements to complement such previous requirements as "free appropriate public education," "individualized educational programs (IEP)," and "least restrictive environment," which have reshaped special education in the past two decades.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Contact Us | Privacy Policy |