Assessment
Overview
- Why assess?
- What is assessment?
- How is information selected to be put in an assessment?
- How do we determine outcomes?
- What is authentic assessment?
- When do teachers assess students' attainment of outcomes?
- Four different times for assessment - diagnostic, formative, summative, and generative
- What is authentic assessment?
- How do the four ideas of assessment fit with the idea of a subject as multidimensional?
Why assess?
The most important task in schooling is deciding what students should know. As important as this is, it has been put in the backseat of the assessment bus, by narrowly defining what is currently being assessed. This is not the page to lament on that topic, but it is the place to emphasize - it is importance to consider the appropriateness of what we expect students to know before you begin an assessment process.
What is assessment?
Assessment is the collection of data. It is the measurement activities educators use to attempt to make valid inferences about students' knowledge, skills, and dispositions; as well as using those measurements and inferences to decide curricular aims, instructional strategies that are developmentally and academically appropriate, and if an instructional sequence was successful.
How is information selected to be put in an assessment?
After a topic or idea is identified to be assessed, the first thing to do is decide what students should know about it? What ideas are important to know conceptually? This is done by identifying or writing concepts.
Having identified what is believed important for students to know, the next step is to determine how to find out what students know about the concepts. The idea of what or how much a person knows in the assessment game is measurement and as with any kind of measurement there has to be a standard or unit to be used to create a ruler. Since concepts are mental ideas, and at the present time can't be seen, we can't compare them and hence can't measure them. The solution...
To determine what concepts a person knows and the degree of conceptual understanding he or she has of those concepts we observe students as they are doing something or an artifact they created. So in the assessment game concepts or ideas are associated to what a person can do that indicates their conceptualization of a particular concept, or what a person would do that would indicate lack of conceptualization of the concept. Therefore, if the concept involves different levels of undertanding, then something a person does could be described that would indicate that person's understanding at each of the levels. This information is known as - outcomes.
The information that describes what can be used to determine the different levels of understanding are collectively know as outcomes and one outcome would refer to what could be done to indicate one particular level of undertanding. Placing a student at a particular level is the measurement aspect. Just as the length of a room can be measured differently by different people the placing of students at different levels can also occur.
The greater the distinction between described levels, the easier the outcomes are for people to match to a student's behavior or artifact. The more agreement on the level of student understanding, the more accurate the measurement and hence a more accurate assessment.
The amount of agreemeent different observers have from one observer's measurement or level placement to anothers determines the assessment's validity. The manner in which an assessment is created, implemented, and scored all affect its validity. There are several ways to increase validity.
How do we determine outcomes?
For each topic or subtopic there are big ideas that capture the essence or power of related to the topic or subtopics. Eah topic must be unpacked or its powerful ideas identified.
Samples of unpacking | science ideas | science inquiry | measurement | pre number sense | number sense | place value | classification - math | additon and subtraction | multiplication and division | fractions, decimals, percents |
You could make a map, drawing, outline, chart to explain what a student at a specific grade level should be able to know (concept) and do (outcome) with respect to the selected standard.
Operational definitons are helpful in defining what students should be able to do and some rubrics are available that have that kind of information (classification rubric does).
After deciding what students should do, then determine a range of outcomes expected of students at a specific grade level or across grade levels. A concept map or chart may be helpful. Other samples can be found at AAAS site scroll to the bottom to view online Atlas of Science Literacy (also includes mathematics and social sciences examples).
A mini-lecture on how to create rubrics for science inquiry at the fourth grade level is at the end of this link. --->>>| Creating fourth grade inquiry rubrics |Other rubrics mathematics, algebra equality, problem solving, probability, cardinality,
Now that you know what is targeted for students to know and do; you would put together a sequence of events or activities that students could participate in to achieve understanding and be able to perform the outcomes at acceptable or exceptional levels. See the following list for suggestions on what and how to plan --->>>>> steps .
What is authentic assessment?
Authentic assessment where students perform tasks that demonstrate meaningful application of knowledge, skills, and dispositions. The closer the task is to what people face in the world as architects, machinists, doctors, mechanics, construction workers, designers, business people, politicians, parents, citizens, the more authentic the assessment. Examples of authentic assessments mathematics algebra - pattern recognition,
When do teachers assess students' attainment of outcomes?
If assessment is ongoing and continuous, then the when should be always. But, even if assessment is ongoing and continuous there is an element of time for assessment that has meaning to the assessor and the person being assessed with respect to self-assessment if nothing else. Those elements of time can be thought of in four ways when they are used to help make decisions to facilitate learning.
Four different times for assessment - diagnostic, formative, summative, and generative
Some thoughts on assessment. If assessment is ongoing and continuous, then can there be different types of assessment? Well, maybe not, but even if assessment is ongoing and continuous there is an element of time of assessment that has meaning to the assessor and the person being assessed with respect to self-assessment if nothing else. Those elements of time can be thought of in four ways when making decisions to facilitate learning. They are: diagnostic, formative, summative, and generative assessment.
While it's possible to take any assessment activity or questions and provide a pretty good explanation on how it would fit in any and all of the four categories, there are still important reasons to consider the four differently.
First, diagnosis.
The major characteristic to associate with diagnostic assessment is that it is preliminary. It is to probe into what is known before facilitating instruction. It usually precedes learning activities, but doesn't have to. Diagnostic assessment can come at anytime during a lesson. If during a lesson a question arises that depends on background information, the teacher can ask a diagnostic question to check the students' level of understanding, At the end of an activity a diagnostic question can be asked to inquire about the next day's activity to diagnose students' readiness.
The second, is formative.
This type of assessment is used to check the students' progress toward learning. This also can happen at anytime during a lesson and is usually understood as such.
The third, is summative.
This kind of assessment is usually associated with the time immediately after facilitating learning. However, what is that time frame? Is it at the end of a five minute mini-teach where summative assessment is to summarize what was learned in the five minutes? It could also be a time frame of an hour, day, week, month, or year.
One could argue that it is only summative if you are inclined to think the students understand the concepts and can perform the outcomes, other wise it could be considered still formative. Whatever, it is the the last assessment before the teacher decides to move to another topic. It could be the first summary check, or a question to double or triple check, and of course the assessment results will be within the range of acceptable or above.
The fourth and last is generative.
This assessment is to inquire into the students' understanding of being able to apply, use, adapt, alter, or join ideas that have been taught. The purpose of seeing how well students understand what they learned. Is the information so fragile that it is only usable within the scope of examples to which they are familiar and similar to what was presented during instruction, or are they able to use it in ways that were not presented and present a greater variety and need for analysis and synthesis.
Again, any one of these kinds can come at anytime of the lesson and only makes sense with respect to the purpose of the teacher within a sequence of facilitating learning. When planning a teacher can anticipate all four types of assessment that will be used through out a sequence for each concept within the sequence. The planning will prepare the teacher to interact with students and be ready to faciliate their learning in real time individualized for each student.
What is authentic assessment?
Authentic assessment includes the idea of continuous assessment as described above. However, some peoples definition goes further to include the kinds of tasks students perform. Tasks that must include a meaningful application of the knowledge, skills, and dispositions to be assessed. The closer the task is to what people face in the world as citizens, parents, families, architects, machinists, doctors, mechanics, construction workers, designers, business people, politicians, the more authentic the assessment.
Examples of authentic assessments mathematics algebra - pattern recognition,
How do the four ideas of assessment fit with the idea of a subject as multidimensional?
Imagine a piece of wool yarn as a sequence of learning with each hair as a concept we would find a possibility of four types of assessments on each hair. If the lesson included or relied on other concepts then the yarn would contain one hair for each concept, but the hairs would overlap depending on when they were needed to be used or worked on for construction. This would create, as is the case in yarn, hairs that start out in the yarn at different places. With hairs starting before and after others with some overlapping and others separated by other hairs along the length of the yarn. With the entire yarn representing mathematics instruction for a certain period of time.
Of course not all of the concept hairs are targeted for instruction. If the teacher knows students' existing knowledge includes observation, measurement, and graphing; then the concept hairs are in that part of the yarn, but they are not targeted and don't have assessment points on them. So if the teaher is pretty sure students don't know much about the function of seed parts, since previous fourth graders didn't and that is why it is listed in the curriculum, the concept hairs for functions would have assessment points on them.
When the teacher begins the sequence planned for plants somewhere along the yarn she will get to a place where she will ask students what a seed is (that is where she puts the start for her facilitating the concept of seed (hair of yearn). As the dicussion progresses she plans to ask if there are parts to seeds and begin the part of the sequence on plants that will focus on seed parts and their function (more concept hairs in the yarn). Hairs that represent seeds, seed coat, embryo, cotyledon, and functions like - seed coat protects, embryo is the baby plant, cotyledon provides the food,... whenever the time comes for that information to be needed or constructed the piece of yarn representing that time has those hairs inside it. All concepts that students need have a concept hair in the yarn, but only hairs that the teacher chooses will have assessment points on them.
Let's say the she also plans to help students create an experiment to investigate the purposes of the different seed parts as her instructional strategy. Since, students have done several experiments before the concepts aren't being introduced, but she plans on seeing if they can identify and suggest to experiment with all the different seed parts (identify variables hair with assessment marks). So there are hairs for inquiry, and all the hairs that represent the different steps and processes of inquiry and experimenting where they are needed in the yarn, but they don't have assessment marks. (To get the bad pun out of the way - Yeah I know, it's getting to be a hairy situation.)
She plans to guide students to the idea that they can disect seeds into parts and put each part and combinations of parts into germination chambers. Set them aside and chart the growth or lack of growth of the different parts in cm on a graph with a small polka dots representing the length of the various seed parts for each day of the experiment. When the data on a particular day has an interesting range she plans to use that to start to facilitate students' conceptual understanding of average of data points (hair) by asking students questions about how they could summarize and explain the data for that certain seed part on that particular day and using the dots to help students visualize how averages plotted on a graph represents all of the data points for that average.
That's a small slice of what happens in a quality science classroom with respect to assessment and decision making. The planning becomes, yes hairy, because there are and should be many concepts that are being facilitated simultaneously. The challenge for you is to include this complexity in your planning.
Dr. Robert Sweetland's Notes ©