On this page:

Assessment of Achievement Programme: Report of the Sixth AAP Survey of Science (2003)

« Previous | Contents | Next »

Listen

Assessment of Achievement Programme: Report of the Sixth AAP Survey of Science (2003)

3. Skills in Science - Investigating

3.1 The assessment process

3.1.1 The assessment tasks

Practical assessment is always more expensive and logistically more difficult to implement than is written assessment, especially in a large-scale time-constrained application like a national attainment survey. For this reason, relatively few investigating tasks - nine in total - were specially developed by teachers under the supervision of an academic staff member in the University of Strathclyde's Department of Education.

Each task was designed to allow pupils, working in groups, to plan, carry out and review/report a meaningful investigation set in a realistic context. Rating checklists were developed that in principle enabled field officers to assess a pupil's investigative behaviour, and to classify the pupil into one of two adjacent 5-14 levels with respect to each relevant attainment target. There were two tasks for every pair of adjacent levels A/B, B/C, C/D and D/E, and one further task which spanned attainment targets at Levels D, E and F. Perhaps inevitably, given the time normally needed to set up and to follow investigations in the biological sciences, the nine investigation tasks were virtually all set within the physical sciences, as the task names and investigation questions listed in Table 3.1 illustrate. Two of the tasks are overviewed in more detail in Table 3.2.

Table 3.1 The investigating tasks

Levels

Task name

Underlying question to be investigated

D/E/F

Heat

"Do different materials have different heat insulation properties?"

D/E

Sliding along

"How does an object's weight affect its deceleration rate?"

Burning

"What effect does burning have on different materials?"

C/D

Dissolve

"How can you speed up rate of dissolving?"

Keeping the Beat

"Does everyone's heart take the same time to relax after exercise?"

B/C

Melting

"Does heat have an effect on the state of an object?"

Magnets

"Does everything stick to a magnet in the same way?"

A/B

Changes

"How do different materials respond to being squashed?"

Floating and Sinking

"Does the size of an object affect its ability to float?"

Every task was designed to take 45 minutes to one hour of pupil time. The first 15 minutes were devoted to task orientation. Here the field officer would start by demonstrating some phenomenon - a ball rolling, a crystal dissolving, an object floating - and would engage the pupils in discussion about it, asking what they had observed, why they thought it had happened, and so on. In this way, the pupils would be guided towards the question that they were to investigate.

After being shown the resources available to them, the pupils were to be given worksheets and invited to make notes in the 'planning' section of these before starting work. The field officer would observe the pupils at work, occasionally asking questions and recording observations on associated checklists.

Table 3.2 Overview of one Level A/B task and one Level D/E task

'Sliding along' - Level D/E

The purpose of this task for P7 and S2 pupils was to investigate "the effect the weight of an object has on the time it takes to come to a stop". The field officer set the scene by rolling a toy car across the floor, and asking the pupils what had happened, why the car slowed down, and what they thought might affect how quickly it slowed down. Possible explanations were posted on a planning board, and each pupil was invited to suggest a question that the group might investigate.

During the ensuing discussion the field officer directed the pupils to investigate the effect of weight on deceleration. They were shown the resources they had available for use, asked about safety issues, and asked to discuss briefly what it means to carry out a fair test. They were then handed worksheets, invited to complete the planning section right away and told to make notes as they worked. They were advised that they would be observed as they carried out the investigation, and that they might be asked questions now and again.

The orientation session lasted around 15 minutes, and was followed by around 20 minutes of investigation activity, during which time the field officer observed the pupils and recorded observations and judgments on the checklist. At the end of the investigation, the field officer asked the pupils individually what they had concluded, whether they would change anything to improve the investigation if they could repeat it, and whether there was any other question they would like to investigate.

'Changes' - Level A/B

The purpose of this task for P3 pupils was to investigate "how different materials respond to being squashed". The usual 15-minute task preparation period began with the field officer throwing and bouncing a rubber ball, and asking the pupils to predict what would happen if they squashed it. The field officer then squashed the ball several times, with different levels of force, and asked the pupils to describe what they saw, and why they thought it happened. Possible explanations from the pupils were posted onto a planning board.

The question to be investigated was then given to them, and they were shown the resources available to them for use: foam rubber ball, plasticine strip, small plastic bottle, among other objects/materials. As in all the tasks, they were asked about safety issues, and then given a worksheet to use while they planned and carried out their investigation. The field officer observed the pupils over the next 20 minutes or so as they worked, occasionally asking questions and completing checklists. Again in common with other tasks, at the end of the active period pupils were individually asked for their conclusions, whether they would do the task differently if they had the chance to repeat it, and what other investigation they would be interested in carrying out.

This practical activity was intended to last about 20 minutes, at the end of which pupils were individually asked what their conclusions were, whether they were happy with the investigation, if not how they would improve it should they be able to do it again, and what other questions they might like to investigate in the future.

3.1.2 The rating checklists

As pupils worked together on their investigation, they were to be individually rated for evidence of the attainment of relevant attainment targets, using a task-specific checklist. For ease of use, the rating checklists were designed to hold records for up to four different pupils, i.e. for all the pupils in the investigating group.

Each rating checklist had three sections, corresponding with the three strands comprising this attainment outcome - 'Preparing for tasks', 'Carrying out tasks' and 'Reviewing and reporting on tasks'. Within each section were listed the associated attainment targets at the levels relevant to the particular task. Table 3.3 overviews the checklist items for the Level A/B task, 'Changes', described in Table 3.2. The 'Changes' checklist is typical of all others, in the sense that one of the two levels has more attainment targets listed than the other, reflecting exactly the strand-target framework in the National Guidelines.

Table 3.3 The structure of the rating checklist for the Level A/B task 'Changes'

Attainment targets

Level A

Level B

Preparing for the task

  • Make suggestions and contribute to the planning of simple practical explorations
  • Make suggestions and contribute to the planning of simple practical investigations
  • Make suggestions about what might happen
  • Recognises when a test or comparison is unfair

Carrying out the task

  • Carry out simple observations and measurements
  • Record observations in a simple form
  • Carry out simple observations and measurements
  • Record observations in a simple form
  • Use simple equipment and techniques to make observations and measurements

Reviewing and reporting

  • Participate in the presentation of findings through visual displays and oral reports
  • Answer simple questions about what happened
  • Participate in the presentation of findings through visual displays and oral reports
  • Answer simple questions about what happened
  • Answer questions on the meaning of the findings

Through observing the pupils at work, talking to them as they worked, and reviewing any notes or other materials they had prepared whilst carrying out their assigned investigations, the field officers were to assess each of the pupils in the investigating group, deciding which attainment targets had been demonstrated, and at what levels. They were also to record whether the pupil concerned had managed to show the evidence unaided or with some prompting. In addition, they were to note occasions when they felt that they could not make an assessment, either because the pupil did not respond to the task or because technical problems arose.

The investigating tasks were trialled informally by the task developers, but full-scale trialling occurred within the survey itself. The consequence of this is that the checklists had not been fully tested for applicability in the field before their use in the survey, and neither had any rater agreement trials been possible. Albeit unavoidable in the circumstances, this situation was unfortunate, given the potential value of this inherently interesting exploration of practical investigation skills. These limitations should be borne in mind when the performance data presented later in this chapter are reviewed, as should the effect of the structure of the task itself on level attainment.

3.1.3 Task administration

For cost and logistic reasons, the tasks were administered in a subsample only of the main sample of survey schools: 87 schools at P3, 85 at P5, 94 at P7 and 90 at S2 (these figures represent just over half the survey schools at each primary stage and around two-thirds of those at S2). Although the 'practical' schools were drawn from across the country, they were not selected entirely at random: two important criteria for involvement were (i) that the school should have sufficient pupils at the stage concerned to justify a day visit by two field officers, and (ii) that it should be within easy travelling distance of the field officers' home bases.

Of the nine tasks, sets of three were administered at each stage (see Table 3.4). Two of the three were used uniquely at a particular stage, but one was used at two consecutive stages in order to allow stage comparisons across three rather than two levels.

Table 3.4 Task distribution by stage

Levels

Task name

P3

P5

P7

S2

E/F

Heat

tick

D/E

Sliding along

tick

Burning

tick

tick

C/D

Dissolve

tick

Keeping the Beat

tick

tick

B/C

Melting

tick

Magnets

tick

tick

A/B

Changes

tick

Floating and Sinking

tick

The tasks were administered in the schools by 148 field officers, who were practising primary teachers and secondary science teachers released by their schools and authorities for seven days each - one day for task orientation, five days for assessment and one day for debriefing. They were given their task orientation in June 2003, in groups of 18-25 at various locations in Scotland. Field officers were paired for school visits, and each pair visited five different schools in their own geographical regions within their allocated 2-week period, administering the same two investigating tasks throughout. In addition to setting up the tasks and managing the assessment sessions, the field officers completed their rating checklists on the spot.

In the selected schools, the pupils who attempted the investigating tasks were those who had already participated in the assessment of Knowledge and understanding in science. Pupils typically undertook their assigned investigations in groups of four, and two such groups were assessed in most of the schools involved.

3.1.4 Reporting skills attainment

In the ideal case, pupils might have been rated at the same level across the various attainment targets included in each checklist, preferably without assistance from the field officer. Had that happened as a general case then pupils could have been classified into appropriate 5-14 levels for their investigation skills. But, as most teachers could have predicted, this did not happen. Individual pupil profiles typically demonstrated a mixed pattern of level judgments, some merited without prompting and some awarded only after help had been given. In addition, there were often gaps in the rating profiles, where the field officers could not make a judgment, because the pupil concerned had not produced any, or enough, evidence of the skill in question, or because there had been a rare technical problem of some kind, or for some other reason. Given this, the only clear option for data analysis was to look across pupils at a stage, to see what proportions were judged as having demonstrated the various attainment targets at the different levels within the three strands. The attainment data to be presented in this chapter are therefore quite different in nature from those presented in Chapter 2 for Knowledge and understanding.

Finally, a word of caution: given the particularly novel nature of the tasks, the relatively few tasks used at each stage, the administration of these tasks in fewer schools than was the case for the written tasks, the fact that the schools involved were not randomly selected, and the fact that no formal piloting had taken place ahead of time, the practical assessment should be considered very exploratory, and its findings, interesting as they are, should be considered indicative rather than robustly generalisable.

3.2 Overview of pupils' attainments

3.2.1 The attainment picture across the stages

Table 3.5 presents the global findings from this exploration of pupils' investigation skills, while Figure 3.1 illustrates the general attainment picture.

The 'prompt' column in Table 3.5 indicates the proportions of pupils, on average, who needed assistance from the field officer before being able to demonstrate rateable behaviours that could be judged to be at one level or another (these pupils do not feature in the 'level' columns). 'No rating' indicates the proportions of pupils, on average, who could not be rated at all because they did not demonstrate the behaviour concerned, with or without field officer help (in a small proportion of cases field officers indicated that there had been a technical problem). The data indicate that on average a quarter to a third of the pupils were able to demonstrate achievement of the attainment target at one level or the other after having been assisted by the field officer, while 10-25% of pupils could not be rated on individual attainment targets for whatever reason. The proportions varied from one task to another, but the variation was not great.

As far as the attainment figures in Table 3.5 are concerned, it should be noted that these are not intended to represent the percentages of pupils 'secure' at the levels concerned, or showing 'basic skills' or 'considerable strengths' - the investigation data produced here do not lend themselves to this kind of analysis and interpretation. Rather, at each 5-14 level the table shows the proportions of pupils judged to have demonstrated attainment targets at that level without help from the field officer, averaged over all attainment targets at the level concerned (typically 8-10 in each task), and also averaged across all three tasks used at the stage in question.

Table 3.5 The general pattern of investigation skills attainment

(% pupils demonstrating attainment targets, averaged over targets and tasks)

Stage

Average pupils/task

Level attained without help

Prompt*

No rating**

A

B

C

D

E

F

S2

216

48

31

24

24

18

P7

198

24

33

7

28

24

P5

190

34

40

5

32

10

P3

194

7

53

11

31

8

No. of attainment targets in each task

5

9

8

10

6

5

9-10

9-10

* This indicates that a judgment was made, but only after field officer prompting helped the pupil.

** 'No rating' means that a pupil did not demonstrate evidence of the attainment target, with or without help.

Thus, at P3 we see a figure of 53% under Level B. This indicates that on average, over the nine Level B attainment targets that featured in each of the three tasks used at this stage, 53% of the pupils were judged as having demonstrated the behaviour concerned, without assistance from the field officer. The tasks concerned were 'Magnets', 'Changes' and 'Floating and sinking', and the nine attainment targets over which ratings are averaged are those shown in Table 3.3 for 'Changes'. The 53% at P3 compares with 34% of the pupils at P5, after averaging over the same nine Level B attainment targets rated in each of their three tasks ('Keeping the beat', 'Melting' and 'Magnets').

Figure 3.1 The attainment picture across stages

(% pupils attaining level-based targets, averaged over 8-10 attainment targets and three tasks per stage)

chart

Despite possible questions about rater reliability, the attainment data show clear evidence of stage progression in investigation skills. In particular, we see that, on average, around one-third of the S2 pupils were rated as having demonstrated the attainment target skills at Level E, with just under half having shown evidence of skills attainment at Level D. Around one-third of the P7 pupils showed target skills attainment at Level D, 40% of the P5 pupils did the same for Level C, and over half the P3 pupils were deemed to have shown appropriate evidence of target skills attainment at Level B.

Table 3.6 presents corresponding data for the nine individual tasks, and confirms the general picture, despite some variation from one task to another.

Table 3.6 The pattern of skills attainment task by task

(% pupils demonstrating attainment targets, averaged over attainment targets)

Task

Levels

Stage

Pupils

Level attained without help

A

B

C

D

E

F

Heat

D/E/F

S2

184

60

38

24

Sliding

D/E

S2

198

43

25

Burning

D/E

S2

265

40

31

Burning

D/E

P7

186

39

7

Dissolve

C/D

P7

211

25

34

Keeping the beat

C/D

P7

196

24

27

Keeping the beat

C/D

P5

222

36

5

Melting

B/C

P5

163

33

37

Magnets

B/C

P5

184

35

46

Magnets

B/C

P3

182

59

11

Changes

A/B

P3

248

3

57

Floating & sinking

A/B

P3

151

12

44

There was no consistent pattern of difference at any stage across the three strands: 'Preparing for tasks', 'Carrying out tasks, 'Reviewing and reporting tasks'.

3.2.2 Gender comparisons

There was no evidence of any gender difference in investigation skills at any stage, even at the level of individual attainment targets.

3.3 Summary

Just over half the survey schools at the primary stages and around two-thirds of the survey schools at S2 participated in the practical assessment of pupils' investigation skills in science, that is 80-90 schools at each stage. In most schools two groups of four pupils were involved, the two groups working on different assigned investigations.

Nine investigation tasks were newly developed for use in the survey. Each task had an accompanying rating checklist containing attainment targets at two adjacent 5-14 levels, which field officers completed in real time as they observed and questioned the investigating pupils. Sets of three tasks were administered at each stage, two of the three being used at one particular stage and the third being used at two adjacent stages. The tasks were administered in the schools by 148 field officers, who were practising primary teachers and secondary science teachers released by their authorities for survey participation.

Across the stages, and averaging over attainment targets and tasks, a quarter to a third of the pupils had to be given assistance by the field officers before they could show enough evidence of relevant skills for them to be judged at one level or the other for each of the attainment targets included in the checklists; 10-25% of the pupils could not be judged at a level at all, because they did not show evidence of the behaviour concerned with or without prompting from the field officer.

The attainment data provided by those pupils who could be judged show clear evidence of stage progression in investigation skills. Again averaging over attainment targets at a level within and across tasks, around one-third of the S2 pupils were rated as having demonstrated the attainment target skills at Level E, with just under half having shown evidence of skills attainment at Level D. Around one-third of the P7 pupils showed target skills attainment at Level D, and 40% of the P5 pupils did the same for Level C. Over half the P3 pupils were deemed to have shown appropriate evidence of target skills attainment at Level B.

There was no evidence in the attainment data of any gender difference in investigation skills.

« Previous | Contents | Next »

Page updated: Thursday, March 24, 2005