On this page:

Assessment of Achievement Programme: Report of the Sixth AAP Survey of Science (2003)

« Previous | Contents | Next »

Listen

Assessment of Achievement Programme: Report of the Sixth AAP Survey of Science (2003)

4. Core skills assessment

4.1 Introduction

The core skills of reading, writing, numeracy, problem solving and using information technology were assessed in this survey. Problem solving was assessed as an integral part of science investigation skills, and the findings are presented in Chapter 3. Here we describe the nature and outcomes of the assessment of reading, writing, numeracy and ICT skills, extending the overview given in Chapter 1.

Fifteen reading tasks were newly developed for use in the survey, three at each of Levels A to E. The 'scientific' topics that featured in the tasks represented in equal numbers the three Knowledge and understanding outcomes. At Levels C, D and E each reading task was accompanied by an associated writing task, which focused on the same topic. There were thus nine writing tasks in total.

Six different ICT tasks were also newly developed for use, each task carefully set in a 'scientific' context. These were administered in schools by trained field officers (see Chapter 1 for general information about the practical assessment). In addition, 15 numeracy tasks were administered in the survey. These originated from the 2002 Social Subjects survey, where they each formed an integral component in larger 'enquiry skills' tasks. Some of the numeracy tasks were used in identical form in the science survey, save for an occasional title change. Others were re-set in scientific contexts. The numeracy tasks were embedded in the science test booklets described in Chapter 2, one numeracy task per booklet.

All the survey schools - 155 primary schools at each stage and 130 secondary schools - participated in the assessment of reading, writing and numeracy. Each school provided up to 10 randomly selected pupils for numeracy assessment (these pupils were those who attempted the science test booklets), and up to eight randomly selected pupils for reading and writing assessment. Just over half the primary schools and two-thirds of the secondary schools, that is 80-90 schools at each stage, also participated in the assessment of pupils' ICT skills (and science investigation skills).

4.2 Reading

4.2.1 The reading tasks

Each of the 15 reading tasks administered in the survey took the same general form as those typically used in AAP English surveys, viz. a source text, or series of related texts, accompanied by a set of comprehension questions. In this case, however, the texts focused on science-relevant topics or issues, as the titles in Table 4.1 suggest. At Levels C, D and E, each reading task was accompanied by a writing task, focusing on the same topic. Table 4.2 describes one task at each of Levels A, C and E, and the writing tasks associated with them.

Table 4.1 Topics featured in the reading and writing tasks*

Energy and Forces

Earth and Space

Living things and the Processes of Life

Level E

Light

Elements and Metals

Soil

Level D

Gravity

Recycling

Seeds

Level C

Friction

Metals

The Eye

Level B

Energy

Paper

Plants

Level A

Electricity

The Moon

Goldfish

* Writing tasks accompanied reading tasks at Levels C, D and E only

Table 4.2 Overview of three reading tasks and two writing tasks

'Soil' - Level E

The 600-word text, ostensibly written by a newspaper Environment Editor, focuses on soil biodiversity. It provides information about the prevalence of known organisms and micro-organisms in the soil in various types of habitat and in various parts of the world, and describes a UN research project planned to search for currently unknown species. A total of 31 test items are presented in three sections: Section A comprises short-response items, Section B presents a series of sentence completion items, and Section C is an 18-gap summary completion. Section D, the writing task, invites pupils to "Write a short report explaining how poor countries could benefit from further research into the area of soil biodiversity".

'Friction' - Level C

The 400-word text explains what friction is, gives everyday examples of its effects, and discusses the value of engine lubrication. Six sections of test items follow, presenting 20 items in total. Section A invites pupils to circle three correct descriptions of friction from among six possibilities. Section B invites pupils to give two examples from the text where friction was helpful and two where friction caused problems. Sections C and E are summary completions, while Sections D and F comprise sentence completion items. Section G, the writing task, invited pupils to "Write a short report on some of the ways that friction affects our lives. Also, think of other everyday examples of how friction affects us."

'The Moon' - Level A

A short text of approximately 200 words gives information about the moon's relative size, composition and orbit, and is followed by 17 test items in four sections. Section A presents three circles of different radius, which the pupils must identify as sun, moon and earth. Section B contains six sentence completion items. Section C presents a word box, and pupils are to circle the three things they 'would find on the moon'. Section D is an assisted (through word boxes) 7-gap summary completion exercise.

The number of test items varied across tasks, from 17-18 per task at Level A rising to 30-35 per task at Level E, and featured a variety of different formats, including multiple choice, summary completion, sentence completion, and short response. Items sharing common formats were grouped into sections, as illustrated in the task examples given in Table 4.2.

Reading tasks were classified by 5-14 level on the basis of the perceived level of their source texts. No prior science knowledge was needed in order to respond successfully to the tasks, although inevitably prior knowledge will occasionally have helped.

Tasks were presented to pupils in the form of test booklets. A single reading task comprised a booklet at Levels A and B, while a reading task and its associated writing task comprised a booklet at Levels C, D and E. Individual pupils attempted two test booklets, one at each of two consecutive levels: P3 pupils attempted reading tasks at Levels A and B, P5 pupils attempted tasks at Levels B and C, P7 pupils attempted tasks at Levels C and D, and S2 pupils attempted tasks at Levels D and E.

Completed booklets were processed centrally. Pupils' responses to the reading tasks were transferred onto specially designed transcription sheets for later keyboarding and machine marking, as described in Chapter 1. In total, just under 5000 scripts were analysed, emanating from over 2500 pupils in over 400 schools (some primary schools were too small to supply the 10 pupils needed for science assessment plus the eight pupils needed for core skills assessment, hence the lower numbers of schools involved here).

4.2.2 Pupils' reading attainment

The attainment results for reading were produced by applying the usual 65% cut-off score to the pupils' total task scores, as described in Chapter 1, and then averaging the proportions of 'secure' attainers over the three tasks at each level. The resulting proportions of pupils classified as 'secure' at the level concerned (the level of the task) are given in Table 4.3.

As Table 4.3 shows, reading attainment was relatively good at P3, with over 80% of the pupils demonstrating secure attainment at Level A, and over half demonstrating secure attainment at Level B. P5 attainment is a little lower at the levels assessed at this stage: 70% showed secure attainment at Level B and just under 40% at Level C.

Table 4.3 Reading attainment by stage *

(% pupils correctly answering 65% or more marks per task ,averaged over the three tasks at each level)

Stage

Level A

Level B

Level C

Level D

Level E

S2

57

43

P7

63

37

P5

70

38

P3

84

55

* Margins of error for the estimated proportions vary between 3 and 4 percentage points.

P7 and S2 performance is rather similar at the two levels assessed at these stages: around 60% of the P7 pupils showed secure attainment at Level C and around 40% did so at Level D, with the same picture emerging for S2 at Levels D and E. Interestingly, the reading results for P7 and S2 in this science survey are in line with the results reported for the AAP English Language survey conducted in 2001 (P3 and P5 did not feature in that survey), in which reading was assessed using very similar tasks but set in more everyday contexts (see Table 4.4). At Level E the attainment results are identical, at Level D they are close, and even at Level C the sample difference in favour of the English survey does not reach statistical significance.

Table 4.4 Reading attainment at P7 and S2: 2001 English survey and 2003 Core Skills survey

(% pupils correctly answering 65% or more marks per task, averaged over the tasks at each level*)

Level C

Level D

Level E

2001

2003

2001

2003

2001

2003

S2

58

55

43

43

P7

72

63

41

36

* Different tasks were used in the two surveys

Interestingly also, there is little evidence in Table 4.5 of the gender gap usually found in favour of girls in reading. Indeed, this time it is the boys who have produced the slightly better performances than the girls. None of the sample differences, however, reaches statistical significance.

Table 4.5 The reading attainments of boys and girls

(% pupils achieving 65% or more marks per task, averaged over three tasks at a level)

Level A

Level B

Level C

Level D

Level E

S2

Boys

59

43

Girls

56

42

B-G

3

1

P7

Boys

64

39

Girls

62

35

B-G

2

4

P5

Boys

72

39

Girls

69

36

B-G

3

3

P3

Boys

83

52

Girls

85

57

B-G

-2

-5

4.3 Writing

4.3.1 The writing tasks and writing evaluation

Each of the nine writing tasks was included in a reading booklet as a final "section", following the sections of test questions that explored pupils' understanding of the stimulus text (see the two examples described in Table 4.2). At P5, each pupil was invited to produce one piece of writing, relating to their Level C reading task. At P7 and S2 each pupil should have produced two pieces of writing, relating, respectively, to the two reading tasks they attempted at Levels C and D (P7) or Levels D and E (S2). This scale of testing resulted in a very large number of individual pieces of writing. In order to make the scale of writing evaluation manageable, a sample of just over 2500 pieces of writing was selected for writing evaluation. The sample was drawn at random, and included examples of writing from all nine tasks at the stages at which each individual task was used. Around 100 different schools at P5 and around 150 different schools at P7 and S2 each provided between one and 12 completed booklets, across all tasks at the relevant stage.

Pupils' writing productions were marked in autumn 2003, by 75 of the field officers who had undertaken the practical assessments in the survey. Since problems had arisen during the marking of writing in the 2002 Social Subjects survey, when an attempt was made to apply an adapted form of the National Criteria for writing, it was decided to use an holistic evaluation scheme here, based on 'best fit' descriptions (see Table 4.6).

The writing evaluators were given some minimal orientation for the writing evaluation activity after their post-survey debriefing. During the meeting they reviewed a small sample of scripts as a group, applying the holistic evaluation scheme. In the days following the meeting, and with access to exemplar materials on the SQA 5-14 web site, they each independently evaluated 50 scripts, drawn from several different tasks at a stage.

Because the evaluation of writing is inevitably a subjective process, the issue of rater agreement is always a concern. It cannot be assumed that different evaluators judging the same script will come to the same decision about writing relevance or quality, whatever the nature of the evaluation scheme that is used. In order to explore this issue here, a rater agreement study was incorporated into the general writing evaluation exercise, in that roughly every other piece of writing was independently evaluated by four different evaluators. The results of this exploratory study are shown in Figure 4.1.

Figure 4.1 Distribution of agreement rates over all evaluator pairs and writing tasks

chart

As expected, it was relatively rare for all four evaluators to award the same attainment level to the same pieces of writing. The highest rate of unanimous agreement for any task was 18% and the lowest was zero. The average inter-rater agreement among pairs of evaluators, was 40% overall (the highest agreement rate per task was 60% and the lowest 7%); this fell to 33% for S2 writing, which was judged by secondary science teachers. Among other problems, the evaluators mentioned difficulty in judging sparse writing, difficulty in deciding content relevance, and difficulty in judging writing quality independently of the nature and accuracy of the relevant content. Averaging over all tasks, the proportion of scripts that could have been given a level classification based on 'majority vote' was 79%.

Table 4.6 The 'best fit' descriptions used to judge pupils' writing

Level F

In response to a Level F task, the writing begins to convey discernment. Ideas are logically and clearly organised throughout and are well linked and supported with appropriate detail. There is a well-developed, effective conclusion, where appropriate. Paragraphs, where relevant, are used correctly throughout. There is appropriate variety in sentence structure and sentences are accurately constructed, linked and punctuated. Spelling is accurate in the main.

Level E

The writing begins to convey discernment. Ideas are logically and clearly organised throughout and are well linked and supported with appropriate detail. There is a well-developed effective conclusion, where appropriate. Paragraphs, where relevant, are used correctly throughout. There is appropriate variety in sentence structure and sentences are accurately constructed, linked and punctuated. Spelling is accurate in the main.

Level D

Ideas are described in detail and are logically and clearly organised throughout. The writing includes relevant and consistent supporting detail. There is a simple but effective conclusion, where appropriate. Paragraphs, where relevant, are used correctly for the most part. There is some variety in sentence structure and most sentences are punctuated accurately. Most of the words needed for the task are accurately spelt.

Level C

The writing conveys a clear sense of ideas that are in the main organised logically without significant omissions/repetition. There is a simple conclusion or rounding off, where appropriate. Some attempt is made to paragraph writing. In the main, the punctuation supports what has been written. Less commonly used words are spelt with increasing confidence and accuracy.

Level B

The writing conveys a main idea with sufficient information to make the message clear. The information is mostly organised logically. Common linking words are used to organise ideas into sentences (e.g. and, then, but, so, that) and punctuation is beginning to support what has been written. An increased range of commonly used words is spelt accurately.

Level A

The writing conveys one or two details, which are linked and mostly relevant. Common linking words are used to organise ideas (e.g. and, then). A capital letter and a full stop are used to mark at least one sentence. Commonly used words are spelt accurately.

4.3.2 Pupils' writing attainment

Table 4.7 presents the results of the writing evaluation exercise, while Figure 4.2 illustrates the picture of pupil attainment. The summary data are based on one single evaluator judgment per script. In the case of those scripts that were independently judged by more than one evaluator in the rater agreement study - just under half the total - the judgments of one only of the evaluators concerned have been used in these analyses, this individual having been selected at random.

In the event, the evaluators found that there was no writing to evaluate in almost 20% of the booklets they were asked to review. Just under 10% more of the scripts contained an insufficient amount of writing to permit a level judgment. The pupils concerned had presumably either run out of time before finishing their test booklets - the writing followed the reading - or were simply not motivated to make the effort to produce much, if any, writing. The presentation of the writing tasks as final sections within the reading tasks could have been a contributory factor here. Pupils might have viewed the task as an extended response reading task or as a restricted response writing task, and either way might not have felt inspired to generate much writing, or any writing, particularly under survey conditions. The proportions of pupils who failed to produce any writing at all varied across tasks, ranging from one in ten for 'Metals', a task used at P5 and P7, to around one in four for 'Light', one of the three tasks used at S2. Over all tasks, a higher proportion of the boys than of the girls produced no text at all (20% versus 14%).

An additional 4% of the scripts offered writing whose science content was considered irrelevant (while it was intended to be the quality of the writing that was judged, and not the accuracy or relevance of the content, the teacher evaluators, particularly the secondary science teachers, found it difficult to separate the two). The end result is that the evaluators were able to make judgments about writing quality for just 70% of the scripts they reviewed (see Table 4.7 and Figure 4.2).

Table 4.7 Writing attainment: Level distributions by stage and gender*

(% pupils classified into each 5-14 level, based on rateable scripts)

Subgroup

Tasks

Scripts

Level A

Level B

Level C

Level D

Level E

Level F

S2

6

612

20

29

28

16

6

1

P7

6

880

27

40

25

8

1

0

P5

3

332

43

41

14

2

0

0

Boys

9

911

19

25

15

6

1

<1

Girls

9

913

19

25

18

8

2

<1

All tasks

9

1824

19

25

16

7

2

0

* Nine writing tasks featured in the survey: three tasks were used at P5 and P7, three more were used at P7 and S2, and three further tasks were used at S2 only.

Table 4.7 and Figure 4.2 provide clear evidence of stage progression in writing. While more than 40% of the rateable writing produced by P5 pupils was rated at Level A, the corresponding proportions at P7 and S2 are markedly lower, at fewer than 30% and 20%, respectively. The proportions rated at Level C or higher at the three stages are 16%, 34% and 51%, respectively. However, the data present a relatively bleak picture of writing attainment at the target levels for P7 and S2 pupils (Levels D and E, respectively): fewer than 10% of the rated scripts at P7 and S2 were judged to show evidence of writing at the target level or higher.

Figure 4.2 The picture of writing attainment P5 to S2 *

chart

Interestingly, among those pupils who produced enough relevant writing to be given a level rating, there is no evidence in Table 4.7 of any gender difference in writing quality.

The fact that a high proportion of the survey pupils attempted both reading and writing tasks means that it is possible to explore the relationship between the two skills. Not surprisingly, given the rather poor performance the pupils in general demonstrated in writing, in the majority of cases pupils' judged writing attainments were lower than their reading attainments, with a minority showing the reverse pattern. Just over a quarter of the pupils were classified into the same level for both reading and writing.

4.4 Numeracy

4.4.1 The numeracy tasks

Each of the 45 pencil and paper enquiry skills tasks specially developed for use in the 2002 Social Subjects survey comprised three subtasks, between them featuring three different types of information resource: one task was based on a text, another focused typically on a table or chart of some kind (the 'numeracy' subtask), while the third featured a map, drawing, diagram, photograph or whatever. Among the numeracy subtasks, several were identified that could be used unchanged or that could be used after minor modifications to wording to render the context more 'scientific'.

This represented an opportunity to explore the possible effect of subject context on numeracy performance whilst at the same time providing information about numeracy skills within the science survey. In response, it was decided to seize the opportunity to capitalize on previous experience and to re-use some of the numeracy tasks. Fifteen different tasks were eventually selected for re-administration in the science survey, three at each of Levels A to E (the level of any numeracy task is simply the level of the original parent social subjects enquiry skills task - the skill demands in the numeracy tasks were designed to be at or below the level of the parent task). Table 4.8 overviews the tasks, while Table 4.9 offers brief descriptions of one task at each of Levels, A, C and E.

Every task comprised a number of atomistic test items, the number per task varying from four to ten. For the most part, the items required pupils to identify highest or lowest values, to count values, and to transfer values from one presentational form into another (chart to summary completion paragraph, table to bar chart, etc). In a very small minority of items, pupils were required to add or subtract values, to compare values, or to describe in words the trend shown in a set of data, usually in graphic form. Every item was scored dichotomously.

Table 4.8 The numeracy tasks

Task name

'Level'

Originating parent task from the 2002 Social Subjects survey

Context

Travel Then and Now

A

Travel Then and Now

Same

Bird Table

A

Egyptians

Changed

Keep Fit

A

Guy Fawkes

Changed

The Wright Brothers

B

The Wright Brothers

Same

The Science Centre

B

Nursery Closure

Changed

Children in Need

B

Children in Need

Same

Nature Trail

C

Vikings

Changed

Water

C

Water

Same

Forest Trail

C

Pool Closure

Changed

Our Environment

D

Sri Lanka

Same

Water Quality

D

River Study

Same

Genetically Modified Food

D

Fireworks

Changed

Infectious Diseases

E

The Great Plague

Same

Coastal Erosion

E

Sculpting the Earth

Same

Food Complaints

E

Advertising

Changed

Table 4.9 Overview of three numeracy tasks

'Food Complaints'

- adapted from the 2002 Social Subjects task 'Advertising' (Level E)

This task features a data table, ostensibly showing the numbers of complaints to the Food Standards Agency in 1998 and 1999, along with the numbers of complaints that were investigated or not investigated, with resulting decisions or reasons for non-investigation. Pupil understanding was assessed using a 10-gap summary completion task, with two of the gap items requiring subtractions.

'Water'

- previously used in the 2002 Social Subjects survey (Level C)

This task also features a table of data, this time water consumption rates in the UK. Related questions require pupils to identify the activity that consumes the most water (toilet flushing), to sum water usage for particular types of consumption (drinking, cooking and gardening), and to transfer values from the table into a partially completed bar chart.

'Bird Table'

- adapted from the 2002 Social Subjects task 'Egyptians' (Level A)

'Bird Table' features a pictograph showing 'different kinds of birds that Primary 3 saw feeding in the bird table in the playground'. One question requires pupils to transfer data from the pictograph into a partially completed frequency table, while others require them to count the total number of birds spotted, both before and after having updated the pictograph to reflect additional bird observations.

A numeracy task was inserted into every science test booklet. At P3 the numeracy tasks were at Levels A and B, at P5 they were at Levels B and C, at P7 they were at Levels C and D, and at S2 they were at Levels D and E. Pupils' responses to the tasks were processed as described in Chapter 1, and machine marked.

4.4.2 Pupils' numeracy performance

Table 4.10 and Figure 4.3 present the results of the numeracy assessment, in the form of average item facilities over all items and all pupils at each level and each stage (note that these data have not been produced by applying cut-off scores to test scores, and they do not therefore represent percentages of pupils attaining particular levels in numeracy).

Table 4.10 Numeracy performance P3 to S2

(average item facility over all items at a stage and level)

Stage

Level A

Level B

Level C

Level D

Level E

S2

67

54

P7

78

55

P5

78

73

P3

62

61

Figure 4.3 Numeracy attainment P3 to S2

(average item facility over all items and all pupils at a stage and level)

chart

In general, Table 4.10 confirms that pupils' numeracy skills are good, the average item facility being over 50% in every case, reaching particularly high levels at P5 and P7 for the items at Levels B and C, respectively. Even where items demanded computational skills, facility values were relatively high: for example, item facilities of over 60% at P3 for simple pictograph counts in the Level A task 'Bird Table' (described in Table 4.9), 45-50% at P3 for three addition/subtraction items in the Level A tasks 'Travel Then and Now' and 'Keep Fit'; 55-65% at P3 and 75-85% at P5 for two addition items in the Level B task 'Children in Need'; 60-80% at P5 and 65-90% at P7 for four addition/subtraction items in the Level C tasks 'Nature Trail' and 'Water'; 35-55% at S2 for three subtraction items in the Level E task 'Food Complaints'.

The only cross-stage anomaly related to a pie chart item in the Level D task 'Genetically Modified Foods'. The pie chart showed the results of a fictitious opinion survey among a sample of 1000 people. Pupils were asked first what was the most popular opinion on GM foods, and were then asked what opinion on GM foods a quarter of the people held. These were well answered. Then came the calculation item: 'How many more preferred further testing to banning them altogether?'. This item required pupils to identify the proportion of individuals who were in favour of further testing (answer: 20%) along with the proportion who were in favour of banning GM foods altogether (answer: 12%), and then subtracting the one proportion from the other (in the answer line they were clued by the % symbol simply to difference the proportions - to get 8% - and not to calculate the actual number of people this would translate to). Over a third of the P7 pupils (37%) correctly answered the question compared with 28% of the pupils at S2 (the corresponding figures were 63% and 53%, respectively, in the social subjects survey); 39% of the P7 pupils and 56% of the S2 pupils, in almost equal proportions, gave 12% or 20% as their answer, that is one or other of the two percentages that should have been subtracted.

The pattern of numeracy attainment over the different levels and stages is similar in both the science and the social subjects surveys. Interestingly, however, there is a consistent, if small, difference in performance across the board in favour of the social subjects survey (see Figure 4.4).

Figure 4.4 A comparison of numeracy attainment in the Science and Social Subjects surveys

(average item facility over all items and all pupils at a stage and level)

chart

The performance differences at any particular stage and level are not large, but the pattern of consistency is noteworthy. It is tempting to conclude that the different contexts in which the tasks were set (science versus social subjects) have influenced motivation levels and hence attainment. Non-response rates were indeed generally higher in the science survey than in the social subjects survey, particularly at P7 and S2. Since the numeracy tasks were embedded in the science booklets the generally higher task non-response rates in science will have affected the numeracy tasks as well as the science tasks. This will almost certainly have contributed to the lower numeracy performances for the science context. This issue of subject context effect is worthy of further research.

There were no differences in the numeracy performances of boys and girls in this science survey or in the previous survey of social subjects.

4.5 Using information technology

4.5.1 The tasks and their administration

The six ICT tasks used in the survey were developed with reference to the 5-14 Guidelines for ICT 9, and focused to varying degrees on the strands 'Using the technology', 'Creating and presenting', 'Searching and researching' and 'Communicating and collaborating'. While the tasks themselves were not assigned to specific 5-14 levels, three of the tasks assessed attainment targets at Levels B and C, and another three assessed attainment targets at Levels D and E.

Every task was set in a scientific context, and, while none required any prior science knowledge or understanding the contexts, were selected to cover the three outcomes 'Understanding Energy and Forces', 'Understanding Earth and Space' and 'Understanding Living Things and the Processes of Life' (see Table 4.11). The three Level B/C tasks featured contexts considered by the task developer to be suitable for pupils in P3 and P5, and the three Level D/E tasks were considered suitable for use with pupils in P7 and S2. In practice, in order to offer a comprehensive basis for cross-stage attainment comparisons, the Level B/C tasks were used at P7 as well as at P3 and P5, and the Level D/E tasks were used at P5 as well as at P7 and S2.

Table 4.11 The 'scientific' contexts of the ICT tasks related to Knowledge and understanding outcomes

Levels

Energy and Forces

Earth and Space

Living Things and the Processes of Life

D/E

Streamlined

Burning

Snails

B/C

Magnetz

Waterworld

Senses

Each task comprised a series of activities based on a set of web pages and other resources held on CDROM, and was planned to take about 30 minutes of testing time. The activities included modifying graphics and editing text (using Word), entering data into a spreadsheet and manipulating that data (using Excel), sending an email message and carrying out and interpreting a simulated web search. Table 4.12 overviews one of the Level B/C tasks and one of the Level D/E tasks.

The tasks were administered in around half the survey schools by trained field officers (see Chapter 1 for details of field officer numbers and training). Pupils worked on laptop computers provided by the field officers, and they were given a few minutes to familiarise themselves with these before the assessments began, using the mouse to control the cursor.

In each participating school, up to eight pupils were involved in the ICT assessment, each pupil working independently on one or two different randomly assigned tasks. After the brief laptop familiarisation, the pupil under assessment was given the relevant task sheet and asked to follow the instructions. If the pupil was struggling, the field officer directed him/her to the next question or brought the session to a close. Individual activities were rated as 'completed successfully', 'completed with some support', 'no real attempt', 'technical problem' or 'ran out of time'. At the end of the task the pupil's task sheet was collected in and filed with the checklist for later analysis.

Table 4.12 Overview of two ICT tasks

'Burning' - Level D/E

On the instructions of the field officer, and after a few minutes of familiarisation with the laptop computer, the pupil was first asked to access a web page (simulated on CD), and then to move from page to page using the mouse ('Navigating a document'). Next, the pupil was instructed to enter a table of data into an Excel spreadsheet and to produce an appropriate graph to display the data ('Managing data'). The third set of activities focused on web site use: the pupil had to identify the URL of a displayed web site (again simulated on CD), and then look for related web sites using an appropriate keyterm search ('Searching and researching'). Finally, the pupil was asked to prepare a short email message to send to a friend ('Communicating and collaborating').

'The Senses' - Level B/C

This task began like all the others. On the instructions of the field officer, and after a few minutes of familiarisation with the laptop computer, the pupil was asked to access a simulated web page, and to move from page to page using the mouse ('Navigating a document'). Then the pupil was told to complete a table of data in a Word file, and to format the table, centring text in columns ('Managing data'). The final activity required the pupil to identify the URL of a simulated web site and to indicate how an accessed web page might be printed ('Searching and researching').

Before the attainment results are presented, some of the problems that arose during this innovative exercise merit comment. Firstly, as already noted, pupils were constrained to work with the laptops provided by the field officers, despite the fact that such a laptop environment was probably unfamiliar to most of them. Secondly, the tasks were only available for use on PCs, so that those pupils who were not familiar with the PC environment would have been disadvantaged. Thirdly, the text handling and data manipulation tasks required use of Word and Excel, software which was probably unfamiliar to most of the younger pupils. Fourthly, there were technical difficulties involved in leaving a simulated web page to edit text in Word or manipulate data in Excel and then returning to the web page. Field officers were advised to assist pupils should they meet problems in this sense. Finally, the text on some of the simulated web pages was not clear, the page images having been reduced to make space for task instructions to be shown alongside, and this might have made it difficult for pupils to answer some of the questions.

4.5.2 Pupils' ICT performance

Table 4.13 presents the overall ICT performance results, while Figure 4.5 illustrates the picture. Averaging over the activities and tasks used at any one stage, Table 4.13 records the percentages of pupils who completed the activities successfully without help from the field officer, the percentages who completed the activities successfully with some prompting from the field officer, the percentages of pupils made no real attempt to complete the activities, and the percentages of pupils who hit technical problems, or ran out of time.

Table 4.13 Response profiles across stages for the ICT tasks*

(% pupils classified into each response type, averaged over activities within tasks and over tasks)

Level

Stage

Unaided

Aided

No attempt

Other**

D/E

S2

63

17

11

9

P7

50

20

20

10

P5

42

17

29

12

B/C

P7

77

9

10

4

P5

68

12

16

4

P3

48

15

29

7

* There were 400-450 pupils per task at P3 and S2,and typically 200-250 at P5 and P7

** 'Other' includes running out of time, meeting a technical problem, or not being rated for whatever other reason

Figure 4.5 Response profiles across stages for the ICT tasks

(200-450 pupils per stage per task)

chart

As expected, Table 4.13 provides evidence of a clear attainment progression from one stage to another. On average, almost half the P3 pupils managed to complete the various Level B/C activities unaided, rising to almost 70% at P5 and over 75% at P7. For the activities at Levels D/E, just over 40% of the P5 pupils managed these without help, rising to 50% at P7 and over 60% at S2. Around a third fewer pupils completed activities without aid as they moved from Levels B/C to Levels D/E (P5 and P7 only). The 'No real attempt' figures follow the same pattern. These range from 10% of pupils to 30%, depending on the stage and level of skills demand: at P5 and P7, where pupils faced skills demands at both level groupings, activity non-response rates doubled from Levels B/C to Levels D/E.

As far as the four activity groups are concerned, viz. 'Navigating a document', 'Managing data', 'Searching and researching' and 'Communicating and collaborating' (in practice 'email communication' here), Table 4.14 illustrates the general picture. Over 90% of the pupils at all stages and in all the tasks managed to use the mouse to open a web page and then to follow links to access other web pages (90-92% at P3, 90-98% at P5, 95-98% at P7, 97-98% at S2). Success in the other skill areas was lower and very variable from one activity to another within general activity types.

Table 4.14 Unaided success rates across types of ICT activity*

(% pupils averaged over activities within tasks and over tasks)

Level

Stage

Navigating a document

Managing data

Searching & researching

Communicating

D/E

S2

97

53

63

65

P7

97

33

47

50

P5

96

24

37

25

B/C

P7

96

67

79

58

P5

94

59

66

46

P3

91

35

35

26

* There were 400-450 pupils per task at P3 and S2, and typically 200-250 at P5 and P7; unaided success rates are averaged over between two and 11 individual activities per category

The majority of pupils at all stages, but rising with age, were able to enter text or data correctly into a given spreadsheet table (50-70% at P3 rising to over 80% at other stages). Much lower proportions were able to create a table from scratch before entering information into it (tested only in the tasks given to pupils at P5, P7 and S2). Around 10% of the P3 pupils, rising to 30-50% at P5 and P7, were able to centre text or data in columns without help (around half the P3 pupils and around 30% of those at P5 and P7 made no attempt at this activity, which was not assessed in the tasks used at S2).

In two of the tasks used at P5, P7 and S2, pupils were asked to produce an appropriate graph to re-present their table of data. In both cases the results were the same: just under 5% of the P5 pupils managed to produce a graph without help, rising to around 10% at P7 and around 35% at S2 (60-70% of the P5 pupils made no attempt to produce a graph, compared with 45% at P7 and just under 20% at S2). In principle the most challenging activity of all, two of the Levels D/E tasks required pupils to perform some computation on their table of data, finding a difference in one case and calculating an average in the other. Fewer than 5% of the S2 pupils and none of the pupils at P5 or P7 attempted to perform the calculations using Excel's function facility. Instead, 65% of the P5 pupils and around 80% of the pupils at P7 and S2 subtracted the numbers in question mentally, while 6% of the P5 pupils, around 20% of the P7 pupils and around 40% of the S2 pupils mentally calculated the average they needed. High proportions of pupils made no attempt to produce the required numbers by either method.

Of the various web site searches and emailing activities included in the tasks, most were reasonably well done, with 25-50% of the P3 pupils, and 40-80% of others, succeeding without help, the proportions varying with activity, level and stage. Three activities proved more difficult than most in this group. In one email activity pupils were to enter a particular email address into the Cc. box in the message header, and were expected to suggest using the attach facility to send a letter to a friend: 60-80% of the P3 and P5 pupils made no attempt to do either, compared with around half the P7 pupils. Around 10% of the P3 pupils, 20% of the P5 pupils and 35% of the P7 pupils did manage to enter the copy address successfully without help, but only 1% of the P3 pupils, 7% of the P5 pupils and 23% of the P7 pupils suggested using the attach facility. A similar pattern emerged when pupils at P5, P7 and S2 were asked to suggest three appropriate keywords when embarking on a given (simulated) web search.

There was no indication in the performance data of any differences in the ICT skills of boys and girls.

4.6 Summary

In addition to problem solving, which featured naturally here, the core skills of reading, writing, numeracy and using ICT were assessed in this 2003 Science survey. Reading, writing and numeracy were assessed using pencil and paper tasks, whilst ICT skills were assessed practically, with pupils working on laptop computers supplied by itinerant trained field officers. All the tasks used were carefully set in 'scientific' contexts.

Reading attainment was good at all stages. Over 80% of the P3 pupils were 'secure' at Level A (gaining at least 65% of the marks on their Level A task) and over half were secure at Level B; around 70% of the P5 pupils were secure at Level B and around 40% were secure at Level C. Roughly 60% of the pupils at P7 and at S2 were secure at Levels C and D, respectively, and around 40% were secure at Levels D and E, respectively. The results at P7 and S2 are in line with those reported for reading attainment in the 2001 English Language survey. In contrast with expectations, there were no apparent gender differences in reading in this survey.

The assessment of writing (conducted at P5, P7 and S2 only) produced two disappointing results. Firstly, an inter-rater agreement study highlighted the continuing problems that are inevitably faced when trying to evaluate pupils' writing. Despite training in the use of an holistic evaluation scheme, and access to exemplification materials, the inter-rater agreement rate was just 40% on average across pairs of raters and over several different writing tasks (this means that, on average, two different raters would allocate the same level classification to the same piece of writing in just 40% of cases). Secondly, the pupil attainment picture that emerged revealed evidence of relatively poor writing skills at all stages. In particular, fewer than 10% of the P7 and S2 pupils produced writing that was rated as being at the target levels for these stages (Levels D and E, respectively), and fewer than 15% of the P5 pupils produced writing deemed to be at Level C or higher. For most of the pupils involved in the reading and writing assessment, writing attainment was lower than reading attainment; just 25% of the pupils assessed in reading and writing were classified at the same level in both. Finally, while proportionally more of the boys than of the girls failed to produce any writing in the survey (20% versus 14%), there were no gender differences in the quality of the writing that was produced and evaluated.

Pupils' numeracy skills, on the other hand, appear to be quite well established. On average, more than half the pupils at every stage successfully answered their numeracy items, even where these involved computation. The numeracy tasks used here were derived from those used in the previous Social Subjects survey, and were used in the science survey in identical form or with simple changes to wording to render the contexts more 'scientific'. A comparison across surveys revealed slightly better performances across all stages and tasks in the social subjects survey than in the science survey, the lower achievements in the science survey reflecting generally higher non-response rates for all tasks in the science test booklets in which the numeracy tasks were embedded. There were no gender differences in numeracy attainment in either survey.

Despite a variety of problems associated with the exercise, the practical ICT assessment produced evidence of well-established skills in some areas and of clear progression elsewhere from one stage to another. Over 90% of the pupils at all stages, even P3, were skilled at opening documents and navigating through web pages. Around a third of the P3 pupils were also able unaided to add information to existing spreadsheet tables, compared with almost 60-70% of the pupils at other stages - at all stages lower proportions of pupils could centre information within columns. At P5, P7 and S2, where these skills were tested, rather low proportions of pupils managed to produce graphs to illustrate their tables of data, and none, or almost none, knew how to use the formula facility in Excel to carry out simple calculations on their data. Around a third of the P3 pupils were at ease simulating web searches while a quarter were able unaided to prepare to send an email; at the other stages the proportions were higher, ranging from 50% to 80%, depending on the activity and the stage. As was the case for the other core skills, there were no gender differences evident in the ICT attainment data.

« Previous | Contents | Next »

Page updated: Thursday, March 24, 2005