« Previous | Contents | Next »
Listen
Assessment of Achievement Programme:
Report of the Sixth AAP Survey of Science (2003)
1. Introduction
1.1 Overview
The 2003 Science Survey was the sixth survey to be
carried out in this curriculum area within the Assessment
of Achievement Programme (AAP), and the second subject
survey to incorporate the assessment of core skills
attainment - the first being the Social Subjects Enquiry
Skills Survey of 2002. The aims of the survey were to:
- assess the science attainment of pupils in P3, P5,
P7 and S2;
- assess the core skills attainment of pupils in P3,
P5, P7 and S2;
- compare attainment across the four stages and
between boys and girls;
- report attainment in terms of 5-14 levels whenever
possible;
- explore pupils' informed attitudes in science;
- provide a learning context against which to reflect
on the attainment and attitude findings.
The 2003 survey differed from previous AAP surveys in
science in three principal ways: pupils'
Knowledge and understanding in science is reported
with reference to 5-14 levels; science attainments have
been assessed for pupils as young as P3; and an attempt has
been made to explore the development of informed attitudes
in science. Two important events led to the first two of
these innovations. Firstly, in 2000 the National Guidelines
for Environmental Studies were revised
1, with the result that
Knowledge and understanding in science is now,
like other outcomes in this and other subjects, organised
into strands, levels and attainment targets rather than key
features and stage bands. Secondly, in 2002 Social Subjects
joined English Language, Mathematics and Science in the
AAP's subject coverage, demanding a shift from a 3-year to
a 4-year subject cycle and inviting a parallel shift in
focus from P4, P7 and S2 to P3, P5, P7 and S2.
The survey involved pupils in maintained and independent
schools across Scotland (special schools were excluded),
and testing took place in May and June 2003. Schools and
pupils were randomly selected for survey participation in
such a way that the tested pupils represented the national
cohort at the stage concerned. In total, just under 9000
pupils in just under 600 schools took part in the survey:
around 2000 pupils in around 155 primary schools at each
primary stage, and just over 2000 S2 pupils in 130
secondary schools. Just over half the pupils attempted
pencil and paper science tasks, while the remainder
participated in core skills reading and writing assessment.
The 'science' pupils in over half the survey schools also
took part in practical science assessments, while the 'core
skills' pupils in these same 'practical' schools also
undertook ICT tasks or participated in group discussions.
Full details of the sampling strategy and the resulting
samples are given in Appendix B.
To provide a context for the attainment and attitude
findings, samples of participating schools at P5, P7 and S2
were invited to complete questionnaires about the resources
available to them for science and about their science
provision, and their sample pupils were invited to complete
questionnaires seeking their views about their science
learning experience and exploring their attitudes to
topical issues in science and technology.
1.2 The assessment framework and approaches to
assessment
The assessment framework for the 2003 survey was in part
established by a Science Reference Group, which comprised
subject specialists, practising primary and secondary
science teachers and other education professionals, and
which met for the first time in August 2002.
1.2.1 Knowledge and understanding
The three
Knowledge and understanding attainment outcomes in
the 5-14 Environmental Studies framework are shown below,
with their respective strands:
Understanding Earth and Space:
- Earth in space
- Materials from Earth
- Changing materials
Understanding Energy and Forces:
- Properties and uses of energy
- Conversion and transfer of energy
- Forces and their effects
Understanding Living Things and the Processes of
Life:
- Variety and characteristic features
- The processes of life
- Interaction of living things with their
environment
The survey was designed to assess and to report P3
attainment at Levels A and B, P5 attainment at Levels B and
C, P7 attainment at Levels C, D and E, and S2 attainment at
Levels D, E and F, for science as a whole.
Given the various constraints that national attainment
surveys inevitably operate within, it would not be possible
to assess and to report pupils' level-based attainment for
each outcome separately. However, following the advice of
the Reference Group, the three outcomes were equally
covered within the entire set of tasks administered to
pupils in the survey, so that some comment on comparative
performance is possible.
Over time attainment comparisons are offered on the
basis of tasks previously administered in the 1999 survey
and re-used on this occasion. Given that P4 no longer
features in the AAP survey programme, and given that Level
F did not feature at S2 in the 1999 survey, it was accepted
that temporal comparisons would be possible at P7 and S2
only, at Levels C (P7 only), D and E.
1.2.2 Skills in science -
investigating
The three strands that comprise investigation skills in
science are:
- Preparing for tasks
- Carrying out tasks
- Reviewing and reporting on tasks
In the 1999 science survey, pupils' skills attainment in
science was explored using pencil and paper tasks
2. For the 2003 survey, the Reference Group strongly
advised that this strategy be abandoned in favour of skills
exploration through pupil participation in purposeful
practical investigations set in meaningful contexts. One
consequence of this decision, given the financial and
logistic challenges associated with implementing practical
assessment on a national scale in schools, is that pupils'
skills attainment at each stage is reported here on the
basis of relatively few practical investigations. Another
consequence is that there can be no comparison of skills
attainment between 1999 and 2003.
1.2.3 Developing informed attitudes
Three strands comprise this outcome in the National
Guidelines:
- A commitment to learning
- Respect and care for self and others
- Social and environmental responsibility
Pupils' attitudes to topics and activities in science
have been explored in earlier surveys, and were explored
again here. In this survey the pupil questionnaire enquiry
incorporated an exploration into pupils' knowledge of and
views about topical issues in science. In addition, a
deliberate attempt was made to look specifically at the
development of informed attitudes, an outcome of science
education included in the National Guidelines but never
before explored in any systematic way within a national
survey. The Reference Group agreed that it was appropriate
to begin addressing what up until now has been a gap in
survey coverage, but at the same time advised against any
attempt to assess the informed attitudes of individual
pupils, even for the purpose of contributing anonymously to
a population estimate. For this and other reasons the
second vehicle for informed attitude exploration was focus
group discussion.
1.2.4 Core skills
Pupil performance in the following core skills was also
explored within the survey:
- Communication (reading and writing)
- Numeracy
- Using information technology
- Problem solving
The survey was explicitly designed to assess and report
pupils' reading and writing attainments, along with their
numeracy and ICT skills, while problem solving would be
addressed naturally through performance in the science
investigation tasks. Whenever possible, attainment in the
core skills was assessed within a science context.
1.3 Task development
1.3.1 Knowledge and understanding
Pupils'
Knowledge and understanding in science was
assessed using relatively short pencil and paper tasks. The
survey design called for a total of 360 such tasks,
comprising 60 tasks at each of Levels A to F, with an equal
representation of the three outcomes. AAP tasks that had
been used in the 1999 survey furnished around one-fifth of
the required tasks. Following the National Guidelines in
operation in 1999, each task used in the survey that year
had been classified into one of the three attainment
outcomes and into one of three stage bands: P1-P3, P4-P6,
P7-S2. In late 2002, in preparation for the 2003 survey,
these tasks were reviewed. They were judged for continued
framework relevance, revised where necessary to improve and
standardise layout, and newly classified into 5-14 levels.
The result was an expected shortfall in required numbers at
every level, along with an outcome imbalance.
An appeal was made to the exemplification materials
developed by a SEED-funded project team working in the then
Department of Curriculum Studies in Northern College, now
the Faculty of Curriculum Studies in the University of
Aberdeen, to provide a further source of assessment tasks
3. The group that had produced the materials comprised
university academics and practising teachers. Following the
brief they were given, the task developers designed tasks
for use by teachers in classrooms for formative purposes.
Not all the tasks were therefore in a suitable form for
direct use in a large-scale time-constrained survey such as
this one, and none had been subjected to formal large-scale
field trialling. Nevertheless, tasks from this source,
often in modified form, served to fill the gaps.
1.3.2 Practical science investigations
Nine practical investigation tasks were newly developed
for survey use, by a group of six teachers supervised by a
staff member in the Faculty of Education in the University
of Strathclyde. Each task was designed to allow pupils,
working in groups, to plan an investigation, carry out the
planned investigation, and report/review the
investigation.
Individual tasks spanned two adjacent levels, in the
sense that checklists were developed that in principle
enabled assessors to classify pupils into one level or the
other for each relevant attainment target, on the basis of
their investigative behaviour. With the exception of Levels
E/F, for which there was a single task, there were two
tasks for every pair of adjacent levels. The investigations
were trialled informally by the developers, with small
numbers of pupils in local schools. Full-scale piloting
occurred within the survey itself.
1.3.3 Reading and writing
New task development for reading and writing was
organised by the Scottish Qualifications Authority (SQA).
Fifteen reading tasks, three at each of Levels A to E, were
developed by practising teachers for survey use. Each task
comprised a text and a series of comprehension questions,
resembling in form the reading tasks typically used in AAP
English Language surveys and in National Assessments. Task
topics were 'scientific', e.g. 'Seeds', 'Electricity' and
'Recycling', and topic selection ensured balance across the
three
Knowledge and understanding outcomes at each
level. Any dependence on prior topic knowledge was
minimised. The texts had been selected and the tasks
developed to fit particular 5-14 levels. Level
classifications were independently confirmed by teacher
validators, and the tasks were put through national
pre-tests before survey use.
At Levels C, D and E, the reading tasks were each
complemented by an associated writing task, which focused
on the same science-relevant topic.
1.3.4 Numeracy
The 15 numeracy tasks used in this survey were tasks
that had previously been developed for use in the 2002
Social Subjects survey, or were adaptations of these. In
that previous survey, numeracy tasks were embedded within
larger level-based enquiry skills tasks, and related to the
overall themes of the parent tasks
4, which might be 'Egyptians', 'River study',
'Advertising', or whatever. The tasks typically presented
pupils with tables or charts, and required them to transfer
some or all of the given information into different
presentational forms, or to use it to answer simple
questions, sometimes involving calculation.
Some of the numeracy tasks were re-usable in the science
survey without modification. In others, context and content
that were not 'scientific' enough could be modified without
changing task format or skill demands. Thus, 'River study'
became 'Water quality', with no changes to the task itself,
'Egyptians' became 'Bird table', with bird species
replacing tomb jewellery in graph and questions, and
'Advertising' became 'Food complaints', otherwise
unchanged.
1.3.5 ICT tasks
Six ICT tasks were commissioned from an independent
consultant, and installed on CDs by Learning and Teaching
Scotland (LTS). The interactive tasks featured exercises in
word processing, spreadsheet and database use, internet
searching and web page browsing (using simulated CD-held
web pages). Three tasks were set in science-relevant
contexts appropriate to pupils in P3 and P5, and focused on
ICT skills at Levels B and C in the ICT National Guidelines
5, while the other three were set in contexts
appropriate for pupils in P7 and S2, and demanded ICT
skills at Levels D and E.
The development of these tasks benefited from previous
experience, since similar tasks had been developed and used
in the 2002 Social Subjects survey, and lessons were
learned from the successes and failures of that survey.
However, there was insufficient time available between task
development and survey implementation for formal
pre-testing, so that these tasks, like the science
investigations, were essentially piloted within the survey
itself.
1.3.6 Group discussions
Three topics were identified by SEED personnel as
appropriate for 5-10 minute focus group discussions that
might allow pupils to show evidence of informed attitudes
in science. The topics had not only to be perceived by
pupils as being 'scientific', but they would also have to
hold interest for them and have some clear current
relevance. The topics would also need to be capable of
provoking discussion among pupils, allowing different
points of view to be argued. This was no easy challenge,
but the three topics finally selected were 'A school
garden' (P3/P5), "Where have all the sparrows gone?"
(P5/P7/S2), and 'Recycling' (P7/S2).
It was intended that trained field officers would
introduce the discussion topics to small groups of pupils,
and would animate and generally manage the ensuing
discussions, rating pupils in real time for evidence of
informed attitudes in science, using criteria-based
checklists.
1.4 Task administration and marking
All the survey pupils were involved in some form of
written assessment, and around half also participated in
some form of practical assessment. The survey design
anticipated that wherever possible 10 pupils in each school
- the 'science' sample pupils - would be involved in the
assessment of
Knowledge and understanding in science, while a
further eight pupils - the 'core skills' sample pupils -
would be involved in the assessment of reading and, at
Levels C, D and E, writing. In a subset of schools - the
'practical' schools - those pupils who had attempted pencil
and paper science tasks would also take part in practical
science investigations, while those pupils who had
attempted reading and writing tasks would also attempt an
ICT task or participate in a group discussion. Typically,
therefore, 18 pupils in each school were involved in one
way or another in the survey.
The pupils' own teachers organised and supervised
written test sessions, but the practical assessments were
the responsibility of itinerant field officers (see section
1.4.3). Once the written testing had taken place, the
survey schools returned completed scripts to SEED for
processing and marking. Field officers completed their
assessments of pupils as they engaged in the practical
activities.
1.4.1 Knowledge and understanding
Pupils at P3 and P5 were to be assessed at two levels
only (A and B for P3; B and C for P5), whereas pupils at P7
and S2 were to be assessed at three levels (C, D and E at
P7; D, E and F at S2). However, there was no possibility
here of targeting tests to pupils, in such a way that every
pupil would be given a test at or close to that pupil's
current level. Rather than risk any possible distress
should a pupil be faced with a single-level test at too
high a level, or boredom and demotivation when faced with a
single-level test at too low a level, it was decided to mix
levels within each test booklet. For similar reasons it was
also decided that every booklet would feature all three
outcomes. In addition, each booklet would contain a single
numeracy task.
The 360
Knowledge and understanding tasks, and the 15
numeracy tasks, were therefore distributed among a number
of different test booklets at each stage in such a way that
every pupil was faced with a collection of tasks from the
two or three levels appropriate for their stage assessment,
and spanning all three outcomes (see Appendix B for further
detail). At P3 and P5, the test booklets comprised 12
science tasks each, plus a numeracy task, and were expected
to take 30-40 minutes to complete. At P7 and S2, test
booklets were longer, at 18 science tasks plus a numeracy
task, and were expected to take 50-60 minutes to complete.
Individual pupils were intended to attempt two different
booklets, with a time interval between test sessions to be
decided by the school.
Completed scripts were processed centrally in a series
of 'response transcription' meetings organised by SEED. In
these meetings, pupils' responses were transferred onto
specially designed transcription sheets, for later
keyboarding and machine marking (Chapter 2 offers further
details).
1.4.2 Reading and writing
Reading tasks at Levels A and B were also presented to
pupils in the form of test booklets, one task forming a
single booklet. At Levels C, D and E, reading tasks were
accompanied by their associated writing tasks. There were
thus six 'reading only' booklets and nine 'reading and
writing' booklets in total, with three booklets at each
level. It was intended that each pupil would attempt two
different reading booklets, the two booklets being at
different levels and featuring topics from two different
Knowledge and understanding outcomes (though not
requiring any prior topic knowledge). This meant that
individual P3 pupils would attempt booklets at Levels A and
B, and individual P5 pupils would attempt booklets at
Levels B and C. At P7, pupils faced booklets at Levels C
and D, while pupils at S2 attempted tasks at Levels D and
E.
Reading booklets were designed to have the same general
time requirement as the science booklets, and it was
expected that schools would organise both types of
assessment to occur in the same test sessions. Completed
reading booklets, like completed science booklets, were
processed centrally, with response transcription forms
completed by students for later keyboarding and machine
marking. Random samples of pupils' writing were selected
for marking by the teachers who had served as field
officers in the survey (further details are given in
Chapter 4).
1.4.3 Practical tasks and group
discussions
Just over half the survey schools at P3, P5 and P7, and
approximately two-thirds of those at S2, took part in the
practical assessments. Typically, in each school eight
pupils undertook science investigations, in groups of four.
In addition, three or four pupils participated in a group
discussion, and up to eight pupils individually attempted
ICT tasks.
A total of 28 authorities provided the 148 practising
primary and secondary science teachers who served as field
officers. The field officers worked in pairs, half at P3/P5
and half at P7/S2, spending five days visiting schools
located within reasonable travelling distance of their home
areas. They spent one day in each of their five assigned
schools, setting up and supervising the practical sessions,
managing the group discussions, and rating pupils
on-the-spot, using checklists to record observations and
judgments (Chapter 3 provides full details). In preparation
for this work, the field officers were given a day of task
orientation in the period early May to early June 2003.
Their involvement in the practical assessment was formally
ended in a debriefing day held in the period late May to
late June 2003, but half of these practising teachers
contributed further by volunteering to be involved in
marking pupils' writing.
1.5 Reporting pupil attainment
Pupils'
Knowledge and understanding attainment, as well as
their reading and writing attainment, is reported in terms
of the percentage of pupils at each stage who were deemed
to have attained specific 5-14 levels. For
Knowledge and understanding and for reading,
attainment decisions were based on the application of
judgmental cut-off scores, whereas pupils' writing was
classified into an appropriate level through application of
'best fit' criteria. For investigation skills in science
and for ICT skills, cut-off scores have not been applied to
test scores, but pupil performance is instead averaged over
test items or behaviours classified at particular levels,
and average facility values or rating distributions are
given.
The resulting attainment findings are presented in
Chapters 2, 3 and 4. Chapter 2 focuses on the assessment of
Knowledge and understanding, Chapter 3 presents
the results of the skills assessment in science, and
Chapter 4 presents the results of the core skills
assessment.
Chapter 5 presents findings from the pupil questionnaire
enquiries, including the assessment of pupils' informed
attitudes, while Chapter 6 presents findings from the
school questionnaire. Finally, Chapter 7 summarises and
reflects on the principal findings of the survey, and
addresses some of the issues raised.
« Previous | Contents | Next »