On this page:

Scottish Household Survey: Annual Report - Results from 2007

« Previous | Contents | Next »

Listen

Annex 1 - Using the information in this report

HOW DATA ARE DISPLAYED IN TABLES

All tables are presented in the format "dependent variable by independent variable" where the independent variable is being used to examine or explain variation in the dependent variable. Thus, a table titled 'housing tenure by household type' shows how housing tenures vary among different household types. Where the tables show column percentages, the dependent variable is shown in the rows and the columns show the independent variable. Where the tables show row percentages, this is switched and the dependent variable is shown in the columns. Some summary tables combine three dimensions, for example agreement by age within statement. These are shown as cell percentages.

All tables have a descriptive and numerical base showing the population or population sub-group examined in it. While all results have been calculated using weighted data, the bases shown give the unweighted counts. It should therefore be noted that the results and bases presented cannot be used to calculate how many respondents gave a certain answer.

For area-based independent variables, the combined total of sub-areas is described as 'Scotland'. For all other independent variables the combined total of sub-groups is described as 'All'.

In general, percentages in tables have been rounded to the nearest whole number. Zero values are shown as a dash (-), values greater than 0% but less that 0.5% are shown as 0% and values of 0.5% but less than 1% are rounded up to 1%. Columns or rows may not add to 100% because of rounding or where multiple responses to a question are possible.

In some tables, percentages have been removed from columns and replaced with '*' where the base on which percentages would be calculated is less than 100. These data are judged to be insufficiently reliable for publication.

VARIATIONS IN BASE SIZES FOR TABLES

Because the questionnaire is administered using CAPI, item non-response is kept to a minimum. Bases do fluctuate slightly due to small amounts of missing information (where, for example, the age or gender of household members has been refused and where derived variables such as household type use this information).

Some questions are asked of a reduced sample and the bases are correspondingly lower. In 2007, some questions were streamed or changed in the course of the year and again the base size is lower. A footnote is included below a table where this occurs.

The sample base annex ( Annex 3) gives details of frequencies and bases for the main dependent variables.

INCOME IMPUTATION

One section of the questionnaire is substantially affected by missing information. In the section on household income, approximately 33% of respondents either refuse to answer the questions or are unable to provide information that is sufficiently reliable to report, for example, because there are no details of the level of income received for one or more components of their income. After the survey, statistical analysis of the characteristics of households where income is available allows income data to be imputed for households where income data are missing. After imputation, missing income data are reduced to only 3%-4% of households (see Annex 2, for more details).

The income information in the report includes the income of the Highest Income Householder and their spouse or partner (where there is one). Income from employment, pensions and benefits and income from other sources is included. The income of other household members is only included if it represents 'other' income for the HIH or spouse i.e. the other household member contributes to household resources by paying 'dig money'.

The current income information collected through the SHS, is only intended to provide estimates by income band. The survey asks for income only for use as a "background" variable when analysing other topics, or for selecting the data for particular sub-groups of the population (such as the low paid) for further analysis. The SHS cannot be used as a source of figures on average income or average earnings. (See Scottish Household Survey: Methodology 200770 for further details).

STATISTICAL SIGNIFICANCE

All survey data has a degree of error associated with it because it is based on a sample of the population. Any proportion measured in the survey has an associated sampling error, usually expressed as ±x% at the 95% confidence limits. Technically, all results should be quoted in this way. However, it is less cumbersome to simply report the percentage as a single percentage, the convention adopted in this report. Where sample sizes are small or comparisons are made between sub-groups of the sample, the sampling error needs to be taken into account. There are formulae to calculate whether differences are statistically significant (i.e. they are unlikely to have occurred by chance) and Annex 4 provides a simple way to estimate if differences are significant.

« Previous | Contents | Next »

Page updated: Thursday, August 7, 2008