****
Scottish Executive*Consultations  

Making it work together
* * *
* Home | Topics | About | News | Publications | Consultations | Search | Links | Contacts | Help *
*
 

< Previous | Contents | Next >

EFFECTIVE AND COST-EFFECTIVE MEASURES TO REDUCE ALCOHOL MISUSE IN SCOTLAND: A LITERATURE REVIEW

CHAPTER SEVEN SCREENING AND DETECTION

Summary

This chapter reviews the effectiveness and cost-effectiveness evidence relating to screening for and detection of problem drinking and alcohol abuse. The main findings are;

  • AUDIT is more effective in detecting at risk, hazardous or harmful drinking;
  • CAGE is superior in detecting alcohol abuse or dependence;
  • T-ACE and TWEAK are more sensitive and specific for screening pregnant women;
  • laboratory tests do not perform well as screening instruments;
  • telemarketing has been shown to be cost-effective in marketing screening and brief intervention packages to GPs in Australia; and
  • specialist workers were shown to be cost-effective as screeners in a UK general hospital.
  • Evaluating the performance of screening tests is hampered by the lack of an agreed gold standard.

Introduction

7.1 This chapter concerns the screening instruments that are available for the detection of problem drinking, alcohol abuse and dependence and the laboratory tests that are available either to confirm such results or to monitor abstinence. The value of screening instruments lies in the detection of people for whom a brief intervention or more intensive treatment may be of benefit. Therefore, the important issue is the accuracy of the information that is provided. The studies reviewed are summarised briefly in tables 7.1 (effectiveness) and 7.2 (cost-effectiveness) at the end of the chapter.

Screening questionnaires

Types of intervention

7.2 A variety of questionnaires have been developed and the main instruments evaluated in the literature are summarised in chart 7.1. Most have been developed for screening purposes although two are used to assess the extent of alcohol problems (S-MAST and SADD). There is some overlap in the type of questions asked, such as those relating to the need to cut down on drinking or needing a drink to start the day (eye-opener). Other variations on these instruments may also be in use.

Chart 7.1 Summary of main instruments in use

AUDIT

(Alcohol Use Disorders Identification Test)
10 questions which aim to detect hazardous drinking

CAGE

(Cut down Annoyed Guilty Eye-opener) 4 questions that aim to detect alcohol abuse and dependence

Health Screening Survey and Questionnaire
PRIME-MD

general questionnaire including alcohol.
( Primary Care Evaluation of Mental Disorders) screens for mental health and alcohol use disorders.

S-MAST

(Short Michigan Alcoholism Screening Test (also B-MAST)) 13 item self completion assessment instrument

SADD

(Short Alcohol Dependence Data Questionnaire) 15 item self completion assessment of dependence severity

T-ACE

(Tolerance Annoyed Cut down Eye-opener)

Trauma Scale

5 questions on alcohol related trauma

TWEAK

(Tolerance Worried Eye-opener Amnesia Kut down) screens for alcohol problems in pregnant women

Effectiveness

7.3 The accuracy of screening instruments is assessed in terms of both sensitivity and specificity. Sensitivity refers to the ability of the instrument to identify true positives (for example, 80% sensitivity implies that 8 out of 10 people will be identified). Specificity refers to the accuracy in excluding true negatives (for example, 80% specificity implies that for every 10 people without the condition of interest, 8 will be excluded). A good test is both sensitive and specific but there is usually a trade off between these performance measures.

7.4 An overall assessment of performance is given by ROC curves, which plot sensitivity against 1 minus specificity. The area under the ROC curve (AUROC) provides a measure of test performance with 1 equalling a perfect test and 0.5 indicating that the test provides no useful information. Only one review reports these figures and they were only available for two studies (Bradley et al 1998). The AUROC score for CAGE was 0.84, for AUDIT the range was 0.86 — 0.94 and for TWEAK the scores were 0.89 - 0.90.

7.5 The performance of the screening questionnaires is very variable and some are better at detecting certain levels of alcohol misuse than others. For general screening purposes, AUDIT is more effective in detecting at risk, hazardous or harmful drinking (sensitivity 51%-97%; specificity 78%-96%), whilst CAGE is superior for detecting alcohol abuse and dependency (sensitivity 43%-94%; specificity 70%-97%). These two tests perform better than other screening tests (Fiellin et al 2000). Screening performance also depends upon the choice of cut-off point. A lower cut-off point will tend to increase sensitivity and reduce specificity.

Population groups

7.6 One review has considered the use of questionnaires with women (Bradley et al 1998). AUDIT, CAGE and TWEAK were the optimal test for detecting alcohol dependence in women but sensitivity in female populations may be lower using traditional cut-off points. CAGE was found to be more sensitive in samples of black women than white women. TWEAK appears to be optimal for detecting heavy drinking or alcohol abuse and dependence in racially mixed female populations. T-ACE and TWEAK are more sensitive and specific than CAGE for screening pregnant women. The full Michigan Alcoholism Screening Test (MAST) has been found useful for psychiatric settings (Teitelbaum and Mullen 2000.)

Setting

7.7 Most screening takes place in outpatient and primary care settings. Comparisons of screening test performance across settings have not taken place.

Providers

7.8 Comparisons of different persons administering questionnaires were not reported in the reviews. Some cost-effectiveness information is provided below (para 7.16).

Quality and relevance of evidence

7.9 The assessment of the screening instruments is limited by the lack of an agreed diagnostic standard against which to confirm the screening results. A range of criterion standards has been used in the studies reviewed and differences in the performance of screening instruments may be due to the choice of criterion standard (Fiellin et al 2000).

Laboratory tests

Types of intervention

7.10 There is a range of routine and more specialised blood tests available to confirm evidence of problem drinking or alcohol abuse. The specific tests available for sustained alcohol abuse are;

  • GGT

gamma glutamyltransferase is an indicator of liver injury

  • CDT

assay carbohydrate deficient transferrin identifies men drinking 5 or more units per day for a year or more

Effectiveness

7.11 Laboratory tests are not useful in screening for alcohol related problems (Fiellin et all 2000; NIAAA 2000). They have a role in monitoring the treatment of alcohol disorders. One review of studies comparing CDT assay with GGT concluded that in studies up to June 1998 commercially available CDT assay tests were not significantly better than GGT as markers of excessive alcohol use (Scouller et al 2000).

Cost-effectiveness

7.12 The economic literature with respect to screening for alcohol problems is very limited. The sequence of events through screening, referral for treatment and eventually changes in patient outcomes means economic analysis in this field is seldom undertaken. The key economic study regarding screening centres on the marketing of the Drink-less package in New Zealand. There is also limited UK evidence.

Drink-less

7.13 McCormick et al (1999) investigated cost-effective methods of encouraging general practitioners to take up screening and early intervention for problem use of alcohol in a marketing trial. The authors analysed the costs and consequences of marketing techniques designed to encourage GPs to receive an early intervention and screening package for the problem use of alcohol. An RCT of New Zealand GPs, comparing mail, telemarketing and personal marketing was employed for the 'Drinkless' early intervention and screening package developed with the WHO collaborative study for early interventions for 'at risk' alcohol consumption. GPs in New Zealand were identified from a database. From a total of 369 GPs, 186 were assigned to mail marketing, 87 to telemarketing and 96 to personal marketing. Of the 369, 40 were either ineligible or un-contactable.

7.14 The costs of the marketing exercise included promotional material, postage, telephone charges, travel costs (time and transport), receptionist time and waiting time. Direct mail costs per doctor were estimated at $5.11, telemarketing $2.92 and personal marketing $16.54. Outcomes were measured by the number of GPs willing to receive the package. Telemarketing appeared to be the most cost-effective means of persuading GPs to receive and employ the Drink-less package. Sixty-four of the 87 GPs contacted agreed to receive the package. The cost per doctor receiving was $3.97. Direct mail cost $10 per doctor agreeing and personal marketing cost $20.36.

Quality and relevance of evidence

7.15 McCormick's study shows a cost-effective means of disseminating information but information about how effective the Drink-less intervention is in practice is required. Little is known about whether the GPs actually used the packages. A follow up would be needed as receiving a package does not amount to using it, and thus health improvements could be negligible. More information on the treatment costs is provided in chapter 8.

Cost-effectiveness of alternative providers.

7.16 Tolley and Rowland (1991) investigated the cost-effectiveness of screening patients for alcohol problems at York District Hospital, UK. Over a 21-month period, doctors, nurses and a specialist worker screened orthopaedic and medical admissions to the Hospital. The costs of the intervention were calculated by using the mid-points of the salary ranges for the relevant grades and computing a cost for each screening based on the hourly rate of pay. The costs of screening were £0.10 for a nurse, £0.11 for a doctor and £0.15 for a specialist worker. As a percentage of admissions (positive screenings in parentheses), nurses screened 48% (7.8%), doctors 27% (9.8%) and the specialist worker 21% (12.5%). In terms of the lowest cost per positive screening, doctors cost £1.17, compared to £1.20 for the specialist worker and £1.29 for nurses.

7.17 The authors concluded that the specialist worker had a higher effectiveness rate defined by the percentage of positive screenings made, whilst nurses managed to screen a higher percentage of all admissions than doctors or the specialist. The authors conducted a marginal cost analysis based on the additional positive cases identified per 1000 screenings of different combinations of health care professionals delivering the interventions. By employing a specialist worker compared to a nurse, the specialist worker would identify an additional 88 cases per 1000 admissions at a marginal cost per positive case identified of £0.56. However, it should be noted that other considerations are important. Hospital managers may decide that the cost of employing a full time specialist is too great compared to the additional cases detected. Furthermore it may mean that a specialist cannot devote any time not screening to other duties. Nurses may be able to use any spare time more productively. The study provides some evidence as to the relative cost-effectiveness of screening for alcohol problems, but evidence is far from conclusive and likely to be affected by staffing levels and other resource issues within the health care provider.

Quality and relevance of evidence

7.18 The crucial shortcomings of both studies are the outcomes employed. The initial screening process identifies positive and negative screens, of which positives are referred for further treatment, yielding a further chain of events before successful health outcomes can be quantified. However, evaluations of screening programmes to date have only considered the first link, and the process through to patient outcomes is not covered by the current literature.

Further research

7.19 The available screening instruments appear to work well but evaluating their performance is hampered by the lack of an agreed diagnostic gold standard. Research to resolve this issue would be useful. Continuing research on the performance and role of laboratory tests will be needed as they are developed. There is a lack of research relating to comparisons of screening settings and providers.

Table 7.1 Summary of Screening and Detection Effectiveness Studies Reviewed

Author(s) and Date

Search Strategy

Inclusion / Exclusion Criteria

Quality assessed

Number of studies

Target Group

Main Findings

Bradley KA, Boyd-Wickizer J, Powell SH, Burman ML. Alcohol screening questionnaires in women: A critical review. JAMA 1998; 280(2): 166-71.

Yes

Studies included compared a brief alcohol screening questionnaire with an appropriate criterion standard for heavy drinking or alcohol abuse or dependence in a generalisable, clinical population of US women. Studies using self-administered questionnaires with self-reported measures of alcohol consumption. were excluded.

Yes

9 studies evaluating 8 questionnaires

Women

The CAGE questionnaire was relatively insensitive in predominantly white female populations. The TWEAK and AUDIT questionnaires have performed adequately in black and white women, using lower cut off points than usual. Referring women to all-female alcohol treatment programs may improve dropout rates and outcomes.

Fiellin DA et al Screening for alcohol problems in primary care: a systematic review. Archives of Internal Medicine. 2000; 160(13): 1977-89.

Yes

Included studies comparing a screening method to a criterion standard , reporting performance characteristics and carried out in a primary care setting

Yes

37 studies

Various

AUDIT was most effective in detecting at risk, hazardous or harmful drinking (sensitivity 51%-97%; specificity 78%-96%). CAGE was superior for detecting alcohol abuse and dependency (sensitivity 43%-94%; specificity 70%-97%). These two methods performed consistently better than other screening tests.
The conclusions are limited by the lack of an agreed criterion standard and this may partly explain conflicting or inconsistent results.

National Institute on Alcohol Abuse and Alcoholism, 10th Special Report to the US Congress on Alcohol and Health June 2000

         

Questions about quantity and frequency of drinking have high sensitivity in detecting people drinking above recommended limits. Patients may understate their drinking.
CAGE is sensitive and specific in identifying alcohol abuse and dependence but misses some at risk drinkers. AUDIT has sensitivity of 50% to 80% depending on the population and cut off score. It may miss borderline risks and binge drinkers. Longer questionnaire may be less suitable as primary screen. Health Screening Survey and Questionnaire has adequate sensitivity and specificity in primary care settings. Telephone assisted computer administration of PRIME-MD is more sensitive than face to face administration by a clinician. The trauma scale is more sensitive than laboratory tests in detecting problem drinking and specific in ruling out problem drinking. T-ACE and TWEAK are more sensitive than CAGE for pregnant women. Laboratory tests are not useful as primary screening tools.

Scouller K et al Should we use carbohydrate-deficient transferrin instead of gamma-glutamyltransferase for detecting problem drinkers? A systematic review and meta-analysis 2000 Clinical Chemistry; 46(12): 1894-1902

Partially reported

Included reports presenting original data from which sensitivity and specificity of CDT could be calculated and data on drinking behaviour

Yes

110 studies.

Men, Women

In studies prior to June 1998, results using commercially available CDT assays were not significantly better than GGT as markers of excessive alcohol use in paired studies. Other assay methods appear promising but further studies are required.

Teitelbaum, L and Mullen, B. The validity of the MAST in psychiatric settings: a meta-analytic integration. Journal of Studies on Alcohol. 2000; 61 : 254-61

Yes

Included studies evaluating the full version of MAST, using corresponding cut-off scores, conducted in a psychiatric setting and reporting sufficient data.

Yes

9 studies

Psychiatric patients

The weighted mean average sensitivity was 87.7% and the weighted mean average specificity was 68.1%. This suggests that MAST is less effective at screening out those who do not have an alcohol-related disorder than in identifying those who do have such a disorder. Five of the 9 studies evaluated MAST in an alcohol treatment setting and the effects were significantly stronger.

< Previous | Contents | Next >

* * *
* Home | Topics | About | News | Publications | Consultations | Search | Links | Contacts | Help *
Crown Copyright | Privacy policy | Content Disclaimer | General enquiries