Key messages
• The results across studies were very different, but we don't know why. This means that the Hospital Depression and Anxiety Scale Anxiety subscale (HADS-A) may be better or worse if applied in practice.
• However, based on the combined results, in practice, a lot of people would be labelled as positive when they are not, which could put more pressure on the healthcare system.
Why is the accurate detection of anxiety disorders important?
Anxiety disorders are quite common but often go undetected, even in people who would benefit from treatment. Through the screening process, people are divided into two groups: those who test negative and those who test positive. Those with positive results need further evaluation. The final diagnosis is made by a competent clinician. However, the screening process can give incorrect results. Not detecting an anxiety disorder when it is there is called a false-negative result. This might mean missing the chance for timely treatment. A false positive is a result that incorrectly shows an anxiety disorder when it is not there. This can cause a burden for patients and the public health system because of unnecessary worry, further testing, and treatments. Screening for anxiety covers different conditions, which are summarised by the term 'any anxiety disorder' (AAD). These include, amongst others, generalised anxiety disorder (GAD) and panic disorder. In our review, we look at these three conditions.
What is the 'HADS-A' subscale?
The Hospital Anxiety and Depression Scale (HADS) is a questionnaire. It was created to detect anxiety and depression in people with medical problems. It has two sections: the Depression subscale (HADS-D) and the Anxiety subscale (HADS-A). Each subscale has seven questions. People answer the questions on a scale from 0 to 3. After answering all the questions, the scores are added up to get a total score. Total scores at or over a specified score (the 'cutoff'), suggest the presence of anxiety disorder. The recommended HADS-A cutoff is 8 or higher for possible anxiety (or 11 or higher for definite anxiety). The HADS-A allows for simple and quick results, so individuals with high HADS-A scores can be referred for further evaluation.
What did we want to find out?
We aimed to find out how well the HADS-A can tell whether an adult has an anxiety disorder or not.
What did we do?
We searched for studies that used HADS-A to detect anxiety. Then, we combined the results of these studies.
What did we find?
This review included results from 67 studies with 18,467 participants. Fifty-four studies had information on HADS-A in detecting AAD, 35 studies had information on GAD and 10 on panic disorder.
What were the main results of the review?
The combined results for AAD alone showed that if HADS-A is administered to 1000 individuals and 170 of them have confirmed AAD, then:
• of the 325 people who tested positive for AAD, 199 would be incorrectly labelled as having AAD (false positives) and 126 would be correctly labelled positive (true positives).
• Out of the 675 people who tested negative, 44 would be incorrectly labelled as not having AAD (false negatives) and 631 would be correctly labelled negative (true negative).
What are the limitations of the evidence?
The examples above are from the combined results of all studies. Yet, the results across all studies were very diverse. Also, there were problems with how most of the studies were done. Finally, not all studies gave enough information for us to say whether they also included participants with mental health complaints. Therefore, we are not sure if the HADS-A will always match the combined results above.
How up to date is this evidence?
The evidence is current up to 10 July 2024.
Read the full abstract
Despite being highly prevalent mental health conditions, anxiety disorders frequently go undiagnosed, prompting the use of questionnaires for anxiety screening as a potential solution. This review summarises the test accuracy of the Hospital Anxiety and Depression Scale Anxiety subscale (HADS-A) for screening purposes.
Objectives
To assess the test accuracy of the HADS-A in screening for any anxiety disorder (AAD), generalised anxiety disorder (GAD) and panic disorder in adults, and to investigate how the test accuracy varies by sources of heterogeneity and across all cutoffs.
Search strategy
We searched Embase, MEDLINE, PubMed-not-MEDLINE subset and PsycINFO from 1990 to 10 July 2024. We checked the reference lists of included studies and review articles.
Selection criteria
We included studies in adults in which the HADS-A was administered cross-sectionally alongside structured or semi-structured clinical interviews, allowing the creation of 2x2 tables. We excluded case-control studies, studies with a time gap exceeding four weeks between administering the HADS-A and the reference standard, and studies with diagnostic criteria based on the Diagnostic and Statistical Manual of Mental Disorders Third Edition or earlier versions. We also excluded studies involving people who were recruited based on mental health symptoms.
Data collection and analysis
At least two review authors independently decided on the eligibility of the articles, extracted data, and assessed the methodological quality of the included studies using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2). For each target condition, we present the sensitivity and specificity of each study along with 95% confidence intervals (CIs). For the primary analyses, we used bivariate models to obtain summary estimates for the recommended HADS-A cutoff score of 8 or higher (≥ 8); if the bivariate models did not converge, we used multiple thresholds models. For the secondary analyses, we obtained summary estimates for all cutoffs using bivariate and multiple thresholds models. From the multiple thresholds model, we derived the summary estimates of all available cutoffs from the summary receiver operating characteristic (SROC) curve and the area under the curve (AUC) as a measure of overall accuracy. We explored sources of heterogeneity using meta-regression models.
Main results
We identified 67 studies, encompassing data from 18,467 participants that were available for the analyses. Fifty-four studies contributed to the analyses of HADS-A for detecting AAD, 35 for GAD, and 10 for panic disorder. The median prevalence of AAD, GAD and panic disorder was 17%, 7% and 6%, respectively. The included studies showed a wide spectrum of clinical and methodological differences.
We considered the overall risk of bias to be low in 19 studies. The most frequent limitations were related to non-consecutive patient selection and to post-hoc cutoff determination. The applicability was of low concern across three domains in nine studies. The main limitations of applicability were the presence of prediagnosed anxiety (prior to undergoing HADS-A) or the fact that this information was not collected or reported.
The estimates of both sensitivity and specificity varied strongly between studies. With regard to the recommended cutoff ≥ 8, the HADS-A subscale demonstrated a summary sensitivity of 0.74 (95% CI 0.70 to 0.78) and a summary specificity of 0.76 (95% CI 0.73 to 0.79) for detecting AAD; a summary sensitivity of 0.82 (95% CI 0.76 to 0.87) and a summary specificity of 0.74 (95% CI 0.70 to 0.77) for detecting GAD; and a summary sensitivity of 0.80 (95% CI 0.69 to 0.88) and a summary specificity of 0.66 (95% CI 0.55 to 0.76) for detecting panic disorder. Results from the multiple thresholds model showed an AUC of 0.81 (95% CI 0.79 to 0.82) for detecting AAD, 0.82 (95% CI 0.80 to 0.84) for GAD and 0.81 (95% CI 0.77 to 0.85) for panic disorder.
The observed heterogeneity remained largely unexplained, except for the investigations of heterogeneity with regard to GAD, which showed that the setting had a significant impact on specificity; and prevalence and the reference standard had a significant impact on sensitivity. With respect to panic disorder, a formal heterogeneity assessment was not feasible.
Authors' conclusions
The use of the HADS-A for screening purposes with a cutoff ≥ 8 in an exemplary cohort of 1000 individuals with an AAD prevalence of 17% would result in 675 individuals testing negative, of whom 44 would be false negatives, while 325 would test positive. Of these, 199 would be false positives, potentially straining the available healthcare resources.
However, caution is warranted in interpreting the review findings, as the strength of evidence was limited by the risk of bias, concerns regarding applicability and substantial, unexplained heterogeneity. The use of estimates derived from clinical populations in which HADS-A is applied would be a reasonable approach. However, subgrouping by clinical population is currently unfeasible due to the limited number of studies per population category. This represents an area of further exploration in future research. The unexplained heterogeneity makes it challenging to reliably predict the results of future studies. Given these limitations, the universal use of the HADS-A with a cutoff ≥ 8 for screening in different settings and populations is currently questionable.