Skip to main content
Journal of Gerontological Nursing, 2018;44(12):35–43
Published Online: by:13


The aim of the current study was to report findings about delirium detection when ward nurses screened for delirium in patients with cognitive impairment using the Delirium Observation Screening Scale (DOSS) in comparison to the Confusion Assessment Method (CAM). A secondary analysis was performed of research data collected in 2010 at a Swiss tertiary university hospital. During the first 5 days after admission, patients 70 and older with cognitive impairment were screened for delirium using the DOSS. Throughout patients' hospital stay, research assistants also completed the CAM on a daily basis. A total of 138 patients who did not have delirium initially participated in the study. Of these patients, 44 (32%) developed delirium with a median duration of 3 days (Q1 = 1.25; Q3 = 5.00). Ward nurses correctly identified delirium using the DOSS in 56% of cases (sensitivity) and no delirium in 92% of cases (specificity). Although the DOSS was 100% correct in detecting patients with hyperactive delirium, the identification rate decreased to 60% for patients with mixed delirium subtype and 38% for patients with hypoactive delirium. Delirium screening using observational methods may be insufficiently sensitive and should be supplemented with a formal attention test. [Journal of Gerontological Nursing, 44(12), 35–43.]


Delirium is an acute, severe neuropsychiatric syndrome characterized by disturbances in arousal, attention, and cognition (American Psychiatric Association [APA], 2013). There is a significant association between the development of incident dementia and the worsening of dementia after delirium (Davis et al., 2012; Davis et al., 2017). The non-detection of prevalent delirium in older patients presenting to emergency departments is associated with a 30.8% mortality rate within 6 months, which is double the mortality rate for patients with detected and treated delirium (Kakuma et al., 2003). The hazard ratio for post-discharge mortality is 1.62 in patients who develop delirium during hospitalization compared to patients who do not develop delirium (Witlox et al., 2010). It is assumed that the persistence of delirium symptoms, typically when delirium is overlooked and untreated, is associated with a reduced threshold for new episodes of delirium, thus causing prolonged neuropsychiatric symptoms (Witlox et al., 2010). The reasons for non-detection include lack of systematic delirium screening (Grossmann et al., 2014) and/or the inappropriate use of validated assessment tools (Inouye, Foreman, Mion, Katz, & Cooney, 2001).

Schuurmans, Shortridge-Baggett, and Duursma (2003) were the first to develop a delirium screening scale based on observation and established delirium definitions from criteria of the fourth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV; APA, 2000). The Delirium Observation Screening Scale (DOSS) comprises 13 items within seven domains (the number of items per domain is shown in parentheses): consciousness (1), attention/concentration (3), thinking (1), memory/orientation (3), psychomotor activity (3), mood (1), and perception (1). Each item counts as 1 point, with scores ranging between 0 (cognitively normal) and 13 (most severe delirium). A mean score ≥3 over a 24-hour period indicates delirium. Items were expressed using nurses' conversational language, such as “Dozes during conversation (or activities)” or “Is easily or suddenly emotional.” Internal consistency was high (Cronbach's alpha = 0.93 to 0.96) and predictive validity compared with DSM-IV criteria or diagnosis by a geriatrician was good (Gemert van & Schuurmans, 2007).

The DOSS is being increasingly used in different settings and in several countries. It has been validated for use in cardiac surgery (Koster, Hensens, Oosterveld, Wijma, & van der Palen, 2009), palliative care (Detroyer et al., 2014), and community hospice patients (Jorgensen, Carnahan, & Weckmann, 2017). It has also been validated in an acute care setting in the United States (Gavinski, Carnahan, & Weckmann, 2016). Currently, validation in British nursing homes is ongoing (Teale et al., 2016). In 2016, the DOSS was implemented in the Delirium Early Monitoring System on a psychogeriatric ward for patients with moderate to severe dementia in northeast England (Rippon et al., 2016).

Despite good performance in the different validation studies of the DOSS, a recent study revealed poor sensitivity of the DOSS in routine daily practice of clinical nurses (Numan et al., 2017). One important characteristic of the study by Numan et al. (2017) was that it was not performed in a center/population for which the instrument was developed. In addition, the study investigated the performance of the DOSS beyond a validation study setting. Therefore, because the current authors had comparable data generated outside a validation study, a secondary analysis of an existing dataset was performed (Hasemann, 2013).

The aim of the current study was to assess the performance of delirium detection based on observation when ward nurses use the DOSS in daily routine practice to screen for delirium in older hospitalized patients with cognitive impairment compared to a structured assessment using the Confusion Assessment Method (CAM; Inouye et al., 1990).



The current diagnostic performance study was a secondary data analysis of a larger program of research focused on a newly developed algorithm, DemDel, to prevent and manage delirium in patients with mild to major neurocognitive disorder (i.e., cognitive impairment) (Hasemann, 2013). The program of research comprised a 1-day training session for ward nurses to administer the intervention that focused on delirium prevention, including an intensive systematic screening schedule for cognitive impairment and delirium, as well as comprehensive delirium management (Hasemann et al., 2016). The session comprised interactive teaching sessions with case studies on nonpharmacological measures as well as the appropriate use of antipsychotic medications, role playing, and video examples for training of the assessments.

Setting and Participants

Data were collected in 2010 from four wards of a medical department at a Swiss tertiary university hospital (203 of 669 total beds). Patients were included in the study if they were newly admitted from their own home or a nursing home, were age 70 or older, and showed signs of cognitive impairment (Hasemann et al., 2016). Cognitive impairment was defined as a score of <27 (of 30) on the Swiss version of the Mini-Mental State Examination (SMMSE; Monsch et al., 1995) or <4 (of 7) points for the Swiss version of the clock drawing test (CDT; Thalmann et al., 2002). Patients were excluded if they were: (a) aphasic; (b) non-native speakers of German; (c) visually or hearing impaired; (d) in an advanced terminal illness state, stuporous, or comatose; (e) delirious on admission or experiencing symptoms due to the withdrawal of alcohol or benzodiazepine agents; (f) ill with a neurological disease, such as stroke or seizures; and/or (g) admitted to an oncology service.


The primary outcome was the development of delirium, which was diagnosed with the short version of the CAM, which comprises four items, to rate the following criteria: (a) acute onset and/or fluctuating course; (b) inattention; (c) disorganized thinking; and (d) altered level of consciousness. All items were rated dichotomously as either absent or present. The CAM algorithm for the diagnosis of delirium required both the first and second and either the third or fourth criteria to be present. To increase the sensitivity of the current study, the “or” option was used for the first criterion (acute onset and/or fluctuating course) (Inouye, 2003). When 1,071 pooled datasets with CAM assessments were validated against the diagnoses of experts (i.e., geriatricians, psychiatrists, neuropsychologists, and advanced practice nurses), the sensitivity of the CAM was 94% (95% confidence interval [CI] [0.91, 0.97]) and the specificity was 89% (95% CI [0.85, 0.94]), using DSM-III, DSM-IIIR, DSM-IV, or International Classification of Diseases, tenth revision criteria (Wei, Fearing, Sternberg, & Inouye, 2008). In patient samples with a higher proportion of dementia, specificity was lower (0.63 to 0.84) (Laurila, Pitkälä, Strandberg, & Tilvis, 2002). The rating of the CAM was based on a structured interview with the SMMSE, the comprehension subtest of the Cognitive Test for Delirium (Hart et al., 1996), and the Digit Span Test (Leung, Lee, Lam, Chan, & Wu, 2011) in the current study to measure inattention. To rate delirium subtype, the motor subscale of the Delirium Rating Scale Revised 98 (DRS-R-98) was used (Trzepacz, Baker, & Greenhouse, 1988).

The secondary outcome was the development of delirium subtype. Patients were classified as having hypoactive delirium if the psychomotor item scores were consistent for “motor retardation,” and as having hyperactive delirium if scores were consistent for “motor agitation.” All other variants were classified as mixed delirium. Patients were stratified into four different delirium risk groups using the approach of Inouye, Viscoli, Horwitz, Hurst, and Tinetti (1993). The following criteria contributed to delirium risk: an abnormal blood urea nitrogen/creatinine ratio ≥18 (Inouye et al., 1993); vision impairment (i.e., corrected binocular near vision worse than 20/70 as measured by the bedside Jaeger vision test) (Bates & Bickley, 1999); illness severity score >16 on the Acute Physiology and Chronic Health Evaluation (Knaus, Draper, Wagner, & Zimmerman, 1985); and cognitive impairment as described above. Cognitive impairment was measured with the SMMSE and CDT. Comorbidity was measured with the Charlson Comorbidity Index (Charlson, Pompei, Ales, & MacKenzie, 1987).

Data Collection

Research assistants assessed patients on a daily basis using the CAM and DRS-R-98. In addition, a chart review was undertaken, and nurses' and physicians' notes were screened for descriptions of delirious behavior such as fluctuation of symptoms. CAM assessments were conducted before ward nurses' DOSS scores were collected.

Statistical Analysis

Analyses of frequencies or measures of central tendency were performed using SPSS version 19 for descriptive statistics. Descriptive variables were scrutinized with the Kolmogorov-Smirnov test. Normally distributed descriptive data were reported with mean scores and standard deviations; non-normally distributed data were reported as medians with 25th (Q1) and 75th (Q3) percentiles. Binary classifiers, such as sensitivity or specificity, were calculated using R version 3.2.1 to calculate 95% CI and the diagnostic accuracy with the area under the curve (AUC) analysis. Exact binomial confidence limits were calculated for sensitivity, specificity, and positive and negative predictive value according to Collett (1999). The CI for positive and negative likelihood ratios is based on formulae provided by Simel, Samsa, and Matchar (1991). To compare ward nurses' DOSS scores with research assistants' CAM assessments, the highest DOSS score per day was compared with the dichotomous CAM result (delirium yes/no). A cutoff score ≥3 of 13 on the DOSS was used to determine whether patients had delirium. If a DOSS was not completed on a given day, that day was excluded from calculations of DOSS performance.

Ethical Considerations

Cognitive competence was an important issue in the current research, as the ability to consent declines with progression of cognitive impairment (Okonkwo et al., 2008). An initial informal conversation was started in which patients were asked about their understanding of their current situation. Further, patients were asked to sign the informed consent after the first assessment. This approach was intended to provide better understanding of the consequences that arose from being involved in the study (Hasemann, 2013). If patients demonstrated signs of disorientation with regard to their situation, they were considered incompetent. In these cases, patients' family members were asked to state whether patients' participation in the study would comply with their presumed wishes. Family members then signed the written proxy informed consent, as approved by the local ethics board (Hasemann, 2013).


From January to August 2010, 138 patients age 70 and older with cognitive impairment were included in the current secondary analysis. Mean patient age was 81.8 years, and 55.1% of patients were female (Table 1). Initially, no patients had delirium; however, 44 (31.8%) patients developed delirium with a median duration of 3 days (Q1 = 1.25; Q3= 5.00). Mixed delirium subtype was predominant (61.4%), followed by hypoactive (34.1%) and hyperactive (4.5%) subtypes. Ward nurses administered the DOSS in 104 of 138 patients, which corresponded to an adherence rate of 75.4%. The delirium rate in non-screened patients was 20.6% (seven of 34 patients). As a result, seven (15.9%) of 44 patients with delirium were missed due to nonadherence (Figure).

Table 1

Table 1 Patient Characteristics (N = 138)

Gender (n, %)
  Female76 (55.1)
  Male62 (44.9)
Mean age (years) (SD) (range)81.8 (6.3) (70 to 98)
Length of hospital stay (days) (median [Q1; Q3])10.3 (7; 14.6)
Initial SMMSE score (median [Q1; Q3] [range])22 (20; 25) (1 to 29)
CDT (median [Q1; Q3] [range])4 (4; 5) (0 to 7)
CCI (median [Q1; Q3] [range])2 (1; 3) (0 to 10)
Age-adjusted CCI (median [Q1; Q3])5.5 (4; 7)
Delirium risk (median [Q1; Q3])1 (1; 2)
Delirium risk in patients with diagnosed delirium (median [Q1; Q3])2 (1; 3)

Note. Q1 = 25th percentile; Q3 = 75th percentile; SMMSE = Swiss version of the Mini-Mental State Examination; CDT = Clock Drawing Test; CCI = Charlson Comorbidity Index.


Nurses' adherence to performance of the Delirium Observation Screening Scale (DOSS).

Primary Outcomes

Table 2 shows the performance calculations of completed DOSS screenings in comparison to CAM assessments. On a day-to-day basis in 104 patients, 355 comparisons of ward nurses' DOSS screenings with research assistants' CAM assessments revealed that ward nurses correctly identified delirium episodes in 56% of cases (sensitivity) and no delirium in 92% of cases (specificity). Among bedside nurses' delirium assignments with the DOSS, 73% were classified as having delirium by the CAM (positive predictive value) and 83% of no delirium assessments were confirmed by research assistants (negative predictive value). The probability that bedside nurses classified a patient with delirium correctly was 6.67 times higher than misclassifying a patient without delirium as having the condition (positive likelihood ratio). Conversely, bedside nurses' probability of misclassifying a patient with delirium as not having delirium was 0.48 times lower than classifying a patient without delirium as not having delirium (negative likelihood ratio) (Table 2).

Table 2

Table 2 Results of Nurses' Assessments Using the Delirium Observation Screening Scale (DOSS)

VariableDelirium Subtype (Value [95% CI])
Observations (n)231305187355
Sensitivity0.38 [0.18, 0.62]0.60 [0.48, 0.70]1.00 [0.01, 1.00]0.56 [0.46, 0.66]
Specificity0.93 [0.89, 0.96]0.93 [0.89, 0.96]0.95 [0.90, 0.97]0.92 [0.87, 0.95]
Positive likelihood5.71 [2.72, 12.02]8.88 [5.28, 14.94]18.60 [0.97, 1.00]6.67 [4.28, 10.39]
Negative likelihood0.66 [0.47, 0.93]0.43 [0.33, 0.56]0.00 [0.00, NaN]0.48 [0.39, 0.60]
Positive predictive value0.36 [0.17, 0.59]0.77 [0.64, 0.86]0.09 [0.00, 0.41]0.73 [0.62, 0.83]
Negative predictive value0.94 [0.90, 0.97]0.86 [0.81, 0.90]1.00 [0.97, 1.00]0.83 [0.78, 0.88]
Delirium According to the CAMObservations (n)
YesNoYesNoYes NoYesNo
DOSS delirium rating = yes8 1449151105821
DOSS delirium rating = no1319633208017646230
Patients (n)126724671673767

Note. CI = confidence interval; NaN = not a number; CAM = Confusion Assessment Method.

Secondary Outcomes

DOSS screenings of patients with hyperactive delirium were 100% correct (AUC = 99.7%); however, the identification rate decreased to 60% in patients with mixed delirium (AUC = 84.8%) and 38% in patients with hypoactive delirium (AUC = 70.7%). Regarding one rating of a patient with hyperactive delirium, the opinion of the research assistants and bedside nurses differed. Research assistants rated the resolved delirium as “no delirium,” whereas the bedside nurses labeled the patient as still having delirium according to their DOSS screenings, which led to a specificity of 95% for patients with hyperactive delirium. The specificity in patients with mixed delirium and hypoactive delirium subtypes was 93% in both cases. False scorings (i.e., false negative or false positive scorings) occurred in 5% of patients with hyperactive delirium, 47% of patients with mixed delirium, and 69% of patients with hypoactive delirium. DOSS–CAM comparisons revealed a DOSS score of 0 (i.e., no delirium) among seven (33.3%) of 21 comparisons in patients with hypoactive delirium and 11 (13.4%) of 82 comparisons in patients with mixed delirium.


Although nurses' screenings with the DOSS revealed high specificity, the overall sensitivity was low, particularly in patients with hypoactive delirium and partially in patients with mixed delirium. Furthermore, the adherence rate to conduct the screenings was 75.4%, which was lower than expected. Specificity of the DOSS was high overall as well as for subtypes, and the positive likelihood achieved in the total comparison and subtype analysis of values >5 (Table 2) is considered clinically useful (Jaeschke, Meade, Guyatt, Keenan, & Cook, 1997). However, with the exception of the hyperactive delirium group, no other subgroup or the total sample achieved acceptable negative likelihood ratios due to false negative DOSS screenings. Furthermore, the low AUC in the hypoactive delirium subgroup revealed that the DOSS has limitations in the prediction of delirium for this patient population.

Concerning the diagnostic performance comparisons, the results of the current study revealed several weaknesses when nurses rated delirium based on observation using the DOSS in patients with cognitive impairment. A global sensitivity rate of 56% is poor, considering that nurses had sufficient time to rate patients' behaviors over a time frame of approximately 9 hours. This finding is consistent with that of a study by Numan et al. (2017), in which only 32% of patients with delirium were detected. The best correspondence between research assistants' CAM assessments and ward nurses' DOSS scores was achieved in patients with hyperactive delirium, and a moderate correspondence was found in patients with mixed delirium. In patients with hypoactive delirium, the detection rate was weak (38%). This result is consistent with findings of Schofield, Tolson, and Fleming (2012), whose critical discourse analysis revealed that “the hyperactive presentation of delirium became for nurses a defining feature of delirium and excluded the equally undesirable hypoactive or ‘quiet’ presentation” (p. 173). As underdetection occurred mainly in patients with mixed and hypoactive delirium, there might be a systematic problem underlying the poor scores of these delirium subtypes. One problem may be that inattention is difficult to detect without formal assessment, as one third of the DOSS scores were 0 in patients with hypoactive delirium. Patients with hypoactive delirium are characterized as having psychomotor retardation in terms of decreased speed of actions or decreased speech (Meagher et al., 2014). Thus, it is more difficult to detect delirium symptoms in this group of patients. In a content analysis of nursing documentation, Steis (2009) could not detect any patient with delirium superimposed on dementia (DSD) because nurses did not use the terms “delirium” or “acute confusion” to describe these patients. Instead, they used orientation criteria to describe patients' mental status.

A similar effect was observed in a study that used the Intensive Care Delirium Screening Checklist (ICDSC), an observational delirium screening instrument for critical care patients (van Eijk et al., 2009). Sensitivity for delirium was 43% when nurses used the ICDSC to screen for delirium (van Eijk et al., 2009). When Inouye et al. (2001) investigated nurses' recognition of delirium and its symptoms, they found four independent patient risk factors for under-recognition. The strongest risk factor was hypoactive delirium. In the study by Inouye et al. (2001), nurses rated delirium based only on observation. The poor detection rates for hypoactive and mixed delirium subtypes may therefore be a consequence of relying on observation instead of performing a structured assessment for appraising patient cognitive status. Furthermore, nurses had difficulty in correctly appraising inattention (15%), disorganized thinking (26%), and altered level of consciousness (15%), which are key features of delirium (Inouye et al., 2001). In addition, Inouye et al. (2001) argued that nurses “tend to label patients as delirious when their behavior made them difficult to care for” (p. 2471), following the argument of Palmateer and McCartney (1985) that nurses tend to describe patients with cognitive deficits as “having more difficulty with self-care and fewer positive social interactions” (p. 12). Nurses' tendency to label patients with cognitive deficits with descriptions of undesirable behavior is consistent with the observations of Schofield et al. (2012), who described nurses' construction of confused older adults “as running, moving, and falling bodies” and that “the discursive framings of patients mainly focused on the movement and speed with which they left the confines of the ward” (p. 172).

In addition, as the current sample comprised patients with pre-existing cognitive impairment, there may have been difficulty in differentiating between normal aging, dementia, and DSD. Richardson et al. (2017) stated that the detection of DSD is usually achieved by collateral history. As the DOSS does not rely on collateral history information, there might be a weakness in detecting acute changes in cognition, as this criterion is not part of the DOSS.

Comparing subgroups (i.e., patients with hypoactive and hyperactive delirium), the detection rates were approximately double those found in the study by Fick, Hodo, Lawrence, and Inouye (2007). This improved detection rate may be due to the training sessions that nurses attended in the current study. However, it is difficult to explain why the sensitivity was, at 94.4%, significantly better in the validation study by Schuurmans, Donders, Shortridge-Baggett, and Duursma (2002) than in the current study (56%). One explanation might be the different settings. Interestingly, higher specificity of 92% was obtained in the current study compared to 77% in the study by Schuurmans et al. (2002). This finding may be attributed to the fact that nurses tend to label patients with altered mental status as experiencing dementia rather than delirium or DSD (Day, Higgins, & Koch, 2008). This labeling may explain the high negative predictive values in the hypoactive delirium subgroup of the current study.

The screening adherence rate of 75.4% in the current study is slightly higher than average adherence rates in implementation studies in which systematic screenings were introduced with different scales. Adherence rates in 13 reviewed studies ranged from 14% to 100%, with a median rate of 75% (Trogrlic et al., 2015).

As delirium detection via observation has limitations, the question arises of whether structured delirium assessment (i.e., asking patients questions about their cognitive status several times per day over several days) would provide a better alternative. Richardson et al. (2017) suggested a combined arousal–attention assessment for diagnosing DSD. However, the current evidence reveals that nonadherence rates remain similar when nurses ask structured questions. Leuenberger, Fierz, Hinck, Bodmer, and Hasemann (2017) investigated adherence to an alcohol withdrawal management program and demonstrated that nurses conducted only 50% of the required structured assessments. Nurses had to rate patients' withdrawal symptoms using the Clinical Institute Withdrawal Assessment of Alcohol Scale (CIWA-Ar; Sullivan, Sykora, Schneiderman, Naranjo, & Sellers, 1989). Nurses had to rate patients' tremors or ask patients whether they had experienced hallucinations. Interestingly, some nurses who reported having performed the CIWA-Ar assessments stated that they did not ask patients the questions about hallucinations or tremors as required to avoid confronting patients with offending questions (Soldi, Hasemann, & Frei, 2015). This finding is consistent with research by Agar et al. (2012). It seems that some nurses tend to avoid discussing stigmatized topics with patients. However, regarding the current study, in the interviews with patients in the parent study, patients with cognitive impairment were open to cognitive testing (Hasemann, 2013).

The discussion of the pros and cons of observation versus structured assessment using questionnaires remains inconclusive. On the one hand, the arguments by Schuurmans et al. (2003) to favor delirium detection through observation are weakened by the poor sensitivity of this procedure in real-life situations, as demonstrated by the current study. On the other hand, nonadherence to structured assessments reveals similar gaps. Neither method appears superior to the other. The DOSS is well-designed, reflects DSM-IV criteria, and provides a score that allows nurses to demonstrate the fluctuating course of delirium, which is helpful during discussions with other providers, particularly during lucid phases of delirium. The advantage of the DOSS lies in its questions, which closely reflect nurses' usual language. In the case of patients who are calm and/or drowsy, nurses should formally rate core features, such as inattention, to improve their delirium detection. Using the Months Backwards Test to screen for inattention, delirium may be ruled out in less than 30 seconds (Hasemann et al., 2018). A fairly new instrument, the 4AT, seems promising (Bellelli et al., 2014). The 4AT asks four questions regarding patient age, date of birth, place, and current year, and asks the patient to recite the months of the year in reverse order. Features of fluctuation and arousal are based on observation. In 2016, the 4AT was administered to measure the nationwide point prevalence of delirium in 108 acute and 12 rehabilitation wards of 161 invited wards in Italian hospitals on “Delirium Day,” resulting in an adherence rate of 74.5% (Bellelli et al., 2016). It would be interesting to see whether this approach would enhance acceptance for formal screening.


Due to low scores in the hyperactive delirium subgroup, some estimates could not be calculated with great precision, resulting in the non-estimation of the positive likelihood ratio. This ratio would have enhanced the interpretation of the results had it been possible to include patients with more severe dementia in the study. Greater refusals for study participation were experienced from family members of patients with end-stage dementia. Their concern was that the study may be an additional stressor to the already-stressful experience of dementia. Involving patients with cognitive impairment who do not have the capacity to provide informed consent is a well-known challenge inherent in delirium research with the risk of selection bias (Adamis, Martin, Treloar, & Macdonald, 2005; Holt, Siddiqi, & Young, 2008). As a consequence, the reliability of bedside nurses' DOSS scores in patients with severe dementia remains unknown. A further challenge for nurses may have been the ability to differentiate between existing and new cognitive impairment. This inability may have also contributed to the low sensitivity of the DOSS in the current patient sample. In addition, neurology and oncology patients were excluded, as both patient groups had different standard operating procedures on how delirium should be medically managed.

Although the CAM is widely accepted and extensively validated in different settings for assessing delirium, it is not the optimal reference standard to compare the performance of different scales. The optimal standard would have been DSM-IV criteria. The CAM performs best when compared to DSM-IV criteria (Laurila et al., 2002), which are the basis of the DOSS. Nevertheless, this fact may have contributed to the lower sensitivity in some cases. However, thorough daily assessments using the SMMSE and DRS-R-98 provided a sound, informed basis to rate the CAM.

Interrater reliability was not formally tested; therefore, a potential risk for measurement bias may have occurred. However, research assistants were not allowed to perform ratings independently unless there was agreement of at least 90%.

As the current study was conducted in patients with cognitive impairment, generalization of the results to patients without baseline cognitive impairment is not recommended.


The current study sought to compare nurse observations based on the validated DOSS. Findings provide evidence that the DOSS can identify patients without delirium with a specificity of 92% in patients with cognitive impairment. However, the low sensitivity of the detection of mixed and hypoactive delirium and DSD raises the question of whether the addition of a formal attention test to the DOSS could improve sensitivity. Future research should focus on improving the recognition of hypoactive delirium features on an observational basis.

  • Adamis D., Martin F.C., Treloar A., Macdonald A.J. (2005). Capacity, consent, and selection bias in a study of delirium. Journal of Medical Ethics, 31, 137–143.10.1136/jme.2002.000919

    Crossref MedlineGoogle Scholar
  • Agar M., Draper B., Phillips P.A., Phillips J., Collier A., Harlum J., Currow D. (2012). Making decisions about delirium: A qualitative comparison of decision making between nurses working in palliative care, aged care, aged care psychiatry, and oncology. Palliative Medicine, 26, 887–896.10.1177/0269216311419884

    Crossref MedlineGoogle Scholar
  • American Psychiatric Association. (2000). Delirium, dementia, and amnestic and other cognitive disorders. In Diagnostic and statistical manual of mental disorders (4th ed., text rev., pp. 135–147). Washington, DC: Author.

    Google Scholar
  • American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.). Washington, DC: Author.

    CrossrefGoogle Scholar
  • Bates B., Bickley L.S. (1999). Bates' guide to physical examination and history taking (7th ed.). Philadelphia, PA: Lippincott.

    Google Scholar
  • Bellelli G., Morandi A., Davis D.H., Mazzola P., Turco R., Gentile S., MacLullich A.M. (2014). Validation of the 4AT, a new instrument for rapid delirium screening: A study in 234 hospitalised older people. Age and Ageing, 43, 496–502.10.1093/ageing/afu021

    Crossref MedlineGoogle Scholar
  • Bellelli G., Morandi A., Di Santo S.G., Mazzone A., Cherubini A., Mossello E., Trabucchi M. (2016). “Delirium Day”: A nationwide point prevalence study of delirium in older hospitalized patients using an easy standardized diagnostic tool. BMC Medicine, 14, 106.10.1186/s12916-016-0649-8

    Crossref MedlineGoogle Scholar
  • Charlson M.E., Pompei P., Ales K.L., MacKenzie C.R. (1987). A new method of classifying prognostic comorbidity in longitudinal studies: Development and validation. Journal of Chronic Diseases, 40, 373–383.10.1016/0021-9681(87)90171-8

    Crossref MedlineGoogle Scholar
  • Collett D. (1999). Modelling binary data (2nd ed.) Boca Raton, FL: Chapman & Hall/CRC.

    Google Scholar
  • Davis D.H., Muniz-Terrera G., Keage H., Rahkonen T., Oinas M., Matthews F.E., Brayne C. (2012). Delirium is a strong risk factor for dementia in the oldest-old: A population-based cohort study. Brain, 135, 2809–2816.10.1093/brain/aws190

    Crossref MedlineGoogle Scholar
  • Davis D.H., Muniz-Terrera G., Keage H.A., Stephan B.C., Fleming J., Ince P.G., Brayne C. (2017). Association of delirium with cognitive decline in late life: A neuropathologic study of 3 population-based cohort studies. JAMA Psychiatry, 74, 244–251.10.1001/jamapsychiatry.2016.3423

    Crossref MedlineGoogle Scholar
  • Day J., Higgins I., Koch T. (2008). Delirium and older people: What are the constraints to best practice in acute care?International Journal of Older People Nursing, 3, 170–177.10.1111/j.1748-3743.2008.00115.x

    Crossref MedlineGoogle Scholar
  • Detroyer E., Clement P.M., Baeten N., Pennemans M., Decruyenaere M., Vandenberghe J., Milisen K. (2014). Detection of delirium in palliative care unit patients: A prospective descriptive study of the Delirium Observation Screening Scale administered by bedside nurses. Palliative Medicine, 28, 79–86.10.1177/0269216313492187

    Crossref MedlineGoogle Scholar
  • Fick D.M., Hodo D.M., Lawrence F., Inouye S.K. (2007). Recognizing delirium superimposed on dementia: Assessing nurses' knowledge using case vignettes. Journal of Gerontological Nursing, 33(2), 40–47.

    LinkGoogle Scholar
  • Gavinski K., Carnahan R., Weckmann M. (2016). Validation of the delirium observation screening scale in a hospitalized older population. Journal of Hospital Medicine, 11, 494–497.10.1002/jhm.2580

    Crossref MedlineGoogle Scholar
  • Gemert van L.A., Schuurmans M.J. (2007). The Neecham Confusion Scale and the Delirium Observation Screening Scale: Capacity to discriminate and ease of use in clinical practice. BMC Nursing, 6, 3.10.1186/1472-6955-6-3

    Crossref MedlineGoogle Scholar
  • Grossmann F.F., Hasemann W., Graber A., Bingisser R., Kressig R.W., Nickel C.H. (2014). Screening, detection and management of delirium in the emergency department—A pilot study on the feasibility of a new algorithm for use in older emergency department patients: The modified Confusion Assessment Method for the Emergency Department (mCAM-ED). Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine, 22, 19.10.1186/1757-7241-22-19

    Crossref MedlineGoogle Scholar
  • Hart R.P., Levenson J.L., Sessler C.N., Best A.M., Schwartz S.M., Rutherford L.E. (1996). Validation of a cognitive test for delirium in medical ICU patients. Psychosomatics, 37, 533–546.10.1016/S0033-3182(96)71517-7

    Crossref MedlineGoogle Scholar
  • Hasemann W. (2013). Effects of the interdisciplinary Basel delirium and dementia prevention and management program DEMDEL (PhD thesis). Glasgow Caledonian University, Glasgow, Scotland.

    Google Scholar
  • Hasemann W., Grossmann F.F., Stadler R., Bingisser R., Breil D., Hafner M., Nickel C.H. (2018). Screening and detection of delirium in older ED patients: Performance of the modified Confusion Assessment Method for the Emergency Department (mCAM-ED). A two-step tool. Internal and Emergency Medicine, 13, 915–922.10.1007/s11739-017-1781-y

    Crossref MedlineGoogle Scholar
  • Hasemann W., Tolson D., Godwin J., Spirig R., Frei I.A., Kressig R.W. (2016). A before and after study of a nurse led comprehensive delirium management programme (DemDel) for older acute care inpatients with cognitive impairment. International Journal of Nursing Studies, 53, 27–38.10.1016/j.ijnurstu.2015.08.003

    Crossref MedlineGoogle Scholar
  • Holt R., Siddiqi N., Young J. (2008). The ethics of consent in delirium studies. Journal of Psychosomatic Research, 65, 283–287.10.1016/j.jpsychores.2008.05.023

    Crossref MedlineGoogle Scholar
  • Inouye S.K. (2003). The Confusion Assessment Method (CAM). Training manual and coding guide. Retrieved from

    Google Scholar
  • Inouye S.K., Foreman M.D., Mion L.C., Katz K.H., Cooney L.M.. (2001). Nurses' recognition of delirium and its symptoms: Comparison of nurse and researcher ratings. Archives of Internal Medicine, 161, 2467–2473.10.1001/archinte.161.20.2467

    Crossref MedlineGoogle Scholar
  • Inouye S.K., van Dyck C.H., Alessi C.A., Balkin S., Siegal A.P., Horwitz R.I. (1990). Clarifying confusion: The confusion assessment method. A new method for detection of delirium. Annals of Internal Medicine, 113, 941–948.10.7326/0003-4819-113-12-941

    Crossref MedlineGoogle Scholar
  • Inouye S.K., Viscoli C.M., Horwitz R.I., Hurst L.D., Tinetti M.E. (1993). A predictive model for delirium in hospitalized elderly medical patients based on admission characteristics. Annals of Internal Medicine, 119, 474–481.10.7326/0003-4819-119-6-199309150-00005

    Crossref MedlineGoogle Scholar
  • Jaeschke R.Z., Meade M.O., Guyatt G.H., Keenan S.P., Cook D.J. (1997). How to use diagnostic test articles in the intensive care unit: Diagnosing weanability using f/Vt. Critical Care Medicine, 25, 1514–1521.10.1097/00003246-199709000-00018

    Crossref MedlineGoogle Scholar
  • Jorgensen S.M., Carnahan R.M., Weckmann M.T. (2017). Validity of the Delirium Observation Screening Scale in identifying delirium in home hospice patients. American Journal of Hospice and Palliative Care, 34, 744–747.10.1177/1049909116658468

    Crossref MedlineGoogle Scholar
  • Kakuma R., du Fort G.G., Arsenault L., Perrault A., Platt R.W., Monette J., Wolfson C. (2003). Delirium in older emergency department patients discharged home: Effect on survival. Journal of the American Geriatrics Society, 51, 443–450.10.1046/j.1532-5415.2003.51151.x

    Crossref MedlineGoogle Scholar
  • Knaus W.A., Draper E.A., Wagner D.P., Zimmerman J.E. (1985). APACHE II: A severity of disease classification system. Critical Care Medicine, 13, 818–829.10.1097/00003246-198510000-00009

    Crossref MedlineGoogle Scholar
  • Koster S., Hensens A.G., Oosterveld F.G., Wijma A., van der Palen J. (2009). The Delirium Observation Screening Scale recognizes delirium early after cardiac surgery. European Journal of Cardiovascular Nursing, 8, 309–314.10.1016/j.ejcnurse.2009.02.006

    Crossref MedlineGoogle Scholar
  • Laurila J.V., Pitkälä K.H., Strandberg T.E., Tilvis R.S. (2002). Confusion assessment method in the diagnostics of delirium among aged hospital patients: Would it serve better in screening than as a diagnostic instrument?International Journal of Geriatric Psychiatry, 17, 1112–1119.10.1002/gps.753

    Crossref MedlineGoogle Scholar
  • Leuenberger D.L., Fierz K., Hinck A., Bodmer D., Hasemann W. (2017). A systematic nurse-led approach to withdrawal risk screening, prevention and treatment among inpatients with an alcohol use disorder in an ear, nose, throat and jaw surgery department: A formative evaluation. Applied Nursing Research, 33, 155–163.10.1016/j.apnr.2016.11.013

    Crossref MedlineGoogle Scholar
  • Leung J.L., Lee G.T., Lam Y.H., Chan R.C., Wu J.Y. (2011). The use of the Digit Span Test in screening for cognitive impairment in acute medical inpatients. International Psychogeriatrics, 23, 1569–1574.10.1017/S1041610211000792

    Crossref MedlineGoogle Scholar
  • Meagher D., Adamis D., Leonard M., Trzepacz P., Grover S., Jabbar F., Maclullich A. (2014). Development of an abbreviated version of the Delirium Motor Subtyping Scale (DMSS-4). International Psychogeriatrics, 26, 693–702.10.1017/S1041610213002585

    Crossref MedlineGoogle Scholar
  • Monsch A.U., Foldi N.S., Ermini-Funfschilling D.E., Berres M., Taylor K.I., Seifritz E., Spiegel R. (1995). Improving the diagnostic accuracy of the Mini-Mental State Examination. Acta Neurologica Scandinavica, 92, 145–150.10.1111/j.1600-0404.1995.tb01029.x

    Crossref MedlineGoogle Scholar
  • Numan T., van den Boogaard M., Kamper A.M., Rood P.J.T., Peelen L.M., Slooter A.J.C. (2017). Recognition of delirium in postoperative elderly patients: A multicenter study. Journal of the American Geriatrics Society, 65, 1932–1938.10.1111/jgs.14933

    Crossref MedlineGoogle Scholar
  • Okonkwo O.C., Griffith H.R., Copeland J.N., Belue K., Lanza S., Zamrini E.Y., Marson D.C. (2008). Medical decision-making capacity in mild cognitive impairment: A 3-year longitudinal study. Neurology, 71, 1474–1480.10.1212/01.wnl.0000334301.32358.48

    Crossref MedlineGoogle Scholar
  • Palmateer L.M., McCartney J.R. (1985). Do nurses know when patients have cognitive deficits?Journal of Gerontological Nursing, 11(2), 6–7, 10–12, 15–16.10.3928/0098-9134-19850201-05

    LinkGoogle Scholar
  • Richardson S.J., Davis D.H.J., Bellelli G., Hasemann W., Meagher D., Kreisel S.H., Morandi A. (2017). Detecting delirium superimposed on dementia: Diagnostic accuracy of a simple combined arousal and attention testing procedure. International Psychogeriatrics, 27, 1585–1593.10.1017/S1041610217000916

    CrossrefGoogle Scholar
  • Rippon D., Milisen K., Detroyer E., Mukaetova-Ladinska E., Harrison B., Schuurmans M., Teodorczuk A. (2016). Evaluation of the delirium early monitoring system (DEMS). International Psychogeriatrics, 28, 1879–1887.10.1017/S1041610216000983

    Crossref MedlineGoogle Scholar
  • Schofield I., Tolson D., Fleming V. (2012). How nurses understand and care for older people with delirium in the acute hospital: A critical discourse analysis. Nursing Inquiry, 19, 165–176.10.1111/j.1440-1800.2011.00554.x

    Crossref MedlineGoogle Scholar
  • Schuurmans M.J., Donders R.T., Shortridge-Baggett L.M., Duursma S.A. (2002). Delirium case finding: Pilot testing of a new screening scale for nurses. Journal of the American Geriatrics Society, 50, S3.

    MedlineGoogle Scholar
  • Schuurmans M.J., Shortridge-Baggett L.M., Duursma S.A. (2003). The Delirium Observation Screening Scale: A screening instrument for delirium. Research and Theory for Nursing Practice, 17, 31–50.10.1891/rtnp.

    Crossref MedlineGoogle Scholar
  • Simel D.L., Samsa G.P., Matchar D.B. (1991). Likelihood ratios with confidence: Sample size estimation for diagnostic test studies. Journal of Clinical Epidemiology, 44, 763–770.10.1016/0895-4356(91)90128-V

    Crossref MedlineGoogle Scholar
  • Soldi M., Hasemann W., Frei I.A. (2015). Patients' and professionals' experiences with a programme to prevent an alcohol withdrawal syndrome - A case study (unpublished Master's thesis). Institute of Nursing Science, University of Basel, Basel, Switzerland.

    Google Scholar
  • Steis M.R. (2009). Nurse recognition of delirium superimposed on dementia in hospitalized older adults (PhD dissertation). Pennsylvania State University, College Park, PA.

    Google Scholar
  • Sullivan J.T., Sykora K., Schneiderman J., Naranjo C.A., Sellers E.M. (1989). Assessment of alcohol withdrawal: The revised clinical institute withdrawal assessment for alcohol scale (CIWA-Ar). British Journal of Addiction, 84, 1353–1357.10.1111/j.1360-0443.1989.tb00737.x

    Crossref MedlineGoogle Scholar
  • Teale E., Young J., Siddiqi N., Munyombwe T., Harrison J., Schuurmanns M. (2016). Study protocol-investigation of the Delirium Observation Screening Scale (DOSS) for the routine detection of delirium in the care home setting: A prospective cohort study. BMJ Open, 6, e009615.10.1136/bmjopen-2015-009615

    Crossref MedlineGoogle Scholar
  • Thalmann B., Spiegel R., Stähelin H.B., Brubacher D., Ermini-Fünfschilling D., Bläsi S., Monsch A.U. (2002). Dementia screening in general practice: Optimised scoring for the clock drawing test. Brain Aging, 2(2), 36–43.

    Google Scholar
  • Trogrlic Z., van der Jagt M., Bakker J., Balas M.C., Ely E.W., van der Voort P.H., Ista E. (2015). A systematic review of implementation strategies for assessment, prevention, and management of ICU delirium and their effect on clinical outcomes. Critical Care, 19, 157.10.1186/s13054-015-0886-9

    Crossref MedlineGoogle Scholar
  • Trzepacz P.T., Baker R.W., Greenhouse J. (1988). A symptom rating scale for delirium. Psychiatry Research, 23, 89–97.10.1016/0165-1781(88)90037-6

    Crossref MedlineGoogle Scholar
  • van Eijk M.M., van Marum R.J., Klijn I.A., de Wit N., Kesecioglu J., Slooter A.J. (2009). Comparison of delirium assessment tools in a mixed intensive care unit. Critical Care Medicine, 37, 1881–1885.10.1097/CCM.0b013e3181a00118

    Crossref MedlineGoogle Scholar
  • Wei L.A., Fearing M.A., Sternberg E.J., Inouye S.K. (2008). The confusion assessment method: A systematic review of current usage. Journal of the American Geriatrics Society, 56, 823–830.10.1111/j.1532-5415.2008.01674.x

    Crossref MedlineGoogle Scholar
  • Witlox J., Eurelings L.S., de Jonghe J.F., Kalisvaart K.J., Eikelenboom P., van Gool W.A. (2010). Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: A meta-analysis. JAMA, 304, 443–451.10.1001/jama.2010.1013

    Crossref MedlineGoogle Scholar

We use cookies on this site to enhance your user experience. For a complete overview of all the cookies used, please see our privacy policy.