+44 20 3868 9735
Aims and objectives: A number of validated questionnaires are routinely used to screen specific populations for obstructive sleep apnea (OSA) including the STOP, STOP-Bang, Berlin and Epworth sleepiness scales. These questionnaires depend on subjective questions which cannot be independently confirmed. The subjective questions also result in high sensitivity and low specificity as they are generally resulting from OSA. The aim of the study was to identify verifiable and independently measurable risk factors and increase specificity to limit the number of polysomnography evaluations (PE) and lower healthcare cost.
Methods: A retrospective data collection of patients (N=164) enrolled for PE was performed which included the results of STOP, STOP Bang, Berlin and Epworth questionnaires as well as demographic and health related variables. OSA was defined as an AHI>=15 obtained from an overnight PE. Sensitivity and specificity of each questionnaire as well as for combinations of other, independently verifiable factors (IVF) was calculated. A new questionnaire was devised including the IVFs and data was prospectively collected from patients undergoing PE (N=209).
Results: The retrospective analysis identified age>50, male, BMI>30, alcohol consumption >21 per week, collar circumference>16 inches (40 cm), diabetes, use of antidepressants and high blood pressure as the most influential factors. Prospective data collection was performed and analysis resulted in a new scale with a cut off of 3 based on the following equation: OSA=(2*BMI>30)+(Age>50)+(Male)+(neck>16)+(diabetes)+(alcohol>21unit/week). For every 100 patients with OSA, the total number enrolled for PE based on each screening tool were respectively for STOP 92 enrolled of whom 41 were diagnosed and 1 patient missed, for STOP-Bang 94 enrolled, 42 identified and 1 missed, Berlin 83 enrolled, 36 identified and 7 missed, Epworth 46 enrolled, 22 identified and 20 missed and our new screening tool 65 enrolled, 35 identified and 8 missed.
Conclusion: In a high risk population of patients referred for PE we identified independently verifiable factors associated with OSA and with only 2/3 of patients enrolled for PE, we identified most OSA cases while keeping the number of missed cases down.
Obstructive sleep apnea; Polysomnography; Epworth sleepiness scales; Merlin Tool
Increased awareness amongst general practitioners, nonrespiratory physicians and the general population of the risks associated with obstructive sleep apnea (OSA) has led to an increase in the volume of referrals requesting polysomnography evaluations in our sleep clinic. There are a number of validated questionnaires which are routinely used to screen specific populations for OSA including the STOP, STOP-Bang, Berlin and Epworth sleepiness scales [1-3]. These questionnaires include subjective questions which cannot be independently confirmed. For instance, snoring which has a higher chance of being reported when the patient has a partner. These subjective questions result in a high sensitivity and low specificity as they are often indicative of OSA.
The aim of the presented study was to identify verifiable and independently measurable risk factors to increase specificity and limit the number/prioritize patients for polysomnography exams (PSG). These improved risk factors will improve independent identification of patients that will likely need treatment which in turn will lower healthcare cost. Our study population included patients whom already had been referred to the sleep clinic and therefore can be considered as a high risk population.
A retrospective data collection recorded all information from patients presenting at the sleep clinic with suspected OSA during 2013. For each patient, the following variables were collected: age, sex, blood pressure, BMI, smoking, alcohol intake, neck circumference, history of asthma, COPD, diabetes and other comorbidities, use of antidepressants and other medications. Each patient also filled out the STOP (-bang), Berlin and Epworth sleepiness scales. Based on the above retrospective data collection and subsequent analysis, a prospective group was then enrolled and our Merlin tool was used to screen for OSA.
Moderate OSA was defined as an AHI>=15 obtained from an overnight PSG. Sensitivity and specificity of each questionnaire as well as for combinations of independently verifiable factors (IVF) were calculated and compared to obtain the most advantageous combinations to maximise specificity without compromising sensitivity too much. Numbers of false positive patients (questionnaire indicates OSA but AHI<15) and false negative patients (questionnaire dismisses OSA but AHI>+15) were calculated to compare the number of missed cases. A diagnostic odds ratio (dOR) was calculated using a 2 by 2 table. The dOR indicates how many times more often a positive test result occurs among patients with the condition of interest compared to patients without the condition. Data was entered in Excel and statistical analysis was performed using IBM-SPSS v22 for Windows.
Ethical approval was sought and obtained from the Health Service Executive, University Hospitals Galway Ethical committee.
Data was collected retrospectively from a total of 164 patients who were already enrolled for PSG. Data collection included the results of STOP, STOP Bang, Berlin and Epworth questionnaires as well as demographic and health related variables (Table 1).
Table 1: Demographics.
|N||%||Of whom needed treatment||%|
|Age ≥ 50||190||61||85||45|
|Neck ≥ 16 cm||211||70||99||47|
|ORO ≥ 3||56||39||33||59|
The prevalence of patients requiring treatment for OSA was 35.5%. The STOP questionnaire had a sensitivity of 93.2% and specificity of 6.8%, the STOPBANG questionnaire had a sensitivity of 100% and specificity of 5.1%, the Berlin questionnaire 98.3% and 11.3% respectively and the Epworth questionnaire resulted in 70.2% sensitivity and 39.8% specificity. Based on different combinations of IVFs, BMI, age, sex, neck circumference, diabetes and alcohol consumption, antidepressant use and high blood pressure were identified as the most influential IVFs.
This score was subsequently optimized using prospectively collected data of all new patients undergoing PSG (N=209) in the sleep clinic. Prevalence of OSA requiring treatment was 42.6%. The final Merlin score was based on the following factors each accounting for 1 point except for BMI >30 which was given 2 points: age>50, male sex, BMI>30 (2 points), alcohol consumption>21 per week, collar circumference>16 inches (40cm). A Merlin score equal to or over 3 was indicative of OSA. The Merlin tool reached a sensitivity of 77.3% and specificity of 48.2% while the other tools changed slightly (Table 2).
Table 2: Overview of the sensitivity and specificity of each tool as well as the dOR at the indicated prevalence.
|Screening tool||Sensitivity %||Specificity %||Number enrolled for PSG||Number with OSA||Number missed||dOR||CI|
Merlin Score=(2*BMI>30)+(Age>50)+(Male)+(neck>16)+ (diabetes)+(alcohol>21unit/week).
For every 100 patients with OSA, the total number enrolled for PSG based on each screening tool were respectively 92 for STOP of whom 41 (45%) were diagnosed and 1 patient missed, 94 for STOP-Bang of whom 42 were diagnosed (45%) and 1 missed, 83 for Berlin of whom 36 (43%) diagnosed and 7 missed, 46 for Epworth of whom 22 (48%) were diagnosed and 20 missed and our new Merlin tool that enrolled 65 patients of whom 35 (54%) diagnosed and 8 missed. Table 2 shows an overview of the sensitivity and specificity of each tool as well as the dOR at the indicated prevalence. In the presented population, with a high prevalence of OSA, only the dOR of the Merlin tool is significant (3.2 (1.6-6.2)) while the other dORs all include 1 in their confidence interval (no difference). This indicates that in the presented population, the chance of a positive diagnosis based on the Merlin tool give you a 3.2 higher chance of being diagnosed with OSA. The outcome of the other tools is no different between those who do and those who do not have OSA.
Questionnaires for evaluation of OSA are used routinely as part of the sleep history in most sleep centers in the world. Due to the subjective nature of these questionnaires the effectiveness for identifying patients with OSA limits their clinical use. Efforts have been made to try and increase the specificity and sensitivity by modifying questionnaires in the past with marginal improvements.
There have been a number of studies which sought to assess the diagnostic accuracy of the various sleep questionnaires. El-Sayed et al.  looked at an Egyptian population of 234 patients and evaluated the sensitivities and specificities of the questionnaires. For moderate-severe OSA (AHI >15 and =/<30 n=177) they found the following STOP BANG (Sensitivity 97.74%, Specificity 3.7%), STOP (Sensitivity 94.35%, Specificity 25.93%), Berlin (Sensitivity 95.48%, Specificity 7.41%) and ESS (Sensitivity 75.7%, Specificity 48.15%). This population specific study’s results were similar to our own in that the Epworth score had the highest specificity but the lowest sensitivity.
Hsiao-Yean Chiu et al. in 2016  performed a meta-analysis to assess the diagnostic accuracy of the various sleep questionnaires according to severity of OSA. For moderate-severe OSA (AHI>15 and =/<30 n=177) they found the following STOP BANG (Sensitivity 90%, Specificity 36%), STOP (Sensitivity 89%, Specificity 32%), Berlin (Sensitivity 77%, Specificity 44%) and ESS (Sensitivity 47%, Specificity 62%). They found for studies which included a sleep clinic specific population there was a higher pooled sensitivity for the questionnaires. They found that the diagnostic odds ratio for the STOP BANG questionnaire was superior to the Berlin, ESS and STOP questionnaires.
We aimed to simplify the triage process by identifying independently identifiable factors which would improve diagnosis. We found a score of 3 or more on the Merlin tool helps independent diagnosis and increased the chance that a patient suspected of sleep apnea would have the condition with the need for treatment.
Limitations of our study are that the patients enrolled are already deemed high risk as they are attending a sleep clinic and referred for PSG, which is reflected in the high prevalence of OSA. The population size is also small and it would be interesting to see if using the Merlin tool in general practice would improve the triage of patients into the sleep clinic.
In a high risk population of patients referred for PSG we identified independently verifiable factors associated with OSA and only enrolled 65 out of every 100 of patients for PSG without an alarming increase in the number of missed cases. The other tools generally enrolled more than double the number of patients for PSG, resulting in a significant increased economic cost. A subsequent prospective conformation of our tool is currently being performed with the aim to maximize OSA identification while keeping costs down. Our current routine outpatient waiting time to be seen in a sleep clinic is approximately 18 months. With the implementation of the Merlin tool improved triage sleep referrals can be accomplished prioritizing those that will likely need treatment and reducing waiting times.
Citation: Scott A, Vellinga A, Geehan M, Ahmed M, Mulloy E, Gilmartin JJ (2020) Improving the Identification and Triage of Patients with Obstructive Sleep Apnea Who Require Treatment: The Merlin Tool for High Risk Populations. J Sleep Disord Ther 9:313. doi: 10.35248/2167-0218.104.22.1683
Received: 28-Feb-2020 Accepted: 03-Aug-2020 Published: 10-Aug-2020 , DOI: 10.35248/2167-0222.214.171.1243
Copyright: © 2020 Scott A, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited