Introduction

Several types of involuntary limb movements may occur during sleep. These movements may be isolated in time and occur in response to external triggers (e.g., noise and light) or seemingly spontaneously. Other limb movements may occur repetitively in series. Periodic limb movement during sleep (PLMS) is characterized by repetitive involuntary flexion/extension of the big toe and ankle, and occasional flexion of the knee and hip [1]. The American Academy of Sleep Medicine (AASM) defines a leg movement (LM) as an increase of ≥ 8 μV in the electromyogram (EMG) voltage of the anterior tibialis above its resting voltage with a duration of 0.5 – 10 s [1]. According to the AASM and the World Association of Sleep Medicine (WASM), to be qualified as a PLMS event, a single limb movement has to occur in a series of at least 4 events with regular intervals of 5 – 90 s [1, 2]. The etiology of PLMS is not fully understood, but available evidence suggests that the occurrence of PLMS is controlled by central nervous system (CNS) pacemakers, which govern rhythmic activities like locomotion and respiration. Dopaminergic dysfunctions may play a role in the occurrence of PLMS [3, 4]. The occurrence of PLMS is often associated with repeated arousals during sleep, which may lead to impaired sleep quality and daytime sleepiness [5, 6]. PLMS may be a side effect of using medications like antidepressants [7]. In addition, PLMS is frequently observed in patients with neurological disorders (e.g., Parkinson’s disease [8]) or other sleep-related disorders (e.g., restless legs syndrome and obstructive sleep apnea [OSA]) [2, 9].

OSA is a common sleep-related breathing disorder, which is characterized by recurrent episodes of apnea and/or hypopnea, resulting from complete or partial obstruction of the upper airway during sleep [10]. Repetitive limb movements in OSA may cause a diagnostic problem as they may not only be PLMS but could also be the consequence of respiratory events (viz., apnea, hypopnea, and respiratory effort-related arousal [RERA]). Therefore, in 1993, the American Sleep Disorders Association (ASDA) excluded LMs at the end of respiratory events from being considered as PLMS [11]. In 2006, the WASM first defined LMs that occurred from − 0.5 s to + 0.5 s relative to the end of respiratory events as respiratory-related leg movements (RRLMs) [12]. On the other hand, the AASM defined a time interval from 0.5 s preceding respiratory events to 0.5 s following the respiratory events for RRLMs [1]. However, both RRLM criteria were based on experts’ opinions. In a previous clinical study from Manconi et al., the probability of LM was found to increase during an interval of − 2.0 s to + 10.25 s around the end of respiratory events, rather than at the beginning or middle of respiratory events [13]. Based on this finding, the latest (2016) recommended WASM criterion is to score LMs that occur during a period between 2.0 s before the end and 10.25 s after the end of respiratory events as RRLMs [4]. The AASM still uses the original scoring criterion for RRLM [14] (Fig. 1).

Fig. 1
figure 1

Scoring criteria for RRLM. The black block is a respiratory event, and the red, green, and blue blocks are the time windows to score RRLMs based on the AASM, original WASM, and recommended WASM criteria, respectively. AASM = American Academy of Sleep Medicine; s = seconds; WASM = World Association of Sleep Medicine

Application of either the WASM or AASM criteria may directly affect LM-related diagnosis and treatment outcomes. The possibility has been suggested that, by using the original WASM and AASM scoring criteria for RRLM, the number of periodic leg movements (PLMs) in patients with OSA is considerably overestimated, while the number of RRLMs is considerably underestimated [13]. If a large amount of RRLMs is misrecognized as PLMs, a patient might be misdiagnosed with periodic leg movement disorder (PLMD). However, recommended treatments for PLMD (e.g., lifestyle changes [15] and medication [16, 17]) will not be effective in reducing RRLMs. On the other hand, the scoring of RRLMs may also overrule potential LM-related diagnoses, such as restless leg syndrome and PLMD, especially in patients with severe OSA. This is because that such patients present with many respiratory events during sleep and many adjacent LMs are consequently scored as RRLMs, even though in nature they may not be driven by respiratory events and just happen to co-occur with respiratory events [18]. To optimally distinguish RRLMs from other LMs, firstly, the temporal association between LMs and respiratory events needs to be further quantified, extending previously reported that LMs are more frequent at the end of respiratory events than at the beginning or middle of respiratory events [13, 19]. In addition, the recommended WASM criterion mainly focuses on LMs at the end of respiratory events, while the AASM criterion scores LMs during respiratory events as RRLMs. The quantitative difference in scoring RRLM between the two criteria is unclear. Therefore, the first aim of the present study was to describe the temporal association between LMs and respiratory events in OSA. We hypothesized that, in addition to the end of respiratory events, LMs can also be commonly observed at the beginning and during respiratory events. The second aim was to quantify the difference in scoring RRLM between the AASM criterion and the recommended WASM criterion. We hypothesized that, compared with the AASM criterion, more LMs will be scored as RRLMs based on the recommended WASM criterion.

Methods

Participants

Participants in this retrospective, observational study were selected from among adult patients (> 18 yrs) who visited the Department of Clinical Neurophysiology of the OLVG (Amsterdam, The Netherlands) between July 2020 and December 2021 for snoring and potential OSA. Included patients were those diagnosed with OSA by polysomnography (PSG; apnea-hypopnea index [AHI] > 5 events/hour), presented with > 10 LMs of any type per hour of sleep. All participants provided informed consent for using their data for scientific purposes. This study was approved by the local ethical committee (study number: WO 19.079).

Polysomnography

Ambulatory PSG recordings (SOMNOscreen Plus, Randersacker, Germany) were performed in accordance with the standard protocol of the hospital. All PSG recordings were manually scored according to the standard criteria (AASM, version 2.6, 2020) [14] by an experienced sleep technician to determine, amongst others, apnea, hypopnea, RERA, and LM. Standard scoring of RRLMs was based on the AASM criteria. LMs that occurred during a period from 0.5 s preceding respiratory events (apnea, hypopnea, or RERA) to 0.5 s following respiratory events were scored as RRLMs. For this study, we also scored RRLMs according to the recommended WASM criterion. To that end, data were exported and reanalyzed using R (version 4.0.5, Vienna University of Economics and Business, Vienna, Austria). A LM was scored as an RRLM if any part of the LM lies within the interval of − 2.0 s to + 10.25 s around the end of a respiratory event [13].

Visualization

Based on the exported PSG data, hypnograms were constructed using the R package ‘plotly’ (version 4.10.0) [20]. Hypnograms included sleep stage (viz., wake, non-rapid eye movement sleep stage 1–3 [N1-3], and rapid eye movement sleep [REM]), LM (viz., isolated leg movement [ILM; not body position-related], PLM, and RRLM), respiratory event (viz., apnea, hypopnea, and RERA), arousal, and sleep position (viz., upright, supine, left lateral decubitus position, right lateral decubitus position, and prone).

Quantification

For each respiratory event, the occurrence of LMs was quantified over three time windows relative to the respiratory event (viz., from 20 s [13] before the onset of respiratory event to the onset of respiratory event, from the onset to the end of respiratory event, and from the end of respiratory event to 20 s after the end of respiratory event). Hence, it was possible that a LM is counted both as a LM at the end of the preceding respiratory event and as a LM at the beginning of the following respiratory event. Notably, because a respiratory event can be either longer or shorter than 20 s, to ensure the comparability between the data from these three time windows, the duration of respiratory events and the number of LMs during respiratory events were normalized to 20 s. For example, for a 10 s respiratory event during which three LMs occurred, the duration of the respiratory event and the number of LMs during this respiratory event would be normalized to 20 s by multiplying by two. After normalization, the number of LMs during this respiratory event was six. Both the original data and the normalized data would be used for further statistical analyses. These procedures were repeated for each participant. In addition, for each participant, the number of RRLMs that were identified based on the AASM criterion and the recommended WASM criterion was calculated. All the above-mentioned procedures were performed using R.

Statistics

In descriptive statistics, nominal and ordinal data were presented as percentages. For continuous data, Kolmogorov–Smirnov test was used to check if the data was normally distributed. Normally distributed data were presented as mean ± standard deviation (SD). To better present the characteristics of the data in this study, non-normally distributed data were presented both as mean ± SD and as median (interquartile range; IQR). In the present study, as multiple respiratory events can be extracted from the recording of a single participant, a multilevel dataset was obtained, i.e., respiratory events (level 1) were nested within individuals (level 2). To adjust for the fact that multiple respiratory events from one participant were not independent of each other, generalized linear mixed model (GLMM) was used to investigate the difference between the number of LMs that occurred before, during, during (normalized), and after respiratory events. To quantify the difference in scoring RRLM between the AASM criterion and the recommended WASM criterion, paired t-test was used for normally distributed data, otherwise Wilcoxon test was used. Significance level was set as 0.05. Analyses were conducted using the IBM SPSS Statistics 27 software package (IBM Corp, Chicago, USA).

Results

A total of 88 eligible patients visited the Department of Clinical Neurophysiology of the OLVG between July 2020 and December 2021 for snoring and potential OSA. Of these 88 patients, 34 were excluded due to an AHI of less than 5, 19 were excluded due to less than 10 LMs/h, and 3 were excluded due to technically failed PSG. After exclusions, 32 participants were included in the present study. The characteristics of participants are shown in Table 1.

Table 1 The characteristics of participants

Temporal association between LMs and respiratory events

From the 32 participants, based on the AASM criteria, a total of 7,177 respiratory events were identified and there were 8,205 LMs that were time linked to these respiratory events. Of the 8,205 LMs, 3346 (41%) occurred before respiratory events, 1323 (16%) occurred during respiratory events, and 3536 (43%) occurred after respiratory events. In statistical analyses, it was found that LMs were significantly more frequent after respiratory events, followed by before respiratory events, and were rare during (both with and without normalization) respiratory events (P < 0.01; Table 2). It should be noted that the mean and median of LMs in Table 2 refer to each individual respiratory event. Although the difference in the number of LMs before and after each respiratory event seems minor (0.46 and 0.49, respectively), the difference in the total number of LMs before and after respiratory events can be major and significant when patients present with hundreds of respiratory events during sleep, and especially when also considering the high SDs, which are 0.67 and 0.69, respectively. In hypnograms, it can be observed that respiratory events often occurred in a series. As a consequence, a majority of LMs were between two consecutive respiratory events (Fig. 2).

Table 2 The difference between the number of leg movements before, during, and after a respiratory event
Fig. 2
figure 2

The occurrence of leg movements between two consecutive respiratory events. The upper panel shows a patient’s hypnogram. The two blue boxes a and b are two example time windows, during which leg movements are observed at the end of consecutive respiratory events. The example time windows are enlarged in the lower panel. CA = central apnea; ILM = isolated leg movement; LMA = leg movement arousal; MA = mixed apnea; N1-3 = non-rapid eye movement sleep stage 1–3; O2A = SpO2 arousal; OA = obstructive apnea; PLM = periodic leg movement; PLMA = periodic leg movement arousal; RA = respiratory arousal; REM = rapid eye movement sleep; RERA = respiratory effort-related arousal; RRLM = respiratory-related leg movement; SA = snoring arousal

Difference in scoring RRLM between the AASM criterion and the recommended WASM criterion

Compared with the AASM criterion, more LMs were scored as RRLMs based on the recommended WASM criterion (“New WASM RRLM” in Fig. 3). Most of the LMs that were newly identified as RRLMs based on the recommended WASM criterion were those between two consecutive respiratory events, which were scored as PLMs or ILMs based on the AASM criterion. On the other hand, a small number of LMs that occurred during respiratory events and were scored as RRLMs based on the AASM criterion were excluded from RRLMs based on the recommended WASM criterion (“Non WASM RRLM” in Fig. 3). Statistical analysis confirmed that, in general, compared with the AASM criterion, more LMs were scored as RRLMs based on the recommended WASM criterion (P = 0.01; Table 3). A point to note is that some respiratory events did not occur in a series and seemed isolated. In that case, adopting the recommended WASM criterion may hamper PLM quantification. Specifically, if one in a series of four repetitive limb movements is labeled as RRLM, the other three do not validate as PLMs anymore. An example of such case is shown in Fig. 4, where a PLM following an isolated respiratory event was identified as a RRLM based on the recommended WASM criterion. As a consequence, the three LMs adjacent to the new RRLM did not meet the criteria for PLMS and would be scored as three ILMs.

Fig. 3
figure 3

The difference in scoring RRLM between the AASM criterion and the recommended WASM criterion in cases where leg movements mainly occur between consecutive respiratory events. The upper panel shows a patient’s hypnogram. The two blue boxes a and b are two example time windows, during which many AASM criterion-identified PLMs and ILMs between two consecutive respiratory events are scored as RRLMs based on the recommended WASM criterion. The example time windows are enlarged in the lower panel. CA = central apnea; ILM = isolated leg movement; LMA = leg movement arousal; MA = mixed apnea; N1-3 = non-rapid eye movement sleep stage 1–3; Non WASM RRLM = leg movement that is scored as RRLM based on the AASM criterion but is excluded from RRLM based on the recommended WASM criterion; New WASM RRLM = leg movement that is not scored as RRLM based on the AASM criterion but is identified as RRLM based on the recommended WASM criterion; O2A = SpO2 arousal; OA = obstructive apnea; PLM = periodic leg movement; PLMA = periodic leg movement arousal; RA = respiratory arousal; REM = rapid eye movement sleep; RERA = respiratory effort-related arousal; RRLM = respiratory-related leg movement; SA = snoring arousal

Table 3 The difference between the number of RRLMs identified based on the AASM and the recommended WASM criteria
Fig. 4
figure 4

Adopting the recommended WASM criterion disrupts PLMS. The upper panel shows a patient’s hypnogram. The blue box a is an example time window, during which an AASM criterion-identified PLM is scored as a RRLM based on the recommended WASM criterion. As a consequence, the three adjacent AASM criterion-identified PLMs will be scored as three ILMs, because they do not fulfill the criterion to be scored as PLMs anymore. This example time window is enlarged in the lower panel. CA = central apnea; ILM = isolated leg movement; LMA = leg movement arousal; MA = mixed apnea; N1-3 = non-rapid eye movement sleep stage 1–3; Non WASM RRLM = leg movement that is scored as RRLM based on the AASM criterion but is excluded from RRLM based on the recommended WASM criterion; New WASM RRLM = leg movement that is not scored as RRLM based on the AASM criterion but is identified as RRLM based on the recommended WASM criterion; O2A = SpO2 arousal; OA = obstructive apnea; PLM = periodic leg movement; PLMA = periodic leg movement arousal; RA = respiratory arousal; REM = rapid eye movement sleep; RERA = respiratory effort-related arousal; RRLM = respiratory-related leg movement; SA = snoring arousal

Discussion

This study aimed to: i) describe the temporal association between LMs and respiratory events in patients with OSA; and ii) quantify the difference in scoring RRLM between the AASM criterion and the recommended WASM criterion. It was found that LMs were significantly more frequent after respiratory events, followed by before respiratory events, and were rare during respiratory events. In addition, compared with the AASM criterion, more RRLMs were identified based on the recommended WASM criterion.

The first finding of this study is in line with previous studies [13, 19] and is partially confirmed by the second finding. Specifically, compared with the AASM criterion, the recommended WASM criterion shortens the time window to score LMs before and during respiratory events as RRLMs, while it extends the time window to score those at the end of respiratory events as RRLMs. It turned out that, compared with the AASM criterion, significantly more RRLMs were scored based on the recommended WASM criterion, suggesting that LMs are more frequent at the end of respiratory events than before and during respiratory events, and that there is an association between respiratory events and LMs.

Given that respiratory events often occurred in a series, LMs were frequent between two consecutive respiratory events. It can be observed in the present study that, by using the recommended WASM criterion, most of LMs between consecutive respiratory events were scored as RRLMs. This is consistent with the authors’ opinion that when scoring RRLMs, all LMs between two consecutive respiratory events should be taken into consideration, especially in the absence of other periodic or patterned LMs during sleep. This is because it is not reasonable to score some of the LMs between two consecutive respiratory events as respiratory-related, while scoring the others as non-respiratory-related. This suggests a different pathogenesis between these LMs, but no factor other than the respiratory event seems to be related to the occurrence of LMs during the short and specific period of time between two consecutive respiratory events.

It is also possible that, in addition to RRLMs, there are other periodic or patterned LMs during sleep, especially during periods without respiratory events (e.g., Fig. 3). This suggests another potential pathogenesis behind LMs. To determine if and when the scoring of other periodic or patterned LMs, such as PLMs, should take priority over the scoring of RRLMs, or the other way around, the first question is whether or not there is a PLMS dominance. This means that, in some patients, compared with respiratory events, PLMS is a stronger or prioritized mechanism driving most or all LMs throughout the night. Thus, as suggested in previous studies that some PLMs may happen to synchronize with respiratory events [18], the occurrence of LMs is not related to respiratory events, even though some LMs may seem to be temporally related to respiratory events. This theorem is partially supported by previous findings that the number of PLMs increased or remained stable after successful OSA-related treatment [5, 21, 22], because OSA-related treatment cannot eliminate PLMs. However, in future studies, in addition to PLMS, which is a subdivision of LMs, attention should also be given to the change in the number of total LMs. Only when the number of total LMs remains stable after OSA-related treatment can it be confirmed that there is a PLMS dominance and OSA-related treatment does not affect LMs. In such studies, patients should be stratified based on AHI, which was reported to be associated with LMs [6, 23]. It is also because that, with the increase of AHI, respiratory events may play an increasing role in LMs and gradually become the dominant mechanism of LMs. In addition, it is also suggested to control for other LM-related factors, such as sleep stage, age, and gender [24, 25], which may further explain previous conflicting findings on the effect of OSA-related treatment on LMs, because even studies on patients with the same severity of OSA reported conflicting findings [21, 22]. If the presence of PLMS dominance is confirmed, further studies are needed to distinguish patients with PLMS dominance from others in order to facilitate treatment-related decision making. If no PLMS dominance is found and respiratory events always take priority over PLMS, then the authors would like to suggest to further delineate the time window to score RRLMs in a clinical evidence-based manner. This might be done by testing the effect of successful OSA-related treatment on the number of the recommended WASM criterion-based RRLMs and the number of PLMs and ILMs. If the number of PLMs and ILMs remains stable before and after successful OSA-related treatments, it means that most RRLMs are identified using the recommended WASM criterion. If not only the number of RRLMs but also the number of PLMs and ILMs decreased (not necessarily statistically significant) with successful OSA-related treatments, it suggests that many RRLMs are still mis-scored as PLMs or ILMs even based on the recommended WASM criterion. Then, further studies are needed to investigate whether LMs during respiratory events should also be scored as RRLMs. In this way, it can be demonstrated which LMs are truly respiratory-related and should be scored as RRLMs.

Previous studies found that LMs were frequent in the time window from -2.0 s to + 10.25 s around the end of respiratory events [13, 19]. In this study, we observed that many LMs during respiratory events were excluded from RRLMs based on this proposed time window. Although LMs during respiratory events are relatively rare, in future studies, attention should be given to the potential difference between LMs during respiratory events and those after respiratory events. Another point that needs to be discussed is arousal, which is imperative to RRLMs but was not taken into consideration in this study in order to keep this study focused. Previous studies found that RRLMs were more frequent when arousals were present, and the probability of RRLM doubled (from 26 to 64% [19] and from 23 to 53% [18], respectively) when an arousal was present at the end of a respiratory event. Another study on elderly men confirmed that RRLM% (viz., number of RRLMs/number of apneas and hypopneas) increased with the increase of arousal index [26]. However, it is still unclear whether arousals and RRLMs are directly associated with each other or whether their association is mediated by a shared provoking factor or mechanism. Further studies are needed to investigate the association among arousals, OSA events, and LMs, and to determine where the appropriate starting point is for addressing these complex and often co-existing issues.

This study has several limitations. First, to avoid excluding many participants due to insufficient LM and to avoid drawing conclusions based on chance events, 10 LMs of any type per hour of sleep was used as the best compromise for including participants in the present study. So, compared with previous studies [27, 28], this study included patients with more LMs and PLMs, and the findings may be limited to the current patient profile. Second, the effect of LM-related factors, such as age, gender, and arousal on the presence of LM was not investigated. Third, due to the reduced clinical capacity during the Covid-19 pandemic, only 32 participants were included in the study. The findings in this study should be confirmed in further studies with a larger sample size.

Conclusion

LMs are more frequent after respiratory events than before and during respiratory events, and more RRLMs may be scored based on the recommended WASM criterion than based on the AASM criterion. However, to facilitate the diagnosis and treatment of LM-related conditions, more studies are needed to define the appropriate time window for scoring RRLMs.