Using a graph-based image segmentation algorithm for remote vital sign estimation and monitoring

doi:10.21203/rs.3.rs-1462179/v1

Download PDF

Article

Using a graph-based image segmentation algorithm for remote vital sign estimation and monitoring

https://doi.org/10.21203/rs.3.rs-1462179/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Reliable and contactless measurements of vital signs, such as respiration and heart rate, are still an unmet need in clinical and home settings. Mm-wave radar and video-based technologies are promising, but currently, the signal processing-based vital sign extraction methods are prone to body motion disruptions or illumination variations in the surrounding environment. Here we propose an image segmentation-based method to extract vital signs from the recorded video and mm-wave radar signals. The proposed method analyses time-frequency spectrograms obtained from Short-Time Fourier Transform rather than individual time-domain signals. This leads to much-improved robustness and accuracy of the heart rate and respiration rate extraction over existing methods. The experiments were conducted under no- and post-exercise conditions and repeated on multiple individuals. The results are evaluated by using four metrics against the gold standard contact-based measurements. Significant improvements are observed in terms of precision, accuracy, and stability. We believe that the proposed estimation method will help address the need for the increasingly popular remote cardiovascular sensing and diagnosing posed by Covid-19.

Vital signs are measurements of the essential physiological functions of a human body. The most routinely checked vital signs include heart rate (HR), breathing rate (BR), body temperature and blood pressure. Vital sign data is also one of the primary data collected for telehealth services for remote health monitoring¹. Conventionally, these vital sign data are measured via contact-based technologies, such as electrocardiography (ECG) and pulse oximetry². As the Covid-19 pandemic is forcing a digital transformation of the healthcare industry, contactless and remote monitoring technologies are replacing the predominant position of contact-based devices in telehealth and clinical settings^3,4.

Vital signs are normally estimated from either detecting the chest and heartbeat displacements or face blood volume pulse (BVP) signal. Previous studies have explored the feasibility of remote heart rate monitoring using frequency modulated continuous wave (FMCW) radars. Time-varying filters, phase drift reduction, and motion suppression algorithms are reported to be effective when using FMCW with various starting frequencies^9,11−13. Recorded face and chest videos have also been used to obtain cardiac pulse measurements. As cardio-vascular activities can cause subtle intensity variance on the human face, a photoplethysmogram (PPG) can be generated to compute the BVP and eventually the HR^12,13. Both visible and near-infrared (NIR) spectrums have been studied for different applications^14,15. For both radar and video monitoring modalities, the most widely used method for estimating vital signs is by finding the most prominent spectral magnitude of the resultant Fourier transform of the measured time-domain signal.

However, remote vital sign measurements suffer from accuracy and reliability issues. FMCW radar is intrinsically affected by the phase randomness, harmonics, and interference from the antenna, while the recorded videos are susceptible to environmental illumination variance. Both the techniques are highly susceptible to motion disruptions when measuring or monitoring in real-life scenarios^5,10−12. The disruptions primarily limit the wide implementation of remote vital signs monitoring technologies in clinical settings. In past studies, test subjects were required to calmly sit or lie down to avoid motion disruptions or drastic HR/BR changes^16,17. Nevertheless, patients are likely to experience stress in clinical settings, which would cause body movements and fluctuation in vital signs measurements. Therefore, vital signs monitoring systems require effective body movement mitigation to be more clinically practical.

Several HR extraction methods have been reported^5,16−18. The methods share a common ground of performing signal processing-based extractions on a sequence of individual waveforms. The extraction methods vary from the maximum spectral magnitude, peak counting, and machine learning. Compressive sensing (CS) and discrete wavelet transform (DWT) based algorithms have also been proposed to increase the precision of estimation⁹. However, external disruptions can cause a significant loss of signal and accuracies when performing signal processing-based methods on the individual waveforms.

In this work, we report an image segmentation-based HR and BR extraction method to improve the measurement accuracy and robustness. Rather than using a single waveform, the proposed method extracts the vital signs from the short-time Fourier transform (STFT) spectrograms using graph weight and normalised cuts image segmentation¹⁹. The open-source retinal layer segmentation software is adopted for our application²⁰. As a result, the less prominent vital sign signals that are submerged by noise and motion corruption will be highlighted, leading to much-improved measurement robustness and accuracy. This approach is able to perform on both radar and video data. The method is validated against the gold standard method, e.g., a commercial contact-based ECG device (Polar H10). The experiment was conducted in a controlled lab environment, with the test subject taking measurements before and after physical exercises. The results of the individual devices and the combined system are cross-compared with the gold standard method. Pearson correlation coefficients (PCC), root-mean-square error (RMSE), area under curve of success rate (AUC-SR) and coverage are employed as the performance and accuracy indicators. The statistics of the results are visualised by boxplots.

The results demonstrate that the proposed image segmentation method provides considerably improved accuracy and robustness when facing interference and disturbance compared with the conventional spectral magnitude method.

The FMCW Radar Based Measurement Principles

Figure 1. shows the system schematic diagram for simultaneous video and radar vital signs monitoring. The FMCW device selected in this experiment was an mm-wave radar (IWR 1843, Texas Instrument), with an operational frequency range of 77–81 GHz. Out of the three transmitting and four receiving antennas equipped on the radar, only the closest spaced pair of transmitting/receiving antennas (TX/RX) was used. Chirps are generated by a waveform generator and transmitted via a self-oscillation circuit, a mixer, a pre-amplifier for the transmit antenna. The received signals are passed through a low-noise amplifier, a low-pass filter, the digital signal processing unit and the analogue-digital-converter (ADC) for the computer to perform further processing.

The principles of FMCW radar have been explained in detail in multiple research^9–12. The conventional use of the FMCW radar is to perform a range FFT on the chirp sets with distance information and generate a range bin map. In the application of vital sign measurement, the amplitude of the chest movement is 4–12 mm, while the amplitude of heart pulses ranges from 0.1–0.5 mm²¹. In order to resolve these small-scale movements, the phase shift \({\Delta }\phi\) between two consecutive chirp signals will be calculated first.

The displacement \({\Delta }R\) of the object can then be calculated as:

\({\Delta }R=\frac{\lambda }{4\pi }{\Delta }\phi\)

(1)

where \(\lambda\) is the central wavelength of the radar.

As shown in Fig. 2. a), the primary processing technique involved performing phase unwrapping on the phase term of a set of chirps to obtain the correct phase information. A 20-second sliding window is then applied to compute vital signs second by second, as shown in Fig. 2. b). Similar to the previous studies, phase randomness and spike noises are removed by computing the phase difference and energy-based thresholding^11,12.

Afterwards, a digital fourth-order Butterworth filter is applied in the frequency band of 0.2–0.6 Hz for respiration rate and 1–4 Hz for heart rate extraction, respectively. The resulting waveforms represent the heartbeat and breathing patterns in the data segment, as shown in Fig. 2. c).

As the breathing and cardiac cycles tend to be periodical, the vibration frequency can be extracted by applying a second FFT. The data is zero-padded with three times the data size to allow more data points in the resulting spectrum. Conventionally, the largest spectral magnitude of the resultant FFT spectrum corresponds to the HR (heartbeat frequency) and BR (breathing frequency). However, due to the spectral noise caused by motion disruption, the largest magnitude may not always produce the correct heart rate estimation. Here we propose a novel image segmentation-based method to address this issue.

Video-based Measurement Principles

Intensity-based face video vital sign extraction stems from photoplethysmography (PPG), a common and low-cost optical technique. Since light is more strongly absorbed by blood than the surrounding tissues, the periodic changes in blood flow can be detected by PPG sensors as changes in the intensity of light²². Thus, the subtle change in the light intensity on human skin can be captured by a digital camera. The face video vital sign extraction process can be referred to Fig. 2.

It has been shown that facial regions around the forehead, cheeks and mouth area tend to be more reliable for cardio-vascular pulses signal extraction²³. Areas close to eyes and mouths are less suitable as they are likely to be affected by facial muscles¹⁷. Breathing pattern signals are most reliable around the chest area, corresponding to ventilation movement. As shown in Fig. 2. a), the cardio-vascular pulses for HR and BR are extracted from a manually selected face region-of-interest (ROI) and a box ROI on the chest area, respectively.

At first, the spatial average is employed to improve the SNR of the raw signal containing cardio-vascular pulses information and enhance the subtle colour changes^5–8. Then, the pixel values of each colour channel in the selected ROI is averaged for each video frame to overcome sensor and quantisation noise. Unlike the conventional PPG-based devices utilising Near-Infrared light, the blood volumetric variation is reflected in three channels of colour video. Because haemoglobin and oxyhaemoglobin in blood both have higher absorption in the green channel than the red channel²⁴, the green channel is selected to deliver the optimal SNR, as shown in Fig. 2. b). The same 20-second sliding window is applied.

The obtained signal waveform comprises "AC" components that are synchronous with each heartbeat and respiration. The slowly varying "DC" baseline corresponds to the subtle changes in illumination and head motion, even in a strictly controlled environment. Thus, a detrending filter is required to reduce the low frequencies and non-stationary trends of the raw signal^10,13. This filter is effectively a high-pass filter with negligible latency. Then, a moving-average filter is applied to remove random noise caused by sudden light intensity changes or motion in the frame sequence.

Finally, the same Butterworth bandpass filter as mentioned in the FMCW radar signal processing section is applied to generate the heart and breathing waveforms, as shown in Fig. 2. c). HR and BR oscillations are the most periodic components in their frequency bands. They can be located as the signal with the most prominent power magnitude in the spectrum after applying the FFT. Similar to FMCW radar, the impact of motion disruption and change in illumination causes the peak values of the power amplitude to lack continuity and accuracy in video-based HR and BR measurement.

Graph-based Image Segmentation Estimation Method

This study employs the STFT on the acquired 20-seconds data segments for both camera and FMCW radar measurements. After converting the frequency unit from Hz to BPM, the spectra from the data segments form an STFT spectrogram to visualise the vibration signal strength, as shown in Fig. 3. Performing an image segmentation algorithm on the STFT spectrograms will compensate for the disruptions and reduce the impact of external factors compared to the conventional signal processing method.

The estimation method we adopted is graph-based image segmentation^19,20. The work was originally developed for segmenting the retinal layers in the cross-sectional images from Optical Coherence Tomography (OCT)¹⁹. The algorithm is generalised for layered structures segmentation, which is an ideal method for extracting the vital signs in STFT spectrograms.

To summarise the segmentation method, each spectrogram is treated as an image, where the pixels are represented by a graph of nodes. The nodes in the graph are connected by edges. When cutting a graph into segments, a route between the start and the end nodes needs to be created by assigning weights to edges.

In literature, graph weights are often represented by the geometric distance and intensity difference of the graph nodes²⁵. However, due to zero-padding in FFT and filtering, the transition between the adjacent pixels in the spectrograms are smooth, allowing us to consider the intensity difference only for weight calculations. Notably, the intensity of pixels corresponds to the spectral magnitudes. In the spectrograms, the signals to be extracted are layer-like and primarily horizontal. Therefore, the difference of intensity \({w}_{xy}\)can be represented by finding the vertical gradients of the image:

\({w}_{xy}= 2-\left({g}_{x}+ {g}_{y}\right)+{w}_{min}\)

(2)

where \({g}_{x}\) and \({g}_{y}\)are the vertical gradients of the image at node \(x\) and \(y\), respectively. \({w}_{xy}\) is the weight assigned to node \(x\) and \(y\), and \({w}_{min}\) is the minimum weight of the graph, added for system stabilisation.

An optimal path can be formed if the sum of assigned weights is at the minimum, which, in this case, is determined by using Dijkstra's algorithm²⁶. The path is passed through a median filter and can then be considered as the extracted vital sign. Since the spectrograms are likely to consist of a single-layered structure after pre-extraction processing, the search region limitation is not necessary. The segmentation-based method can be used on spectrograms generated by both the radar and the camera to extract vital sign readings. The processing flow is demonstrated in Fig. 4.

Experimental Setup

In this experiment, the chirp duration is set to T_c = 50 µs with an idle time of 7 µs. The available bandwidth of the device is 4 GHz, resulting in a frequency slope S = 70 MHz/µs. Each chirp recurs every T_f = 50 ms, which is equivalent to the sampling rate of 20 Hz. A customised Python program was developed in-house to receive and process the FMCW signals in real-time. Face videos are captured by a PointGrey colour camera (Blackfly BFLY-U3-13S2C) with a selected resolution of 640 × 480 pixels. In principle, any camera with sufficient pixel resolution and 30 Hz frame rate can be used for this application^6,7. The point grey camera is chosen in this study to provide a stable frame rate at 30 Hz to later compare with FMCW measurements and the ground truth. All measurements were performed under room light illumination at room temperature. The recorded raw data are transmitted to the PC for further processing.

The ground truth was simultaneous ECG measurements obtained by using the Polar H10 heart belt as the gold standard. The beats-per-minute (BPM) with respect to time results from three sources were cross-compared for validation.

The Measurements

In the experiments, the FMCW radar and the camera were situated side-by-side at a distance of one meter in front of the test subject. The heart belt was worn by the subject around the chest. The data processing and validation from the three devices are performed offline by in-house software written using MATLAB (2020a). The three devices are configured to be triggered by the in-house developed software. Timestamp alignment is employed to compensate for the delay in starting time, introduced by either hardware initiation or software triggering.

The experiments were designed to test the performance of the proposed extraction method and the integrated system in both post-exercise and no-exercise conditions. The subject was requested to sit stationary during tests in both conditions. In the first condition, the subject needs to conduct physical exercises prior to the test. Signal losses are expected as the unconscious body and head movements might happen due to post-exercise hyperventilation. In the no-exercise condition, the subject was requested to refrain from large movements for at least 15 minutes before testing to reduce unconscious movements.

The test duration is set to 5 minutes. As the HR and BR signals are sensitive to noise, a 20-second sliding window was implemented to segment data and to increase reliability. For comparison, five sets of measurements were conducted on each exercise condition on the same subject. The experiment was then repeated under the same environment with three individuals across two different sessions.

The Operation Flow

The operation flow of the proposed estimation method is shown in Fig. 5. The collected data from the radar and camera recordings can be passed through the same vital sign estimation process. For radar data, the chirp signals are processed to measure the vibration frequency of the test subjects. Heartbeat and respiratory vibration can be separated by applying bandpass filters with different passbands. The vibration of each time point can be further used to extract the frequency and construct the STFT spectrograms. The graph-based image segmentation is then performed on the constructed spectrograms, which can be considered as images. The optimal paths found by the algorithm are used as the final HR and BR estimation. Similarly, the PPG signals are extracted for video recordings by characterising the intensity variance on the face or chest videos. The frequencies of the emergence of peaks in PPG signals can be used to construct the STFT spectrograms after Fourier transform. The same image segmentation method is performed on the spectrograms, and outputs are considered the HR and BR estimations.

The Evaluation Methods

We employed four evaluation metrics to determine the algorithm and system performance, as proposed in past literature^14,15. The Pearson Correlation Coefficient is also used to reflect the accuracy of measurements. Each dataset is evaluated under the four metrics.

Root Mean Square Error (RMSE):

RMSE is calculated by finding the squared and then averaged difference between the measured data and the corresponding ground truth data. The square root of the average is then taken. As the errors are squared before being averaged, the RMSE gives a relatively high weight to large errors. If RMSE equals 0, it means the perfect fit of two datasets. Therefore, a smaller value of RMSE indicates a more accurate estimation.

Area Under the Curve of the Success Rate (AUC-SR):

The AUC-SR is found by first calculating the absolute difference between the measured and the ground truth data. Then, the percentage of the data points under the tolerance range (T) can be considered the SR. T is determined to be within [0, 10] bpm. Lastly, AUC is chosen as the quality indicator, where a larger number means a higher success rate. Note that the AUC is normalised by the area under ten and thus varies in [0,1].

Coverage at ± 3 bpm:

The success rate with T set as three is selected to calculate the measurement coverage. This metric offers a direct view on the percentage of time when the difference between the measured and gold standard HR is within 3 bpm.

Pearson Correlation Coefficient (PCC):

PCC is used as a parameter to assess the similarity between the measured data and ground truth data. The PCC value is between − 1 (strong negative relationship) and + 1 (strong positive relationship). The coefficient R can be expressed as:

\(R= \frac{{\sum }_{i=1}^{N}\left({d}_{ai }- {\stackrel{-}{d}}_{a}\right)\left({d}_{bi} -{\stackrel{-}{ d}}_{b}\right)}{\sqrt{{\sum }_{i=1}^{N}\left({d}_{ai} - {{\stackrel{-}{d}}_{a}}^{2}\right)\left({d}_{bi }-{{\stackrel{-}{ d}}_{b}}^{2}\right)}}\)

(3)

where \({d}_{ai }\)and \({d}_{bi }\)are the i th elements of the measured and ground truth data, respectively. \({\stackrel{-}{d}}_{a}\) and \({\stackrel{-}{d}}_{b}\) are the mean values of the data. \(N\) is the number of elements in the dataset.

Results and Discussions

The data collected from both the camera and radar sources are used to construct the 5-minute STFT spectrograms, as shown in Fig. 6. Only 280 seconds are shown because of the sliding window effect. The spectral magnitude represents the presence of the most substantial frequencies (colour map parula). The noise of interference and motion disruption are visualised by the small-scale magnitude surrounding the main signals. The two columns show the performance of the conventional signal processing method and the proposed image segmentation method, respectively. The red dashed lines in the spectrograms are the estimated HR signals obtained by the proposed image segmentation method. Notably, the spectral magnitude results from Fig. 6a) & b) are obtained by conventional signal processing method instead of the spectrograms.

The discontinuities of the spectral magnitude and sudden change of estimation in Fig. 6. a) and b) are mostly caused by failing to distinguish the largest magnitude in the spectrums. This is most likely caused by motion and environmental disruptions. The two methods are further verified by comparing with the time trace results from the gold-standard ECG device, as illustrated in Fig. 7. This provides an intuitive visual comparison of the performance of the proposed and conventional approaches. Multiple drops and mismatches are observed in the spectral magnitude method when comparing the processed signals from radar and video sources to the gold standard. In contrast, the image segmentation method generates visually highly similar results from two devices and a close match to the gold standard.

The performance of the proposed algorithm is also statistically evaluated by analysing the instantaneous values of the data and the gold standard. The data is presented by using the Bland-Altman plot, as shown in Fig. 8. The Y-axis indicates the difference between the two data, while X-axis indicates the mean of two data. The 95% prediction interval (PI), i.e., the ± 1.96 standard deviation (SD), is the region between the two red dashed lines. The black dashed line is the mean of the difference between the two data. The values of these boundaries are marked on the side of the dashed lines. The correlation coefficients are also placed inside each plot. The data agreement can be evaluated by the range of the PI boundaries and scattering of the data points.

The PI regions of the difference have seen a near 4-times decrease from the conventional to the proposed method. Whereas the mean differences, namely the biases, are also reduced to a negligible level. Intuitively, the data from the proposed method is also less scattered outside of the PI region compared to the spectral magnitude method. The PCC annotation demonstrates a 20–40% increase of the PCCs and reduced standard deviation of the difference when comparing the proposed method to the conventional one. To conclude, this set of data validation shows that the proposed method achieved around 95% PCCs with a high agreement and negligible bias. A significant performance improvement is observed compared to the conventional method.

The boxplots in Fig. 9. are formed by ten datasets, including both the no- and post-exercise measurements. The left side of each plot represents the evaluation of the spectral magnitude method, whereas the right side represents the proposed method. It can be observed that the range and median values of RMSE are significantly reduced, whereas the AUC-SR, coverage and PCC values are increased with also reduced range. The image segmentation method demonstrates a much-improved estimation accuracy and stability.

Cross-sessional and cross-subject test results of 2-minutes data are presented in Table 1. The PCCs are also used to indicate the accuracy of the proposed system. The proposed estimation method yields PCCs ranging from 90–99% across three subjects in two different sessions. The ability to generalise test results is vital to future implementations in practical and more complex scenarios.

Table 1

Cross-subject and cross-sessional PCC results
	Radar vs ECG		Video vs ECG
	Session 1	Session 2	Session 1	Session 2
Subject 1	0.95	0.97	0.94	0.98
Subject 2	0.93	0.90	0.91	0.91
Subject 3	0.89	0.99	0.90	0.99

In this work, we introduced the first use of a graph-based image segmentation algorithm in remote measurements of vital signs. The method performs the segmentation on the STFT spectrograms, which are formed by using 20-second data segments from either an FMCW radar or a camera. The method searches for the shortest path across the STFT spectrograms to segment the image along with the layered structures. The found path represents the HR or BR estimation over the duration of a measurement. The experiment was conducted in a lab environment and ten sets of 2-minutes data were collected for performance evaluation. Compared to the conventional spectral magnitude method, the proposed image segmentation method achieves a significantly improved result over the four evaluation metrics. The image segmentation can also be performed on a rolling basis on the spectrograms for real-time monitoring applications. The experiment was repeated on three different subjects in two sessions, yielding PCCs in the range of 90–98%. The stability, accuracy and adaptability of our method will contribute to the development of contactless telehealth systems, especially under the challenges imposed by Covid-19. This work can be extended to the simultaneous monitoring of multiple individuals in more complex environments in the future.

The human image appeared in Fig 1&2 is a photo of the author Mr Xingyu Yang. Mr Xingyu Yang has given the informed consent for it to be published.

The test subjects are fully informed with the purpose of, and methods involved in the study. The subjects have given the informed consents for conducting the experiment and for the data/results to be published.

This research was approved by the Committee on Research Ethics of the Dept of Electrical Engineering and Electronics in the University of Liverpool.

The devices involved in the study is fully safe for human testing. The devices involved in the study is safe for human testing. The experiment protocols and methods follow the safety regulations of and are approved by the Dept of Electrical Engineering and Electronics in the University of Liverpool.

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Acknowledgement

The authors would like to thank Mr Xinhua Li, Mr Zhiyi Zhao, and Dr Tianyuan Jia for participation in the data collection. This work is partially supported by the ENGINEERING AND PHYSICAL SCIENCES RESEARCH COUNCIL (EP/R014094/1) and the Royal Society (IEC\NSFC\181415).

Tuckson, R. V., Edmunds, M. & Hodgkins, M. L. Telehealth. New England Journal of Medicine 377, 1585–1592 (2017).
Hao, Y. & Foster, R. Wireless body sensor networks for health-monitoring applications. Physiological measurement 29, R27 (2008).
Peek, N., Sujan, M. & Scott, P. Digital health and care in pandemic times: impact of COVID-19. BMJ Health & Care Informatics 27, e100166, doi:10.1136/bmjhci-2020-100166 (2020).
Gadzinski, A. J. & Ellimoottil, C. Telehealth in urology after the COVID-19 pandemic. Nature Reviews Urology 17, 363–364, doi:10.1038/s41585-020-0336-6 (2020).
Verkruysse, W., Svaasand, L. O. & Nelson, J. S. Remote plethysmographic imaging using ambient light. Optics express 16, 21434–21445 (2008).
Poh, M.-Z., McDuff, D. J. & Picard, R. W. Non-contact, automated cardiac pulse measurements using video imaging and blind source separation. Optics express 18, 10762–10774 (2010).
Li, X., Chen, J., Zhao, G. & Pietikainen, M. in Proceedings of the IEEE conference on computer vision and pattern recognition. 4264–4271.
Dasari, A., Prakash, S. K. A., Jeni, L. A. & Tucker, C. S. Evaluation of biases in remote photoplethysmography methods. npj Digital Medicine 4, 91, doi:10.1038/s41746-021-00462-z (2021).
Wang, Y., Wang, W., Zhou, M., Ren, A. & Tian, Z. Remote monitoring of human vital signs based on 77-GHz mm-wave FMCW radar. Sensors 20, 2999 (2020).
Mercuri, M. et al. Vital-sign monitoring and spatial tracking of multiple people using a contactless radar-based sensor. Nature Electronics 2, 252–262, doi:10.1038/s41928-019-0258-6 (2019).
Alizadeh, M., Shaker, G., De Almeida, J. C. M., Morita, P. P. & Safavi-Naeini, S. Remote monitoring of human vital signs using mm-Wave FMCW radar. IEEE Access 7, 54958–54968 (2019).
Ahmad, A., Roh, J. C., Wang, D. & Dubey, A. in 2018 IEEE Radar Conference (RadarConf18). 1450–1455 (IEEE).
Zhao, P. et al. in 2020 IEEE International Conference on Robotics and Automation (ICRA). 2812–2818 (IEEE).
Wang, W. & den Brinker, A. C. Modified RGB cameras for infrared remote-PPG. IEEE Transactions on Biomedical Engineering 67, 2893–2904 (2020).
Wang, W., den Brinker, A. C. & De Haan, G. Discriminative signatures for remote-PPG. IEEE Transactions on Biomedical Engineering 67, 1462–1473 (2019).
Kebe, M. et al. Human vital signs detection methods and potential using radars: A review. Sensors 20, 1454 (2020).
Wang, C., Pun, T. & Chanel, G. A comparative survey of methods for remote heart rate detection from frontal face videos. Frontiers in bioengineering and biotechnology 6, 33 (2018).
Gilgen-Ammann, R., Schweizer, T. & Wyss, T. RR interval signal quality of a heart rate monitor and an ECG Holter at rest and during exercise. European journal of applied physiology 119, 1525–1532 (2019).
Chiu, S. J. et al. Automatic segmentation of seven retinal layers in SDOCT images congruent with expert manual segmentation. Optics Express 18, 19413–19428, doi:10.1364/OE.18.019413 (2010).
Graph-based segmentation of retinal layers in oct images (MATLAB MATLAB Central File Exchange, 2021).
Ramachandran, G. & Singh, M. Three-dimensional reconstruction of cardiac displacement patterns on the chest wall during the P, QRS and T-segments of the ECG by laser speckle inteferometry. Medical and Biological Engineering and Computing 27, 525–530 (1989).
Wang, W., den Brinker, A. C., Stuijk, S. & De Haan, G. Algorithmic principles of remote PPG. IEEE Transactions on Biomedical Engineering 64, 1479–1491 (2016).
Stricker, R., Müller, S. & Gross, H.-M. in The 23rd IEEE International Symposium on Robot and Human Interactive Communication. 1056–1062 (IEEE).
Jensen, J. N. & Hannemose, M. Camera-based heart rate monitoring. Technical University of Denmark, Department of Applied Mathematics and Computer Science, DTU Computer: Lyngby, Denmark 17 (2014).
Shi, J. & Malik, J. Normalized cuts and image segmentation. IEEE Transactions on pattern analysis and machine intelligence 22, 888–905 (2000).
Dijkstra, E. W. A note on two problems in connexion with graphs. Numerische mathematik 1, 269–271 (1959).

No competing interests reported.

Download PDF

Editorial decision: Major revision
27 May, 2022
Reviews received at journal
12 May, 2022
Reviewers agreed at journal
10 May, 2022
Reviewers agreed at journal
10 May, 2022
Reviewers invited by journal
10 May, 2022
Editor assigned by journal
10 May, 2022
Editor invited by journal
30 Mar, 2022
Submission checks completed at journal
30 Mar, 2022
First submitted to journal
17 Mar, 2022

You are reading this latest preprint version

Using a graph-based image segmentation algorithm for remote vital sign estimation and monitoring

Status:

Version 1

Abstract

Figures

Introduction

Methodology

The FMCW Radar Based Measurement Principles

Video-based Measurement Principles

Graph-based Image Segmentation Estimation Method

The Experiment

Experimental Setup

The Measurements

The Operation Flow

The Evaluation Methods

Root Mean Square Error (RMSE):

Area Under the Curve of the Success Rate (AUC-SR):

Coverage at ± 3 bpm:

Pearson Correlation Coefficient (PCC):

Results and Discussions

Conclusions

Declarations

References

Additional Declarations

Status:

Version 1