Kurihara 1 Yoshihiro Maeda 2 Daisuke Sugimura 3 Takayuki Hamamoto 1 1 Tokyo University of Science 2 Tokyo Metropolitan University 3 Shibaura Institute of Technology ICIP 2024 TP1.L5.5
attention because of its convenience ⚫ Principle: color change analysis on the facial videos • Blood volume pulse (BVP) associated with the cardiac pulse causes skin color changes → BVP signal and HR can be estimated Non-contact HR estimation using cameras 1 HR: 68 bpm RGB video Temporal variations G B R BVP signal Blood vessel Skin Cardiac pulse Analysis time→
is low signal-to-noise ratio 1. BVP signal is subtle signal (typically less than 2 bits) 2. Large noise due to fluctuations of ambient illumination Problem of video-based HR estimation 2 Camera Ambient light 1. Subtle BVP 2. Illumination noise Illumination Small amplitude Large amplitude BVP
[1] • BVP has quasi-periodic temporal characteristics → BVP can be modeled using conservative systems • BVP propagates over the facial region with the same timing → BVP can be modeled as spatially similar Previous method 1: Physiological modeling of BVP 3 video observation time Temporal characteristic Cardiac pulse Spatial characteristic BVP propagation spatial [1] Kurihara+, Spatio-Temporal Structure Extraction of Blood Volume Pulse Using Dynamic Mode Decomposition for Heart Rate Estimation, Access 2023
invisible NIR light → Stable NIR signals independent to illumination noise ⚫ Flexible integration of RGB/NIR videos based on background illumination analysis [2] → Achieve robustness in a various illumination scene Previous method 2: Ambient illumination modeling 4 RGB video NIR video Controlled illumination RGB video NIR video Uncontrolled illumination [2] Kurihara+, Non-Contact Heart Rate Estimation via Adaptive RGB/NIR Signal Fusion, TIP 2021
Overview of our algorithm 6 NIR video RGB video Flexible integration based on illumination model RGB/NIR weight calculation patch group NIR HR likelihood computation based on physiological model RGB HR likelihood computation based on physiological model Integrated HR likelihood HR posterior at 𝜏-1 HR posterior at 𝜏 MAP estimation State transition
signals: project onto the chrominance space → Mitigate influences of the noise components Obtaining multiple observation signal 7 RGB video RGB Observation RGB signal ←patch ←patch Chrominance analysis[6] B R G NIR video NIR Observation ←patch time→ time→ time→ patches
• Assess whether RGB or NIR signals is more reliable [2] » Uncontrolled scene : High correlations → NIR is reliable because of stable NIR light » Controlled scene : Less correlations → RGB is reliable because of skin and blood characteristics Flexible RGB/NIR integration: overview 11 Face Back Controlled : RGB is reliable Face Back Uncontrolled : NIR is reliable [2] Kurihara+, Non-Contact Heart Rate Estimation via Adaptive RGB/NIR Signal Fusion, TIP 2021
and background regions • In uncontrolled scene: coefficient becomes high RGB facial signal is difficult to estimate HR → Assign more weight on NIR likelihood Flexible RGB/NIR integration: compute flexible weight 12 RGB likelihood (priority in controlled) NIR likelihood (priority in uncontrolled) Flexibly integrated likelihood RGB video G face G back Pearson Correlation Correlation analysis × ×
HR posterior using integrated HR likelihood and HR prior based on particle filter framework HR estimation based on MAP inference 13 HR HR posterior at 𝜏 − 1 HR posterior at 𝜏 HR prior at 𝜏 State transition RGB/NIR integrated HR likelihood Apply Bayes' theorem MAP estimation