1. Introduction
While pulse oximetry is ubiquitous in medicine and consumer health, the strict foundations for this methodology are not well established. As such, it relies mostly on empirical methods, which is particularly true for remote photoplethysmography (rPPG).
The typical approach in pulse oximetry is to use two wavelengths and calculate the oxygenation using the so-called ratio-of-ratios,
R.
Here,
AC and
DC refer to the amplitude of the AC component and the DC components of the measured signal; 1 and 2 refer to different wavelengths. Typically, the wavelengths are selected in red in the spectrum’s near-infrared (NIR) ranges, where oxyhemoglobin and deoxyhemoglobin have very different absorptions. For example, 660 nm and 940 nm are commonly used in commercial pulse oximeters [
1].
The dependence of peripheral oxygen saturation (
SpO2) on the “ratio-of-ratios”
R in pulse oximetry is typically modeled using the linear relationship (see, for example, [
1])
Here, C1 and C2 are factors determined using a calibration procedure.
The same approach is used in rPPG pulse oximetry. For example, Humphrey et al. [
2] observed pulse waveforms at 760 and 880 nm on 10 healthy volunteers and concluded that they could be used to extract SpO2 remotely. Shao et al. [
3] found that the use of orange (611 nm) and NIR (880 nm) provides the best SNR for remote PPG pulse oximetry. Verkruysse et al. [
4] showed that a single universal calibration curve with acceptable spread between individuals could be achieved by using 660 nm (red) and 840 nm (NIR) light. Much effort is dedicated to using only the visible spectrum range for remote pulse oximetry, as it allows for much simpler setups, like smartphone cameras. For example, Moco and Verkruysse [
5], in addition to red (675 nm) and NIR (840 nm), used green (580 nm) as a potential substitution for NIR lights. They extracted the arterial blood oxygen saturation (SpO2) of 46 healthy adults. They found that SpO2 can be calibrated under controlled conditions with red and green light, but the accuracy is less than that of SpO2, as estimated in the usual red–NIR window.
However, we have identified a critical gap in the current knowledge. Namely, there is no sound analytical justification for an ability to extract tissue oxygenation in rPPG geometry.
In particular, while the use of “ratio-of-ratios” is typically explained using the Beer–Lambert model, oxygenation cannot be derived directly from physical and physiological considerations of light absorption in oxy- and deoxyhemoglobins, based on the Beer–Lambert law [
6]. In particular, the ability to extract oxygenation is based on the primary assumption that the optical pathlengths for both wavelengths are identical. While it may be a reasonable assumption for traditional (transmissive mode) pulse oximeters [
7], it is definitely not the case in remote PPG, where pathlengths linearly depend on the penetration depths [
8], which are very different for red and infrared light [
9,
10]. Therefore, in practice, even transmissive mode commercial pulse oximeters use an empirical relationship, where the relationship between
R and
SaO2 is determined experimentally for each type of pulse oximeter sensor by calibration.
This lack of accurate models can be partially attributed to the fact that even the origin of observed pulsations is far from clear [
4]. There are multiple contradictory theories, from traditional volumetric (absorption-based) to scattering-based, which assume rouleau formation and disintegration during the cardiac cycle [
11].
This uncertainty is particularly obvious for remote PPG (rPPG), which has a very shallow sampling depth. In particular, while contact pulse oximetry uses spatially resolved measurements, which sample tissues at different depths, the rPPG is an imaging geometry that samples primarily the epidermis and the papillary dermis.
This paper aims to lay the foundations for the theoretical framework, which explains the possibility of extracting tissue blood oxygenation in rPPG geometry. The primary contribution of the article is the model, which directly links the peripheral oxygen saturation with the experimentally measurable value (ratio-of-ratios) in rPPG geometry. In addition, we explored the possibility of assessing the origin of the pulsating volume from experimental data.
2. Materials and Methods
Photoplethysmographic techniques assess changes in the blood volume caused by pulse propagation. We will refer to the variable blood volume in microcirculation during the cardiac cycle as a pulsatile volume. More specifically, we will denote an excess blood volume over diastolic volume as a pulsatile volume V. Generally, V can be expressed as volume per vessel (μm3). In this case, the distribution of the pulsatile volumes can be characterized by surface density ρ (L/mm2). Alternatively, V can be expressed as volume per unit of skin surface area, (μm3/mm2). Moreover, V depends on time V(t). However, we will skip the time dependence for compactness.
2.1. Analytical Model
Let us consider the following volumetric model of blood pulsations in rPPG geometry: A single pulsatile volume,
V, is located at a certain depth,
Z. This pulsatile volume impacts light propagation. To account for this impact, we can consider a tissue with diastolic blood distribution as a base-case scenario approximated by a homogenous blood distribution. In addition to that, pulsatile blood volumes are present. As these volumes are small, we can consider them perturbations and follow the perturbation approach developed by Saiko et al. [
12]. This approach can be summarized as the following: Firstly, we find the light distribution for the homogeneous semi-infinite space. Then, we consider a light-absorbing heterogeneity with excessive absorption coefficient
δμa and volume
V, buried at some depth
Z as a perturbation. We represent this heterogeneity as a negative point source and look for the light distribution caused by this negative light source. The overall light distribution will be the sum of homogeneous and point-source-induced contributions.
In particular, if the inhomogeneity is located at (0,0,
Z), then the fluence rate at any point on the surface of the tissue surface (here, we assume cylindrical coordinates) in the presence of mismatched boundary can be found as follows [
12]:
Here, φ(Z) is the fluence rate for the homogeneous media (unperturbed solution), r is the distance on the surface of the tissue from the projection of the defect to the surface (), , ,, and , where r10 is the coefficient of reflection of diffuse light on the border of tissue and air.
Unlike [
12], which focused on detecting a single or double inhomogeneity, we are interested in the combined effect of multiple pulsatile volumes distributed homogeneously in the dermis. Thus, we can change perspective, select an observation point (which, for convenience, can be at the beginning of coordinates), and sum contributions from all heterogeneities in the tissue to the fluency rate at this point.
Here, ρ is the 2D inhomogeneity density in the plane.
After fairly straightforward integration by applying
u =
r2,
a +
u ->
u, and
substitutions, we will obtain
As the pulsatile volume
V changes in time, Equation (5) is directly associated with an
AC component of the rPPG signal. As such, we can try to estimate the
AC/
DC ratio, which is routinely used in PPG measurements. We notice that the
DC component will have a homogeneous fluence rate at the surface
φ(0). Thus, the
AC/
DC ratio can be found as
Here, φ(0) and φ(Z) are homogeneous fluence rates at the surface and the pulsatile volume’s depth, accordingly. A light-absorbing defect is characterized by excessive absorption coefficient δμ, maximum pulsatile volume, Vmax, depth Z, and surface density ρ.
Wide Beam Diffuse Illumination
The homogeneous fluence rate within the tissue depends on the illumination. For example, the homogeneous fluence rate can be found using a diffuse approximation for collimated and diffuse illumination.
For a practical case of wide beam diffuse illumination
(see, for example, [
13]). Thus, for certain practical applications, Equation (6) can be simplified into
Consequently, the “ratio-of-ratios”, which is used for blood oxygen saturation calculations, can be written as
Subindexes 1 and 2 refer to different wavelengths.
In linear approximation on
μeffh, we will have a much simpler expression.
Note that R in Equations (8) and (9) explicitly (and quite strongly) depends on Z. Thus, Equations (8) and (9) can potentially be used to extract the depth of pulsations from experimental data.
2.2. Simulations
To verify the model, we performed simulations using Equation (8). The following parameters were used.
2.2.1. Absorption
In the absence of melanin, the absorption of the bloodless tissue can be modeled as the background absorption of human flesh [
14]:
where
(mm
−1). Here, the wavelength λ is measured in (nm).
In the presence of blood (dermis), the absorption of the dermis can be modeled as a combination of background-, oxyhemoglobin-, and deoxyhemoglobin-related absorption.
Here,
c is the blood volume fraction in the dermis,
SO2 is the tissue oxygen blood saturation, and HbO2 and RHb refer to oxyhemoglobin and deoxyhemoglobin, respectively. The absorption coefficients for oxyhemoglobin- and deoxyhemoglobin are wellknown [
15]. Blood typically occupies around 0.4% of the physical volume of the papillary dermis [
16].
However, the tissue oxygen blood saturation will be average between arterial and venous compartments. Thus, Equation (10) can be rewritten taking into account relative volumes of the arterial and venous compartment: νa and νv (νa + νv = 1)
SaO2 and
SvO2 are the arterial and venous compartment oxygenations, respectively. Assuming no accumulation of blood into the papillary plexus, the relative blood volume will be approximately equal between compartments, ν
a = ν
v = 1/2. Thus, we can write
where
2.2.2. Scattering
The reduced scattering coefficient for the dermis and epidermis also follows the power law [
17]
, with
k = 1.3. With this power law, we can set a reference value at a particular wavelength and simulate its dependence on the wavelength. In particular, we can assign values at 633 nm [
18] for the living epidermis (
μs’ = 9 mm
−1) and reticular dermis (
μs’ =5 mm
−1), which represent the bulk of the tissue in healthy epidermis and dermis, respectively. Assuming normal skin, where the stratum corneum is thin, we can ignore the presence of the stratum corneum. Thus, we can write
.
2.2.3. Other Parameters
The refractive index of the tissue depends on the wavelength. However, this dependence is small; we will ignore it in our calculations. As such, the refractive index was set to n = 1.42. The depth of the pulsating volume was set in the papillary dermis (Z = 0.5 mm).
The model depends on two variables:
SaO2 and
SvO2. To simplify interpretation, the assumption was made that the oxygen’s constant fraction (0.3) is extracted during gas exchange in capillaries irrespective of initial oxygenation. Thus,
As a result, all obtained results depend on a single variable, SaO2.
2.2.4. Test Scenarios
While it is typically assumed that the pulsatile volume is located on the arterial side, there is no definitive justification. While the arterial side is characterized by the larger changes in the blood pressure, it is also characterized by much smaller compliance than the venous compartment. Thus, it is hypothetically possible that the pulsatile volume is located in the venous compartment.
Thus, to understand the origin of the photoplethysmographic signal, it is necessary to estimate the depth of the pulsations and the compartment where these pulsations occur: arterial or venous. Thus, we have simulated two scenarios: (a) the pulsatile volume is located on the arterial side, and (b) the pulsatile volume is located on the venous side.
In the first scenario, the pulsatile volume is characterized by the arterial oxygenation
SaO2. Thus,
In the second scenario, the pulsatile volume is characterized by the venous oxygenation
SvO2. Thus,
4. Discussion
We derived an explicit analytical relationship connecting blood oxygenation with rPPG geometry’s ratio-of-ratios. We also fitted the obtained function with a linear function. Our first finding was that a linear function in the considered oxygenation range (SaO2 = 70–100%) provides an extremely good fit (R2 = 0.98–0.99) for all considered R/IR pairs.
We can compare our results with experimental data from the literature. In particular, Verkruysse et al. [
4] calibrated rPPG oximetry with illumination at 660 and 840 nm by video recordings of the foreheads of 41 healthy adults subjected to normoxic, hypoxic, and low-temperature conditions. They obtained a
SO2 = 118 − 45.9
R fitting function. These values are very close to our out-of-the-box prediction for
C1 and
C2 for the 660/840 pair 119.1 and 41.8, respectively (see
Figure 1A).
We have analyzed the sensitivity of the
C1 and
C2 to model parameters. The model is insensitive to changes in volume blood fraction c and refractive index n. However, it is very sensitive to the depth of the pulsatile volume
Z. In particular, for
Z = 0.6 mm, for
C1 and
C2, we obtained 119.2 and 45.2, respectively, which is very close to values estimated by Verkruysse et al. [
4]. Depending on the epidermis thickness, this depth can be in the papillary dermis, subpapillary vascular plexus, or upper portion of the reticular dermis. However, we can estimate more accurately as measurements were taken from the forehead. For example, Jeong et al. [
19] found that the epidermal thickness on the forehead is 0.334 ± 0.157 mm. As the thickness of a papillary dermis is approximately 300–400 μm [
20], we can conclude that the pulsatile volume is located in the lower part of the papillary dermis or subpapillary vascular plexus.
Another conclusion from this experimental comparison is that pulsatile volume most likely resides on the arterial side. In particular, the venous side’s origin would be characterized by a much smaller coefficient
C2 (see
Figure 1B vs.
Figure 1A).
Based on these estimations, the pulsatile volume is located in the arterial compartment of the papillary dermis’ lower part or subpapillary vascular plexus. Potentially, it can be an upper portion of ascending arterioles. However, a subpapillary vascular plexus origin is much more likely as it most likely has the largest compliance.
The shortcoming of the proposed model is that it does not account for the impact of the stratum corneum and skin tone. In particular, there is growing evidence that skin tone may impact the accuracy of pulse oximetry measurements [
21]. Similarly, the stratum corneum thickness may impact the accuracy of pulse oximetry measurements. In particular, while the thickness of the dermis is quite uniform across all body parts, the epidermis thickness varies between glabrous and non-glabrous skin [
22], primarily due to the differences in stratum corneum thickness. Moreover, the stratum corneum thickness can be even higher in corns and calluses. However, we excluded epidermis effects (skin tone and stratum corneum thickness) from consideration as semi-quantitative analysis shows that their impact is nonmaterial. In particular, we can write the ratio-of-ratios
R as
Here,
I1 and
I2 refer to the intensity of the red and infrared light on the sensor, operator Δrefers to changes between systole and diastole. Then, using the Beer–Lambert law for multilayer tissue (
); here,
Li is the pathlength in the
i-th layer), we can write
Here, μa,1,i, μa,2,i and Δμa,1,i, Δμa,2,i are absorption coefficients for red and IR wavelengths in the i-th layer and changes in absorption coefficient between diastole and systole for red and IR wavelength in the i-th layer, accordingly. Similarly, L1,i, L2,i and ΔL1,i, ΔL2,i are the mean optical path for red and IR wavelengths in the i-th layer and changes in the mean optical path between diastole and systole for red and IR wavelengths in the i-th layer, accordingly. Thus, mean optical paths (MOP) are MOP1 = ∑L1,i and MOP2 = ∑L2,i.
From Equation (17), we can see two types of contributions. In the first term in the nominator and denominator, one can expect that only tissue layers where Δμa,1,i and Δμa,2,i (pulsatile volume) are non-zero contribute to R. The second term in the nominator and denominator accounts for the change in the mean optical path between diastole and systole.
As the surface layers (stratum corneum and living epidermis, where melanin is synthesized) do not contribute to the first terms and are unlikely to have a significant contribution to the second terms, one would expect that surface layers should not provide a meaningful contribution to Equation (17). Thus, it should not have a noticeable impact on the ratio-of-ratios.
R. However, it does not hold true if the surface layer significantly impacts the passage of light to the dermis (either strong scattering in thick stratum corneum or strong absorption in the melanin layer). In this case, the light does not sample the pulsatile volume, and pulse oxygenation cannot be estimated altogether. Alternatively, the signal is so weak (and noisy) that accurate oxygenation estimation is impossible. For example, it was estimated that oxygenation cannot be accurately extracted for calluses thicker than 1.5 mm [
23]. Another effect of the thickening of the epidermis layer is an increase in the pulsatile volume depth,
Z, and a corresponding increase in the
C2 coefficient (see Equation (A6) for details). Thus, rPPG oxygenation measurement should be taken from skin areas with similar epidermal thicknesses once calibrated. Considering these corner cases, we can conclude that the epidermis should not impact our analysis for light skin tone and not very thick stratum corneum (e.g., non-glabrous skin). However, in the general case, a thorough analysis is required.
In summary, we can conclude that our simple estimation gave us quite realistic values. However, further analysis of the model’s sensitivity to different model parameters is required. In future work, we also plan to validate our results with Monte Carlo simulations, which are a de facto gold standard in tissue optics.