1. Introduction
Swallowing is a reflectional process that requires effective coordination among respiration, tongue pressure, submental muscle reaction, and pharyngeal movement on the swallowing path. The swallowing path is typically divided into three stages: oral, pharyngeal, and esophageal. In the oral stage, food (or liquid) is formed into a bolus, and the extrinsic and intrinsic muscles of the tongue constrict to push the bolus toward the pharynx. In the pharyngeal stage, respiration is briefly inhibited and the vocal folds close with the contraction of the submental muscles to prevent the bolus from entering the oral cavity and airway. In the esophageal stage, the bolus descends into the esophagus, and respiration is resumed. Numerous diseases, such as neurological disease, neuromuscular disorder, chronic indigestion disorder, gastroesophageal reflux disease, or cancer of the head and neck, might impair this coordination and cause swallowing dysfunction [
1,
2]. A typical symptom of swallowing dysfunction is residual food or liquid on the swallowing path, necessitating additional swallowing. This symptom may occur at any stage of the aforementioned diseases and lead to sensorimotor disorder, choking, aspiration, or potential complications. If the swallowing dysfunction is not treated adequately, those complications may deteriorate and result in dehydration, malnutrition, choking injuries, aspiration pneumonia, or even death [
3].
Tongue pressure is a crucial factor in the initial stage of the swallowing path. Tongue pressure results from the tongue pressing against the hard palate, contributing substantially to bolus formation and manipulation as well as the safe transporting of food from the oral cavity to the pharynx [
4,
5]. Insufficient tongue pressure may lead to poor masticatory performance and a deterioration or loss of safe and smooth swallowing [
6,
7]. Tongue pressure is also a good predictor of the presence of oral-stage swallowing impairment [
8]. It has been proved that neurogenic disorders, such as a stroke or Parkinson’s disease, can lead to deficits in the sensory and motor functions of the tongue [
9]. This is the major cause of oral and pharyngeal residues. Therefore, sufficient tongue pressure becomes a determining factor for safe swallowing.
Related studies have proposed several methods to evaluate tongue pressure. Yano et al. [
10] used a specific air ball device (TPM-01, JMS Co., Hiroshima, Japan) to investigate the effect of tongue-strengthening exercise on the suprahyoid muscles. When using this device, the participants were required to use the anterior part of the tongue to press against the hard palate. Fukuoka et al. [
11] measured tongue pressure by using a pressure sheet (Swallow-SCAN, Nitta, Osaka, Japan) attached to the palate. Efficient swallowing, however, relies on more than only a single factor; effective coordination among the tongue, nasal, oral, and pharyngeal structures is essential. Therefore, developing a multisensor approach for monitoring the correlation between tongue pressure and swallowing function is necessary.
Current standard approaches of swallowing function assessment are based on optical devices. The most widely used technique is video fluoroscopic swallowing study (VFSS) [
12,
13]. The VFSS device uses X-ray video to monitor the structural swallowing events, especially those related to the hyoid bone and thyroid cartilage movements. However, the device can only detect the laryngeal motions, and it has the risk of radiation exposure. Another method is fiberoptic endoscopic evaluation of swallowing (FEES) [
14]. During FEES, a fiberoptic endoscope is inserted through the nasal cavity to the pharynx to obtain images of the swallowing process. It is an invasive assessment tool that carries some risk of injury for patients and may cause discomfort. Both VFSS and FEES must be performed in a specific lab or hospital, which obviously results in time and space limitations. Patients with poor mobility who require swallowing assessment through these two methods frequently face almost insurmountable challenges. Moreover, tongue pressure is difficult to measure through optical approaches.
Practitioners tend to prefer swallowing assessments that employ noninvasive sensors, resulting in a test that is more convenient, safe, and free of radiation. Li et al. [
15,
16] evaluated biomedical coordination during oropharyngeal swallowing by using a noninvasive sensing system consisting of a tongue pressure sensor sheet between the tongue and hard palate, surface electromyography (sEMG) electrodes on the surface of the submental muscles, and a bend sensor with a microphone on the throat. These researchers synchronized all data to identify the temporal coordination among those structures. Their studies, however, only addressed healthy male participants. Furthermore, they only examined the correlation of muscle activity with tongue pressure during swallowing, thus lacking an analysis of the correlation of tongue pressure with nasal or oropharyngeal parameters. Murakami et al. [
7] investigated the correlations between tongue pressure, hyoid movement, and suprahyoid muscle activity. They proposed a T-shaped sensor sheet to measure tongue pressure with sEMG and VFSS. Their study, however, only addressed the oral strategies (e.g., how to squeeze the gels) but not the swallowing path. Other related studies [
17,
18,
19] have used multisensors to investigate the coordination between swallowing and respiration, but they have not considered tongue pressure. Since tongue and pharyngeal organs play crucial roles in bolus propulsion, understanding disorder of the individual organs involved in oral cavity and pharyngeal swallowing is not enough to ascertain the condition of swallowing impairment. It is necessary to explore the time-based correlation of the movements among the various organs.
The goal of this study was to integrate tongue pressure measurement into a noninvasive, multisensor method and evaluate the correlation of this measurement with other swallowing functions. Toward this goal, an air bulb was placed between the anterior tongue and the hard palate to detect the tongue pressure. The air pressure in the bulb was measured to obtain the force of the tongue squeezing the bulb during swallowing. The air bulb was used to reduce discomfort in swallowing compared with the use of a pressure sheet adhered to the hard palate. Moreover, the pressure of the air bulb was collected synchronously with other sensors, including those for nasal air flow, submental muscle sEMG, and thyroid cartilage excursion, to reveal the correlations among them on the swallowing path. A multisensor analytic model was also proposed to measure all swallowing parameters from the sensor waveforms such that a physician can monitor swallowing patterns in real time. This noninvasive, multisensor model is particularly suitable for the bedside assessment of swallowing function.
A total of 39 healthy young and middle-aged individuals participated in this study to evaluate the effectiveness of the proposed approach. Each participant was instructed to swallow 1, 3, 5, and 10 mL of water. The data of all swallowing parameters were recorded synchronously for correlation analysis. The results indicated that, of all the parameters, tongue pressure has the strongest correlation with the submental muscles.
3. Experimental Results
In this section, the verification results of the autodetection program are first presented. The detection accuracy of the proposed method is then verified by comparing the detected time of each event with that obtained through a manual approach performed by two clinical physicians monitoring the signal waveform. Subsequently, the effects of the factors (i.e., sex, age, and water volume) that might affect swallowing behavior are reported for each sensor. Finally, the correlation between the tongue pressure and the other sensors in measuring the swallowing parameters is revealed.
Three-way analysis of variance (ANOVA) was performed to examine the main effect and correlations of the factors (i.e., sex, age, and water volume) on the swallowing parameters, namely sEMG duration, nasal airflow duration, laryngeal excursion movement, and tongue pressure. Pearson’s coefficients were also calculated to identify the strength of correlations between any two parameters. SPSS version 25 (IBM, SPSS Inc., Chicago, IL, USA) was used for all statistical analyses. The level of significance was set to p < 0.05. All the data tested in this study were based on nonparametric statistics.
3.1. Verification of the Autodetection Program
Table 4 summarizes the error rates of the autodetection program compared with the manual detection results for each sensor in the tests with various volumes of water. The results indicated that the proposed method had the lower error rates (all smaller than 1%) in sEMG detection. The error rates increased evidently for the nasal airflow and FSR sensors, ranging from 0.9% to 7.46%. This increase was attributed to the waveforms for nasal airflow and FSR being more complicated than those for the other two signals, making detection more difficult. Nevertheless, the verification results confirmed that the proposed method is sufficiently adaptable to trace different waveforms with acceptable error rates.
3.2. Results of sEMG
Figure 9 reveals the comparison of the sEMG onset time (
TsEMG) by sex and age. The male participants exhibited a slightly longer
TsEMG than did the female participants (
Figure 9a). By contrast, a marked difference was observed between the young and old groups for all volumes of water, with the participants in the old group having a longer
TsEMG on average (
Figure 9b). The longer sEMG onset time indicated that the participants spent more time in the oral stage pushing the water backward to the larynx. ANOVA was used to evaluate which factor (i.e., sex, age, or water volume) had a main effect on
TsEMG (
Table 5). For each single factor, only age presented an obvious difference (i.e.,
p = 0.047), whereas the other two factors did not. Further evaluation of the interaction effects between different combinations of two factors confirmed that the age factor affected sEMG performance with different volumes of water (i.e.,
p = 0.007 for Age*Volume), whereas the sex factor did not. This finding indicates that age has a more critical effect on swallowing behavior than do the other factors, according to the sEMG measurement.
3.3. Results of Nasal Airflow
Figure 10 reveals the results of the nasal airflow pause duration (
TNAir) by sex and age. The male participants again exhibited a longer respiration pause than did the female participants (
Figure 10a); however, a similar longer duration was not observed in those in the old group (
Figure 10b). The young and the old groups presented highly similar results. ANOVA results are provided in
Table 6. The sex factor was confirmed to have a main effect on
TNAir (
p = 0.017), but this effect was not observed for the age factor (
p = 0.172). This result is inconsistent with those obtained for sEMG measurement, where age had a main effect.
The volume of water was another factor that had a main effect on
TNAir (
p = 0.002;
Table 6). In
Figure 10a,b, the length of
TNAir increased with the volume of water. This is to be expected because a larger volume of water requires a longer airflow pause for safe swallowing. The only exception was the 1 mL water, which had a pause duration highly similar to that observed for the 3 mL volume. This is reasonable because most participants used greater force for a longer period to swallow a small volume of water.
3.4. Results of FSR
The thyroid cartilage movement presented a W-shaped waveform. Thus, in this section, the total excursion time (TTC1 + TC2) is evaluated first. Subsequently, the durations of the first phase (TTC1) and the second phase (TTC2) of the waveform are examined to determine which one dominated the total excursion time.
A. Thyroid cartilage total excursion time (TTC1 + TC2)
Figure 11 illustrates the total excursion time of the thyroid cartilage movement for each volume of water by sex and age. A clear difference was observed between the male and female participants swallowing 3 mL and 5 mL of water, and the female participants had a shorter total excursion time (
Figure 11a). For the 1 mL and 10 mL volumes, the male and female participants required approximately equal time to swallow the water. By contrast, in terms of age, those in the old group spent more time on the 1 mL, 3 mL, and 5 mL tests on average than did those in the young group (
Figure 11b). Only in the 10 mL test did the young group spend slightly longer swallowing than did the old group.
Table 7 reports the ANOVA results for each factor. Sex, age, and water volume had a major effect on the total excursion time, but these factors had no interaction effects.
B. Thyroid cartilage first phase and second phase duration (TTC1, TTC2)
Figure 12 reveals the results of the first phase duration (
TTC1) in the thyroid cartilage excursion. The sex and age factors did not result in significant differences in any of the tests. The ANOVA results in
Table 8 confirm this result; no factors had main effects or interaction effects with other factors on
TTC1.
Nevertheless, the second phase duration (
TTC2) of the thyroid cartilage excursion differed evidently between the sexes and between the age groups (
Figure 13). The male participants and those in the old group had a longer second phase duration on average, as shown in
Figure 13a,b, respectively. The ANOVA results in
Table 9 confirm that the durations of
TTC2 differed significantly between the sexes, age groups, and water volumes. This finding indicates that the second phase duration (
TTC2) clearly dominated the total excursion movement of the thyroid cartilage.
3.5. Results of Tongue Pressure
A. Tongue pressure onset duration (Ttongue)
Figure 14 presents a comparison of the results of the tongue pressure onset duration (
Ttongue) by sex and age. The male participants on average required an evidently longer onset duration than the female participants did in all tests (
Figure 14a). This finding indicates that the male participants required more time to swallow the water during the oral stage. The same result was noted among those in the old group (
Figure 14b). Those in the old group exhibited an even longer tongue pressure onset time than did the young group for safe swallowing. The ANOVA results in
Table 10 support this result; significant differences were observed between the sexes and between the age groups. For different water volumes, however, significant differences were not observed between the sexes or between the age groups.
B. Tongue pressure peak value (Ptongue)
Figure 15 illustrates a comparison of the tongue pressure peak value (
Ptongue) by sex and age. The tongue pressure peak values differed considerably from the tongue pressure onset durations in that the male participants (
Figure 15a) and those in the old group (
Figure 15b) exhibited lower peak values than did the female and young participants, respectively. This finding may be attributed to a compensation mechanism in which those with lower tongue strength take longer when swallowing to ensure that the water is swallowed smoothly. ANOVA revealed significant differences between the sexes and between the age groups (
Table 11).
3.6. Correlations between Tongue Pressure and the Other Sensors
Finally, the correlation between tongue pressure and other parameters was evaluated. Pearson’s correlation was used to examine the strength of the relationship between tongue pressure onset duration (Ttongue) and sEMG onset duration (TsEMG), nasal airflow pause duration (TNAir), and thyroid cartilage excursion duration (TTC1+TC2). Pearson’s r values are reported, and the correlation was considered strongly positive if 0.5 < r < 1. The p values of the ANOVA test were used to evaluate how well the result rejected the null hypothesis—that is, that no relationship exists between the two compared parameters. The results were considered statistically significant when p < 0.05.
Table 12 illustrates the comparison results of tongue pressure onset duration with other parameters. The results revealed a strong positive correlation between the tongue pressure onset duration (
Ttongue) and the sEMG duration (
TsEMG), with
r values ranging from 0.532 to 0.717. Regarding the ANOVA, the
p values between
Ttongue and
TsEMG were smaller than 0.01 in all tests, confirming the strong positive correlation. However, Pearson’s test revealed no obvious correlation between the tongue pressure onset duration (
Ttongue) and nasal airflow pause duration (
TNAir) or thyroid cartilage excursion duration (
TTC1 + TC2) parameters (
Table 12).
Figure 16 displays the grouped scatter plots of the parameter
Ttongue with the other three parameters (i.e.,
TsEMG,
TNAir, and
TTC1+TC2).
4. Discussion
Swallowing involves numerous physiological reactions, and thus, the coordination between the parts of the swallowing process is crucial. Many studies have addressed the coordination of the swallowing process by measuring laryngeal movement, respiration, and submental muscle movement. Multisensor approaches have become an emerging trend because such tests are noninvasive, radiation free, and convenient [
15,
16,
17,
18,
19]. This study used an air bulb to integrate the tongue pressure measurement into a multisensor model. A slope-based autodetection program was designed to help physicians mark the events and measure the parameters for each signal. The verification results revealed that the poorest error rate of the proposed method was less than 5.5% (
Table 4). The correlation of the tongue pressure, as well as the onset duration, with other swallowing parameters, including the nasal airflow pause duration, submental muscle sEMG onset time, and thyroid cartilage excursion time, was also investigated. Factors affecting swallowing, namely sex, age, and volume of water, were also evaluated to verify the sensitivity of the proposed multisensor model.
The goal of developing the autodetection method is to help the physicians to identify the onset events of each signal precisely in real time for clinical or bedside swallowing assessment. To the best knowledge of the authors, there are few studies addressing the autodetection issue. Therefore, the verification of the autodetection was performed via manual marking in this study. To ensure the correctness of the manual judgement, two experienced clinical physicians participated the verification and the average of two results were used as the comparison basis. Moreover, by the design of the sliding window scanning, a small variation of the signal could be filtered out without affecting the detection. If a large variation happens to change the trend of the signal and cause a misdetection, the cursor function can help the physicians to fix the error immediately. The experimental results showed that the autodetection can effectively help the physicians to screen the signals in low error rates. According to the previous studies [
17,
18,
19], 5 mL to 10 mL is a suitable range for swallowing. Thus, most subjects can swallow 5 mL water within stable signals, and as a result, have the lower error rates for autodetection.
The experimental results revealed that both sex and age had significant influences on pharyngeal stage swallowing ability, whereas sex had a greater impact on the nasal airflow pause duration and the thyroid cartilage excursion time; moreover, age had a greater impact on the sEMG onset time and the thyroid cartilage excursion time. These findings are supported by those of a related study [
23]. In addition, a failure of bolus propulsion results in residue in the oral cavity and pharyngeal organs. Laryngeal penetration and risk of aspiration are frequently seen, not only among patients with impaired swallowing, but among the elderly population as well [
24]. The current study expanded on the findings of the aforementioned study by revealing that both sex and age factors affect the oral-stage swallowing, including the tongue pressure onset duration and the tongue pressure peak value. The present results indicated that the male participants had slower tongue movements because of their longer tongue pressure onset durations and smaller tongue pressure values than those of female participants; furthermore, the same effects were observed between those in the old and young groups, where the old participants had longer tongue pressure onset durations and smaller tongue pressure values.
The influence of water volume was also investigated in this study. Sayaca et al. noted that exercise that involves swallowing different volumes of water prevented pulmonary complications due to aspiration and improved the quality of life in older individuals [
25]. The current study further investigated which parameters are affected by the volumes of water. The results illustrated that the nasal airflow pause duration and the thyroid cartilage excursion time increased with the volume of water swallowed, whereas the sEMG onset time and the tongue pressure onset duration did not exhibit such obvious increases. Another major finding in this study is that swallowing 1 mL of water requires a sEMG onset time, nasal airflow pause duration, tongue pressure onset duration, and tongue pressure peak value equal to or even longer than those required when swallowing 3 mL or 5 mL of water. These findings may help physicians to design water swallowing training strategies with different volumes that are tailored to a patient’s requirements.
In this study, the signal waveform was translated into a sequence of state transitions for correlation analysis. Each transition from a different state can be considered a change of the trend from the signal. Therefore, marking the appearing time of each state transition can help the proposed autodetection program to identify the onset, offset, or the turnaround time on the waveforms. The time-based parameters as well as the correlations among different signals can be measured accordingly.
Further critical results were obtained by evaluating the correlations between the tongue pressure and other swallowing parameters. The results revealed that the tongue pressure onset duration and peak value had strong positive correlations with the submental muscle sEMG. This is because when swallowing, hyoid bone movement causes the tongue to contract. Simultaneously, the submental muscles act to assist in pushing the bolus backward to the larynx. This coordination plays an essential role in the oral-stage swallowing. This finding can help physicians to develop a substitute measurement based on tongue pressure for patients in which measurement of the submental muscles is unfeasible because of oropharynx lesions or disease. On the other hand, the results showed that the tongue pressure onset duration did not have strong positive correlations with the thyroid cartilage excursion duration, or the respiration pause duration. While drinking a liquid substance or swallowing food bolus, the tongue plays another important role to seal the posterior oral cavity in holding the water or bolus temporally. After the water leaves the elevation of the tongue, the smooth swallowing relies more on pharyngeal constrictor contractions, thyroid cartilage movements, and respiration pause as well, than the tongue muscles. This is why no strong correlations can be observed between the onset durations of the tongue and the thyroid cartilage or the nasal airflow.
The usage of an air bulb distinguishes this study from related studies [
4,
6,
7,
8,
9,
10,
11,
26], which have employed flat resistive or capacitive pressure sensor sheets. The air bulb consisted of soft rubber materials without sharp edges or corners that may damage the oral structure. The air bulb also did not hamper the swallowing of water in the test. However, a problem regarding the air bulb size was observed: only a single size bulb was obtained and used in this study. For participants with a smaller oral cavity, the bulb size may limit the testing volume of water. The usage of different bulb sizes may be considered in future work.
Table 13 illustrates the comparison of the air bulb method with other sensor sheet methods. This study also addressed only the swallowing of room-temperature water. Foods with diverse consistencies, such as jelly or pudding, may be considered for testing participants’ chewing and swallowing ability with semisolid food.