Introduction

Esophageal cancer is the eighth most frequent cancer worldwide and the sixth leading cause of cancer-related deaths [1]. Radical resection of the esophagus with lymph node dissection has been the mainstay of treatment for locally advanced esophageal cancer [2]. This invasive treatment causes serious postoperative complications in nearly 60% of patients, including anastomotic leakage, strictures, and vocal fold immobility due to recurrent laryngeal nerve damage, pneumonia, and dysphagia [3]. Dysphagia and impaired swallowing function increase the risk of pneumonia [4] and mortality [5]. Thus, adequate management of dysphagia is crucial to achieve better health outcomes and quality of life after esophagectomy.

Clinically, dysphagic symptoms can be assessed using a videofluoroscopic swallowing study (VFSS) [6] and fiberoptic endoscopic examinations of swallowing (FEES) [7]. Previous research has reported aspiration in up to 49% [8,9,10,11] and residue in the vallecula and pyriform sinus in 45% and 80% of the patients postoperatively [12]. Furthermore, kinematic analyses of the postoperative VFSS [13, 14] demonstrated reduced displacement of the hyoid bone [8, 15, 16], reduced laryngeal elevation [9, 17], delayed swallowing reflex [8], and decreased opening of the upper esophageal sphincter [15]. Among these abnormalities, delayed swallowing reflex [8], reduced laryngeal elevation [8, 17], and vocal fold immobilit y [11, 18] are associated with postoperative aspiration.

Postoperative dysphagia has, however, been subject to several limitations in previous studies. First, VFSS was not performed preoperatively [12]. Thus, it is unclear whether the reported postoperative dysphagia was a newly developed complication of the surgical intervention or a worsening of the preexisting age-dependent decline in swallowing function, namely, presbyphagia [18, 19]. In addition, VFSS was not repeated in the long term. Thus, it is also unknown whether the reported swallowing impairment improves and if patients can return to a regular diet over time [18]. Second, most previous studies have reported swallowing symptoms, such as aspiration or pharyngeal residue only, and have not analyzed swallowing functions comprehensively using a standard tool, such as the Modified Barium Swallow Study Impairment Profile (MBSImP™) [6]. Furthermore, few studies have examined the relationship between pathological VFSS findings and postoperative aspiration pneumonia [9]. Thus, the impact of the identified swallowing abnormalities after esophagectomy on the development of pneumonia is uncertain.

This study longitudinally analyzed the changes in MBSImP scores, swallowing kinematic measurements, swallowing-related symptoms, and diet levels in patients undergoing esophagectomy. We then examined the association between the swallowing pathophysiology identified in the VFSS and aspiration pneumonia after esophagectomy.

Methods

Setting

This retrospective observational study was conducted at a teaching hospital in Tokyo, Japan. Written informed consent was obtained from all participants. The institutional review board approved the study protocol (Ref. No. 2018016NI).

Participants

We used the data of consecutive patients who underwent curative esophagectomy for esophageal cancer between June 2017 and January 2020. We included patients who completed the laryngoscopic examination and VFSS at three time points: before, two weeks, and three months after surgery.

VFSS Protocol

Otolaryngologists performed VFSS in the lateral projection with the patient upright at 1/30 s intervals before, two weeks after, and three months after the surgery. A hypoosmotic, nonionic, iodinated contrast agent (Iohexol; Omnipaque , Daiichi Sankyo Co., Ltd. Tokyo, Japan) was delivered via a syringe. A length reference marker (coin, 20.0 mm in diameter) was attached under each participant’s chin to calibrate the VFSS measurements. VFSS was performed using different volumes and consistencies of liquid contrast agents. According to the clinical decision of the examiner upon observing penetration-aspiration events, the evaluation of the different volumes and consistencies was discontinued to maximize patient safety and reduce the risk of adverse effects related to VFSS, since the materials could not be ejected by cough. VFSS recordings were archived as movie files in a hospital data storage system. In this study, we copied the movie files of the participants and de-identified them to allow subsequent analyses to be blinded. Although different volumes and consistencies of the test food were used for clinical VFSS, we only analyzed the data from the 5 mL of moderately thickened liquid swallows. The 5 mL moderately thickened liquid contrast agent (150–300 mPas) was swallowed in the head neutral position with no compensatory strategies [20] as all patients completed this trial. There was a considerable amount of missing data for thin liquids, following the decision to stop the trials for swallowing safety.

Data Collection

Patient Characteristics

We captured the participants' demographics, disease status, and treatment characteristics through a chart review of the hospital’s electronic medical records. Preoperative nutritional status was determined using Onodera's prognostic nutritional index (PNI: 10 × albumin concentration [g/dL] + 0.005 × total lymphocyte count per mm3), which has been used to predict surgical outcomes of esophageal cancer [21]. Aspiration pneumonia was diagnosed by attending physicians based on a combination of infiltrates on chest radiography, clinical pneumonia symptoms, and laboratory criteria [22]. We analyzed the medical records during a follow-up period of three months after surgery to determine if aspiration pneumonia had occurred.

Longitudinal Data

VFSS: The MBSImP

An MBSImP-certified speech-language therapist (SLT) scored the MBSImP to assess overall swallowing impairment. We excluded bolus preparation/mastication from the oral impairment (OI) components because we used only a liquid test bolus. The OI score was calculated as the sum of five of the six original components. We also excluded pharyngeal contractions from the Pharyngeal Impairment (PI) components because we did not perform the VFSS in the anteroposterior view in all patients. Thus, the PI score was the sum of nine of the ten original components.

VFSS: Swallowing Kinematic Measurements

We performed one temporal[13] and three displacement measurements[14] to analyze swallowing kinematics using a software application (Swallowtail, Belldev Medical, Arlington Heights, IL, USA). The following four measurements were collected:

  • The period between the time when the bolus arrives at the pharyngoesophageal segment (BP1) and the first onset of hyoid movement leading to swallowing (H1) [13]

  • The maximum displacement of the hyoid during swallowing (Hmax) [14]: The difference between the hyoid position at rest, when the bolus is in the anterior oral cavity and before any posterior movement of the bolus occurs, and the position where the hyoid is maximally elevated relative to the anterior corner of the second and fourth cervical vertebrae

  • Maximum opening of the pharyngoesophageal segment (PESMax) [14]: The most significant expansion of the site designated as the PES (narrowest gap in the lateral view of the upper esophagus between C4 and C6) during swallowing

  • Pharyngeal Constriction Ratio (PCR) [23]: The ratio of the area of the pharynx at rest and when it is maximally constricted.

VFSS: Swallowing Findings

Two assessors, the SLT and a physiatrist, independently judged the participants' levels of penetration/aspiration using the penetration-aspiration scale [24] (PAS score: 1: normal; 8: material enters the airway and passes below the vocal folds, and no effort is made to eject it). The raters were blinded to the participants' information and timing of the recorded VFSS assessment (before, two weeks after, or three months after surgery). Disagreements between the two assessors were resolved through discussion. The PAS scores were then categorized into three levels: regular (PAS score ≤ 2), penetration (3 ≤ PAS score ≤ 5), and aspiration (6 ≤ PAS score ≤ 8) [25]. The timing of penetration or aspiration was recorded before, during, and after swallowing [26]. We extracted the pharyngeal residue score from the MBSImP to determine the severity of pharyngeal residue. A score of 0 indicated “complete pharyngeal clearance,” and a score of 4 indicated “minimal to no pharyngeal clearance.”

Vocal Fold Immobility

We examined records of laryngoscopic examinations performed by otolaryngologists before, two weeks after, and three months after surgery. Vocal fold mobility was categorized as normal, unilateral, or bilateral.

Diet Levels

We examined the dietary levels of the participants before, two weeks after, and three months after surgery. Diet levels were graded using the ten-point Food Intake Level Scale (FILS; Level 1: no swallowing training except for oral care; Level 10: no dietary restriction, and the patient ingests three meals orally) [27]. The FILS scores were then categorized into three levels: no oral intake (FILS level ≤ 3), oral intake and alternative nutrition (4 ≤ FILS level ≤ 6), and oral intake alone (7 ≤ FILS level ≤ 10) [27].

Statistical Analyses

Statistical analyses were performed using SAS (version 9.4; SAS Institute, Inc. Cary, NC, USA). Descriptive statistics were performed for continuous (means and standard deviations) and ordinal (medians and ranges) variables for patient demographic data. Categorical variables were presented as counts and percentages. The Friedman and post hoc Dunn–Bonferroni tests were performed to compare the ordinal repeated measurements across the three time points in the MBSImP, PAS, and FILS scores. The repeated measures of ANOVA with Bonferroni post hoc analyses were performed to analyze the longitudinal changes in VFSS kinematic measurements parametrically. The non-parametrical Wilcoxon’s rank sum test was performed to determine the association between postoperative aspiration pneumonia, pathological MBSImP scores, and swallowing kinematic measurements. The Fisher’s exact test was performed to determine the association between vocal fold immobility and aspiration pneumonia. A p-value of less than 0.05 was considered statistically significant.

Results

Participant Characteristics

Fifty-one patients underwent subtotal esophagectomy during the study period. Twenty-nine patients completed the laryngoscopic examination and the VFSS before, two weeks after, and three months after surgery and, thus, were included in this study. Twenty-two patients were excluded from this study as they did not complete the laryngoscopic examination and VFSS at the three time points. The majority of the study participants underwent subtotal esophagectomy, followed by reconstruction with a gastric tube conduit and cervical anastomosis. The posterior mediastinal route was usually used for the reconstruction. All the participants provided informed consent.

Table 1 displays the participant characteristics. The mean age of the participants was 64.4 ± 9.0 years. The mean preoperative PNI was 49.3 ± 5.4. Even though the procedures for subtotal esophagus resection varied, the sternohyoid and sternothyroid muscles were divided in all patients. The participants were nil per os for an average of 10.2 ± 7.0 days. Nine participants (31.0%) developed aspiration pneumonia after surgery. Pneumonia events occurred between 2 to 19 days postoperatively.

Table 1 Participant characteristics (n = 29)

Longitudinal Analyses

VFSS: The MBSImP

Table 2 shows the distribution (median and interquartile range) of the MBSImP scores for swallowing 5 mL of a thickened liquid of 29 participants at three time points: before, two weeks after, and three months after surgery. The global test demonstrated that the Oral Impairment Score was stable across the three time points (p = 0.28), and the Pharyngeal Impairment Score changed significantly over time (p <  0.0001). Table 3 shows the post hoc pairwise analyses of the MBSImP components with significance in the global test. The Pharyngeal Impairment Score decreased two weeks after surgery (p = 0.0002) but returned to the baseline level three months after surgery (p = 0.08). A similar trend was observed for tongue base retraction and pharyngeal residue.

Table 2 The MBSImP component scores before, two weeks after, and three months after esophagectomy (n = 29)
Table 3 The Post-hoc pairwise analyses of MBSImP scores with significance in the global test (n = 29)

VFSS: Swallowing Kinematic Measurements

Table 4 shows the changes in the kinematic measurements for a 5 mL thickened liquid swallowed by 29 participants at the three time points. The global test demonstrated that the maximum hyoid displacement (p < 0.0001) and PCR (p = 0.0068) significantly changed over time. Table 5 displays the post hoc pairwise analyses of the kinematic measurements. The timing measure (H1-BP1) and maximum PES opening remained stable. The PCR increased after surgery (p = 0.0016) and returned to preoperative levels after three months (p > 0.99). Hyoid displacement significantly decreased after surgery (p < 0.0001) and did not return to baseline after three months (p = 0.0016).

Table 4 The kinematic measurements before, two weeks after, and three months after esophagectomy (n = 29)
Table 5 Post-hoc pairwise analyses of kinematic measurements (n = 29)

VFSS: Swallowing Findings

Table 6 shows the changes in swallowing findings regarding symptoms before and after surgery. None of the participants showed penetration or aspiration before surgery. Two of 29 participants (6.9%) showed penetration, and four of 29 participants (13.8%) demonstrated aspiration two weeks after surgery. Penetration remained in three participants (10.3%), and no participant presented with aspiration three months after surgery. As for the timing of airway invasion, all nine penetration and four aspiration events observed in the three time points occurred “during” swallowing. All the penetration and aspiration events occurred when the participants’ maximum laryngeal elevation seemed to be achieved. Table 7 shows the results of the pairwise analysis of the swallowing findings. No significant differences in the PAS were found across the groups.

Table 6 Changes in longitudinal data (n = 29)
Table 7 Post-hoc pairwise analyses of longitudinal data

The pharyngeal residue in the MBSImP increased significantly two weeks after surgery (p = 0.0003) but returned to the baseline level three months after surgery (p = 0.07, Table 3).

Vocal Fold Immobility

All the participants showed normal vocal fold movements before surgery. Twenty of 29 participants (69.0%) presented with unilateral vocal fold immobility, and seven of 29 (24.1%) developed bilateral vocal fold immobility two weeks after surgery. Three months after surgery, symptoms remained unilaterally and bilaterally in 13 (44.8%) and 3 participants (10.3%), respectively (Table 6). Pairwise analysis indicated that vocal fold mobility significantly worsened two weeks after surgery (p < 0.0001; Table 7) and did not return to the baseline level three months after surgery (p = 0.003).

Diet Levels

Before surgery, 96.6% of participants were on a full oral diet. Of the participants, 48.3% achieved a full oral diet two weeks after surgery; all participants returned to a full oral diet three months after surgery (Table 6). The drop in diet levels was significant two weeks after surgery (p = 0.0143) but returned to baseline levels three months after surgery (p > 0.99; Table 7).

Swallowing Pathophysiology and Postoperative Aspiration Pneumonia

Table 8 demonstrates the association between pneumonia status and the significant declines in MBSImP or kinematic measurements two weeks after surgery. Increased pharyngeal residue in the MBSImP group after surgery was associated with postoperative aspiration pneumonia (p = 0.05), while postoperative vocal fold immobility was not associated with aspiration pneumonia (p = 0.18).

Table 8 The association between the significant postoperative decline and aspiration pneumonia

Discussion

This study delineated the longitudinal changes in swallowing physiology, dysphagic symptoms, and diet levels in patients undergoing esophagectomy. This study identified an association between pharyngeal residue and aspiration pneumonia after esophagectomy.

First, our preoperative VFSS assessment revealed that the swallowing functions of patients with esophageal cancer were intact before esophagectomy despite the assumption that presbyphagia [19] or sarcopenic dysphagia [28,29,30] may already exist owing to advanced age or malnutrition in patients with esophageal cancer preoperatively [18]. The study participants were younger (mean age = 64.4 years) than those with presbyphagia, which typically develops after 80 years of age [31]. In addition, they had a better nutritional status (mean PNI 49.3) than patients with sarcopenia preoperatively (mean PNI 46.0) [31]. Thus, preoperatively older or less-nourished patients may still be at risk of symptomatic or asymptomatic swallowing deficits, which could be exacerbated after surgery. Therefore, preoperative VFSS assessment remains valuable for identifying patients with baseline dysphagia and elevated risk of postoperative dysphagia.

Second, our longitudinal assessment demonstrated recovery of swallowing impairments after esophagectomy. Initiation of swallowing and PES opening were unchanged after surgery. Tongue base retraction and PCR worsened two weeks after surgery but returned to baseline levels three months after surgery. However, once impaired after surgery, hyoid displacement during swallowing and vocal fold immobility did not fully recover after three months.

The temporary decline in tongue base retraction and pharyngeal squeezing indicated by PCR were unexpected changes after surgery, as the surgical procedure involved in esophagectomy or lymphadenectomy does not typically affect these swallowing mechanisms. However, as previously reported, it is possible that exposure to physical stress during invasive surgery and prolonged postoperative nil per os status overwhelmed the participants' functional reserves, causing temporal weakness in the lingual or pharyngeal muscles [19]. Consequently, bolus transport was tentatively affected, increasing pharyngeal residue after esophagectomy. Another potential reason for the tentative impairment in pharyngeal squeezing is iatrogenic damage to the ansa cervicalis and nerve rootlets during neck dissection, although the impact of neck dissection on pharyngeal movement was likely limited [32, 33].

Contrary to the decline in tongue base retraction and PCR, the reduced hyoid displacement during swallowing and vocal fold immobility is likely due to the direct impact of surgical procedures on swallowing mechanisms and, therefore, is persistent. In cervical lymph node dissection during esophagectomy, the infrahyoid muscles are either retracted or partially divided [34]. The damaged infrahyoid muscles become scarred, thereby hindering the lifting of the hyolaryngeal complex [9]. Scar formation around the trachea may also produce a counterforce against the hyolaryngeal elevation [17]. Furthermore, damage to the recurrent laryngeal nerve during lymphadenectomy increases the risk of vocal fold immobility after esophagectomy [35]. All the study participants underwent lymphadenectomy, which may be a reason for the higher incidence of vocal fold immobility observed in our study compared to that in previous reports in which esophagectomy was performed without lymphadenectomy (12.7% [8], 14.9% [11]). These surgery-induced changes might have allowed postoperative airway invasion in the participants. As previously reported, reduced hyolaryngeal elevation [8, 17] and vocal fold immobility [11, 18] were associated with aspiration after esophagectomy.

Despite persistent deficits in the movement of the hyoid and vocal folds, no aspiration events occurred three months after surgery. One possible reason for this favorable outcome is that the participants may have acquired maneuvers for safe swallowing, such as super supraglottic swallow [36] or chin-tuck swallow [37]. Kumai et al. reported that patients who underwent esophagectomy with three-field lymph node dissection had already acquired a chin-down swallow at postoperative VFSS without specific guidance and thus showed lower PAS scores than those who did not [38]. Our longitudinal research added that the compromised airway protective mechanism could continue for up to three months, emphasizing the importance of postoperative education for patients with compensatory strategies for airway protection.

Finally, our study found that pharyngeal residue, among other swallowing-related abnormalities, identified through a thorough swallowing assessment, is the single postoperative change associated with aspiration pneumonia. The pharyngeal residue, specifically the residue in the vallecula, is related to post-swallow aspiration [39]. However, in our study, all penetration and aspiration events were observed during swallowing when the participants’ maximum laryngeal elevation seemed to be achieved; these were probably due to incomplete airway closure. No aspiration of post-swallow residue was observed during VFSS, which might be due to the limitation of the test bolus volume to 5 mL. As such, accumulated residue with larger bolus in real-life meals can result in aspiration, thereby leading to aspiration pneumonia. Therefore, managing pharyngeal residues is essential to decrease penetration and aspiration after esophagectomy.

The study did not specifically analyze the location of the residue. However, the significant postoperative impairments identified in this study are known contributors to pharyngeal residue [40]: reduced tongue base retraction, reduced pharyngeal contraction, and decreased hyoid displacement. This study could not examine the longitudinal changes in several other factors, such as hyolaryngeal approximation or pharyngeal shortening during swallowing, which might affect postoperative residue. Further kinematic analyses may reveal the detailed contributions of swallowing mechanisms to pharyngeal residue, which may, in turn, help identify potential targets for postoperative swallowing exercises.

The current study has several limitations, similar to other retrospective observational studies [41]. First, the results were based on a limited sample size from a single-center setting, hampering generalizability and limited univariate statistical analyses. For example, most participants had thoracic esophagus cancers. Other participant characteristics, such as pathological stage, surgical procedure, or reconstruction routes, may have influenced postoperative swallowing impairments. To our knowledge, no other comprehensive longitudinal analyses of the kinematics and dysphagic symptoms have been published. Therefore, our study provides unique findings regarding dysphagia associated with esophagectomy. Second, our inclusion criteria required three-time VFSS data, potentially limiting patients with severe medical conditions and prolonged serious dysphagia who could not undergo VFSS from participating in the study. Third, the analysis was restricted to swallowing of only 5 mL of thickened liquid in this study due to missing data regarding thin liquid trials resulting from VFSS discontinuation for swallowing safety. Thin-liquid swallowing trials might have revealed more penetration and aspiration events, leading to additional findings of dysphagia after esophagectomy. Similarly, decreased UES openings may be observed for larger bolus volumes, as UES opening is volume dependent [14]. However, as this study was performed postoperatively in an acute hospital setting, swallowing safety during the VFSS was prioritized. Finally, observer bias may be involved in assessing the MBSImP since only one SLP served as a rater. However, to minimize bias, we ensured that the scoring was performed by an MBSImP-certified rater. We also blinded the two raters when assessing the PAS.

Future large-scale studies are warranted to examine longitudinal changes in swallowing function, kinematics, and symptoms of other dysphagia-related variables. Studies should also investigate factors associated with patients with severe dysphagia using various test boluses after esophagectomy.

Conclusion

This study demonstrated the longitudinal changes in swallowing physiology using VFSS and diet levels in patients undergoing esophagectomy. Preoperative swallowing function was intact, while postoperative vocal fold immobility, MBSImP parameters (tongue base retraction and pharyngeal residue), and kinematic measurements (hyoid displacement and PCR) worsened postoperatively. Of these changes, reduced hyoid displacement and vocal fold immobility remained after three months; however, no aspiration events occurred, and all patients returned to a regular diet. Postoperative pneumonia was associated with pharyngeal residue, highlighting the importance of postoperative education to improve swallowing safety and efficiency.