The relationship between biomechanics of pharyngoesophageal segment and tracheoesophageal phonation

Zhang, Teng; Cook, Ian; Szczęśniak, Michał; Maclean, Julia; Wu, Peter; Nguyen, Duong Duy; Madill, Catherine

doi:10.1038/s41598-019-46223-7

Download PDF

Article
Open access
Published: 05 July 2019

The relationship between biomechanics of pharyngoesophageal segment and tracheoesophageal phonation

Scientific Reports volume 9, Article number: 9722 (2019) Cite this article

1819 Accesses
6 Citations
Metrics details

Subjects

Abstract

This study examined the relationship between biomechanical features of the pharyngoesophageal (PE) segment, acoustic characteristics of tracheoesophageal (TE) phonation, and patients’ satisfaction with TE phonation. Fifteen patients using TE phonation after total laryngectomy completed the Voice Symptom Scale (VoiSS) and underwent acoustic voice analysis for cepstral peak prominence (CPP) and relative intensity. High resolution manometry (HRM) combined with videofluoroscopy was used to evaluate PE segment pressure and calculate the pressure gradient (ΔP), which was the pressure difference between the upper oesophagus and a point two centimetres above the vibrating PE segment. The upper oesophageal sphincter (UOS) minimal diameters were measured by Endolumenal Functional Lumen Imaging Probe (EndoFLIP). HRM detected rapid pressure changes at the level of the 4th – 6th cervical vertebra. CPP, relative intensity, and ΔP were significant predictors of satisfactory TE phonation. ΔP was a significant predictor of CPP and intensity. Minimal UOS diameter was a significant predictor of relative intensity of TE phonation. In two patients with unsuccessful TE phonation, endoscopic dilatation subsequently restored TE phonation. These findings suggest that sufficient ΔP and large UOS diameter are required for satisfactory TE phonation. Endoscopic dilatation increasing UOS diameter may provide a new approach to treat unsuccessful TE phonation.

The clinical efficacy of relocation pharyngoplasty to improve retropalatal circumferential narrowing in obstructive sleep apnea patients

Article Open access 07 February 2020

Heonjeong Oh, Hyung Gu Kim, … Hyun Jik Kim

Comparative analysis of high-speed videolaryngoscopy images and sound data simultaneously acquired from rigid and flexible laryngoscope: a pilot study

Article Open access 14 October 2021

Wioletta Pietruszewska, Marcin Just, … Ewa Niebudek-Bogusz

A study of acoustic characteristics of voluntary expiratory sounds produced before and immediately after swallowing

Article Open access 14 January 2022

Shoma Hattori, Shinji Nozue, … Koji Takahashi

Introduction

Verbal communication is one of the most important forms of human interaction¹. In normal laryngeal phonation, the vocal folds produce a voice signal with well-defined harmonic structures² which are the key component of normal voice quality³. In patients with advanced laryngeal cancer who are treated by total laryngectomy, the mechanisms of sound production change permanently. Studies have shown that patients experience a reduced quality of life as a result of their communication impairment after total laryngectomy⁴.

The tracheoesophageal (TE) voice is currently the gold standard for voice restoration in laryngectomy patients⁵. This involves surgical placement of a silicone valve called the TE voice prosthesis between the trachea and the oesophagus. When the tracheostoma is intentionally occluded for phonation, the voice prosthesis allows one-way pulmonary air passage from the trachea through to the oesophagus⁶. The airflow then sets the cervical oesophagus and lower hypopharynx into vibration, generating an acoustic signal^7,8. This vibrating part, referred to as the pharyngoesophageal (PE) segment, is located in the lower third of the neck, corresponding to cervical vertebrae from C5 to C7⁹. Given that voice characteristics depend on biomechanics of the vibration source¹⁰, the voice output of TE phonation is influenced by structure and function of the PE segment as a result of the surgical techniques and the patient’s specific anatomy¹¹. Research has attempted to investigate the relationship between various characteristics of the PE segment and the resulting TE phonation¹². Currently, there is limited understanding of how variations in PE dimension, vibratory characteristics, air pressure, and airflow affect TE phonation. Whilst perceptual voice quality, patient satisfaction and quality of life outcomes related to TE phonation have been documented¹³, establishing objective outcome measures such as appropriate biomechanical or acoustic analysis is essential for multidimensional evaluation of the efficacy of any interventions to improve the quality of TE phonation.

Unlike laryngeal phonation where the vocal folds can be observed using laryngoscopy, the whole PE segment cannot be visualized directly except for the visible neoglottis that can be seen via high-speech endoscopy¹⁴ and videostrobosopy¹⁵. Therefore, the PE segment has been assessed by a number of parameters such as vibratory characteristics⁶, dimension¹⁶, position of the PE segment prominence related to the anterior¹⁷ and posterior pharyngeal walls¹⁶, intraluminal pressure¹⁸, and PE geometry¹⁹. A range of methods have been used to investigate the characteristics of the vibrating PE segment during TE phonation including videofluoroscopy¹⁷, manometry²⁰, and acoustic analysis²¹.

Videofluoroscopy

This method has proven useful in identifying the location⁹ and measuring the dimensions¹⁶ of the PE segment. It has also been used to correlate the anatomical features of the PE segment and the resulting perceptual voice quality²². van As et al.¹⁷ found significant correlation between TE voice quality and the minimal distance between the PE segment prominence and anterior pharyngeal wall at rest and during phonation. In contrast, Takeshita et al.¹⁶ suggested that TE voice quality was correlated with the anteroposterior distance between the PE segment prominence and the posterior pharyngeal wall in both resting and phonation. As such, there is conflicting evidence regarding the exact association between the quality of TE phonation and fluoroscopic dimensional parameters.

Manometry

Manometry studies of PE biomechanics in laryngectomees have been performed to investigate the pressure features of the PE segment that may influence TE phonation. Takeshita et al.¹⁶ found the intraluminal pressure (mmHg) at rest and during TE phonation to be 13.1 and 25.5 in good, 17.55 and 36.41 in moderate, and 4.44 and 40.46 in poor TE speakers. This study suggested that the difference in pressure between rest and phonation should be small for efficient TE phonation. Aguiar-Ricz et al.²³ used manometry to compare the pressure of the upper oesophageal sphincter (UOS) at rest and phonation between successful and unsuccessful TE speakers. They found that at rest, the mean UES pressure was 11.83 mm Hg for successful esophageal speakers and 9.92 mm Hg for unsuccessful esophageal speakers and with no significant difference between groups during phonation. These findings imply that the role of intraluminal pressure of the PE segment has not yet been fully understood. The key factors including the pressure gradient across the vibrating segment (ΔP) and PE geometry and their associations with TE phonation have not yet been systematically investigated. This can be examined through use of high-resolution pharyngeal manometry (HRM) combined with concurrent videofluoroscopy, which provides fast signal acquisition and accurate biomechanical measurements during TE phonation.

Acoustic voice analysis

Acoustic analysis of a recorded sound signal is often used because of the ease and non-invasive nature of sound recording in clinics and correlation with perceptual voice measures²⁴. Objective analysis of sound signals including fundamental frequency, frequency and pitch perturbation, spectral characteristics and relative intensity analysis can be obtained using tools such as the Computerised Speech Lab (CSL)²⁵ and Praat²⁶.

To ensure accurate and meaningful acoustic analysis, signal typing must be undertaken to identify the level of noise in the signals. In 1995 Titze²⁷ classified acoustic signals into three types. Type 1 signals were nearly-periodic that did not show qualitative changes and strong modulations or subharmonics (i.e. the energy level of these components, if present, were below that of the fundamental frequency, f0). Type 2 signals contained qualitative changes (e.g. bifurcations) or modulations and subharmonics with energy level approaching that of f0. This signal type did not have an obvious single f0 in the signal. Type 3 signals contained no obvious periodic structure. This signal type was further defined by Sprecher et al. as chaotic with a finite dimension²⁸. Sprecher et al.²⁸ also added Type 4 signals i.e. signals that contained an infinite dimension; In spectrograms, there was smearing of energy across a wide range of frequencies similarly to broadband white noise.

It has been proven that analysis of perturbation in voice signals dominated by chaotic and stochastic noise is unreliable^28,29. For example, Type 3 and Type 4 signals²⁸ have been identified as being inappropriate for analysis of perturbation (shimmer and jitter) and harmonics-to-noise ratio (HNR). In recent times, a new measure, cepstral peak prominence (CPP) and its smoothed measure (CPPS)³⁰ have been used to analyze Type 3 and 4 signals as it is not dependent on f0 tracking³¹. The CPP is measured as the difference in amplitude between the cepstral peak and the value on a linear regression line directly below the peak relating cepstral frequency (i.e. quefrency) to cepstral magnitude³¹. The CPP is able to provide a reliable assessment of dysphonia for voice with high levels of noise^32,33 and has been demonstrated to have a strong correlation with TE voice quality³⁴. In laryngeal phonation, it has been shown that CPP values can vary across vocal tasks³⁵, intensity³⁶, and acoustic analysis programs³⁷.

Acoustic features, such as the maximum phonation time, voice sound intensity, fundamental frequency, perceptual evaluations, and HNR have been well studied in this population^8,38,39,40. These studies reveal that in spite of a significant difference from normal laryngeal phonation, TE phonation is the closest to normal phonation compared with other alaryngeal rehabilitation methods⁴¹.

Measurement of diameter of the PE segment

The recently developed impedance based Endolumenal Functional Lumen Imaging Probe (EndoFLIP) positioned in the PE segment offers a more direct and precise measurement of the PE geometry⁴². EndoFLIP has been used as a novel technology to study the oesophagogastric junction/lower oesophageal sphincter to measure the outcome of fundoplication and Heller’s myotomy surgeries⁴³, to determine the consistent stoma size during gastric banding⁴⁴, to examine oesophageal narrowing due to stenosis⁴⁵, and to assess mechanical competence of the gastroesophageal junction^46,47. No study has used this method to measure the PE diameter.

To date, no studies have systematically investigated the factors that influence the quality of TE phonation using the combination of the self-evaluation, fluoroscopy, pressure evaluation and acoustic analysis. Therefore, the aims of this study were: (1) to describe the features of TE phonation using patient’s self - rating, acoustic analyses, and dynamic manometric measurements; (2) to investigate the relationship between the patient’s self-rating of satisfaction toward their TE phonation and acoustic measurements and dynamic manometric characteristics during TE phonation; and (3) to examine the effects of changes in biomechanical properties of the PE segment on TE acoustic results.

Materials and methods

Ethical approval

The study protocol was approved by the Human Research Ethics Committee of the South Eastern Sydney Local Health District of New South Wales Health (HREC/12/POWH/452). Written informed consent was obtained from all participants to participate in this study. The study was implemented in accordance with relevant ethical guidelines and regulations. The measurement procedures used in this study conformed to the standards set by the latest revision of the Declaration of Helsinki.

Participants

Fifteen patients participated in this study with mean age of 67 (standard deviation, SD = 7; range = 55 to 77), 14 were male, one was female. Patients were recruited through the Departments of Gastroenterology, Speech Pathology and Radiation Oncology at St George Hospital, and the Laryngectomee Association of New South Wales, Australia. Patients were only included if they had undergone total laryngectomy surgery at least 12 months prior to this study and were communicating using TE phonation at the time of the study. Patients were excluded if they had any history of local tumour recurrence or any neurological disorder potentially associated with dysphagia, such as a prior cerebrovascular accident, Parkinson’s disease, or myopathy.

Study procedures

Pressure measurement of vibrating PE segment

High resolution manometry (HRM) combined with concurrent videofluoroscopy⁴⁸ was used to investigate the biomechanical properties of the PE segment during TE phonation. With participants seated upright, the manometry catheter (Unisensor USA Inc., Portsmouth, NH, USA), with diameter 3.6 mm incorporating 25 solid-state pressure sensors at 1-cm spacing, was inserted transnasally to span the UOS after topical anaesthesia (lignocaine 10%). Fluoroscopic videos were acquired (MultiDiagnost Eleva; Philips, Best, The Netherlands) and recorded concurrently with HRM using an MMS Solar GI system (Software Version 8.21o; MMS, Enschede, The Netherlands).

Participants swallowed 2 mL of EZ-HD barium (Bracco UK Limited, Woodburn Green, High Wycombe, UK) to enhance visualization of the PE segment before phonation. Participants were then required to take a deep breath and count numbers steadily in one breath and then phonate a prolonged/a/with the stoma closed with videofluoroscopic examination. The anatomical locations of interest were identified by locating the corresponding sensor positions in the fluoroscopic images. The PE pressure gradient (ΔP) was calculated as the difference between the pressure in the upper oesophagus and a point 2 cm above the vibrating PE segment during prolonged/a/(Fig. 1).

Figure 1 shows a representative recording of HRM and concurrent videofluoroscopy during TE phonation. By counting the HRM sensors on the fluoroscopic cine-loops, pressure changes at several anatomical positions of interested were localised during phonation. Pressurisation zones at sensors 21 and 18 respectively indicated the closure of the soft palate and the movement of the base of the tongue during TE phonation. Sensor 13 at the superior hypopharynx measured the atmospheric pressure, which was the reference point to calculate the pressure gradient across the PE segment.

At the bottom of the pressure map, pressurisation of the cervical oesophagus was identified during phonation. Rapid pressure changes demonstrated the anatomical location of maximal vibration of the PE segment, situated between the fourth and sixth cervical vertebrae (C4–C6), and covering the inferior hypopharynx and the UOS (Fig. 1, between the red dashed lines). The movement of the PE wall was also revealed on the concurrent fluoroscopic recording.

Diameter measurement of UOS

The Endolumenal Functional Lumen Imaging Probe (EndoFLIP) was used to capture accurate measurement of UOS diameters¹⁹. The minimal diameter of the EndoFLIP balloon is zero when the lumen is completed obstructed by a passive stricture. Prior to each study, the probe was calibrated at body temperature by filling the bag with 0.2% saline within a calibration block containing a set of cylindrical lumens with their surface areas ranging from 50 to 616 mm² and the pressure transducers were calibrated at 0 and 75 mmHg⁴⁷. The patients, under sedation (fentanyl, midazolam, and propofol), were then kept in the right lateral position to minimise the influence of gravity on the balloon and involuntary contraction of the UOS in response to balloon insertion.

The EndoFLIP catheter was placed transorally into the oesophagus and withdrawn until the bag was centred at the UOS (Fig. 2B). Bag position was also confirmed by partially filling the balloon (10 mL) and observing an hourglass shape on the EndoFLIP screen. When the catheter was held in place, the balloon was deflated and the patient had a brief (2 min) habituation period. It was followed by 30 mL ramp distension (rate 60 mL/min) to obtain the minimal diameter of the PE segment. The PE segment geometry was monitored in real time to ensure the bag remained in position; a repositioning was necessary if any suspected migrations were observed^47,49.

To clinically treat UOS strictures and pharyngeal dysphagia, endoscopic dilatation was performed in order to increase the minimal diameter of the UOS. Detailed protocols on the endoscopic dilation can be found in our previous publication⁵⁰. In one patient, the EndoFLIP measurement was repeated after the dilatation.

Self-rating of TE voice impairment

The subjective assessment of voice impairment was obtained by using a validated questionnaire known as the Voice Symptom Scale (VoiSS)⁵¹. The “Impairment” subscale of the questionnaire (VoiSS-I) was selected as the tool for assessing the degree of self-reported satisfaction towards TE phonation. This scale was selected as it had the most rigorous development process compared with other self-rating scales⁵². Previous research has found it to have high sensitivity, specificity, and efficiency in rating the impact of voice problem on the patient’s life⁵³. This scale is also simple for patients to use.⁵¹. The VoiSS was regarded as psychometrically the most robust and extensively validated self-report voice measure⁵⁴.

Acoustic analysis of TE voice

The acoustic recordings of TE voice were collected using the CSL Model 4500²⁵ at least one week after the UOS diameters measurement to avoid possible effects of this procedure on the PE segment mucosa e.g. oedema that could affect TE phonation. A cardioid (directional) microphone attached to the CSL was placed 20 cm horizontally from the mouth to record the TE voice⁵⁵. Sitting in the upright position, patients were required to take a deep breath, then cover the stoma, and initially count steadily at one number per second. After the initial count step, another deep breath was taken and patients were asked to phonate the vowel/a/as long as possible with the stoma covered. The patient was required to repeat the task three times. The voice signal was recorded at 44.1 kHz sampling rate and saved in *.wav file.

The Praat acoustic analysis program (version 5.1.02)⁵⁶ was used to generate spectrograms of the prolonged/a/using settings described in Sprecher et al.²⁸ as follows: Hamming window, window length = 50 ms, time step = 0.002 seconds, frequency step = 5 Hz, and a dynamic range of 40 dB. Signal typing was performed visually by the sixth author, by comparing the spectrograms to the exemplar signal types described in Sprecher et al.²⁸.

CPP (in decibel, dB)⁵⁷ was measured using the Analysis of Dysphonia in Speech and Voice (ADSV)⁵⁸ and relative intensity (dB) was measured using the intensity analysis function. CPP was used as it has been recommended in the assessment of voice quality not suitable for more traditional analysis such as noise-to-harmonic ratio, jitter and shimmer⁵⁹. Previous research has found a strong correlation between this measure and TE voice quality³⁴. Acoustic analyses for these measures were performed using the whole vowel duration. The mean vowel duration (second, s) was 5.9 s, SD = 3.8, minimum = 2.0 s, maximum = 14.3 s. The final acoustic value for each measure was averaged across the three trials. This study only used this vowel to allow correlation calculation with other dynamic measurements which were also analysed on this vowel.

Statistical analyses

Statistical analyses were performed using Prism version 4.0 for Windows⁶⁰. Descriptive statistics was used to describe biomechanical and voice measures. Linear regression analysis was used to assess the relationship between ΔP and minimal diameter of the PE segment and TE voice impairment (VoiSS-I), CPP, and relative intensity. The relationship between VoiSS-I scores and acoustic measures (CPP and relative intensity) was also examined using linear regression. In all statistical calculations, a significance level of p < 0.05 was used. Bonferroni correction was not implemented given the preliminary and exploratory nature of this study.

Results

Pressure characteristics of PE segment

Fourteen patients were able to undergo the manometry protocol. By counting the pressure sensors from the fluoroscopic image (Fig. 1), the location of the vibration segment during phonation could be identified over a region covering the inferior hypopharynx and the UOS (sensor 9), corresponding to the cervical vertebrae C4 – C6. Additionally, the soft palate was closed during phonation (sensor 21), with movement of the base of tongue (sensor 18) and unpressurised superior hypopharynx (sensor 13). Figure 3 shows HRM examples of successful (Fig. 3A) and unsuccessful (Fig. 3B) TE phonation with insertion of voice prosthesis along with concurrent videofluoroscopy. Fluoroscopically, in successful TE speakers rapid movement of the PE wall was observed from the fluoroscopic video, whereas in unsuccessful TE phonation no such rapid movement was observed. Manometrically, successful TE speakers demonstrated oesophageal pressurisation and rapid pressure changes at the inferior hypopharynx and the UOS (Fig. 3A, between the red dashed lines). In laryngectomees with failed TE phonation, no rapid pressure changes at the hypopharyngeal region and UOS were present despite oesophageal pressurisation (Fig. 3B). From HRM examination, the ΔP was averaged across 10 data point for each patient. The average ΔP (n = 14) during TE phonation of/a/varied from 15.0–78.0 mmHg across patients (Table 1).

Table 1 Descriptive statistics of all measures in the present study.

Full size table

UOS diameters

Table 1 shows the mean, SD and other statistical parameters of the minimal diameter of the PE segment obtained from seven laryngectomees who underwent voice recordings. The UOS minimal diameter was obtained while the patient did not phonate. Two laryngectomees in the study initially with no TE phonation, even with the insertion of a voice prosthesis, underwent endoscopic dilatation for treating pharyngeal dysphagia. After dilatation, the patients were able to communicate using TE phonation. In order to understand this phenomenon, endoscopic dilatation was performed on another laryngectomee with failed TE phonation despite the insertion of a voice prosthesis. Minimal diameters of the PE segment (minimal UOS diameters) pre- and post-dilatation were measured by EndoFLIP. Post-dilatation, the minimal UOS diameter increased from 6.4 mm to 7.3 mm (Fig. 4) and the patient was capable of TE phonation.

VoiSS and acoustic findings

VoiSS questionnaires were completed by 15 participants with the mean (SD) score of the VoiSS-I subscale being 31.6 (12.9).

In 12 patients who had voice recordings, 2 had Type 2 signals, 6 had Type 3 signals, and 4 had Type 4 signals (see Appendix). Signal type 4 is dominated by stochastic noise with smearing of energy across a wide range of frequencies²⁸. Given these signal types in the voice recordings, f0 was not measured. The vowel CPP of the TE voice ranged from 0.3 dB to 4.8 dB, with a mean (SD) value of 1.9 (1.4) dB.

Relationship between biomechanics and voice measures

Linear regression was used to determine whether biomechanical measures were significant predictors of voice outcomes in TE phonation. Figure 5 shows linear regression lines representing the relationship between CPP, ΔP, and relative intensity and VoiSS-I scores. This figure showed that VoiSS-I scores were statistically significantly predicted by CPP (Y = −0.7X + 44; R² = 0.49; p = 0.011), ΔP (Y = −0.4X + 49; R² = 0.35; p = 0.025), and relative intensity (Y = −0.9X + 89; R² = 0.35; p = 0.043). These regression equations showed that satisfactory TE phonation (i.e. low VoiSS-I score) was associated with a high CPP (Fig. 5A), high ΔP (Fig. 5B), and high intensity (Fig. 5C).

Data from 11 laryngectomees who had both HRM and TE voice recording were used to calculate the linear regression to evaluate the relationship between ΔP and acoustic measures. The results showed that ΔP was a significant predictor of both CPP (Fig. 6A: Y = 0.05X −0.05; R² = 0.38; p = 0.042) and relative intensity (Fig. 6B: Y = 0.4X + 51; R² = 0.51; p = 0.013) of TE phonation. Larger ΔP was associated with higher CPP and greater intensity. In addition, relative intensity was also a significant predictor of CPP (Fig. 6C: Y = 0.1X − 5; R² = 0.44; p = 0.019), that is, higher intensity would result in higher CPP in TE phonation.

Figure 7 shows regression lines and equations representing the relationship between the minimal UOS diameter and CPP and relative intensity of TE phonation. Although a large minimal UOS diameter tended to be associated with a high CPP result, no significant linear relationship between these two measures was found (p = 0.06, Fig. 7A). The minimal UOS diameter was a significant predictor of relative intensity (Y = 3.6X + 29; R² = 0.73, p = 0.015); the larger the UOS diameter, the greater intensity of TE phonation (Fig. 7B).

Discussion

Understanding the characteristics of the PE segment that determine successful TE phonation is important in the rehabilitation of phonation in laryngectomy patients. This study was an attempt to correlate patient’s satisfaction towards TE phonation and acoustic characteristics of TE phonation to a number of biomechanical measures of the PE segment collected in the context of clinical examination and treatment of PE stricture. The methods used in this study include HRM, videofluoroscopy and EndoFLIP, which provided data on location of the PE segment, intraluminal pressure inside the PE segment, and minimal UOS diameter. The combined assessment was deemed necessary to provide a comprehensive assessment of the PE segment biomechanics.

Pressure characteristics of the vibrating PE segment

The simultaneous use of videofluoroscopy and manometry⁶¹ is useful in providing information on the location, pressure, and function of the PE segment. To our knowledge this is the first study to combine HRM and videofluoroscopy to identify the vibrating PE segment during phonation in laryngectomees. The segment was observed to correspond to C4–C6, which is the location of the UOS and inferior hypopharynx. This was higher than the previously reported C5 to C7⁹. It is possible that a larger number of sensors would allow detecting the parts of PE segment that can be otherwise missed if a smaller number of sensors are used. This is an advantage of HRM in examining PE function.

Two major phenomena were observed. Firstly, HRM and videofluoroscopy showed pressurisation and rapid pressure changes in this segment, which were not observed in patients with unsuccessful TE phonation. This showed the essential nature of the vibration of the PE walls on TE voice production. This further supported a previous view that TE phonation should be regarded as an aerodynamic-myoelastic rather than merely aerodynamic event⁶². However, voluntary control over this may not be similar to that in laryngeal phonation.

Secondly, the present study showed that sufficient ΔP is essential in generating satisfactory TE voice in total laryngectomees (i.e., lower voice impairment self-rating and improved acoustic outcomes). There was a wide range of pressure gradients across the PE segment during intelligible phonation (between 15.0 and 78.0 mmHg across patients), implying a wide tolerance in tonic pressure following laryngectomy. To our knowledge, this is the first study to measure ΔP in laryngectomy patients with TE phonation. In individuals with a normal larynx, an oscillation threshold pressure from the lungs will be required to generate the inferior-superior pressure gradient² across the vocal folds for phonation. This pressure gradient induces fast airflow through the vocal folds, creating pressure changes on the surface of vocal fold tissues, which leads to a self-sustained tissue vibration and sound generation⁶³. From both the theoretical basis and our experimental data, the muscular pressure gradient across the PE vibration segment during phonation may be an important factor that determines TE phonation. Measurement of air pressure and flow in the PE segment during TE phonation would be required to investigate this hypothesis. Regression analyses confirmed the relationship between self-rated VoiSS scores and CPP, relative intensity, and pressure gradient across the vibrating segment. Thus, ΔP might be a valuable predictor of satisfactory TE voice production.

The results indicated that the ΔP might be regarded as an indirect estimate of the PE segment resistance and hence TE phonation efficiency. In normal phonation, laryngeal airway resistance is determined by air pressure and airflow rate⁶⁴ both of which can be intentionally controlled for. In TE phonation, assuming that the same rule applies, the resistance within the PE segment would be determined by trans-oesophageal air pressure and airflow rate. Grolman et al.⁶⁵ found that, although airflow rate at comfortable TE phonation (167 ml/s) was not significantly different from normal laryngeal phonation, the resistance of the PE segment was significantly higher (198 cm H₂O/l/s). Given the differences in biomechanical characteristics between the larynx and the PE segment, it is possible that achieving an adequate pressure is the most important strategy for TE speakers to produce an acoustic signal. In laryngeal phonation, the relationship between pressure-flow and the interaction between pressure and flow and vocal fold tissue determines vibratory characteristics⁶⁶ in which the phonation threshold pressure is dependent upon the vocal fold geometry and viscoelastic properties². However, in TE phonation, the PE segment as a vibrator may function differently, making the interaction between pressure-flow and PE tissue more complex. The viscoelastic characteristics of the PE segment can change over time as a result of stricture and scarring, influencing the vibration of the PE segment, associating with a degradation of sound quality or a lack of vibration. These can impede vibration or, in the worst scenario, stop the vibration completely. Reducing resistance of this segment may help return the vibration, for example in cases where botulinum injection or PE dilatation are used to restore TE voicing⁶⁷.

Diameters of the PE segment

We found that minimal UOS diameters, as measured by EndoFLIP, were a significant predictor of TE voice. Given the applicability of impedance planimetry to measure cross-sectional areas within the oesophagus⁴⁵, using this technique in biomechanically assessing the PE segment is reasonable. Lohscheller et al. estimated UOS cross-sectional area from HS endoscopy⁶⁸. In comparison to UOS cross-sectional area estimated by HS endoscopy, EndoFLIP can provide a direct, objective and more accurate assessment of the minimal UOS diameter⁶⁹.

The minimal UOS diameter is an important factor for a successful TE phonation because it may determine aerodynamic resistance. In laryngectomy patients phonating with a TE voice prosthesis, the endo-tracheal phonation pressure and aerodynamic resistance are considerably higher than that in normal laryngeal phonation⁶⁵. There is also substantial power loss at the level of the TE voice prosthesis⁶⁵. The constricted PE segment would further increase the resistance. This study showed that endoscopic dilatation increasing the minimal UOS diameter improved TE phonation in a small sample size. More patient data gathered during a clinical trial is necessary to confirm our observations. A small UOS diameter is potentially correctable by endoscopic dilatation, which is a simple and safe therapeutic option, primarily for treating pharyngeal dysphagia with UOS strictures⁷⁰ but not well-recognised as a tool to improve TE phonation. It should also be noted that for patients with adjuvant chemotherapy and UOS narrowing, multiple dilatation sessions (range: 1–12 sessions; mean: 3 sessions) may be required to achieve a response⁷¹.

Large minimal UOS diameter was correlated with high intensity, but not better quality of the TE voice in this study. This implies that diameter of the UOS may not be the only factor to determine the quality of TE voice. The two patients with failed rehabilitation of TE phonation underwent endoscopic dilatation to increase the minimal UOS diameters and to treat dysphagia. The capability of TE phonation was subsequently restored. In one patient with failed TE phonation, EndoFlip measured UOS minimal diameters pre- and post- dilation confirmed that the enlarged minimal diameter was associated with the capacity of using TE phonation.

Voice quality in TE phonation

The voice recordings of laryngectomees in this study had aperiodic signals in which Type 3 and Type 4 signals predominated. In this study, the mean CPP of the TE phonation was 1.9 dB with a range from 0.3 to 4.8 dB. The CPP value in the present study was significantly lower than that in vocally healthy speakers in a study by Madill et al.³⁷ in which CPP of the same vowel (/a/) obtained by SpeechTool is 17.7 dB (SD = 2.19). The CPP values in this study were also considerably lower than mean CPP values of the/a/vowel from participants at the highest age range (50 years old) measured using ADSV (12.145 dB, SD = 2.044 for males and 10.894 dB, SD = 1.751 for females)⁷². In the literature, only one study has measured CPP from TE phonation³⁴ in which the mean CPP measured by SpeechTool from sustained vowel combined with connected speech was 11.45 dB (range = 8.66–15.54 dB). It should be noted that there are factors responsible for the differences in CPP between this study and the literature. It has been found that different software packages produce different CPP results^37,73. This is due to the different processing algorithms used in each analysis program. Even in the same program and algorithm, different CPP values may result using different settings. Phadke⁷³ has pointed out that the output can be different if different versions of the same program (e.g. Praat) stipulate different “default” time and quefrency averaging windows. Additionally, there may be unexplained between-version differences in CPP even when the same settings were used, for example in Praat⁷³. Given these facts, future studies on CPP should clearly specify the settings used to extract the data and cross-studies comparisons would be irrelevant if there are differences as discussed above. The low CPP values as observed reflected the degraded sound quality provided by TE phonation as a result of differences in phonation mechanisms compared with normal laryngeal phonation. The physiologic characteristics of the PE segment may not allow voluntary control over TE phonation. Omori et al.⁶ found two separate bulges in the PE segment in which the upper bulge, which was made of the thyropharyngeal muscle, was found to be the sound source. They maintained that TE phonation stemmed from thyropharyngeal muscle contraction in harmonization with mucosal vibration in the PE segment. However, unlike laryngeal phonation, these muscular structures of the PE segment are not naturally designed for phonation⁷⁴. Consequently, the modulation of the vibrating tissue with the air flow and pressure may not be as effective as that in laryngeal phonation, creating considerable aperiodicity and noise in the signal as presented by very low CPP as mentioned above.

In this study, we also found that CPP was significantly predicted by intensity (Fig. 6C), which agreed with the literature on laryngeal phonation³⁶. Although CPP is considered a robust acoustic measure of voice quality⁵⁹, the findings in this study could not determine whether an increase in CPP resulted from intensity or voice quality of TE phonation. Auditory-perceptual judgements and nonlinear analysis⁷⁵ may be useful in evaluating voice quality independently from intensity.

The negative relationship between CPP and VoiSS scores confirmed the impact of poor voice quality in patients’ satisfaction on TE voice. Our findings are consistent with Robertson et al.⁷⁶ in which the mean VoiSS score (Impairment) for laryngectomy patients was 29.8 (SD = 12.5).

Limitations and future directions

This study had a number of limitations. In acoustic analysis, only a sustained vowel was used. This allowed correlation calculation between acoustic and biomechanical measures and minimized the within-speakers task-specific variations in the acoustic output. However, this may not reflect the actual varying phenomena occurring in connected speech and preclude any generalization to normal speech production. Future studies should use speech tasks for both acoustic and biomechanical measurements.

This study could not assess the degree of aerodynamic power loss and the myoelastic property of the PE tissue. In fact, there may be considerable between-subject variability in these factors, which may affect vocal efficiency, intensity, and voice quality⁶⁵. It is difficult to control for these factors given differences across patients in the power loss due to the prosthesis, the extent of prior surgery and use of different surgical techniques. Therefore, these remain the major challenging factors when studying PE segment and TE phonation. Additionally, this study did not measure aerodynamic parameters e.g. airflow and phonatory pressure. Collecting these would allow comparing biomechanical factors with vocal efficiency of TE phonation, providing more insight regarding what is needed for optimal function of the PE segment in TE phonation.

The sample size in this study was limited and not all patients were included in all analyses. This was due to a range of reasons including patient attrition during the study, difficulties tolerating the procedure and technical difficulties at the time of data collection. This may affect statistical significance level in some of the correlation calculations. A study replicating our protocols using a calculated sample size may clarify the potential correlations in this study e.g. the relationship between UOS diameter and CPP.

Lastly, perceptual analysis of TE phonation was not used in this study. This limited the explanation of acoustic findings. Future studies may combine perceptual and acoustic analyses and biomechanical measurements to provide a better understanding of TE phonation.

Conclusion

This study found significant correlation between pressure gradient and patient’s voice-related satisfaction and a robust acoustic voice measure, CPP. This suggests that sufficient pressure difference across the PE segment and lower pharynx is needed for efficient TE phonation.

A large minimal diameter tended to result in a higher CPP result, but no significance in this trend was found. Large minimal diameters of the PE segment were associated with higher relative intensity production. As such, a large UOS diameter is a predictor for successful TE voice. In unsuccessful TE phonation patients, endoscopic dilatation increasing the UOS minimal diameter may provide a new approach to treat unsuccessful TE phonation.

References

Titze, I. R. et al. Comparison between electroglottography and electromagnetic glottography. J Acoust Soc Am 107, 581–588 (2000).
Article ADS CAS Google Scholar
Titze, I. R. The physics of small-amplitude oscillation of the vocal folds. J Acoust Soc Am 83, 1536–1552 (1988).
Article ADS CAS Google Scholar
Dollinger, M. et al. Vibration parameter extraction from endoscopic image series of the vocal folds. IEEE Trans. Biomed. Eng. 49, 773–781 (2002).
Article Google Scholar
Perry, A., Casey, E. & Cotton, S. Quality of life after total laryngectomy: functioning, psychological well-being and self-efficacy. Int J Lang Commun Disord 50, 467–475 (2015).
Article Google Scholar
Blom, E. D., Pauloski, B. R. & Hamaker, R. C. Functional outcome after surgery for prevention of pharyngospasms in tracheoesophageal speakers. Part I: Speech characteristics. Laryngoscope 105, 1093–1103 (1995).
Article CAS Google Scholar
Omori, K., Kojima, H., Nonomura, M. & Fukushima, H. Mechanism of tracheoesophageal shunt phonation. Arch Otolaryngol Head Neck Surg 120, 648–652 (1994).
Article CAS Google Scholar
Mohri, M., Yoshifuji, M., Kinishi, M. & Amatsu, M. Neoglottic activity in tracheoesophageal phonation. Auris Nasus Larynx 21, 53–58 (1994).
Article CAS Google Scholar
Pindzola, R. H. & Cain, B. H. Duration and frequency characteristics of tracheoesophageal speech. Ann Otol Rhinol Laryngol 98, 960–964 (1989).
Article CAS Google Scholar
Wetmore, S. J. et al. Location of the vibratory segment in tracheoesophageal speakers. Otolaryngol Head Neck Surg 93, 355–361 (1985).
Article CAS Google Scholar
Berke, G. S. & Gerratt, B. R. Laryngeal biomechanics: an overview of mucosal wave mechanics. J Voice 7, 123–128 (1993).
Article CAS Google Scholar
Albirmawy, O. A., Elsheikh, M. N., Silver, C. E., Rinaldo, A. & Ferlito, A. Contemporary review: Impact of primary neopharyngoplasty on acoustic characteristics of alaryngeal tracheoesophageal voice. Laryngoscope 122, 299–306 (2012).
Article Google Scholar
van As-Brooks, C. J., Hilgers, F. J., Koopmans-van Beinum, F. J. & Pols, L. C. Anatomical and functional correlates of voice quality in tracheoesophageal speech. J Voice 19, 360–372 (2005).
Article Google Scholar
Eadie, T. L. & Doyle, P. C. Quality of life in male tracheoesophageal (TE) speakers. J Rehabil Res Dev 42, 115–124 (2005).
PubMed Google Scholar
Van As, C. J., Op de Coul, B. M., Eysholdt, U. & Hilgers, F. J. Value of digital high-speed endoscopy in addition to videofluoroscopic imaging of the neoglottis in tracheoesophageal speech. Acta Otolaryngol 124, 82–89 (2004).
Article Google Scholar
Dworkin, J. P. et al. Videostroboscopy of the pharyngoesophageal segment in total laryngectomees. Laryngoscope 108, 1773–1781 (1998).
Article CAS Google Scholar
Takeshita, T. K. et al. Relation between the dimensions and intraluminal pressure of the pharyngoesophageal segment and tracheoesophageal voice and speech proficiency. Head Neck 35, 500–504 (2013).
Article Google Scholar
van As, C. J., Op de Coul, B. M., van den Hoogen, F. J., Koopmans-van Beinum, F. J. & Hilgers, F. J. Quantitative videofluoroscopy: a new evaluation tool for tracheoesophageal voice production. Arch Otolaryngol Head Neck Surg 127, 161–169 (2001).
Article Google Scholar
Takeshita-Monaretti, T. K., Dantas, R. O., Ricz, H. & Aguiar-Ricz, L. N. Correlation of maximum phonation time and vocal intensity with intraluminal esophageal and pharyngoesophageal pressure in total laryngectomees. Ann Otol Rhinol Laryngol 123, 811–816 (2014).
Article Google Scholar
Wu, P. I. et al. Clinical utility of a functional lumen imaging probe in management of dysphagia following head and neck cancer therapies. Endoscopy 49, 848–854 (2017).
Article Google Scholar
Arenaz Bua, B., Olsson, R., Westin, U. & Rydell, R. The Pharyngoesophageal Segment After Total Laryngectomy. Ann Otol Rhinol Laryngol 126, 138–145 (2017).
Article Google Scholar
Most, T., Tobin, Y. & Mimran, R. C. Acoustic and perceptual characteristics of esophageal and tracheoesophageal speech production. J Commun Disord 33, 165–180; quiz 180–161 (2000).
Lundstrom, E., Hammarberg, B., Munck-Wikland, E. & Edsborg, N. The pharyngoesophageal segment in laryngectomees–videoradiographic, acoustic, and voice quality perceptual data. Logoped Phoniatr Vocol 33, 115–125 (2008).
Article Google Scholar
Aguiar-Ricz, L. et al. Behavior of the cricopharyngeal segment during esophageal phonation in laryngectomized patients. J Voice 21, 248–256 (2007).
Article Google Scholar
Carding, P. N., Wilson, J. A., MacKenzie, K. & Deary, I. J. Measuring voice outcomes: state of the science review. J Laryngol Otol 123, 823–829 (2009).
Article Google Scholar
PentaxMedical. Computerized Speech Lab, https://www.pentaxmedical.com/pentax/en/99/1/Computerized-Speech-Lab-CSL (2018).
Boersma, P. & Weenink, D. Praat: doing phonetics by computer, http://www.fon.hum.uva.nl/praat/ (2018).
Titze, I. R. Workshop on Acoustic Voice Analysis: Summary Statement. (National Center for Voice and Speech, 1995).
Sprecher, A., Olszewski, A., Jiang, J. J. & Zhang, Y. Updating signal typing in voice: addition of type 4 signals. J Acoust Soc Am 127, 3710–3716 (2010).
Article ADS Google Scholar
van As-Brooks, C. J., Koopmans-van Beinum, F. J., Pols, L. C. & Hilgers, F. J. Acoustic signal typing for evaluation of voice quality in tracheoesophageal speech. J Voice 20, 355–368 (2006).
Article Google Scholar
Hillenbrand, J. & Houde, R. A. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res 39, 311–321 (1996).
Article CAS Google Scholar
Hillenbrand, J., Cleveland, R. A. & Erickson, R. L. Acoustic correlates of breathy vocal quality. J Speech Hear Res 37, 769–778 (1994).
Article CAS Google Scholar
Heman-Ackah, Y. D. et al. Cepstral peak prominence: a more reliable measure of dysphonia. The Annals of otology, rhinology, and laryngology 112, 324–333 (2003).
Article Google Scholar
Heman-Ackah, Y. D. et al. Quantifying the cepstral peak prominence, a measure of dysphonia. J Voice 28, 783–788 (2014).
Article Google Scholar
Maryn, Y., Dick, C., Vandenbruaene, C., Vauterin, T. & Jacobs, T. Spectral, cepstral, and multivariate exploration of tracheoesophageal voice quality in continuous speech and sustained vowels. Laryngoscope 119, 2384–2394 (2009).
Article Google Scholar
Watts, C. R. The Effect of CAPE-V Sentences on Cepstral/Spectral Acoustic Measures in Dysphonic Speakers. Folia Phoniatr Logop 67, 15–20 (2015).
Article Google Scholar
Awan, S. N., Giovinco, A. & Owens, J. Effects of vocal intensity and vowel type on cepstral analysis of voice. J Voice 26(670), e615–620 (2012).
Google Scholar
Madill, C., Nguyen, D. D., Eastwood, C., Heard, R. & Warhurst, S. Comparison of cepstral peak prominence measures using the ADSV, SpeechTool and VoiceSauce acoustic analysis programs. Acoustics Australia 46, 215–226 (2018).
Article Google Scholar
Deore, N. et al. Acoustic analysis of tracheo-oesophageal voice in male total laryngectomy patients. Ann. R. Coll. Surg. Engl. 93, 523–527 (2011).
Article CAS Google Scholar
Torrejano, G. & Guimaraes, I. Voice quality after supracricoid laryngectomy and total laryngectomy with insertion of voice prosthesis. J Voice 23, 240–246 (2009).
Article Google Scholar
Schindler, A. et al. Intensity and fundamental frequency control in tracheoesophageal voice. Acta Otorhinolaryngol. Ital. 25, 240–244 (2005).
CAS PubMed PubMed Central Google Scholar
Globlek, D., Stajner-Katusic, S., Musura, M., Horga, D. & Liker, M. Comparison of alaryngeal voice and speech. Logoped Phoniatr Vocol 29, 87–91 (2004).
Article CAS Google Scholar
Hirano, I., Pandolfino, J. E. & Boeckxstaens, G. E. Functional Lumen Imaging Probe for the Management of Esophageal Disorders: Expert Review From the Clinical Practice Updates Committee of the AGA Institute. Clin Gastroenterol Hepatol 15, 325–334 (2017).
Article Google Scholar
Perretta, S., Dallemagne, B., Allemann, P. & Marescaux, J. Multimedia manuscript. Heller myotomy and intraluminal fundoplication: a NOTES technique. Surg Endosc 24, 2903 (2010).
Article Google Scholar
Praduodenal, L. Abstracts of the 2011 Scientific Session of the Society of American Gastrointestinal and Endoscopic Surgeons (SAGES). San Antonio, Texas, USA March 30- April 2, 2011. Surg Endosc 25(Suppl 1), 191–388 (2011).
Google Scholar
Scharitzer, M. et al. Comparison of videofluoroscopy and impedance planimetry for the evaluation of oesophageal stenosis: a retrospective study. Eur Radiol 27, 1760–1767 (2017).
Article Google Scholar
Nathanson, L. K., Brunott, N. & Cavallucci, D. Adult esophagogastric junction distensibility during general anesthesia assessed with an endoscopic functional luminal imaging probe (EndoFLIP(R). Surg Endosc 26, 1051–1055 (2012).
Article Google Scholar
Kwiatek, M. A., Pandolfino, J. E., Hirano, I. & Kahrilas, P. J. Esophagogastric junction distensibility assessed with an endoscopic functional luminal imaging probe (EndoFLIP). Gastrointestinal endoscopy 72, 272–278 (2010).
Article Google Scholar
Szczesniak, M. M. et al. Inter-rater reliability and validity of automated impedance manometry analysis and fluoroscopy in dysphagic patients after head and neck cancer radiotherapy. Neurogastroenterol Motil 27, 1183–1189 (2015).
Article CAS Google Scholar
McMahon, B. P. et al. The functional lumen imaging probe (FLIP) for evaluation of the esophagogastric junction. American journal of physiology. Gastrointestinal and liver physiology 292, G377–384 (2007).
Article CAS Google Scholar
Zhang, T. et al. Biomechanics of Pharyngeal Deglutitive Function following Total Laryngectomy. Otolaryngol Head Neck Surg 155, 295–302 (2016).
Article Google Scholar
Deary, I. J., Wilson, J. A., Carding, P. N. & MacKenzie, K. VoiSS: a patient-derived Voice Symptom Scale. J Psychosom Res 54, 483–489 (2003).
Article Google Scholar
Branski, R. C. et al. Measuring quality of life in dysphonic patients: a systematic review of content development in patient-reported outcomes measures. J Voice 24, 193–198 (2010).
Article Google Scholar
Behlau, M. et al. Efficiency and Cutoff Values of Self-Assessment Instruments on the Impact of a Voice Problem. J Voice 30, 506 e509–506 e518 (2016).
Article Google Scholar
Wilson, J. A. et al. The Voice Symptom Scale (VoiSS) and the Vocal Handicap Index (VHI): a comparison of structure and content. Clin Otolaryngol Allied Sci 29, 169–174 (2004).
Article CAS Google Scholar
Svec, J. G. & Granqvist, S. Guidelines for selecting microphones for human voice production research. Am J Speech Lang Pathol 19, 356–368 (2010).
Article Google Scholar
Boersma, P. & Weenink, D. Praat: Doing phonetics by computer Version 5.1.02, http://www.praat.org (2009).
Skowronski, M. D., Shrivastav, R. & Hunter, E. J. Cepstral Peak Sensitivity: A Theoretic Analysis and Comparison of Several Implementations. J Voice 29, 670–681 (2015).
Article Google Scholar
PentaxMedical. Analysis of Dysphonia in Speech and Voice - ADSV, https://www.pentaxmedical.com/pentax/en/99/1/Analysis-of-Dysphonia-in-Speech-and-Voice-ADSV (2018).
Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P. & Corthals, P. Acoustic measurement of overall voice quality: a meta-analysis. J Acoust Soc Am 126, 2619–2634 (2009).
Article ADS Google Scholar
GraphPad Software, https://www.graphpad.com/scientific-software/prism/ (2018).
Olsson, R., Nilsson, H. & Ekberg, O. Simultaneous videoradiography and computerized pharyngeal manometry–videomanometry. Acta Radiol 35, 30–34 (1994).
Article CAS Google Scholar
Moon, J. B. & Weinberg, B. Aerodynamic and myoelastic contributions to tracheoesophageal voice production. J Speech Hear Res 30, 387–395 (1987).
Article CAS Google Scholar
Lucero, J. C. The minimum lung pressure to sustain vocal fold oscillation. The Journal of the Acoustical Society of America 98, 779–784 (1995).
Article ADS CAS Google Scholar
Baken, R. J. & Orlikoff, R. F. Clinical measurement of speech and voice. Second edition edn, (Singular Publishing Group, 2000).
Grolman, W. et al. Vocal efficiency in tracheoesophageal phonation. Auris Nasus Larynx 35, 83–88 (2008).
Article Google Scholar
Ishizaka, K. & Matsudaira, M. Fluid Mechanical Considerations of Vocal Cord Vibration. (SCRL Monograph Series, 1972).
Chaukar, D. A. et al. Ultrasound-guided botulinum toxin injection: A simple in-office technique to improve tracheoesophageal speech in postlaryngectomy patients. Head Neck 35, E122–125 (2013).
Article Google Scholar
Lohscheller, J. et al. Quantitative investigation of the vibration pattern of the substitute voice generator. IEEE Trans Biomed Eng 51, 1394–1400 (2004).
Article Google Scholar
Jiang, L. et al. Comprehensive swallowing exercises to treat complicated dysphagia caused by esophageal replacement with colon: A case report. Medicine (Baltimore) 96, e5707 (2017).
Article Google Scholar
Harris, R. L., Grundy, A. & Odutoye, T. Radiologically guided balloon dilatation of neopharyngeal strictures following total laryngectomy and pharyngolaryngectomy: 21 years’ experience. J Laryngol Otol 124, 175–179 (2010).
Article CAS Google Scholar
Paramsothy, S., Maclean, J., Szczesniak, M. M. & Cook, I. J. Sa1175 Cricopharyngeal Dilatation for Post Laryngectomy Dysphagia - a Pilot Study of Efficacy and Safety. Gastroenterology 142, S–235 (2012).
Google Scholar
Garrett, R. Cepstral- and spectral-based acoustic measures of normal voices Master of Science thesis, University of Wisconsin-Milwaukee (2013).
Phadke, K. V. Selected topics in laryngeal, perceptual and acoustic assessments of human voice: Videokymographic evaluations of vocal folds and investigations of teachers’ voices. PhD thesis, Palacký University Olomouc, (2018).
Angermeier, C. B. & Weinberg, B. Some aspects of fundamental frequency control by esophageal speakers. J Speech Hear Res 24, 85–91 (1981).
Article CAS Google Scholar
Jiang, J. J., Zhang, Y. & McGilligan, C. Chaos in voice, from modeling to measurement. J Voice 20, 2–17 (2006).
Article Google Scholar
Robertson, S. M., Yeo, J. C., Dunnet, C., Young, D. & Mackenzie, K. Voice, swallowing, and quality of life after total laryngectomy: results of the west of Scotland laryngectomy audit. Head Neck 34, 59–65 (2012).
Article Google Scholar

Download references

Acknowledgements

This study was supported by the Australian Postgraduate Award and the Dr Liang Voice Program at The University of Sydney.

Author information

Authors and Affiliations

Biomechanics Laboratory, Li Ka Shing Faculty of Medicine, Hong Kong, Hong Kong
Teng Zhang
Faculty of Medicine, The University of New South Wales; Department of Gastroenterology, Sydney, Australia
Ian Cook, Michał Szczęśniak & Peter Wu
Cancer Care Centre, St George Hospital, Sydney, Australia
Julia Maclean
Voice Research Laboratory, The University of Sydney, Sydney, Australia
Duong Duy Nguyen & Catherine Madill

Authors

Teng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ian Cook
View author publications
You can also search for this author in PubMed Google Scholar
Michał Szczęśniak
View author publications
You can also search for this author in PubMed Google Scholar
Julia Maclean
View author publications
You can also search for this author in PubMed Google Scholar
Peter Wu
View author publications
You can also search for this author in PubMed Google Scholar
Duong Duy Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Madill
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Teng Zhang was involved in research question identification, study planning, and protocol preparation. She performed the manometry, voice recording, and analysed the results. She also wrote the manuscript text and prepared all figures. Ian Cook was involved in study planning and protocol preparation. He performed the EndoFlip protocols and analysed the results. Michał Szczęśniak was involved in problem identification and protocol preparation. He performed the manometry protocols and analysed the results. Julia Maclean was involved in problem identification, study planning, and protocol preparation. She recruited the patients and audited the manometry study. Peter Wu was involved in problem identification and study planning. He performed the EndoFlip and analysed the results. Duong Duy Nguyen interpreted findings in the study and wrote and edited the manuscript. He also edited and formatted all figures. Catherine Madill was involved in problem identification, study planning, and protocol preparation. She performed acoustic voice analyses and edited the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Catherine Madill.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Spectrograms of all patients in this study

Supplementary information

Spectrograms of all patients in this study

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, T., Cook, I., Szczęśniak, M. et al. The relationship between biomechanics of pharyngoesophageal segment and tracheoesophageal phonation. Sci Rep 9, 9722 (2019). https://doi.org/10.1038/s41598-019-46223-7

Download citation

Received: 15 November 2018
Accepted: 13 June 2019
Published: 05 July 2019
DOI: https://doi.org/10.1038/s41598-019-46223-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

The clinical efficacy of relocation pharyngoplasty to improve retropalatal circumferential narrowing in obstructive sleep apnea patients

Comparative analysis of high-speed videolaryngoscopy images and sound data simultaneously acquired from rigid and flexible laryngoscope: a pilot study

A study of acoustic characteristics of voluntary expiratory sounds produced before and immediately after swallowing

Introduction

Videofluoroscopy

Manometry

Acoustic voice analysis

Measurement of diameter of the PE segment

Materials and methods

Ethical approval

Participants

Study procedures

Pressure measurement of vibrating PE segment

Diameter measurement of UOS

Self-rating of TE voice impairment

Acoustic analysis of TE voice

Statistical analyses

Results

Pressure characteristics of PE segment

UOS diameters

VoiSS and acoustic findings

Relationship between biomechanics and voice measures

Discussion

Pressure characteristics of the vibrating PE segment

Diameters of the PE segment

Voice quality in TE phonation

Limitations and future directions

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Spectrograms of all patients in this study

Supplementary information

Spectrograms of all patients in this study

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links