Hyperfunctional dysphonia
Hypofunctional dysphonia
Pathological mechanism
Irregularities of vocal fold vibration due to increased glottal resistance
Glottal closure insufficiency due to decreased glottal resistance
Subjective complaints
Vocal discomfort and voice sound alteration with longer voice use
Rapid vocal exhaustion, loss of voice brilliance
Auditory perceptual voice sound evaluation (GRBAS)
Rough, strained
Breathy, asthenic
General appearance
General muscle hypertension, thoracic breathing pattern, tight articulation pattern, cervical tension
General muscle hypotension, tendency towards shallow breathing
Laryngostroboscopy
Supraglottal hypertension, reduced open phase, reduced maximum amplitudes, long closure phase
Insufficient glottal closure even in loud phonation, enlarged maximum amplitude, long open phase, short closure phase
Voice range profile measurement
Loss of piano function
Loss of forte function
The currently used subclassification into hyper- and hypo-functional dysphonias is often made on the basis of stroboscopy. Stroboscopic parameters of clinical interest are usually maximum amplitude, relative mucosal wave, closed phase, phase differences and symmetry of vibrations.
Arndt and Schäfer (1994) introduced the width-length relationship as a quotient between maximum amplitude and total vocal fold length. They could successfully differentiate healthy voices from functional dysphonias by using this parameter, whereas the comparison of hyper- and hypo-functional dysphonias by using stroboscopic parameters revealed no difference (Schneider et al. 2002). Schneider et al. failed to find stroboscopic evidence (correlates) of subtypes of functional dysphonia, and for the diagnostics of functional dysphonias, stroboscopy did not provide high expectations.
Voice range profile measurements can easily differentiate subtypes of functional dysphonias. A hypofunctional dysphonia is usually characterised by a limitation of loud phonation (see Case Study 5.1). These patients can produce very low sound pressure levels in soft speaking and soft singing, but the increase in vocal intensity is reduced, and they mostly reach maximum sound pressure levels of less than 90 dB.
Case Study 5.1
A 46-year-old kindergarten teacher had undergone a viral infection of the upper respiratory tract 1 month before. She took antibiotics and cortisone. Now she has still a weak voice. In her history, there is a period of anorexia that is stable now and symptom-free. During leisure time, she sings in a choir. Video-laryngostroboscopy shows incomplete closure. The vocal range is mildly restricted, voice quality mildly breathy (R0 B1 H1).
In contrast, a hyperfunctional dysphonic patient usually reaches quite loud intensities while loud singing and shouting. With loud phonation, maximum values of sound pressure of more than 90 dB can easily be produced, whereas the ability of soft voice production is disturbed (see Case Study 5.2). Patients can hardly phonate at sound pressure level values lower than 55 dB.
Case Study 5.2
A 27-year-old actor occasionally has problems with his voice after having acted on stage the day before. He is sensitive to cold. His voice range profile shows an elevated curve of the softest tones. He is well able to project his voice. The voice sounds rough (R1 B0 H1).
5.1.7 Management
In general, for therapy of a voice disorder, a clinician usually has to choose between conservative therapy, application of drugs and surgical approaches/phonosurgery. In functional dysphonia phonosurgery is not indicated. The only clinical exception is a severe hyperfunctional dysphonia without response to conservative voice therapy. In this case an indirect laryngeal application of botulinum toxin might be indicated. In extreme muscular hypertension, the application of muscle relaxants or so-called neuromuscular blocking agents can also be discussed. In cases of cervical problems, a pain analgesic medication, orthopaedic therapy or physiotherapy might also be helpful before entering voice therapy.
Functional voice dysphonias are first of all a domain of conservative voice treatment despite the considerable controversial discussion about causal aetiological mechanisms. The literature is replete with evidence that symptomatic voice therapy for functional voice disorders can often result in rapid and dramatic voice improvement (Roy 2003; Roy and Leeper 1993; Ziegler et al. 2014; Moore 2012; Van Houtte et al. 2011; Hazlett et al. 2011; Marszałek et al. 2012; Van Lierde et al. 2010).
Many voice therapies include yawn-sigh, resonant voice therapy, visual and electromyographic biofeedback, progressive muscle relaxation and perilaryngeal massage to reduce or rebalance muscular tension. Manual/digital techniques have been described to determine the presence and degree of laryngeal musculoskeletal tension, as have methods to relieve such tension during the diagnostic assessment and management session.
5.2 Occupational Voice Disorders
5.2.1 Introduction
In recent decades, voice has assumed an important role in occupational activities. According to some studies, about one-third of the workforce relies on the voice to perform the job (Carding 2007). The increasing role of verbal communication in developing society and the heavy vocal demands associated with particular professions induce the growing risk of occupational voice disorders having a serious impact on working ability (Dejonckere 2001; Vilkman 2004; Niebudek-Bogusz and Sliwinska-Kowalska 2013).
5.2.2 Definition
Occupational voice disorders are work-related self-reported symptoms and clinical signs of dysphonia occurring in subjects whose job demands voice use as a critical aspect of the work.
Group I—professions demanding high quality of voice, e.g. singers, actors and other vocal performers
Group II—professions demanding permanent vocal load, e.g. teachers, interpreters, speakers
Group III—professions demanding more than average endurance of vocal apparatus because of some voice tasks performed in noisy environment, e.g. lawyers, physicians, clergy, military men
5.2.3 Epidemiology
Data from the United Kingdom show that more than five million workers are routinely affected by voice impairments at an annual cost of approximately ₤200 million (Carding 2007). It has been suggested that some professional groups, such as teachers, performers and telemarketing staff, are more at risk of developing voice disorders than other voice-demanding professions. Several randomised studies proved that the prevalence of dysphonia is around two to three times higher in teachers than in controls (Roy et al. 2005; Sliwinska-Kowalska et al. 2006). Data from various countries confirm that teaching is an occupation with the greatest risk (de Jong 2010). They indicate that from about 11–38% of the investigated teachers experience current voice problems, but more than 57% of them report at least one occurrence of voice disorders during their life. Additionally, at least one in three teachers claims that teaching has a detrimental effect on his or her voice and they are sometimes even forced to change profession (Roy et al. 2005).
Another emerging group of employees with a high risk of voice disorders is that of call centre industry workers. This group has dramatically grown over the past decade. Approximately 5% of the workforce in the United States works in call centres, compared with an estimated 2% in the EU. The incidence of voice disorders is expected to grow, owing to the highly demanding vocal load in this professional group and work-related stress factors in call centres (Schneider-Stickler et al. 2012). Raising the retirement age of voice professionals is an additional factor that can contribute to the increasing number of cases of occupation-related dysphonia in some EU countries.
Thus occupational dysphonia can not only affect the quality of life and social well-being of employees but also the economic efficiency of companies.
5.2.4 Aetiology and Risk Factors for Occupational Dysphonia
Occupational voice disorders have a multifactorial aetiology. Conditions at work may affect voice, whereas individual-specific factors may influence the employees’ susceptibility to dysphonia.
Voice loading: a crucial factor in the development of professional voice problems (see also Sect. 6.8). The load-related changes in the vocal tract are caused by prolonged speaking: at high intensity (more than 70 dB (A)), at a pitch outside the normal range, with improper intonation and abnormal resonance (Niebudek-Bogusz et al. 2006). The study of Remacle et al. (2012) indicated that 2 h of continuous oral reading with intensity 70–75 dB(A) could be phonotraumatic for normophonic females. It should be stressed that research to establish safety standards concerning the duration and intensity level of loading factors is still in progress (Echternach et al. 2014)
Environmental factors: dust, pollution, poor humidity and exposure to fluctuations of temperature may affect the condition of the mucosa of the upper respiratory tract. Other important ergonomic factors are background noise and improper acoustic conditions with poor reverberation time, which decrease the Speech Transmission Index (STI). STI expresses speech transmission in space. Poor room acoustics frequently lead to an increase in vocal effort. In order to be heard in deficient acoustic conditions, a speaker involuntarily tries to increase voice intensity; estimated subglottal pressure (ESP) and sound pressure level (SPL) have shown it is kept about 10–15 dB above the environmental noise (the Lombard effect)
Psycho-emotional factors, which play an important role in handling the demands of a professional voice: voicing with an adverse psycho-emotional attitude increases the prevalence of voice disorders. Many voice professionals with dysphonia have entered a vicious circle: the psychological factors exacerbate voice pathology, and voice disabilities interfere with job satisfaction, performance and attendance, affecting psychological well-being (de Jong 2010)
Lack of vocal hygiene and incorrect voice production techniques, which may cause pathological compensation of heavy vocal load and in consequence induce muscle tension in the area of the pharynx, larynx and neck, affecting body posture and the breathing pathway. Moreover, the lack of coping strategies in occupational stress enhances muscle strain (Koufman 2003; Marszałek et al. 2012)
Risk factors of occupational voice disorders
Ergonomic factors | Health factors | Personal characteristics |
---|---|---|
Voice loading Background noise Poor acoustics Indoor air quality: Humidity Temperature Irritants Work-related stress | Respiratory tract diseases Allergy LPR (Laryngopharyngeal reflux) Mucosal problems Hormonal disturbances Musculoskeletal abnormalities Deterioration of general physical/mental condition | Gender (female) Family history of voice disorders Incorrect phonation technique Posture Smoking Personality features Physical and mental activity Ability to cope with stress |
According to some cross-sectional studies, lifetime vocal load, incorrect techniques of voice production and psychological predispositions constitute the major risk factors for developing occupation-related voice disorders (Sliwinska-Kowalska et al. 2006; Bermudez de Alvear et al. 2010) (see Table 5.2). The individual-specific factors that can contribute to voice disorders are multiple, including frequent infections of the upper respiratory tract, allergies, laryngopharyngeal reflux and hormonal disturbances. Moreover, some studies report a gender predisposition to occupational dysphonia. The glottal cycle dose is much higher in females than in men owing to the nearly doubled fundamental frequency F0. Moreover, the female larynx seems to be more vulnerable (Dejonckere 2001) because of the following physiological differences: (1) being subjected to lifelong hormone-mediated effects, (2) the curving shape of the vocal folds contributing to insufficient glottal closure and to the hourglass-shaped oscillation pattern (characteristic of vocal nodules) and (3) reduced lubrication of the female laryngeal mucosa causing lower levels of hyaluronic acid in mucus secretions, which interferes with vocal fold vibration and decreases resistance to vocal loading. The relation between the above-mentioned factors is complex, and the relative influence of each factor varies among individuals.
5.2.5 Diagnostic Procedures
The patient’s history
ENT/phoniatric examination
Perceptual assessment of voice quality (e.g. GRBAS)
Self-assessment of voice (e.g. Voice Handicap Index)
Video-laryngostroboscopy—standard in diagnosing voice disorders and imaging of the larynx function
Objective evaluation of the vocal function via acoustic and aerodynamic measurements, in an attempt to quantify the severity of voice pathology
Occupational History (see Table 5.3) should be specific and concern the following: the subject’s current job, duration of employment (for teachers—the type of school and class, the number of class hours per day and per week as well as the number of students per class) and environmental work conditions (ambient temperature, humidity, dust, pollution, etc.). A more detailed enquiry should concern subjective vocal symptoms appearing over the entire working life, including recurrent or permanent hoarseness, vocal fatigue, loss of voice or aphonia, chronic dryness and the sensation of a ‘lump’ in the throat or persistent dry cough. Odynophonia—pain caused by using the voice can be a disturbing symptom. The history should also refer to current and past laryngological or phoniatric care (including voice training) and sick leave due to voice disorders. Further questions should be asked about nonoccupational risk factors, e.g. smoking or other habits, previous or concomitant ENT and other diseases (including sinusitis, pharyngitis and allergies, bronchitis, reflux disease, thyroid diseases, sexual hormonal imbalances, anaemia, cervical spine problems). Careful questioning as to current medications including antihistamines, diuretic, birth control pills and vitamins is needed. A history of previous surgery, particularly performed under anaesthesia, is also important. Laryngeal surgery is a matter of great concern
Occupational history
Job-related factors | Vocal symptoms | Nonoccupational risk factors |
---|---|---|
Kind of job, work load Hours of vocal effort per day/week Work environment Access to sound amplifier Psychosocial conditions Frequent business trips Jet-lag risk | Chronic/recurrent hoarseness Vocal fatigue Loss of voice/aphonia Difficulty in being heard Throat clearing Chronic dryness in the throat ‘Lump’ in the throat Sore throat Odynophonia Persistent dry cough | Smoking, alcohol consumption, drugs Improper diet or daily fluid intake ENT diseases (e.g. sinusitis, pharyngitis) Other diseases (e.g. reflux, allergies, hearing loss) Extra-occupational voice loading Voice demanding hobby Current medication (e.g. antihistamines, diuretics, oral contraceptives) Previous operations, anaesthesia |
Self-Assessment of Voice is a valuable tool of comprehensive voice diagnosis, particularly important among professional voice users, because it enables evaluation of the level of handicap that the subject experiences as a result of voice disorders. In reference to the WHO multidimensional concept of health, the patient’s subjective opinion should not be ignored in assessing the health consequence of disability, for example, the extent of impairment. One of the more common instruments of voice self-assessment is the Voice Handicap Index (VHI)—a 30-item questionnaire evaluating complaints and signs of disability due to voice disorders. This scale has been shown to have validity and reliability in various countries, and thus it can be used as a valuable screening tool (Cheng and Woo 2010; Herbst et al. 2015). The assessment of voice handicap ought to be taken into account, particularly in voice professionals, for whom voice disorders may not only have an impact on their general health condition but also bring about employment-related problems. Other common methods of voice self-assessment include V-RQOL (Voice-Related Quality of Life), VOS (Voice Outcome Survey), VoiSS (Voice Symptom Scale) and VTDS (Vocal Tract Discomfort Scale) (Bermudez de Alvear et al. 2010; Woznicka et al. 2012)
5.2.6 The Assessment of Vocal Load Effects
A crucial issue in diagnosing occupational dysphonia is the evaluation of the influence of vocal load on the voice apparatus (Hunter and Titze 2009). The evaluation of vocal load effects may be performed by means of the following.
Laboratory Tests Including Vocal Loading Tasks
The most popular vocal loading task (see also Sect. 6.8) consists of reading aloud a text for 30 min at the level of background (white) noise of 85 dB SPL (decibels sound pressure level).
Subjective self-ratings: specific surveys including questions concerning symptoms that accompany vocal effort have been developed by using a visual analogue scale (VAS) (Laukkanen et al. 2004). However, the common VHI questionnaire also covers complaints indicating vocal fatigue
Acoustic analysis:
Comparison of the pre- and post-test acoustic parameter values obtained by using the Multidimensional Voice Program. The most frequent changes are observed in the average fundamental frequency and in the parameters of jitter and shimmer groups (Niebudek-Bogusz et al. 2008; Remacle et al. 2012)
Comparison of the pre- and post-test voice range profile (VRP): a decrease in the maximum post-vocal loading VRP is considered to be a voice reduction test (cited after Obrębowski 2008)
Aerodynamic parameters: pre- and post-test values of maximum phonation time (MPT), estimated subglottal pressure (ESP) and sound pressure level (SPL) have been shown to be relevant for the assessment of vocal efficiency (Chang and Karnell 2004; Remacle et al. 2012; Echternach et al. 2014)
The changes of objective measurements may be accompanied by laryngo-videostroboscopic signs after the vocal loading test. The effects of poorly compensated vocal effort may affect dimensions of phonatory functions, including deterioration of the quality of the mucosal wave or incompleteness of phonatory glottal closure. A negative vocal loading test without deterioration after vocal effort has been noted in subjects with a well-functioning voice apparatus and sufficient voice capacity.
Individual Dosimeters
Dosimeters used in the workplace are portable devices worn by subject in order to extract and collect important parameters of vocal behaviour over an entire daily working time. Evaluation by means of dosimeters is particularly useful in assessing individual vocal load in specific job conditions and recovery time in various options of multimodal communications.
Ambulatory Phonation Monitor, which extracts the following data: the voicing percentage (%), the average fundamental frequency (Hz) and amplitude (dBSPL), glottal cycle dose and also distance dose—the length ‘covered’ by the vocal fold mucosa in the whole cycle of daily working activities (m)
Voxlog—a voice accumulator that registers phonation time, the fundamental frequency, voice sound pressure level, the background noise level, cycle and distance dose and energy dissipation and radiated energy dose (www.sonvox.com/voxlog/)
Individual dosimeters or vocal loading tasks seem to be valuable tools for the assessment of ‘voice fitness for duty’ in professional voice users and may enable clinicians to detect even early stages of occupational dysphonia. They can be used not only for predicting dysphonic severity but also for biofeedback and for the monitoring of the results of vocal training (Calcinoni and Niebudek-Bogusz 2014).
5.2.7 Clinical Aspects of Occupational Voice Disorders
Negative voice adaptation under conditions of chronic vocal load may result in the development of vocal fatigue and, as a consequence, reduced vocal endurance, leading to occupational voice disorders.
Occupational dysphonia may produce multiple voice symptoms, including recurrent or chronic hoarseness, sensation of dry throat or a lump in the pharynx, even pain, dry cough, loss of the singing range, voice tiredness or voicelessness (aphonia).
Functional (Non-organic) Dysphonia—the most frequent form of hyperfunctional origin (see also Sect. 5.1); phonation with excessive muscular forces leads to chronic abuse syndrome, inducing vocal fatigue and reduced voice capacity. Phoniatric examination may reveal incorrect type of breathing, excessive neck muscle tension, incorrect resonator activation and ‘hard’ phonation with rough voice, accompanied by abnormal vibration parameters observed by means of videostroboscopy (e.g. diminished amplitude of vocal fold vibration, reduced mucosal wave or glottal sphincter closure with vestibular phonation). Long-lasting functional dysphonia often leads to irreversible laryngeal lesions, frequently of organic type
Organic Dysphonia—the most common organic outcomes of vocal hyperfunction include vocal nodules, hypertrophy of the vocal folds margin, vascular lesions of the glottis, contact ulcers and asthenia of the internal larynx muscles contributing to glottal insufficiency. Vocal nodules are considered to be the most frequent occupation-related pathology. Vocal nodules are the effect of chronic phono-trauma and occur mostly in females (in adult age), which can be explained by the sex differences described above. These organic lesions are frequently preceded by prenodules—reversible symmetric slight oedema of vocal fold edges, which accompanies functional dysphonia, with muscular tension imbalance and an hourglass-shaped vibration pattern. In cases of the chronic vocal injuries, these oedemas may change into permanent hyalinised fibrous nodules or into epithelial calluses occurring around the free edge of the vocal folds, at the anteroposterior midpoint of the membranous folds. The vocal nodules, typically, are whitish, small, generally bilateral and symmetrical (Fig. 5.1)
Case Study 5.3
A 40-year-old teacher of music and sports at a high school had voice problems for the last 3 years. When singing high notes, she feels that there is no closure of the vocal folds. She had studied singing and conducts five choirs besides her teaching activities. At the age of 34, she suffered two heart attacks and was fitted with a bypass. She takes beta-blockers and clopidogrel and additionally thyroxine because of Hashimoto’s struma.
There is characteristic marginal oedema on both vocal folds in the middle of the membranous part. Closure is complete in the chest register but hourglass-shaped at soft phonation. The voice range profile shows that high notes cannot be sung softly; otherwise, it is inconspicuous. The voice at connected speech is normal (R0 B0 H0) as is the singing voice in the low register.
Case Study 5.4
A 26-year-old event manager providing much professional talking had voice problems for years. After vocal load the voice is hoarse up to aphonic and regularly worse in the evening. There are no diseases in her history.
The video shows vocal fold nodules in the characteristic location and an hourglass-shaped closure. Vocal range is restricted. The speaking voice is mildly hoarse (R1 B1 H1) and has a tendency to pressed phonation.
5.2.8 Differential Diagnosis
5.2.9 Management in Occupational Voice Disorders
Occupation-related voice disorders are becoming a global multifocal health problem, involving social, economic and public health aspects. The prevention and treatment of occupational dysphonia call for improving occupational safety regulations and introducing better health arrangements for professional voice users.
5.2.9.1 Treatment
Education on voice hygiene: stressing that the vocal effort should be followed by an appropriate period of voice rest
Vocal training: an important method of voice therapy that can improve vocal capacity and the ability to compensate for vocal load. It can significantly increase “voice fitness for duty” in professional voice users and may enable them to continue working
Coping strategies for dealing with occupational stress and burn-out symptoms: this is a particularly important module in the treatment of functional occupational dysphonia, since psycho-emotional factors constitute a considerable risk of persisting voice problems in some individuals
Management of occupational dysphonia should be holistic and should rely on a close collaboration of phoniatricians, otolaryngologists, speech therapists, psychologists and physiotherapists or osteopaths or other clinicians if needed (Marszałek et al. 2012). Extensive treatment of occupational voice disorders performed in a health spa frequently produces better results than those from treatment in an outpatient clinic.
5.2.9.2 Certification
In many EU countries, the problem of occupational voice disorders is undervalued, perhaps because of the lack of this disease entity in the basic list of occupational diseases specified in the EU Commission Recommendation 2003/670/EC. Only in some countries, for example, in Poland, are occupational voice disorders included in the list of occupational diseases and regarded as a medicolegal issue (Calcinoni and Niebudek-Bogusz 2014). For example, according to Polish law, a certified occupational voice disorder entitles the person to financial compensation in the form of a single financial indemnity (calculated proportionally to the percentage of the loss of health, ranging between 10 and 40%). If partial permanent work disability due to occupational disease is proven, a pension may be provided. The pension is granted frequently for 1–2 years only, and the person is expected to retrain in order to get another job.
High-quality documentation of voice examination from the whole employment period plays a crucial role in the certification procedure. The phoniatric evaluation, including videostroboscopic imaging and the instrumental assessment of work-related vocal load, should be documented and archived as relevant evidence in the diagnostic procedure of occupational voice disorders and in the monitoring of the voice improvement in the course of treatment.
5.2.9.3 Prophylaxis
The important role of the prevention of occupational voice disorders has already been recognised in some studies, but only a few countries have been introducing better health arrangements for voice professionals in the last few decades. For example, in Poland, teachers employed in public schools are examined by laryngologists before they start work, and then the examination is repeated at obligatorily set periods. One of the Polish preventive programmes has resulted in the introduction of an optional new subject into the syllabus of pedagogical colleges: education on vocal hygiene and proper vocal behaviour (Niebudek-Bogusz and Sliwinska-Kowalska 2013). Several other countries, e.g. Scandinavian countries and Austria, have introduced many preventive programmes for professional voice users (Vilkman 2004; Schneider-Stickler et al. 2012).
Prevention should also concern other vocally demanding professions, including the still-growing staff of the call centre companies. Therefore, new technologies of instrumental voice evaluation, including the assessment of vocal loading effects, ought to be developed and implemented. They can facilitate the diagnostics of the risk of occupation-related dysphonia and the detection of changes in its early stages. They can also support therapy (biofeedback). The prophylactic programmes should be developed on the basis of multidisciplinary treatment of occupational dysphonia and by the collaboration of laryngologists, phoniatricians and speech therapists, as well as occupational medicine doctors.
To sum up, in view of the increasing impact of voice disorders on people’s ability to work, more relevant prevention strategies are needed.
5.3 Singing Voice Disorders
5.3.1 Introduction
Human voice production is accomplished through the perfect coordination of breathing, laryngeal movement and articulation. The movement patterns employed in this process are some of the more complex movements of which the human body is capable.
In general, any dysfunction of the human voice is called dysphonia. According to the underlying complex mechanisms of voice production, there are multiple causes of dysphonia. The cardinal symptom of dysphonia is hoarseness, irrespective of its aetiology. In daily clinical work, a rough classification of organ-related and organ-unrelated dysphonias has proved useful.
Organic voice dysfunctions may have an inflammatory or a traumatic origin. Other causes may be benign or malignant lesions of the vocal cords.
Dysphonia-causing dysfunctions of the voice-producing organ itself
Dysfunctions accompanying general diseases (e.g. influenza), in which a more or less severe impairment of the body or vocal capability is intrinsic
In the latter case, the vocal dysfunction is not a disease of the vocal organ but a symptom of the general disease (Wendler and Seidner 2005).
5.3.2 Special Issues in the Evaluation of Singers’ Voices
For the evaluation of a singer’s voice, it is mandatory for the examiner to be aware that a singer’s larynx may show visible abnormalities or deviations that do not cause any voice impairment.
Structural modifications such as asymmetries, unilateral vocal cord swelling, oedema, polyps, varices, etc. are not unusual in a professional singer’s larynx. Systematic studies in professional singers have shown that these ‘anatomical lesions’ do not necessarily affect tone production or the vocal sound; moreover, no impairment of vocal sound production and sound stability may be perceived either by the singers themselves or by the person listening to them (Traser et al. 2012). For the appropriate classification of stroboscopic images with regard to functional aspects of voice production, the sound quality of the voice, as well as the vocal capacity and singing skills, has to be considered before any decision is made about whether a treatment should be initiated and, especially for singers, what the specific treatment procedures should be.
Dysfunctions that have their cause primarily in the singing voice (singing-related disorders)
Dysfunctions that have other causes but that also hamper singing
5.3.3 Singing-Related Disorders
5.3.3.1 Acute Symptoms of Vocal ‘Overuse’
The onset of respiratory infections is often associated with acute symptoms of vocal ‘overuse’. These mostly start with a rhinitic medical condition of viral origin as an ordinary ‘cold’. Young singers in particular lose control of their singing voice when they are affected by a rhinitis. They do not only perceive a change in the sound of vocal consonants, but they also sense a kinaesthetic change.
Such haematomas require special care if acetylsalicylic acid had been taken during the infection, since acetylsalicylic acid, as well as diclofenac, enhances the disposition for bleeding.
When affected by a respiratory infection, those with a high occupational use of the voice—singers and actors—are confronted with the decision whether to sing or speak despite the infection. The answer to this depends on the individual conditions and on the expected vocal strain.
Is the patient hoarse?
Does the patient show any laryngitic symptoms?
How good are the patient’s vocal skills?
Singers, irrespective of their experience, should not put any strain on the voice in the case of hoarseness or any signs of inflammation in the laryngeal area. The voice must not be strained under any circumstances when the acute disease comes with hoarseness. If signs and symptoms of laryngitis are proven by an ENT or phoniatrician during careful stroboscopic examination, singers and actors should not perform at all.
A hoarse voice must also not be used for singing in a choir because the auditory sound control is hampered when singing in an ensemble. This so-called Lombard-effect is an impairment of the auditory feedback system during voice production and is due to the increased surrounding noise volume (Lombard 1911). Individuals without special vocal training or experience automatically increase their speaking or singing volume when speaking or singing in surroundings with a noise volume over 70 decibels, as is often the case in a choir. Moreover, when the singing takes place in a choir, onset of hoarseness is not immediately perceived, and, since the affected singer does not want to lose face in front of his choral associates, the singing is often continued despite increasing voice problems.
Highly experienced singers with fully developed singing skills are able to sing even when suffering from an infection—as long as there is no hoarseness or laryngitis. In general, singers have learned to use their voice without increased strain even in the case of altered auditory or kinaesthetic feedback.
5.3.3.2 Chronic Overuse (Misuse)
As a rule, overuse results from a disproportion of vocal strain to vocal capacity. This diagnosis especially applies to continuous (chronic) vocal overuse.
From the beginning of the nineteenth century, a paradigm change regarding the vocal exigencies can be observed: from lyrical to dramatic singing. Along with the alteration to more dramatic demands in opera, reports of vocal crises and on chronic overuse in opera singers have been increasing. Accordingly, vocal overuse is not a new phenomenon in professional singers; it is also not at all rare. The world-famous Swedish singer Jenny Lind (1820–1887), who had her debut at the age of 17 singing the role of Agathe in Freischütz, is known to have suffered an existential vocal crisis at the age of 21. Another example is Enrico Caruso (1873–1921) who needed two vocal cord surgeries in the course of his career (Springer 2002). Unrelated to these operations, Caruso again paused from performance for 7 months in 1911. His biographer, Springer (2002), attributed this break to Caruso’s general exhaustion. In the middle of her career, Christa Ludwig repeatedly suffered from vocal haematoma resulting from voice overuse (Ludwig 1999).
Some of the composers who were responsible for the growing dramatisation of operas, e.g. Richard Wagner and Giuseppe Verdi, were quite aware of the high—and increased—vocal exigencies inherent to their scores. In 1875, Wagner complained in correspondence with Julius Hey (1832–1909):
[…] since the quite suggesting and commonly used term ‘Sprechgesang (chant)’ only referred to a strongly enhanced speaking of texts that had to combat the unequal battle with the orchestra. No wonder that this senseless proceeding ruined numberless beautiful voices. I am to blame for all this.
Hey (1911)
On the occasion of the planned representation of his innovative ‘dramatic’ opera MACBETH in Naples in 1848, Verdi wrote about the singer Eugenia Tadolini (1809–1851) who was planning to sing the Lady Macbeth part:
Tadolini’s qualities are far too good for that role! This may seem absurd to you!! … […] Tadolini has a stupendous voice, clear, limpid, powerful; and I would like the Lady to have a harsh, stifled, and hollow voice. Tadolini’s voice has an angelic quality; I would like the Lady’s voice to be diabolical. […].
Gossett (2007)
One should not derive from Verdi’s expression that he preferred ‘ugly’ voices; but it becomes obvious that also for him the dramatisation of the singing technique became increasingly important in the course of his creative career.
As a reaction to this growing dramatisation of operas, the writer and music critic George Bernard Shaw (1856–1950) expressed his opinion on Verdi’s compositional style and its health consequences for singers:
The whole secret of a healthy voice writing lies in keeping the normal plane of music, and therefore the bulk of the singers’ work, in the middle of the voice. Unfortunately, the middle of the voice is not the prettiest part of it; and in immature or badly and insufficiently trained voices it is often the weakest part. There is, therefore, a constant temptation for composers to use the upper fifth of the voice almost exclusively; and this is exactly what Verdi did without remorse. He practically treated that upper fifth as the whole voice, and pitched his melodies in the middle of it instead of the middle of the entire compass, the result being a frightful strain on the singer.
In order to avoid vocal overuse, the roles of the singing teacher as well as of the physician are crucial. The singing teacher is, together with the student, responsible for the teaching content and for choosing the appropriate exercises and scores. The singing teacher has a very high responsibility for the prophylactic avoidance of vocal overuse. Here, essential support comes from the phoniatrician who may be addressed whenever the singing teacher or the student have open questions or need help.
It is essential and important that any phoniatrician in charge of professional voice users be very experienced (see Sect. 1.10). The phoniatrician should also have profound knowledge of psychological issues (see Sect. 5.9). Ideally, the phoniatrician should have professional voice training.
With the phoniatrician’s and the singing teacher’s help, young singers should develop awareness that singing should only take place within one’s own voice category (Fach). A prerequisite for this is the teaching of knowledge about voice categories. Furthermore, active advice based on comprehensive experience is required (see Sect. 1.10).
It is very wise for a singer to avoid any voice overuse. A singer has to be aware that the more the voice is strained by difficult scores, the more agility and stretching of the relevant muscles are needed for it to recover. Therefore, the messa di voce (see Sect. 1.8) for the coloratura and others should under no circumstances be removed from the training programme but, on the contrary, be further practised. Dramatic and highly dramatic singing—including some subgroups of jazz and pop-rock singing, as well as belting in general—represent an especially complex challenge in terms of voice physiology and treatment by any voice specialist. In phoniatric practice, the question of the right time for singers to turn to dramatic scores often arises only when signs of overuse become apparent.
Here, three major mistakes—the ‘magic 3 t’s’—can be observed: (1) Singers start too early in their career to sing dramatic scores; (2) the scores chosen are too difficult; (3) the practising of the scores is too long and loud. Consequently, voice specialists should generally recommend a slow and careful start to singing development. Young singers who in particular do not have the pressure of time constraints should not rush to change towards the dramatic Fach: it is observed that this development is usually adopted too early.
Vocal overuse can be treated through competent phoniatric care and adequate involvement of the singer in the treatment procedure. Chronic overuse may lead to a functional voice disorder and, especially in singers, to dysodia (see Sect. 5.2). Various other voice problems may also arise from chronic or acute overuse. These are called benign organic voice disorders and are explained in detail in Sect. 5.4.
5.3.4 Dysfunctions and Diseases Hampering Singing
Organic Voice Disorders
Organic voice disorders may—besides original alterations—result either from functional disorders, which cause a purely mechanical impairment of the vocal fold mobility, or from functional disorders of the muscles and corresponding nerves that are responsible for the tension and mobility of the vocal cords.
Acute Inflammation
Acute respiratory infections are often accompanied by signs of laryngitis of viral or bacterial origin. Regardless of the underlying pathological cause, this laryngitis leads to swelling of the vocal cords and to hoarseness. In these cases singing and speaking should be strictly limited to avoid acute or chronic overuse of the voice.
Chronic Inflammation
There are various causes of chronic inflammation. Inhaled noxious agents such as cigarette smoke, dust and vapours may lead to chronic inflammation (Richter et al. 2002). In this context, of particular note are the chronic inflammatory changes associated with inhalatory steroid therapy used in the treatment of asthma. These changes are often accompanied by fungal colonisation of the larynx. However, continuous overuse of the voice may also lead to chronic inflammatory laryngeal changes. Furthermore, more general medical conditions such as rheumatic diseases may be a cause of chronic inflammatory changes in the larynx (see Sect. 5.13).
Significant Changes in the Vocal Tract
Singing voice disorders are not only restricted to problems of laryngeal origin. Modifications of the vocal tract might lead to or promote dysphonia. Diseases that involve disorders of the functioning of the vocal tract, such as movement disorders of the soft palate, or impairments of the velopharyngeal closure after tonsillectomy, possibly influence the acoustic properties of the vocal tract: impairment of formant clustering and formant tuning as well as influences on the voice source by means of non-linear relationships may be involved (Titze 2008). These variables might explain dysphonia in conditions without impairment of vocal fold function. Therefore, the diagnostics of dysphonia should not be reduced to functional analysis of vocal fold oscillations (Echternach et al. 2011).
Indoor Air Climate
The climatic conditions of an opera stage—especially the air’s humidity—may deviate considerably from those recommended for singers (Richter et al. 2000). This is particularly so when there is no effective air conditioning. From considering these conditions, along with the fact that singers may suffer a substantial dryness of the mouth because of the adrenergic stress reaction of stage fright, the importance of the optimisation of the indoor air climate becomes quite clear and unquestionable.
Reflux
Reflux-related posterior laryngitis is a disease that is not specific to singers, but singers who suffer from reflux (GERD, gastro-oesophageal reflux disease) often develop a posterior laryngitis (Cammarota et al. 2007). In all cases, singers suffering from GERD should undergo thorough medical examination (see Sect. 5.5).
Tumours
To date, there is no literature that provides descriptions of malignant vocal cord tumours related to or caused by vocal strain. Accordingly, these lesions are no more common in singers than in people with no vocal profession. There may even be a lower rate of malignant tumours in singers since they, for the reason of voice hygiene, usually consume less noxious agents (alcohol and nicotine) than average members of the population. Nevertheless, any hoarseness of unknown origin lasting longer than 3 weeks should be examined by an ENT physician or a phoniatrician. Since there are various diseases with hoarseness as a symptom, a thorough diagnosis is required in order to exclude any malign lesions.
Vocal Fold Movement Disorders
So far there have been no descriptions of strain-related vocal cord movement disorders. Accordingly, vocal fold movement disorders are no more common in singers than in non-professional voice users (see Sects. 5.6–5.8). Where a singer requires surgery in the region of the nervus laryngeus recurrens and the nervus laryngeus superior (e.g. thyroid gland operation), a very careful medical procedure in the context of voice hygiene is required, including neuro-monitoring. Given the surgeons’ awareness of this risk, dysfunctions that arise from such surgery (e.g. nerve paralysis) may even be rarer in singers than in the average population.
Case Study 5.5
A 40-year-old pop and soul singer had voice problems with her singing voice. She had no other diseases and takes no medication. The examination with video-laryngostroboscopy revealed vocal fold nodules. The voice range profile shows loss of high notes especially for soft tones. The speaking voice is normal. When singing, she fails to hit the passaggio note C5 (523 Hz), whereas the next one, D5 (587 Hz), is possible. This does not happen regularly. Usually singers avoid such accidents by pushing their voice.
Case Study 5.6
A 39-year-old musical singer and actor had some problems with his high voice. He had a phonosurgical intervention years ago, but it is not known which vocal fold was operated on. During video-laryngostroboscopy, there is a hardly noticeable swelling of the left vocal fold. He is a high baritone, and his voice range profile is adequate to the requirements of a musical singer. The speaking voice is normal as is the singing voice. However, he feels insecure about the performance and does not always trust his voice.
Case Study 5.7
A 43-year-old opera singer (soprano) had trouble with her high notes and needed a long time (more than 48 h) for recovery after performances. Video-laryngostroboscopy shows marginal oedema on both vocal folds in the middle of the membranous part where amplitudes are largest and collision and shearing forces strongest. When singing, she loses control over the high notes that tend to slide up. The speaking voice sounds mildly rough (R1 B0 H0).
Case Study 5.8
This 40-year-old baritone sings dramatic opera roles but wants to be a lyric singer. His voice is a bit heavy after he had sung performances with a cold. The video-laryngostroboscopy shows mucus/phlegm on the vocal folds. The margins are macerated, and on the right side, there is a small swelling. These are typical signs of an inflammation after vocal load. The vocal range is not affected. He has a very strong singer’s formant. His speaking voice sounds a little rough (R1 B0 H0). When singing, he tends to push his voice. After a few days of voice rest he will recover.
5.4 Benign Organic Voice Disorders
5.4.1 Lesions of the Lamina Propria of the Vocal Folds
5.4.1.1 Vocal Fold Nodules
Vocal fold nodules are the most common benign lesions of the vocal folds in both children and adults. The prevalence of nodules in the general population is not known but has been reported as being the cause of hoarseness in up to 23.4% of children, 0.5–1.3% of ENT clinic attendances and 6% of phoniatric clinic attendances. The prevalence of nodules in female teachers was found to be 43% of 218 cases with dysphonia, in a population of 1046 female teachers in a study in Spain. Teachers speak for an average of 102 min per 8 h. Nodules were found in 25% of hoarse singers (Pedersen and McGlashan 2012).
Definition
Vocal fold nodules are mostly symmetrical thickenings of the vocal fold margin due to oedematous swelling of the lamina propria and distension of the overlying epithelium.
Causes
Vocal fold nodules usually result from continuous vocal abuse, misuse or overload. This rather simplistic view should be extended because they may appear on the one hand as localised oedemas and on the other as epithelial thickenings. The main cause is certainly vocal stress, but there are other contributing factors that are not known. In recent years, attention has been focused on another aspect of aetiology, namely, epithelial tissue deviations in the region of the nodules. The development of nodules does not seem to be related to age but rather to functional load when speaking and singing (Martins et al. 2010).
Signs and Symptoms
Nodules are generally bilateral lesions located at the mid-membranous vocal fold. There are two types of nodule that may not necessarily have the same aetiology or prognosis. The first type consists in the beginning of fairly soft and pliable diffuse swellings. The histological abnormality seems to be more concentrated in the superficial lamina propria. They may increase over time owing to chronic vocal load up to the size of Reinke’s oedema. The other kind of nodule is characterised by focal hard and fibrotic epithelial thickenings.
Laryngostroboscopic examination demonstrates an hourglass-closure pattern and normal or minimally reduced mucosal wave vibratory activity.
Treatment
The standard treatment is voice therapy. Many people try to avoid surgical intervention. In terms of lasting improvement, phonosurgery is the better choice when combined with voice therapy.
Prognosis
These lesions may respond to a combination of voice rest or voice ergonomics and voice therapy when the patient is compliant, and the therapy is done in an appropriate fashion. However, they will not cease completely and may recur after renewed vocal load such as vocal recitals. After phonosurgery, the prerequisites for voice therapy and adjustments of singing technique are better. Even then, vocal fold nodules may recur.
5.4.1.2 Vocal Fold Polyps
Definition
Vocal polyps, more often unilateral than bilateral, are typically exophytic or pedunculated lesions of the middle part of the membranous vocal folds.
Causes
Polyps result from vocal hyperfunction, as do vocal nodules. However, unlike vocal nodules, polyps are often precipitated by a single vocal event, such as screaming. Once a small polyp begins, any continued vocal abuse or misuse will irritate the area and lead to its continued growth.
Signs and Symptoms
There are two types of polyp: hyaline or glassy ones of pale colour and haemorrhagic ones that are red. Polyps have a sharply delimited margin to the surrounding epithelium. Stroboscopic evaluation reveals a variable closure pattern according to size and localisation of the polyp. Pedunculated polyps are pressed below the vocal fold level into the inframarginal region, or they topple over during phonation and sit on the superior plane of the vocal fold. Polyps originating from the medial vocal fold edge may lead to an hourglass-closure pattern. The mucosal wave is inhibited at the site of the polyp, whereas other parts of the vocal folds may vibrate in a normal fashion.
Treatment
Voice therapy may help the symptoms of polyps, but generally surgical excision of the lesion is the treatment of choice. The distended epithelium covering the polyp should be resected as a whole, leaving the surrounding strong epithelium intact. Lamina propria material should not be removed below the epithelial level, to provide a matrix for epithelial regeneration. The microflap technique is not advisable.
Prognosis
There is a good prognosis for voice function after polyp excision, provided the intervention is limited to the pathological structures. Polyps usually do not recur.
Case Study 5.9
A 39-year-old construction engineer had suffered from hoarseness for 6 months. The vocal fold polyp may have developed when she used her loud voice on construction sites. She had no other health-related problems. She has lost her head voice and is unable to sing. The speaking voice sounds moderately rough (R2 B1 H2), and it feels as if she needs to clear her throat from phlegm in order to speak normally.
5.4.1.3 Polypoid Corditis (Reinke’s Oedema)
Definition
Reinke’s oedema is a pathological condition of the vocal folds, which involves accumulation of a gelatinous type of fluid under the vocal fold cover in Reinke’s space.
Causes
The most common aetiological factors of Reinke’s oedema involve tobacco use, LPR (laryngopharyngeal reflux) and chronic vocal abuse. Upper airway infection can often be present in patients with Reinke’s oedema, but there is no clinical evidence that supports infection as an aetiological factor.
Signs and Symptoms
Reinke’s oedema can considerably increase the mass and volume of the vocal folds. That typically lowers the pitch of the voice and causes increased vocal effort and instability. The aetiological factors should be addressed before surgical treatment.
Treatment
Phono-microsurgery is indicated once Reinke’s oedemas have reached a significant size. It is not likely that they will reform after cessation of smoking or after treating laryngopharyngeal reflux. The operation may be carried out on both vocal folds at the same time when done properly (see Sect. 8.10). Vocal effort may increase after removal of the oedema when a gap arises between the vocal folds. That is why the thin and fragile epithelium covering the gelatinous masses should never be resected completely. A so-called stripping is definitely malpractice and leads inevitably to scarring and irreparable vocal damage. Such side effects of the operation cause more discomfort than the oedema itself. Voice therapy before surgery is helpful in eliminating the vocal abuse and obtaining a better result after the surgery.
Prognosis
After surgical removal, Reinke’s oedema usually does not recur. In case of removal in an exact fashion, voice can improve significantly. Otherwise, it is nearly impossible to correct resulting scar formation when too much of the epithelium has been removed.
Case Study 5.10
A 54-year-old woman, a smoker with 45 pack-years, has been hoarse for many years. The video shows giant Reinke’s oedema on both vocal folds. The pitch of the speaking voice is in the region of male voices, but the timbre is still female. She is unable to speak up or sing.
5.4.2 Secondary Pathological Changes of the Vocal Folds and the Larynx
5.4.2.1 Vocal Fold Cysts
Definition
Vocal fold cysts are swellings below the epithelium of the vocal fold. They contain mucus or a caseous mass.
Causes
Epidermoid cysts may be secondary to vocal abuse or to remaining epithelium trapped inside the lamina propria. They have a lining of stratified squamous and keratinised epithelium and a duct that ends blindly on the inferior margin of the vocal fold.
Mucous-retention cysts develop from the obstruction of the glandular ducts caused by different conditions, such as voice overuse, laryngitis secondary to gastro-oesophageal reflux or upper-airway infections. The cavity is covered by a ciliated cylindrical epithelium. The cysts are more common in adults but can also be diagnosed in children (Martins et al. 2011).
Signs and Symptoms
Patients complain about increasing hoarseness of the voice. Dynamic and pitch range are restricted. Intra-cordal cysts appear as small spheres on the margins of the vocal folds and sometimes on the superior surface. They are predominantly unilateral. Epidermoid cysts are white, while mucus retention cysts are often yellowish. On stroboscopic evaluation, mucosal wave vibratory activity may be normal to disrupted, depending on the size of the cyst, with an hourglass-closure pattern.
Treatment
Phonosurgery is generally required for symptomatic cysts. The epithelium does not have to be resected but just divided to dissect the subepithelial cyst, and remove it completely.
Prognosis
Epidermoid cysts fill with mucus and grow with time. Sometimes the duct opens and the mucus drains off. The small duct may close and the cyst may fill up again. Ruptured cysts may generate sulcus vocalis or mucosal bridges (Bouchayer et al. 1985). When removed properly, the vocal ligament heals without sequelae. Cysts may recur when parts of the epithelium stay inside the vocal fold.
Case Study 5.11
A 42-year-old lawyer had hoarseness for 4 years. A cyst of the left vocal fold had been operated on 2 years ago but had now recurred. Therapy should be aimed at a complete removal of the encapsulated epithelium within the lamina propria. The voice sounds mildly rough and breathy (R1 B1 H1). Dynamic vocal range is slightly restricted, which is noticed by the patient only after an above-average vocal load.
5.4.2.2 Sulcus Vocalis
Definition
Sulcus vocalis is a narrow, linear depression in the surface of the vocal fold running longitudinally along the vocal folds. It is a deficiency of the lamina propria of the vocal fold and the covering epithelium, which is thin and not pliable.
Causes
The aetiology of sulcus vocalis is uncertain (Ford et al. 1996). Possible aetiological factors are congenital, developmental or traumatic. In the elderly a sulcus vocalis may develop after inflammation such as laryngitis or myositis.
Signs and Symptoms
The longitudinal extent of the furrow is variable, as is its depth (type I superficial, type IIa vergeture involving deeper layers of the lamina propria, type IIb similar to an open cyst (Giovanni et al. 2007)). It frequently disrupts normal vibration of the vocal folds. It is seen as a deepening or a line between the upper and lower edge of the vocal folds on laryngostroboscopic examination.
Treatment
It is extremely difficult to treat a sulcus vocalis successfully, especially after myositis. Several surgical methods have been described. In principle scarred tissue is mobilised, or the thin epithelium is separated from the ligament. Augmentation is used in some cases. As the original structure of the sulcus is not changed by augmentation, this procedure seems to be the least invasive and best function-preserving one.
Prognosis
Surgical results are often unsatisfactory owing to the lack of lamina propria substance, the attachment of epithelium to the vocal ligament and the frequently associated scar (Ford et al. 1996; Pontes and Behlau 1993).
Case Study 5.12
A 33-year-old student of theology had voice problems for more than 10 years. She is allergic to animals, moulds and some foods. Moreover, she has bronchial asthma and takes ciclesonide as an inhaled steroid. A cyst of the left vocal fold had been removed 2 years ago. The video shows scars on both vocal folds in the middle of the membranous part. The left vocal fold is thicker than the right one. There is a phase shift of about 45°, i.e. the left vocal fold moves later to the midline, with an incomplete closure. The voice range profile indicates that her vocal capacity is sufficient for a voice professional. The voice is normal (R0 B0 H0). Note a mild dyslalia in the form of a lisp, an interdental sigmatism.
5.4.2.3 Laryngocele
Definition
A laryngocele is an air containing dilation of Morgagni’s ventricle. It communicates with the ventricle via a narrow stalk. In cases of obliteration, the cavity fills with mucus. It expands either internally or externally or both.
Causes
Congenital laryngoceles are lined with respiratory epithelium and filled with serous liquid. They are located in the paraglottal space lateral to the vocal folds or in the epiglottic vallecula. Acquired laryngoceles develop after closure of the laryngeal ventricle by inflammation, trauma or dispersed epithelium in the ventricular folds. The cystic dilation of the laryngeal saccule is filled by secretion from the encapsulated epithelium.
Signs and Symptoms
Congenital laryngoceles lead to stridor and dyspnea. Acquired epithelial laryngoceles remain symptom-free over decades. The first sign of increase is hoarseness and only rarely dysphagia or dyspnoea. The contents may become infected and accumulate pus (laryngopyocele). This leads quickly to these symptoms, which are accompanied by pain.
The full clinical shape of laryngoceles mostly appears in adults. The ventricular fold, aryepiglottic fold or vocal fold is swollen. Even the epiglottic vallecula or piriform recess may be filled. The ventricular fold may cover the vocal fold and narrow the laryngeal entrance. In cases of expansion from the larynx through the thyrohyoid membrane, the external laryngocele may form a swelling of the neck. Internal laryngoceles are within the larynx itself and do not cross the thyrohyoid membrane. External laryngoceles penetrate the thyrohyoid membrane at the neurovascular bundle. The segment confined by the membrane is of normal size and connects normally with the laryngeal ventricle, whereas the portion outside the membrane is dilated. Mixed laryngoceles are dilated in both segments.
The secretion is serous in the beginning and thickens over time. It acquires a yellowish hue and resembles pus although bacteria are rarely detected. Laryngoceles are diagnosed by video-laryngostroboscopy. In cases of an external laryngocele, a CT scan may support the planning of surgery.
Treatment
Laryngoceles are resected via micro-laryngoscopy with a CO2 laser. During surgery care has to be taken not to injure the vocal folds. External laryngoceles can be pulled into the inner larynx during micro-laryngoscopy and then excised. Open surgery from the neck can be avoided in most cases (Rosen and Simpson 2008).
Prognosis
There is hardly a recurrence of laryngoceles when the lining epithelium is completely resected. Voice and respiration usually normalise and swallowing difficulties disappear.
Case Study 5.13
A 59-year-old pensioner noticed deterioration of her voice half a year ago without remarkable cause. Breathing and swallowing were not impaired. She has slipped discs between C5 and C7, suffers from an anxiety disorder treated by psychotherapy and has problems with her meniscus and nodes of the thyroid gland. None of these problems had anything to do with the laryngocele that inhibits the vibration of the right vocal fold, restricts the voice range and leads to a moderately hoarse voice dominated by roughness (R2 B1 H2). After laser surgery voice function was restored.
5.4.2.4 Contact Ulcer/Contact Granuloma
Definition
A contact ulcer or granuloma is primitive granulomatous tissue at the vocal process of the arytenoid cartilage arising from a defect in the mucosal lining.
Causes
The literature suggests that these two clinical problems are probably the same disorder and seem to result from one of three causes or a combination of these. The first cause is intubation trauma during surgery (intubation granuloma); the second is glottal trauma from abuse-misuse, and the third aetiology, possibly the largest group, is laryngopharyngeal reflux (LPR). The risk of having intubation trauma leading to contact ulcer/granuloma is particularly high in children and women, as they have smaller airways and are thus more often traumatised by large tubes (Whited 1979). Excessive slamming of the arytenoids during production of low-pitched phonation coupled with excessively hard glottal attack and possibly increased loudness with frequent throat clearing and coughing may result in contact injury of the vocal folds (Boone and McFarlane 2000). A far larger group of patients with contact injury seems to be in the class of LPR. Stomach acid is forced up the oesophagus and irritates the area between the arytenoids in reflux patients (Koufman 1991).
Other causes that are not evident in the literature may be conscious or unconscious mental stress and insufficient glottal closure due to atrophy of the vocalis muscle. The latter may cause involuntary increase of adduction forces between the arytenoid cartilages or crossing of the vocal processes of the arytenoids thereby fostering the formation of contact granuloma.
Signs and Symptoms
Patients with contact ulcer or contact granuloma may present with vocal fatigue accompanied with pain in the laryngeal area, severe dysphonia and frequent throat clearing.
Contact ulcers and contact granulomas generally arise from the vocal process of the arytenoid cartilage. They may also be above the glottal level when arytenoid apices contact during phonation. They may be on one side or appear on both and be complementary in shape, such as a hammer and anvil. The size may range from a shallow bowl-shaped area to a large exophytic ball lying on the vocal fold and thereby impeding vocal fold vibrations.
Treatment
In case the diagnosis has not been confirmed by histological examination, the tissue should be removed. This is best performed via micro-laryngoscopy with jet ventilation and the use of a CO2 laser. Voice therapy is helpful in case of habitually pressed voice to regain a normal voice.
Prognosis
Contact granulomas often recur shortly after excision. In this case repeated interventions should be made only when the voice is affected. Once the aetiological trauma and irritants that initiated the contact granuloma/ulcer are removed, they will spontaneously resolve within months.
5.4.2.5 Papilloma
Definition
Papillomas are wartlike growths originating from the epithelium of the vocal folds and the mucosa of the larynx after infection with human papilloma virus (HPV).
Causes
Recurrent respiratory papillomatosis is most commonly caused by and associated with HPV-6 and HPV-11 subtypes. It may be transferred during vaginal birth from mother to child. There is no evidence that it is transmitted through oral sex, and it is not considered a sexually transmitted disease.
Signs and Symptoms
Papillomas involving the larynx frequently occur in young children. They are the most common laryngeal tumour in children. Clinical presentation of laryngeal papilloma is progressive airway obstruction and dysphonia that may even progress to aphonia.
Treatment
Surgical treatment in most cases is laser surgery. In order to protect deeper layers of the lamina propria, papilloma should be vaporised only down to the epithelial level, not below it. In order not to implant epithelial cells into the vocal fold, subepithelial infusion should be applied before laser vaporisation. Some surgeons prefer conventional excision surgery in order to prevent scattering viable papillomatous cells into deeper layers of the vocal folds. The use of a micro-debrider is also possible but it should be handled with great care. Scars can easily form when the epithelium has been removed completely. Repeated surgery can lead to postsurgical webbing of the vocal folds and to vocal fold scar formation. Hence papillomas should only be removed when they interfere with the airway (Boone and McFarlane 2000; Wetmore et al. 1985) or cause loss of voice. Radiation therapy is obsolete.
Adjuvant medical therapies, such as interferon therapy, retinoic acid, indol-3-carbinol and cidofovir (Lundquist et al. 1984; Thompson and Kershner 2007) have been tried, but all failed to cure the disease.
In adult men, laryngeal secretions tend to become negative, and serum antibody titres tend to increase after vaccination with Gardasil. However, there was no significant difference between serum level of antibodies, negative laryngeal secretions and the presence of papillomatosis 1 year later (Hirai et al. 2018; Makiyama et al. 2017).
A promising line of therapy based on successful case reports includes systemic application of bevacizumab as well as cidofovir and epidermal growth factor receptor (EGFR) inhibitors such as gefitinib and erlotinib for emergency treatment when the airway is severely compromised and other therapeutic measures fail (Best et al. 2012; Fernandez-Bussy et al. 2018; Kalanjeri et al. 2017; Mohr et al. 2014; Nagel et al. 2009; Rogers et al. 2013).
Prognosis
Papilloma of the airway in children is serious, because of its recurrent and persistent threat to compromise the small airway. There is no proof that papillomatosis stops recurring about the time of puberty. There is no cure for this benign disease.
Case Study 5.14
A 66-year-old woman had been suffering from hoarseness for 6 months. In her history she had undergone surgery for a meningioma of the falx cerebri 3 years ago and has underactivity of the thyroid gland. The laryngeal examination revealed a papilloma of both vocal folds. Her voice has nearly disappeared. Vocal range is restricted to a mere whisper. After careful removal of the flat papilloma spread along the vocal fold margins, her voice improved towards near normal function.
5.4.2.6 Benign Neoplasia
Definition
Benign neoplasia is an uncontrolled propagation of vascular or neural cells. One may also find extracellular deposition of substances leading to vocal fold bulging and dysphonia.
Causes
The causes of benign neoplasia often cannot be determined.
Signs and Symptoms
Neurofibromas are the most common benign laryngeal tumours. They appear very similar to inclusion cysts and rarely cause symptoms. They are much firmer than inclusion cysts (Anderson 2007). Adult haemangiomas may arise from the glottis or the supraglottis. They tend to form uniform submucosal masses.
Treatment
Biopsy is needed for diagnosis, and they can generally be treated with complete surgical excision. Treatment is determined by the severity of dysphonia or airway obstruction (Hoffman et al. 2003). Asymptomatic laryngeal masses with a benign appearance can be observed without a biopsy in most cases.
Prognosis
Benign laryngeal neoplasms are rare and the course of disease cannot be predicted. Usually there is no recurrence after surgical treatment.
Case Study 5.15
A 46-year-old pastry cook who worked formerly as a kindergarten teacher had been suffering from voice impairment for 1.5 year. She is otherwise healthy and takes no medication. The video shows a markedly bulging left vocal fold. Surgery was planned under the suspicion of a laryngocele. However, it turned out to be a solid mass that was histologically diagnosed as a neurofibroma. The speaking voice is a bit higher than normal, the vocal range restricted. The voice sound is moderately hoarse (R2 B2H2).
Case Study 5.16
A 45-year-old pharmaceutical representative had been suffering from reduced vocal capacity and recurring hoarseness for several years. Laryngeal examination revealed a swelling of the right ventricular fold and a small oedema of the right vocal fold margin. Surgery was planned on the suspicion of a laryngeal cyst. However, during the intervention a solid mass was discovered under the surface which turned out to be bulging cartilage. Vocal range is restricted; the voice is mildly hoarse (R1 B1 H1).
5.5 Gastro-oesopharyngeal Reflux Influences on Larynx and Voice
5.5.1 What Is Gastro-oesophageal Reflux?
Gastro-oesophageal reflux (GER) occurs when stomach contents flow back up into the oesophagus. This reflux is a physiological phenomenon with an average of 50 episodes per day, mainly after meals. Gastro-oesophageal reflux disease (GERD) is a more serious, chronic—or long-lasting—form of GER. The backward flow of stomach acid into the oesophagus causes symptoms or oesophageal mucosal lesions. The diagnosis is based on pH monitoring studies. GERD is defined (Richter 2003) by up to 50 reflux episodes below pH 4.0 per day into the oesophagus or a time below pH 4.0 up to 1.5% of the whole time of the monitoring (24 h). Classical symptoms are heartburn (uncomfortable, upward burning feeling in the mid-chest, behind the breastbone), acid regurgitation and foul-smelling eructations. They may be absent in more than half the patients presenting with extra-digestive manifestations (Jaspersen et al. 2003). Other non-typical manifestations may involve the upper aero-digestive tract (hoarseness, cough, throat clearing, globus pharyngeus, laryngeal hyperirritability, laryngospasm, pharyngitis, otitis media, laryngitis, laryngeal pyogenic granuloma, glottal and subglottal stenosis, laryngeal carcinoma) or lungs (asthma, bronchitis, bronchiectasis, aspiration pneumonia, idiopathic fibrosis) (Galli et al. 2006; Balkissoon and Kenn 2012).
Physiological reflux occurring just after a meal. This is regurgitation of some of the stomach’s digestive juices and food without pain symptoms
Symptomatic reflux giving discrete signs of digestive dysfunction, such as heartburn and dyspepsia, the sensations of more or less acid reflux into the pharynx, which are perfectly tolerated. This is not a real disease because there is no complication
Pathological reflux, meaning that the patient suffers signs of hurting with the GER. These signs can be digestive or not, related to GERD or to laryngopharyngeal reflux (LPR). Proton pump inhibitor (PPI) therapy remains the cornerstone of treatment (Koufman 2002; Richter 2003)
Furthermore, heavy reflux may occur with aspiration of much material (mostly during sleep) leading to laryngospasm and threatening and frightening apnoea.
5.5.2 What Is Laryngopharyngeal Reflux?
Laryngopharyngeal reflux (LPR) refers to the backflow of stomach contents into the laryngopharynx. There are numerous synonyms for LPR in the medical literature; the most accepted of these terms is extra-oesophageal reflux.
Koufman (1991, 2002) demonstrated the existence of pharyngeal reflux by a double-probe pH monitoring (oesophageal and pharyngeal) performed on an ambulatory basis over 24 h. Of the 206 otolaryngology patients with GERD undergoing diagnostic pH monitoring, 62% had abnormal oesophageal pH, and 30% demonstrated reflux into the pharynx. Pharyngeal reflux was present in 58% of patients presenting laryngeal symptoms. Its duration was shorter and it arose more frequently in a vertical position. Lower oesophageal sphincter dysfunction and oesophageal motility disorders were frequently suspected. Other studies also support a role for LPR. For example, in patients with resected benign true vocal fold lesions, the prevalence of laryngopharyngeal reflux is higher than that of pathological gastro-oesophageal reflux (Beltsis et al. 2011). The diagnosis of LPR is often missed because symptoms associated classically with reflux, such as dyspepsia and pyrosis, are frequently absent because patients with LPR either do not develop oesophagitis or do not respond to acid reflux with typical symptoms such as heartburn (Ford 2005).
Despite these arguments, controversy exists because of difficulties in making the diagnosis. An LPR event is evident when pH in the proximal sensor abruptly drops to less than 4 during or immediately after distal acid exposure (exposure near the lower oesophageal sphincter). LPR is confirmed when total acid exposure time (percentage of time during 24-h monitoring when the sensor detects pH levels <4) is more than 1% (Kawamura et al. 2004). However, there are no standardised and normative data to consider pathological versus physiological reflux. That is why the most frequently used tools are the reflux finding score (RFS) and the reflux symptom index (RSI) developed by Belafsky et al. (2001b, 2002b). Currently, the two main diagnostic tools are laryngoscopy and reflux monitoring of LPR. On laryngoscopy, the signs most commonly used to diagnose LPR are erythema and oedema of the larynx; however, these signs are not specific for LPR, may be associated with other causes and may even be found in healthy individuals. In addition, pH testing has low sensitivity in diagnosing gastro-oesophageal reflux disease-related laryngeal findings (Abou-Ismail and Vaezi 2011).
The reflux symptom index (RSI) | The reflux finding score (RFS) | |
---|---|---|
Instructions | Within the past month, how did the following problems affect you? | Assign a grade of intensity to the different laryngoscopic signs |
Ordinal scale: 0–5 (0, no problem; 5, severe problem) | Ordinal scale, specific for each item | |
Items | Hoarseness or other voice problems | Infraglottic oedema (pseudosulcus). Scale: 0–2 0, absent; 2, present |
Clearing throat | Ventricular obliteration. Scale: 0–4 0, none; 2, partial; 4, complete | |
Excess throat mucus or postnasal drip | Erythema/hyperaemia. Scale 0–4 0, none; 2, arytenoids only; 4, diffuse | |
Difficulty swallowing food, liquid or pills | Vocal fold oedema. Scale 0–4 0, none; 1, mild; 2, moderate; 3, severe; 4, polypoid | |
Coughing after eating or after lying down | Diffuse laryngeal oedema. Scale 0–4 0, none; 1, mild; 2, moderate; 3, severe; 4, obstructing | |
Breathing difficulties or choking episodes | Posterior commissure hypertrophy. Scale 0–4 0, none; 1, mild; 2, moderate; 3, severe; 4, obstructing | |
Troublesome or annoying cough | Granuloma/granulation. Scale 0–2 0, absent; 2, present | |
Sensations of something sticking in the throat or lump in throat | Thick endolaryngeal mucus. Scale 0–2 0, absent; 2, present | |
Heartburn, chest pain, indigestion or stomach acid coming up | ||
Interpretation of the scores | An RSI >9 indicates patients symptomatic for LPR | An RFS >7 predicts a 95% certainty of patients with LPR |
By implementation in daily use, most patients may not need examination in the first-line assessment of LPR. These examinations can be reserved for nonresponders, restricting uncontrolled prescription of PPIs (Habermann et al. 2012).
However, disagreement persists regarding optimum diagnostic techniques, criteria of normality and treatment efficacy. Because of a paucity of convincing evidence regarding techniques for establishing a definitive diagnosis and causation in individual patients, and because of a plethora of imperfect studies that have produced conflicting conclusions, LPR diagnosis and management remain controversial (Sataloff et al. 2010; Hawkshaw et al. 2013). For this reason, new approaches are being explored, such as pepsin immunohistochemical staining of the laryngeal mucosal epithelia (Jiang et al. 2011).
5.5.3 Reflux and Larynx
Traditionally, laryngeal abnormalities may be caused by direct injury or by a secondary mechanism. Direct injury is due to contact of acid and pepsin with the laryngeal mucosa, resulting in mucosal damage (Koufman 1991; Delahunty and Cherry 1968; Cherry and Margulies 1968; Lillemoe et al. 1982). Alternatively, irritation of the distal oesophagus by acid may cause a reflex mediated by the vagus nerve , resulting in chronic cough and throat clearing, which may produce traumatic injury to the laryngeal mucosa (Ramet 1994; Sacre and Vandenplas 1989). Recently, some authors have been focusing on the specific molecular aetiopathology of LPR. Researchers have considered that the failure of intrinsic defences in the larynx may cause changes in laryngeal epithelium, such as alterations in carbonic anhydrases and E-cadherin (Wood et al. 2011). Mucin expression also varies according to the severity of reflux. Moreover, they have discovered the presence of H+/K+-ATPase pumps around the submucosal glands of the human larynx. The lack of carbonic anhydrase in the epithelium of the vocal folds in cases of LPR supports the hypothesis that reflux-related damage mainly affects the glottal area. Finally, pepsin is often found on laryngeal epithelial biopsy and in sputum of patients with pH test-proven GERD and symptoms of LPR. Pepsin, at pH 7, in non-acidic refluxes causes damage by becoming reactivated inside the cell. Inhibitors of peptic activity hold promise as a new therapy for reflux (Wassenaar et al. 2011; Johnston et al. 2010). Further research is required to identify a definitive mechanism for mucosal injury.
Posterior laryngitis (Kamel et al. 1994; Garrigues et al. 2003; Ulualp et al. 1999; Ylitalo et al. 2001)
Subglottal stenosis (Little et al. 1985)
Pseudosulcus (see Fig. 5.8a and in comparison sulcus in Fig. 5.8b) and partial effacement or obliteration of the laryngeal ventricle (Belafsky et al. 2002a; Belafsky 2003; Hickson et al. 2001)
Suggestive laryngeal aspects
On the other hand, oedema of the interarytenoid mucosa seen on endoscopy is related to endoscopic-positive oesophagitis (EE) and is an independent predictor of EE.
Unfortunately, laryngoscopy cannot be considered as a tool determining the diagnosis of reflux because of the poor intra- and inter-assessor agreements of the scores developed to establish the clinical laryngoscopic diagnosis (Sataloff et al. 2010; Branski et al. 2002; Welby-Gieusse et al. 2008). The diagnosis reflux laryngeal disease remains too examiner-dependent.
Moreover, the association of several factors may lead to the apparition of specific laryngeal lesion. For example, mechanical laryngeal trauma and reflux are factors provoking granuloma; reflux and inhaled corticosteroids can promote fungal laryngitis.
According to recent advances, it is assumed that the gastro-oesophageal reflux (GER) is a possible co-promoting factor for squamous cell carcinoma development in the upper parts of the gastrointestinal and respiratory systems, considering the higher frequency of acid-dependent lesions in the population suffering GER. Laryngopharyngeal reflux appears to be an important risk factor in the development of laryngeal carcinoma. Some authors have demonstrated that gastric reflux is an independent risk factor for laryngopharyngeal carcinoma (Ozlugedik et al. 2006; Mercante et al. 2003; Langevin et al. 2013; Coca-Pelaz et al. 2013), while the prevalence of Helicobacter pylori is lower than the prevalence of LPR and is not correlated with the risk of cancer (Cekin et al. 2012).
Finally, reflux dysphonia is possible without posterior laryngitis in laryngoscopy.
5.5.4 Reflux and Voice
GER is a vocal risk factor often entangled with other factors such as stress, poisoning from smoking and vocal overload. It is systematically sought in vocal disorders, but the multifactorial aspect of dysphonia makes difficult the precise determination of the role of each factor. Therefore, if the reality of a causal relationship between the GER and laryngeal symptoms is no longer discussed, highlighting it in clinical practice remains difficult.
The problems of voice affect more than 30% of the population at some period of life. The GER is also a very frequent phenomenon. Its prevalence in adults is from 30 to 40% for one episode at least monthly and from 5 to 10% for a daily episode. Prevalence of the extra-digestive symptoms is estimated at 30% (Jaspersen et al. 2003). Recently, Randhawa et al. (2009) highlighted the risk of masking of GER by other factors, such as allergy and food intolerance, that lead to overestimation of the role of the reflux. Allergy, more specifically allergic rhinitis, affects approximately 24% of the population. These authors note the very high frequency of this effect and the difficulties it presents to highlight causal relations between these various other entities.
Thus, in the presence of an ENT manifestation of reflux, the physician has to adopt a holistic approach, analysing the various factors contributing to the emergence of the symptom.
This situation leads to an overdiagnosis of laryngopharyngeal reflux as the cause of hoarseness (Thomas and Zubiaur 2013).
Regarding the treatment, one proposal is to plan empirical anti-reflux treatment according to the reflux index system (RSI) and reflux finding score (RFS). There is no agreement about the modalities of the therapeutic test, but some authors recommend the use of a double dose of a proton pump inhibitor for 3 months.
Proton pump inhibitors (PPIs) are medications that are ubiquitous in a gastroenterologist’s practice.
Their safety among pharmacological agents has been recognised despite the reports of potential adverse effects associated with their use. These potential interactions have ranged from alteration of the absorption of vitamins and minerals, metabolic effects on bone density, alteration of pharmacokinetics/pharmacodynamics and related drug interactions and alterations of intended effect, infection risk to hypersensitivity responses with consequent organ damage (de la Coba et al. 2016).
In the study of Naiboglu et al. (2011), anti-reflux treatment led to significantly reduced laryngopharyngeal symptoms and signs. Longer times of treatment may be needed for complete resolution of symptoms. This method seems to be effective, but the level of proof remains low because of the lack of placebo controls. Therefore, many physicians empirically prescribe reflux medication as a primary therapy, even when symptoms of gastro-oesophageal reflux disease are not present (Ruiz et al. 2014). The current management recommendation for this group of patients is empirical therapy with twice-daily PPIs for 1–3 months. In the majority of those who are unresponsive to such therapy, other causes of laryngeal irritation should be considered. Surgical fundoplication is most effective in those who are responsive to acid-suppressive therapy.
In broad outline, three situations can be met in phoniatric practice with a therapeutic proposal, as communicated here.
First Situation
The GER is only retained as an associated risk factor. The main factor may be dysfunctional, psychological or anatomical. The treatment is based on the hygiene and dietary measures associated with a medical treatment according to the degree of posterior inflammation of the larynx. Taking into consideration the GER in case of laryngeal surgery is important and may lead to a preventive treatment before and after the surgery.
Second Situation
Finally, in a context of intubation, reflux favours complications such as granuloma and laryngeal synechia.
Medical treatment is necessary by inhibiting the proton pump. A tri-therapy can be proposed: antacids, prokinetic and antisecretory. It is always associated with the hygiene and dietary measures and with the management of the various associated vocal risk factors, to avoid relapse upon stopping the treatment.
Third Situation
Another aspect of the question is the physiological interaction of speech or singing with reflux. Vocal load without singing is probably not an important aetiological factor for laryngopharyngeal reflux. However, the increase of intra-abdominal pressure and stress appears to increase GER in singers (Cammarota et al. 2007), and some authors have described that laryngopharyngeal reflux affects choristers more often than teachers or control subjects without vocal load at work (Hočevar-Boltežar et al. 2012). These results suggest that singing as the main professional activity can notably contribute to the development of the reflux.
Case Study 5.17
A 42-year-old driver at an embassy had an operation on the larynx 6 years ago. For about 2 years, he had noticed a ‘flap’, a foreign body, during breathing. A biopsy taken 2 weeks ago revealed an unspecific granulation polyp. The patient has type 2 diabetes and takes metformin. Contact granuloma may be associated with reflux, but not necessarily. The video shows a typical contact granuloma on the left that during adduction extends above and below the right vocal process on the opposite side and inhibits complete closure. Note that there is also a small inframarginal cyst on the right side. Vocal range is markedly reduced; the voice is mildly hoarse (R1 B1 H1).
Case Study 5.18
A 62-year-old housewife has always had a dark voice. After an infection of the upper airways her voice had become worse. She suffers from reflux. Her medication is pantoprazole because of reflux, thyroxine because of a thyroidectomy and clopidogrel because of heart attacks. She stopped smoking a year ago. The video shows chronic laryngitis. The epithelium and the inter-arytenoid region are swollen and keratotic. The vocal range is reduced; the voice quality mildly hoarse (R1 B1 H1).
5.5.5 Conclusion
It can be assumed that reflux that reaches the larynx can damage it. Because the symptoms possibly related to LPR can be linked to other causes, a careful consideration of the patient’s medical history is of utmost importance, followed by mandatory laryngoscopy. Several laryngeal and vocal risk factors are often present in voice disorders, highlighting a frequent multifactorial context. This level of complexity explains the poor level of scientific proof and why the role of GER remains a ‘myth’ fed by practical clinical observations. The search for the different factors involved and their organisation into a hierarchy allow the determination of a strategy of care, by ranking the various therapeutic options.
Diagnostic procedures, such as the trial administration of proton pump inhibitors or long-term pH measurement, should be used selectively. Depending on the individual symptoms, lifestyle changes, breathing or voice therapy or PPI therapy might be useful.
5.6 Central Neurogenic Voice Disorders
5.6.1 Introduction
This section focuses on central neurological issues. For further information on the peripheral neurological system, please see Sects. 1.2 and 5.7.
In the twenty-first century, voice remains the primary means of human communication, in spite of the massive increase in the use of electronic devices. With the advent of Skype and telemedicine communications, it is the authors’ opinion that an increasing number of currently non-verbal teletextual types of electronic communication will incorporate the human voice owing to the richness of the signal that it includes.
Sound signals are used to communicate intellectual content, linguistic and paralinguistic content and emotional content. There is little doubt that the voice and the vocal tract are interlinked to emotions and feelings. It is a common experience that the phone will ring, the listener will hear a sound (that is frequently frequency-limited) and within a matter of moments, even if the speaker has not been heard from in many months or years, the listener will be able to both recognise the listener, the paralinguistic as well as the linguistic message, and respond. This response may well be influenced by a sudden change in the listener’s autonomic system as well as central nervous system. How can this occur?
In an attempt to respond to the question posed above, we need to go back to basics. The brain has approximately 100 billion neurons. These include nerve cells and glial (support) cells. The neurons, in essence, act as ‘on-off’ switches. When ‘on’ they transmit nerve signals-electrical signals/current. Neurons are made up of a cell body (soma), nucleus, dendrites (that can branch multiple times and communicate with other cells) and axons (that can be very long (metres)). Synapses connect the neurons; they are found between somas and dendrites of nerve fibres or axonal terminals. Each neuron has between 1000 and 10,000 synapses. When an action potential reaches a synapse, it triggers the release of tiny amounts (quanta) of neurotransmitter causing depolarisation of the postsynaptic region. There are hundreds of neurotransmitters, both excitatory and inhibitory. Well-known ones include dopamine, serotonin, acetylcholine, noradrenaline and gamma aminobutyric acid (GABA). Whether or not an action potential is generated in the target neuron will depend on a variety of factors, including the integration and summation of inhibitory and excitatory synaptic inputs to that cell. Thus the heart of neural function is part electrical part chemical.
Information tends to reach the brain via neural networks. A typical, rather simple, but powerful neural network is that of the laryngeal adductor reflex. As an example, imagine that the readers were eating crisps at the time they were reading this section and a crisp accidently irritated one side of the epiglottis or lateral pharyngeal wall. This irritation would elicit neurons to fire, via the internal branch of the superior laryngeal nerve, and neural activity will go up to the nodose ganglion and then to the ipsilateral nucleus tractus solitarius. At this point there would be interneural connections to both nucleus ambiguus. This would effect a bilateral motor response to the internal laryngeal muscles with a resultant glottal closure reflex. This reflex arc is very fast, the conduction velocity in the extracranial nerves being approximately 60 m/s, leading to a glottal closure response in less than 60 ms from the stimulating event. Had the stimulating event been at the glottis or subglottal level, the effector would have been via the recurrent rather than the superior laryngeal nerve.
The surface of the brain is grey-brown, the colour of the neuronal cell bodies (grey matter) and the neuropil (dendrites and unmyelinated axons). Grey matter regions are typically involved in muscle control and sensory perception. They can be thought of as the Chief Executive Officers and Trustees, performing executive functions of the brain, processing information. Deep to this lie the axon layers with white sheath (myelinated). This is the white matter and can perhaps be thought of as the personal administrators and middle managers—white matter tends to carry information.
Historically the brain was anatomically divided into separate lobes that were functionally distinct, whereby the frontal lobe is involved in speech planning and reason and contains the primary and supplementary motor cortices; the parietal lobe in sensory perception and praxis, with sensory feedback from the body; the temporal lobe with speech perception, some memory and hearing; the occipital lobe with the visual cortex; the cerebellum with movement and balance; and the brainstem with breathing and swallowing. The brain also has two hemispheres connected by the corpus callosum. In general the right hemisphere is said to deal with visual activities. It is more ‘intuitive’ than the left hemisphere and can be thought of as putting information together, grouping it. For example, if a car is viewed, the right hemisphere activity might be to note ‘I recognise that, it is a car’. The left hemisphere is said to be dominant in 95% of the population. It is more analytical than the right hemisphere, analysing the information collected by the right hemisphere. It also tends to deal more with language, understanding and expression.
5.6.2 Language
In terms of language, the left temporal and frontal areas are dominant in 95% of individuals for language. Please see Figs. 1.47 and 1.48 in Sect. 1.2, demonstrating areas active in language on the brain surface (Fig. 1.47) and functional anatomy of language processing (Fig. 1.48). The right side is somewhat equivalent for environmental noise, spatial skills and prosody (Nasreddine et al. 1999). Wernicke’s area is one of the two parts of the cerebral cortex that have been linked to speech since the nineteenth century. It is located in the posterior section of the superior temporal gyrus in the left (or dominant) cerebral hemisphere and encircles the auditory cortex on the Sylvian fissure. It is neuroanatomically described as being in the posterior part of Brodmann’s area (BA 22). Wernicke’s area is the site that on functional MRI studies is most consistently implicated in understanding of written and spoken language/word recognition (DeWitt and Rauschecker 2012). Broca’s area is the other classically recognised speech-related area. It is a region of the frontal lobe of the left hemisphere usually identified as the pars opercularis and pars triangularis of the inferior frontal gyrus, neuroanatomically as Brodmann’s area 44 and 45 (BA 44, 45). It abuts on and instructs the motor cortex regarding articulation and speech.
Previously it was believed that information passed from Wernicke’s to Broca’s area through the arcuate fasciculus, a band of white matter deep to the supramarginal gyrus connecting both language areas, but this is in dispute. Broca’s area initiates a motor plan that is transmitted to the primary motor cortex (Brodmann’s area 4) to pronounce the words. The motor cortex, in coordination with the supplementary motor area, basal ganglia and cerebellum, sends corticobulbar fibres to implement speech sounds (Nasreddine et al. 1999). More recent functional imaging has supported a larger range of processing areas for speech reception and language processing, and the last word has certainly not been written on this subject (Newman et al. 2010).
According to Jürgens and colleagues, in many mammals there is a visceromotor subcortical network of brain regions dedicated to phonatory species-specific calls. In a sense the peri-acqueductal grey can be considered at the core, regulating neurons of the lower brainstem via the reticular formation from the lateral pontomedullary- and limbic-related regions. There are cortical laryngeal projections from the anterior cingulate gyrus (Jürgens 2002a, b, 2009). In humans, some aspects of emotive voicing exploit this older pathway. However, for linguistic and paralinguistic speech, there are more direct corticobulbar projections to the reticular formation in effect bypassing the limbic system. Temporal lobe activity is critical for auditory self-regulation. The time from a thought to sound is approximately 130 ms.
In the mammalian brainstem the nucleus ambiguus and retroambiguus are of particular importance from an effector standpoint for laryngeal and expiratory muscle control. The midbrain and cerebellum are substantially involved in feedback and timing (Benninger et al. 2006; Jürgens 2002a, b, 2009; Kuypers 1958).
5.6.3 Neurolaryngology History-Taking
History taking in neurological voice disorders is of paramount importance (Woodson 2008). Temporal development of the disorder, symptom description, associated features, handedness, associated intercurrent or past medical disease, past psychiatric history and family history have all been highlighted as integral to the neurological history (Nasreddine et al. 1999). We also feel that emphasis should be placed on aspects relating to the psychosocial history, as this is often the key to successful management and rehabilitation.
Vocal fatigue, vocal weakness, pain when speaking, increased effort relating to speech, glottal tightness, pitch breaks and tremor all point towards a neurological cause of the disorder (Woodson 2008). Changes in normal vocal quality (strain/strangle, harshness), vocal effort (range, breathiness, fatigue), vocal pitch (monopitch, loss of stability of pitch with yodelling or pitch breaks, etc.) and vocal tremor should all be ascertained and may point to problems at the level of the vocal folds. Speech-related problems such as articulatory issues, slurred or unintelligible speech, velopharyngeal insufficiency with hypernasality point towards bulbar or suprabulbar pathology. Woodson (2008) notes that hoarseness in isolation is far less likely to point to a neurological cause than when associated with dysphagia. Again the history is crucial.
5.6.4 The Clinical Examination: General
When evaluating a patient with a suspected neurological voice (or swallow) disorder, the patient will often come in with the diagnosis already substantiated. None the less, the first assessment by the ENT surgeon/phoniatrician should be global rather than ENT-specific; otherwise it is easy to miss a critical clue to diagnosis and management. In particular affect, gait, posture and coordination, movement (spontaneous, tremor, etc.), reflexes and tone all need reviewing, before the more-focused ENT/phoniatric examination. Let us review these.
Affect
Flattening of the intonation could be indicative of depression. It can also be seen in neurological disorders affecting the frontal lobes and subcortical white matter. Parkinsonian movement disorders cause hypomimia with a paucity of facial expression. Hypothyroidism should be considered. A labile affect could be seen in a bipolar disorder, and potentially also in thyrotoxicosis. It can also be seen in a variety of neurological disorders, but it should be remembered that certain psychoactive medications (e.g. dopaminergic agonists) can cause behavioural abnormalities.
Gait, Posture and Coordination
Gait and posture in Parkinsonism are characterised by small steps (marche a petit pas) with successively more rapid steps (festination). Posture is not uncommonly forward-held with the spine bent forward, head down, immobile (akinesia) and rigid. Gait and posture in individuals with cerebellar disease can be broad-based, with a coarse trunk tremor (titubation). Gait is often ataxic with lurching from side to side or staggering. Patients with Huntington’s chorea will have continuous twisting movements of the trunk and limbs that may be worsened by walking. Sufferers with muscular dystrophy often have weak trunk and lower limb muscles. As they rise up from a chair, a characteristic posture is to flex at the hip and push upwards with help from the hands on the upper thighs (Gower’s sign). Owing to pelvic weakness, they often ‘waddle’ when walking forwards with legs spread and shoulders sloping down (Gilman 1998).
Coordination abnormalities are common in individuals with cerebellar pathology, whereby finger-to-nose testing is problematic, sloppy and dysmetric with a side-to-side tremor coming from the shoulder. In Parkinsonism coordination may be slow and hypometric. In corticospinal tract disorders, it may be slow and clumsy (Nutt 1999).
Movement
Spontaneous movement can be seen in individuals with basal ganglia disease (e.g. chorea, dystonia, rest tremor). Patients with Huntington’s chorea may have spontaneous grimacing of the facial muscles. Patients with Tourette’s may have spontaneous facial movements as well as spontaneous vocal tics. Craniocervical dystonia patients may have involuntary movements of the facial or neck musculature. On the other hand, patients with Parkinson’s typically have akinesia, with difficulty in initiating movement.
Tremor
Tremor at rest is consistent with Parkinsonism or extrapyramidal disease (Nasreddine et al. 1999); intention tremor is suggestive of cerebellar disease.
Reflexes and Tone
Hyper-reflexia and spasticity are upper motor neuron signs, consistent with corticospinal tract disorders. Spastic paralysis involving one side of the body in association with spastic type speech points to a lesion of the subcortical white matter and internal capsule. Hypo-reflexia is commonly seen in association with peripheral nerve diseases causing weakness or proprioceptive sensory loss.
As part of the ENT/phoniatric examination all of the cranial nerves should be tested. The ears should be examined to make certain that cerumen impaction or middle ear pathology is not impacting upon the patient’s ability to hear. ‘Glue’ ear may also be indicative of pathology of the Eustachian tube, post nasal space, nose or the soft palate. The mouth and oropharynx are of particular importance as many patients with neurological disorders present with dysarthrophonia. The lips should be evaluated for adequate ability to seal the tongue for fasciculations (often a sign of neurodegenerative pathology such as motor neuron disease) and for fullness of movement. Skull base lesions could lead to hypoglossal nerve (XII cranial nerve) dysfunction. Subtle tongue dysfunction, particularly after repeated movement could ‘unmask’ a neuromuscular junction pathology such as myasthenia gravis. Tongue dysfunction is a not infrequent sequela of stroke. The soft palate should be looked at for evidence of clonus (such as seen in myoclonus). Assessment of voluntary function is also necessary. Loss of such function is not uncommon in stroke or severe head injury and can be associated with velopharyngeal dysfunction as well as speech issues. Soft palate, tongue and facial muscle problems can lead to dysphagia as well as speech disorders.
The key ENT/phoniatric examination in anyone suspected of neurological voice, speech or swallow disturbance is flexible nasendoscopy, preferably in combination with stroboscopy (Woodson et al. 1991). En route to the larynx the nose, post nasal space and soft palate can be comprehensively studied for altered anatomical and physiological function. The degree of closure of the soft palate against Passavant’s ridge can be assessed and the integrity of the velum confirmed.
The larynx should be observed at rest as well as in function. At rest, clonic or other spontaneous activity can be looked for. In function, gentle adduction, sphincteric closure, cough, quiet breathing and vigorous abduction are all reviewed. The larynx is ‘stressed’ by repeated counting and repeated ‘e-sniff’ manoeuvres. This permits the examiner to look for laryngeal muscular fatigue or incompetence or both related to work, a sign of neuromuscular junction pathology. Asymmetry of abduction or adduction over time may be a subtle sign for vocal fold paralysis or paresis. General laryngeal incoordination may be a sign of suprabulbar pathology. The use of stroboscopy allows routine qualitative assessment. Of particular importance from the standpoint of management is identification of bowing or incompetent valving/closure of the glottis. This can be addressed with speech therapy intervention or surgical augmentation.
Laryngeal EMG is of great importance. It can help determine if functional recovery is feasible or if there is ongoing neuronal degeneration. It can also assist in determining which nerves have been injured. In certain forms of myopathy and motor neuron disorders, it can be diagnostic. If botulinum toxin is being considered for a spastic or spasmodic condition, EMG will assist the surgeon in locating the tip of the injecting needle (Sulica et al. 2006; Blitzer et al. 1985; Brin et al. 1992).
5.6.5 Neurological Speech and Voice Disorder Terms
As we begin to discuss specific disorders, certain terms are helpful. Dysphasia or aphasia is a partial or complete impairment of the ability to communicate, resulting from brain injury, usually brought on by damage to the cortex. Aphasia types include, amongst others, expressive aphasia, receptive aphasia, conduction aphasia, anomic aphasia, global aphasia and primary progressive aphasias. In essence expressive (Broca’s) aphasia is associated with the expressive difficulty in word finding and putting words together. Patients with expressive aphasia have lesions to the medial insular cortex (Dronkers et al. 2007; Masdeu 2000). Receptive (Wernicke’s/sensory) aphasia is associated with the inability to comprehend meanings. It is associated with damage to the medial temporal lobe and underlying white matter (Kolb et al. 2003).
Verbal apraxia or dyspraxia is an oro-motor speech disorder characterised by the inability to speak correctly and consistently. It is not due to weakness or paralysis of the speech musculature individually; conversely, it affects the purposeful control of the movements necessary for speech. Verbal dyspraxia may occur in children, and thus it is also known as developmental articulatory dyspraxia.
Ataxia is loss of muscular coordination. Ataxia-phasia is the inability to form connected sentences—only single intelligible words. Dysarthria signifies a disturbance of articulation generally due to paralysis, incoordination or spasticity of the muscles used for creating spoken language. Dysarthrophonia is a speech dysfunction that affects respiration, phonation, articulation and prosody of speech.
Case Study 5.19
This is a case of dysarthrophonia in a 72-year-old woman who had noticed a fading voice for the last 5–6 years. The larynx shows a tremor that cannot be suppressed at will. The voice is shaking during examination with the rigid endoscope. The voice range profile reveals that she is neither able to produce soft sounds nor loud tones.
5.6.6 Clinical Approach
In our Neurological Voice Clinic at the National Hospital for Neurology and Neurosurgery, we find it helpful to consider voice disorders from the standpoint of their effects at the level of the larynx (and speech disorders from the standpoint of their effects on the vocal tract especially the articulators). To that end we use a modified version of the Ramig classification (Smith and Ramig 2006). From a laryngeal standpoint, we look at vocal fold adduction/abduction. In this portion of the section, we shall focus on certain disorders with hypo-adduction, hyper-adduction and mal-adduction.
5.6.6.1 Hypo-adduction
This section is aimed primarily at central causes of neurological voice disorders. Consequently, peripheral disorders of the muscle, neuromuscular junction, peripheral nerve and nucleus, all relatively common causes of hypo-adduction will not be reviewed here; peripheral neurogenic voice disorders can be found in Sect. 5.7.
Parkinsonism
Parkinsonism is a constellation of disorders that have hypo-adduction as a typical laryngeal finding. More than 80% of these cases are caused by Parkinson’s disease, which will be the focus of this discussion. Parkinson’s disease is a disorder of the nigrostriatal region characterised by dopamine deficiency (Hornykiewicz 1966). Typical findings include bradykinesia, rigidity, postural instability and resting tremor of 4–7 Hz (Jankovic and Stacy 1999). Prevalence is 1 per 1000, but in the elderly it is closer to 1 per 100 population. Approximately 1.5 million individuals are known to suffer from Parkinson’s disease in the United States and ten million worldwide, and approximately 90% are said to have voice problems (Logemann et al. 1978; Hartelius and Svensson 1994; Ho et al. 1998, 2008; Sapir et al. 2001). That said, a study in Boston identified some Parkinsonian features in 34% of the elderly (Bennet et al. 1996). Voice symptoms appear early in the disorder and are often progressive (Harel et al. 2004; Rusz et al. 2011; Holmes et al. 2000). From a vocal perspective, the voice is often soft, breathy and lacking in projection. It also lacks pitch variation, sounding monotonous. The laryngeal exam often shows bowing of the vocal folds with incomplete apposition. Articulation is also often poor, as is intelligibility (Smith and Ramig 2006). Rate of speech is high. EMG studies have shown reduction in activation and increase in variability of firing of thyroarytenoid muscles (Baker et al. 1998; Luschei et al. 1991); this is believed to be due to reduced central drive. Management with levodopa treatment has not proven consistently beneficial to voice and speech production; neither has deep brain stimulation (Ho et al. 2008; Skodda 2012). Augmentation of the vocal folds has been tested. Temporary improvement with collagen injection was found. Sewall et al. (2006) and Berke et al. (1999a, b) found similar benefits.
Although traditional speech therapy has proven to be of little benefit, the Lee Silverman technique has now been used for over 20 years with proven efficacy in patients with Parkinson’s disease. Benefits have been demonstrated in sound pressure level and subglottal pressure (Ramig et al. 2002).
Parkinson-plus syndromes are multisystem neurodegenerative disorders that incorporate Parkinsonian features with other neurological symptoms. These syndromes include progressive supranuclear palsy (PSP), multiple systems atrophy (MSA) and corticobasal degeneration (CBGD) and are due to abnormalities in tau- and alpha-synuclein function (Sjoström et al. 2002; Williams and Lees 2009). In PSP, dopaminergic, cholinergic and adrenergic neurotransmitter systems are involved; in MSA the Parkinsonism is poorly responsive to levodopa therapy together with varying degrees of autonomic, cerebellar and pyramidal dysfunction.
CBGD is distinctive owing to the Parkinsonism plus certain specific cortical signs as well as limb apraxia which at times is so dysfunctional as to be called ‘alien limb’ (Doody and Jankovic 1992). Voice and speech symptoms are common and early features, particularly in MSA, consisting of mixed type dysarthria with hypokinetic as well as spastic components (Skodda et al. 2010). Blumin and Berke (2002) found five of seven patients presenting with neurological disorders and with bilateral abductor vocal cord paresis to have MSA.
Case Study 5.20
A patient received deep brain stimulation because of Parkinson disease at the age of 70. His voice became weak in the course of the disease during the last years. In his history there is a myocardial infarction years ago and diabetes type 2. His medication comprises L-DOPA, insulin, acetylsalicylic acid and a beta-blocker. The vocal folds are bowed with atrophy of the vocalis muscle and lamina propria. The closure pattern shows a complete glottal gap. The mucosal wave is reduced. The voice sounds monotonous, breathy and faint. The vocal range is severely restricted.
Case Study 5.21
A 62-year-old former shop assistant had been suffering from hoarseness, articulation disorder, dyspnoea on exertion and swallowing difficulty for 1 year. She reports mild reflux without heartburn, has bronchial hyperreactivity and had overcome an episode of depression. By the time of examination, restricted mobility of the orofacial musculature, the soft palate and the larynx were present. No other neurologic symptoms were found. Examination of pharyngeal and laryngeal structures reveals reduced constriction of the nasopharynx, incomplete velar closure, bowed vocal folds and attenuated abduction of the left vocal fold. These findings are typical for multiple system atrophy of central origin. During phonation, the vocal folds vibrate and the closure pattern shows complete closure. Speech is dominated by slow, slurred articulation and voice quality by roughness (R2 B1 H2). The vocal range is severely restricted.
Closed Head Trauma (Traumatic Brain Injury)
Among a constellation of other signs and symptoms, closed head trauma is a cause of speech disorders. It is a major cause of death and disability, particularly among both children and young adults. The most common areas to have focal lesions in non-penetrating traumatic brain injury are the orbitofrontal cortex and the anterior temporal lobes. Aphasia or dysarthrophonia can occur when somewhat less commonly affected areas such as motor or language areas are, respectively, damaged. In one study of closed head-injured patients requiring admission to a rehabilitation unit, 87% had speech-related issues. Thirty percent were aphasic, 25% had motor speech disorders, and 32% had mixed speech deficits (Menon et al. 1993). In another study, McHenry et al. (1994) found reduction in laryngeal airway resistance and concomitant increase in glottal airflow, likely consistent with hypo-adduction and breathiness.
Motor Neuron Disorders (MND)
Motor neuron disorders are a group of neurodegenerative disorders (motor neuronopathies) that include amyotrophic lateral sclerosis (ALS), progressive muscular atrophy, primary lateral sclerosis and progressive bulbar palsy. These disorders all affect the motor neurons and have generalised and usually progressive voluntary motor disorders, including swallowing, speech and voice. The muscles supporting expiration are also generally very weak. Voluntary muscle atrophy and fasciculations are common. Dysarthria, dysphagia and dyspnoea are frequent. From a voice perspective, patients suffering from MND often have weak breathy voices, particularly in bulbar onset, but they can also present with spasticity and hyperfunction. Once the respiratory muscles become involved, the voice is likely to become progressively breathy. Articulation also becomes more problematic owing to denervation of the muscles. ALS is the most common of the disorders. Other ENT/Phoniatric manifestations often include velopharyngeal insufficiency as well as swallowing problems and difficulty with managing secretions. Roughly one quarter of cases present with bulbar onset ALS. In this group ENT/phoniatric symptoms may predate limb symptoms. A smaller group will present with respiratory onset ALS whereby the intercostal muscles weaken first, leading to breathing issues. There is both upper and lower motor neuronal degeneration. At times tracheotomy is required along with management of secretions. Management is generally supportive (Smith and Ramig 2006; Rowland 2000).
5.6.6.2 Hyper-adduction Disorders
Hyper-adduction disorders can be reviewed from the standpoint of constancy of strain and lack of constancy (Brin et al. 1992). This includes the ‘pseudobulbar’ or supranuclear palsies, the choreas and the dystonias.
‘Pseudobulbar’ or Supranuclear Palsies
Pseudobulbar palsy typically refers to injuries (usually bilateral) of the corticobulbar tracts. It is characterised by spastic dysarthria (also called upper motor neuron dysarthria) (Nasreddine et al. 1999). Speech and swallowing disorders are due to bilateral injury of these pathways. The interruption may occur at any point from the cortex to the brain stem, but most commonly the interruption is due to ischaemic injury. Clinical findings indicative of cerebral lesions producing these pseudobulbar symptoms may include aphasia, sensory changes and corticospinal tract findings such as limb spasticity, increased tendon and jaw jerk reflexes and an extensor plantar response. Patients with pseudobulbar palsy also may have impaired emotional control, with pathological laughter or crying, and emergence of primitive reflexes such as suck and snout (Hermanowicz and Truong 1999).
Choreas
Huntington’s chorea is the example of chorea that we shall use. It is an autosomal dominant disease, caused by an unstable trinucleotide repeat mutation on chromosome 4. Incidence is approximately 5–10 per 100,000 (Huntington’s Disease Collaboration Research Group 1993). Pathological findings in the brain include generalised atrophy with cortical neuronal degeneration. Marked caudate atrophy is the pathological hallmark of the disease (Penney and Young 1998). Almost any part of the body can be involved with jerky involuntary sudden movements and motor impermanence. This can also affect the laryngeal muscles leading to sudden pitch and loudness changes that generally have a strained quality to them. General medical management includes use of dopaminergic depleters such as tetrabenazine. Laryngeal botulinum toxin injections have also been tested in trials with some success.
Other genetic causes of chorea are rare, including some of the spinocerebellar ataxias, Friedrich’s ataxia, brain iron accumulation disorders, etc. Acquired causes include some HIV infections in association with cryptococcal disease, Sydenham’s chorea and a group of other disorders.
Spasmodic Dysphonia
Although relatively uncommon (approximate incidence 1:60,000 to 1:100,000), there is a substantial body of ENT-related information on spasmodic dysphonia (SD), most likely due to our ability to treat symptoms with botulinum toxin injections. Spasmodic dysphonia is a dystonia that affects the laryngeal muscles. Dystonias are a family of movement disorders characterised by abnormal involuntary movements of voluntary muscle groups. Classification depends on age of onset and distribution of the dystonic activity. In some cases these involuntary contractions only occur during certain task-specific activities. In the larynx this tends to manifest as speech-related pitch breaks. Mutations have been identified for several familial forms of dystonias (e.g. DYT1, DYT4, DYT6). These tend to have clinical manifestations in childhood. Cases of focal adult onset SD are rare, and a genetic locus for SD has yet to be found (Blumin and Ludlow 2014; Xiao et al. 2010).
Spasmodic dysphonia (SD) has been subdivided into adductor and abductor types. The pure adductor type is commonest (approximately 85–90%), and a pure abductor type is very unusual, only representing about 2–3%. There is also a mixed adductor/abductor type. In this subsection we will focus on the more common adductor type. Typical pitch breaks with adductor SD occur in association with vowel sounds, particularly the \a\ and \e\ vowels. It is particularly apparent when one word ends with a vowel followed by a word beginning with a vowel such as occurs in the phrase ‘I eat apples every day’ or in words with concurrent or consequential vowel sounds such as ‘eighty’. The voice is said to sound strangled/strained. EMG demonstrates bursts of muscular activity overlying a normal interference pattern (Ludlow 2006; Blitzer et al. 1985).
As far as management is concerned, many approaches have been tested. There is no curative management protocol yet available. Speech therapy may be helpful as an adjunct but is not the primary treatment. Surgical management has included nerve section (Dedo 1976), thyroplasty or laryngeal nerve section with reinnervation (Berke et al. 1999a, b). To date no peripheral surgical manipulation has regularly proven curative. To the authors’ knowledge there are no controlled neuropharmacological studies demonstrating significant symptom relief, and the role of medications in SD is only adjunctive (Blumin and Ludlow 2014). Most commonly, low-dose botulinum toxin is used in peripheral laryngeal muscles, usually the thyroarytenoid (Truong et al. 1991; Lees et al. 1992). Our experience is that low-dose unilateral injections appear as efficacious as bilateral (Upile et al. 2009). Thus as of the time of press, botulinum toxin injection is still the generally preferred treatment (Blumin and Ludlow 2014). That said, the risk of development of antibodies though rare, does exist.
Case Study 5.22
A 69-year-old woman had been suffering from pressed voice for more than 20 years. She underwent multiple injections of botulinum toxin which was applied trans-orally directly into the vocal folds. On a nearly regular basis, she is treated every 3–4 months when speaking becomes strenuous. The voice sound is rough, breathy and groaning. The video shows inconspicuous vocal folds. Dynamic range and pitch range are reduced. The second video of this case shows the trans-oral injection of botulinum toxin under local anaesthesia.
Case Study 5.23
This 47-year-old woman had been hoarse for 2 years. The thyroid gland, allegedly pressed against the larynx, had been removed 2 months ago. The voice did not improve afterwards. Voice therapy was ineffective, too. This case is an example of abductor spasmodic dysphonia. In the video one observes the difficulty in adducting the vocal folds. But this phenomenon is not as characteristic as the voice sound. During speaking the voice is interrupted by aphonic passages at the onset of words as well as in the middle of phonation. The vocal range is markedly reduced.
5.6.6.3 Tremor
Long-term tremor is seen in essential tremor, Parkinson’s disease and cerebellar/dystonic tremor. It is also present in many instances of spasmodic dysphonia. We will use essential tremor as the example of this discussion. Without doubt, essential tremor is the commonest of the hyperkinetic movement disorders. Incidence has been reported to be between 4 and 60 per thousand individuals and increases with increasing age; prevalence is 2–5%. Although the hands and head are more commonly affected, the voice can be affected in up to 20% of individuals (Elble and Koller 1990). An autosomal dominance pattern has been established in approximately 50% (Murray 1981); thus, the term familial tremor is often used. Typical tremor varies from 3 to 7 Hertz. Abnormalities in the olivocerebellar tracts have been identified on PET scanning (Jenkins et al. 1993), and it is likely that peripheral somatosensory feedback loops are involved, leading to disturbances in rhythmicity (Deuschl and Elble 2000). Unlike resting tremors such as are found in Parkinson’s disease, essential tremor is a postural tremor (Jankovic and Stacy 1999). Women are affected more often than men, the tremor generally appearing in middle age.
Management of essential tremor should include a beta-blocker unless contraindicated. That said, beta-blockers have only limited benefit for vocal tremor. At the time of publication, no pharmacologic trials have demonstrated evidence of significant vocal benefit in patients with essential tremor. Deep brain stimulation in the thalamus has been noted to have benefited a few patients with vocal tremor (Yoon et al. 1999), but these are still just case reports. Botulinum toxin has been injected into the thyroarytenoid muscles with some benefit. However, the more widespread the tremor, the less likely the efficacy of the laryngeal botulinum (Blumin and Ludlow 2014). We have substantial experience of botulinum injection in tremor. Our experience is that prolonged breathiness following injection is more likely than in cases of SD. Thus we tend to limit our injections to very low-dose unilateral injections, with some benefit (JR personal observation).
5.6.7 Summary
This section has reviewed central neurological disorders. It introduced the subject by reviewing some basic aspects of brain function and of language. It then presented the neuro-laryngological history and physical examination, emphasising ENT/phoniatric manifestations.
Specific central neurological disorders were then described, using the context of the effect at a voice (laryngeal) level and speech (articulatory) level. Parkinson’s disease, closed head trauma, motor neuron disease, spasmodic dysphonia and tremor were reviewed in some detail.
5.7 Peripheral Neurogenic Voice Disorders
5.7.1 Introduction
Physiological positions of vocal folds are ‘median’ during phonation and ‘lateral’ during respiration. In cases of vocal fold paralysis, the impaired adduction and abduction of vocal folds lead to different positions between ‘median’ and ‘lateral’.
For the majority of textbooks, vocal fold mobility impairments are classified by their ‘paramedian’ or ‘intermediate’ position. Additionally ‘cadaveric’ positions are described with a bowed vocal fold in a most lateral position.
This common classification does not consider further variables (e.g. time course, functional deficits) and does not entirely meet the high variability of laryngoscopic and videostroboscopic findings.
Paralyses have to be identified and characterised in order to indicate necessary diagnostics and to arrange obligatory or facultative therapies. Therefore the knowledge of laryngeal innervation, possible reasons for their dysfunction and options for recreation or rehabilitation are essential.
This section will give the anatomical and physiological background and intends to be a useful guide for clinical tasks.
5.7.2 Brief Historical Review
Because a historical review is not common in textbooks, I would like to offer this to interested readers. There is an ongoing discussion about laryngeal innervation, and the readers’ own clinical experience can be compared with current theories and classifications.
Since the first studies of vocal fold paralyses (Gerhardt 1863), the aim was to conclude the site of nerve lesion from the position of paralysed vocal folds. ‘Semon’s law’ (Clerf and Baltzell 1953) and the ‘Wagner-Grossmann-Theory’ (Grossmann 1897) have to be mentioned first in this context. Early contradictory publications came from Lemére (1933), Clerf and Baltzell (1953) and Tschiassny (1957) who suspected that different affected nerve branches were responsible for vocal fold position. More recent publications from Crumley (1989), Sanders et al. (1994), Sanudo et al. (1999) and Olthoff et al. (2007) have illustrated an ongoing discussion about the laryngeal motoric system.
Trying to incorporate all different kinds of vocal fold impairment under the generic term ‘paralysis of the recurrent nerve’ implies an indissoluble problem: in our human motoric system, competing abductors and adductors (e.g. flexors and tensors in our extremities) are innervated by different nerves. Transferring this fact to the laryngeal motoric system, the only opener of the larynx (the posterior cricoarytenoid muscle) should have an exclusive innervation. Hence, the recurrent laryngeal nerve must be ‘more than one’ and needs further analysis to reveal whether at least separated fibres allow the innervation of antagonistic muscles. The different theories that were intended to show separate innervation of laryngeal adductors and the sole abductor will be briefly highlighted.
The main intent of all authors has been to interpret the cause of ‘paramedian’ positions of paralysed vocal folds. Semon (Clerf and Baltzell 1953) found a separation of the recurrent laryngeal nerve into abductor and adductor fibres and stated that a higher vulnerability of abductor fibres led to ‘paramedian’ vocal fold positions. In case of recurrent laryngeal nerve lesion, Wagner-Grossmann (Grossmann 1897) defined the external branch of the superior laryngeal nerve as the only competing nerve that enables a ‘paramedian’ vocal fold position resulting from the exclusive innervation of the adductory cricothyroid muscle. Even though this theory was rejected by other authors (Woodson 1993; Koufman et al. 1995), it became consensus in the European literature. Lemére (1933) supported Semon’s aspect of fibre separation, and later Miehlke et al. (1973) used this aspect for reinnervation procedures. Lemére (1933) additionally suspected that the only unpaired muscle of the larynx, the interarytenoid muscle, was responsible for ‘paramedian’ findings owing to its adductor competence and bilateral innervation. The unaffected opposite nerve could cause the adduction and paramedian position. To explain bilateral ‘paramedian’ paralyses, Clerf and Baltzell (1953) and Tschiassny (1957) implicated motoric fibres in the internal branch of the superior laryngeal nerve that run to the interarytenoid muscle. In case of recurrent laryngeal nerve lesion, the preserved adductory fibres from the superior laryngeal nerve would lead to bilateral ‘paramedian’ vocal fold positions. This theory was supported later from Sanders et al. (1994) who visualised these fibres with a special staining technique. The existence of motoric fibres in internal branches that run into the interarytenoid muscle could not be proved by histochemical analyses (Olthoff and Ehrlich 2010). Lemére (1933) and Rueger (1972) demonstrated these fibres running through the interarytenoid muscle without motoric but with sensory endings in the laryngeal mucosa. Other studies with retrograde tracers also proved that fibres from the internal branch of the superior laryngeal nerve run to the posterior cricoarytenoid muscle with motoric projection to the rostral ambiguous nucleus. A collateral reinnervation and neuromuscular sprouting of these fibres in the case of recurrent laryngeal nerve lesions were interpreted as evidence for motoric reinnervation (Hydman and Mattsson 2008).
Apart from vagal innervation and all mentioned laryngeal muscles, Réthi (1952) described a ‘stylopharyngeal muscle system’ in order to interpret ventricular phonation and dystonic voice disorders. The involved stylopharyngeal muscle is innervated from the glossopharyngeal nerve, and its insertion into the supraglottal area might enable some supraglottal adduction in the case of vagal nerve paralyses.
In later studies the aspect of spontaneous or surgical reinnervation was emphasised as the main reason for ‘paramedian’ vocal fold positions caused by increased adductor muscle tension (Crumley 1989). The fixation of vocal folds was interpreted by concurrent innervation of antagonistic muscles (‘synkinesis’) that causes an ‘autoparalysis’ that was first described for facial innervation (Stennert 1982). But as the regeneration of nerve fibres needs time (several months), the idea of ‘reinnervation’ could not explain immediate bilateral paramedian vocal fold paralysis after thyroid surgery.
More recent macroscopic studies have revealed new findings of very vulnerable recurrent laryngeal nerve fibres running to the posterior cricoarytenoid muscle. These fibres are highly endangered in, for example, thyroid surgery, and their injury is potentially responsible for ‘paramedian’ paralyses (Olthoff et al. 2007). Additionally, a branch that runs from the external branch of the superior laryngeal nerve into the endolarynx might support a ‘paramedian’ vocal fold position and may redirect the discussion of the ‘Wagner-Grossmann-Theory’ (Wu et al. 1994; Olthoff et al. 2007).
As an ‘up-to-date hypothesis’, we could state that synkinesis will be crucial for paramedian vocal fold positions in paralyses older than 3–6 months.
Paramedian vocal fold positions in earlier stages will be caused by injuries of abductor fibres owing to incomplete lesions of the recurrent trunk fibres. In incomplete lesions the superior number of adductor fibres will cause paramedian positions.
Thus, a paramedian vocal fold position is based on an imbalance between adductor and abductor muscle activities for the reasons above. A more lateral (‘intermediate’) position will consequently indicate an additional lesion of adductor fibres or refer to a more complete lesion of the recurrent trunk.
This knowledge was the basis for the current advantage in laryngeal pacing. The first target of stimulation was the laryngeal abductor muscle (posterior cricoarytenoid muscle) to free patients with bilateral paramedian vocal fold paralysis from dyspnoea (Li et al. 2013). Because the bilateral paramedian vocal folds’ position implies imbalanced adductor muscle activity, the stimulation of the laryngeal opener tends to restore the muscular balance.
5.7.3 Anatomy and Physiology
The efferent laryngeal vagal nerve fibres (cranial nerve X) derive from the nucleus ambiguus and run together with fibres from the spinal nuclei of the accessory nerve (cranial nerve XI) through the jugular foramen. Afferent laryngeal fibres run back to the nuclei of the solitary tract (Sasaki et al. 2001). Because muscle spindles are proven to be in laryngeal muscles (Baken and Noback 1971; Malannino 1974; Tellis et al. 2004), a reflex motoric system can be stated following the general principles of our human motoric system. Afferent proprioceptive fibres will run with the internal branch of the superior laryngeal nerve (Lemére 1933; Sanudo et al. 1999; Sasaki et al. 2001).
5.7.4 Laryngeal Innervation
After passage of the jugular foramen the superior laryngeal nerve separates from the vagus nerve and sends one internal sensoric branch through the hyothyroid membrane into the larynx and one external motoric branch to the cricothyroid muscle. Deriving from the external nerve, a branch runs into the larynx that was first called ‘communicating nerve’ and later ‘ventricular branch’ owing to its course into the ventricular muscle (Wu et al. 1994, Olthoff et al. 2007).
After parting from the vagal nerve, the right recurrent nerve curves around the brachiocephalic trunk, and the left recurrent nerve around the aortic arch, before running upwards to the larynx. The recurrent laryngeal nerves pass close to the trachea and reach the larynx from the rear laterally. Before entering the larynx, the ansa galeni leaves the recurrent nerve and joins the internal branch of the superior laryngeal nerve. Directly before the dorsal entrance into the larynx, very thin and vulnerable fibres leave the recurrent nerve and enter the posterior cricoarytenoid muscle, which is the only opener of the larynx (posticus). These extralaryngeal fibres (Rr. postici) are highly endangered in surgical treatment, such as thyroid surgery, even by just moving the recurrent nerve.
Apart from their vagal innervation, the stylopharyngeal muscles insert bilaterally into the supraglottis and are also innervated from the glossopharyngeal nerve. They are still discussed as supporting the elevation of the larynx during swallowing. Owing to their supraglottal insertion, they might be able to induce some supraglottal adductions.
5.7.5 Laryngeal Motoric System
Except for the posterior cricoarytenoid muscle, every laryngeal muscle tends to narrow the laryngeal lumen. The exclusive opener of the larynx enables an abduction of vocal folds exceeding the cadaveric position that is caused by the elastic laryngeal tissues.
The adduction of vocal and ventricular folds, combined with the approach of the arytenoids against the caudal part of the epiglottis, leads to a complete sphincteric closure of the larynx that is essential for airway protection during swallowing. More differentiated adduction is present in throat clearing, coughing and phonation.
This might explain the presence of various adductors that in concert are able to perform the different adductor tasks. Laryngeal opening is a comparatively simple but essential function. Therefore only one abductor muscle, namely, the posterior cricoarytenoid muscle, proves to be sufficient.
In detail the adductors are the lateral cricoarytenoid muscle, the strongest adductor of the ligaments of the vocal folds, owing to its insertion to the anterior part of the muscular process of the arytenoid cartilage (opposite and antagonistic to the posterior cricoarytenoid muscle, which inserts on the posterior surface). The interarytenoid muscle is responsible for the closure of the cartilaginous part of the glottis by narrowing the arytenoid bodies to the midline. Transverse and oblique fibres work in a synergistic manner (Schiel et al. 2004). The internal and external thyroarytenoid muscles are responsible for the fine-tuning of vocal fold tension. An increased vocal fold tension tends to support adduction. The cricothyroid muscles narrow the cricoid against the thyroid cartilage anteriorly. Owing to the dorsal articulation between both cartilages, this approximation leads to a tension and consequently adductory support of the vocal folds. This effect depends on a sufficient tension of the thyroarytenoid muscles.
The above-mentioned ‘stylopharyngeal muscle system’ (Réthi 1952) might enable some supraglottal (ventricular) adductions owing to its insertion, away from all mentioned laryngeal muscles, into the supraglottis.