The
human voice consists of
sound made by a
human being using the
vocal folds for
talking,
singing,
laughing,
crying,
screaming, etc.
Human voice is specifically that part of human sound production in
which the vocal folds (vocal cords) are the primary sound source.
Generally speaking, the mechanism for generating the human voice
can be subdivided into three parts; the lungs, the vocal folds
within the
larynx, and the articulators. The
lung (the pump) must produce adequate airflow
and air pressure to vibrate vocal folds (this air pressure is the
fuel of the voice). The vocal folds (vocal cords) are a vibrating
valve that chops up the airflow from the lungs into audible pulses
that form the laryngeal sound source. The muscles of the larynx
adjust the length and tension of the vocal folds to ‘fine tune’
pitch and
tone. The articulators (the parts of the
vocal tract above the larynx consisting
of
tongue,
palate,
cheek,
lips, etc.)
articulate and
filter the sound emanating from the larynx
and to some degree can interact with the laryngeal airflow to
strengthen it or weaken it as a sound source.
The vocal folds, in combination with the articulators, are capable
of producing highly intricate arrays of sound. The tone of voice
may be modulated to suggest
emotions such as
anger,
surprise, or
happiness.
Singers
use the
human voice as an
instrument for creating
music.
Voice types and the folds (cords) themselves
Adult men and women have different vocal folds sizes; reflecting
the male-female differences in larynx size. Adult male voices are
usually lower-pitched and have larger folds. The male vocal folds
(which would be measured vertically in the opposite diagram), are
between 17 mm and 25 mm in length. the female vocal folds
are between 12.5 mm and 17.5 mm in length.
As seen in the illustration, the folds are located just above the
vertebrate trachea (the windpipe
which travels from the lungs). Food and drink do not pass through
the cords but instead pass through the
esophagus, an unlinked tube. Both tubes are
separated by the
epiglottis, a "flap"
that covers the opening of the trachea while swallowing.
The folds in both sexes are within the
larynx. They are attached at the back (side nearest
the spinal cord) to the
arytenoids cartilages, and at the
front (side under the chin) to the
thyroid
cartilage. They have no outer edge as they blend into the side of
the breathing tube (the illustration is out of date and does not
show this well) while their inner edges or "margins" are free to
vibrate (the hole). They have a three layer construction of an
epithelium, vocal ligament, then muscle
(
vocalis muscle), which can shorten
and bulge the folds. They are flat triangular bands and are pearly
white in color. Above both sides of the vocal cord is the
vestibular fold or
false vocal
cord, which has a small
sac between its two
folds (not illustrated).
The difference in vocal folds size between men and women means that
they have differently pitched voices. Additionally,
genetics also causes variances amongst the same
sex, with men and women's
singing voices
being categorized into types. For example, among men, there are
bass,
baritone,
tenor and
countertenor (ranging from
E2 to even F6), and among women,
contralto,
mezzo-soprano and
soprano (ranging from F3 to C6). There are
additional categories for
operatic
voices, see
voice type. This is not
the only source of difference between male and female voice. Men,
generally speaking, have a larger vocal tract, which essentially
gives the resultant voice a lower tonal quality. This is mostly
independent of the vocal folds themselves.
Voice modulation in spoken language
Human spoken language makes use of the ability of almost all
persons in a given society to dynamically modulate certain
parameters of the laryngeal voice source in a consistent manner.
The most important communicative, or phonetic, parameters are the
voice pitch (determined by the vibratory frequency of the vocal
folds) and the degree of separation of the vocal folds, referred to
as vocal fold abduction (coming together) or adduction
(separating).
The ability to vary the ab/adduction of the vocal folds quickly has
a strong genetic component, since vocal fold adduction has a
life-preserving function in keeping food from passing into the
lungs, in addition to the covering action of the epiglottis.
Consequently, the muscles that control this action are among the
fastest in the body. Children can learn to use this action
consistently during speech at an early age, as they learn to speak
the difference between utterances such as "apa" (having an
abductory-adductory gesture for the p) as "aba" (having no
abductory-adductory gesture). Surprisingly enough, they can learn
to do this well before the age of two by listening only to the
voices of adults around them who have voices much different than
their own, and even though the laryngeal movements causing these
phonetic differentiations are deep in the throat and not visible to
them.
If an abductory movement or adductory movement is strong enough,
the vibrations of the vocal folds will stop (or not start). If the
gesture is abductory and is part of a speech sound, the sound will
be called
Voiceless. However, voiceless
speech sounds are sometimes better identified as containing an
abductory gesture, even if the gesture was not strong enough to
stop the vocal folds from vibrating. This anomalous feature of
voiceless speech sounds is better understood if it is realized that
it is the change in the spectral qualities of the voice as
abduction proceeds that is the primary acoustic attribute that the
listener attends to when identifying a voiceless speech sound, and
not simply the presence or absence of voice (periodic
energy).
An adductory gesture is also identified by the change in voice
spectral energy it produces. Thus, a speech sound having an
adductory gesture may be referred to as a "glottal stop" even if
the vocal fold vibrations do not entirely stop. for an example
illustrating this, obtained by using the
inverse filtering of oral airflow.]
Other aspects of the voice, such as variations in the regularity of
vibration, are also used for communication, and are important for
the trained voice user to master, but are more rarely used in the
formal phonetic code of a spoken language.
Physiology and vocal timbre
The sound of each individual's voice is entirely unique not only
because of the actual shape and size of an individual's vocal cords
but also due to the size and shape of the rest of that person's
body, especially the vocal tract, and the manner in which the
speech sounds are habitually formed and articulated. (It is this
latter aspect of the sound of the voice that can be mimicked by
skilled performers.) Humans have vocal folds which can loosen,
tighten, or change their thickness, and over which breath can be
transferred at varying pressures. The shape of chest and neck, the
position of the tongue, and the tightness of otherwise unrelated
muscles can be altered. Any one of these actions results in a
change in pitch, volume, timbre, or tone of the sound produced.
Sound also resonates within different parts of the body, and an
individual's size and bone structure can affect somewhat the sound
produced by an individual.
Singers can also learn to project sound in certain ways so that it
resonates better within their vocal tract. This is known as
vocal resonation. Another major
influence on vocal sound and production is the function of the
larynx which people can manipulate in different ways to produce
different sounds. These different kinds of laryngeal function are
described as different kinds of
vocal
registers. The primary method for singers to accomplish this is
through the use of the
Singer's Formant, which has been shown to be a
resonance added to the normal resonances of the vocal tract above
the frequency range of most instruments and so enables the singer's
voice to carry better over musical accompaniment.
Vocal registration
Vocal registration refers to the system of vocal
registers within the human voice. A register in the human voice is
a particular series of tones, produced in the same vibratory
pattern of the
vocal folds, and
possessing the same quality. Registers originate in
laryngeal functioning. They occur because the
vocal folds are capable of producing several different vibratory
patterns. Each of these vibratory patterns appears within a
particular
Vocal range range of
pitches and produces certain characteristic
sounds. the term register can be somewhat confusing as it
encompasses several aspects of the human voice. The term register
can be used to refer to any of the following:
- A particular part of the vocal range
such as the upper, middle, or lower registers.
- A resonance area such as chest voice or head
voice.
- A phonatory process
- A certain vocal timbre
- A region of the voice which is defined or delimited by vocal
breaks.
- A subset of a language used for a
particular purpose or in a particular social setting.
In
linguistics, a
register
language is a language which combines
tone and vowel
phonation into a single
phonology
phonological system.
Within
speech pathology the term
vocal register has three constituent elements: a certain vibratory
pattern of the vocal folds, a certain series of pitches, and a
certain type of sound. Speech pathologists identify four vocal
registers based on the physiology of laryngeal function: the
vocal fry register, the
modal register, and the
falsetto register, and the
whistle register. This view is also adopted
by many vocal pedagogists.
Vocal resonation
Vocal resonation is the process by which the basic
product of phonation is enhanced in timbre and/or intensity by the
air-filled cavities through which it passes on its way to the
outside air. Various terms related to the resonation process
include amplification, enrichment, enlargement, improvement,
intensification, and prolongation; although in strictly scientific
usage acoustic authorities would question most of them. The main
point to be drawn from these terms by a singer or speaker is that
the end result of resonation is, or should be, to make a better
sound.There are seven areas that maybe listed as possible vocal
resonators. In sequence from the lowest within the body to the
highest, these areas are the chest, the tracheal tree, the larynx
itself, the pharynx, the oral cavity, the nasal cavity, and the
sinuses.
Influences of the human voice
The
twelve-tone musical scale, upon
which the majority of the music in the world is based, may have its
roots in the sound of the human voice during the course of
evolution, according to a study
published by the
New
Scientist. Analysis of recorded speech samples found peaks in
acoustic energy that mirrored the distances between notes in the
twelve-tone scale. It is suspected that some of the influence may
also lay in the nature of lower vocal notes being linked to life
expectancy, while higher vocal notes are linked to fertility.
Voice disorders
There are many
disorders which
affect the human voice; these include
speech impediments, and growths and
lesions on the
vocal folds. Talking for improperly long periods
of time causes
vocal loading, which is
stress inflicted on the
speech organs.
When vocal injury is done, often an
ENT specialist may be able
to help, but the best treatment is the prevention of injuries
through good vocal production. Voice therapy is generally delivered
by a
Speech-language
pathologist.
Hoarseness or breathiness that lasts for
more than two weeks is a common symptom of an underlying voice
disorder and should be investigated medically.
See also
Footnotes
- Stevens, K.N.(2000), Acoustic Phonetics, MIT Press, ISBN
0262692503, 9780262692502
- Titze, I.R. (1994). Principles of Voice Production, Prentice
Hall (currently published by NCVS.org), ISBN 978-0137178933.
- Titze, I. R. (2006).The Myoelatic Aerodynamic Theory of
Phonation, Iowa City:National Center for Voice and Speech,
2006.
- Smith BL, Brown BL, Strong WJ, Rencher AC. (1975) Effects of
speech rate on personality perception. Lang Speech. 18(2):145-52
PMID: 1195957
- Williams CE, Stevens KN.(1972). Emotions and speech: some
acoustical correlates. J Acoust Soc Am. 52(4):1238-50 PMID:
4638039
- I. R. Titze, S. Mapes, and B. Story. (1994) Acoustics of the
Tenor High Voice. J.Acoust.Soc.Am. 95 (2):1133-1142. PMID:
8132903
- Thurman, Leon & Welch, ed., Graham (2000), Body mind &
voice: Foundations of voice education (revised ed.), Collegeville,
Minnesota: The Voice Care Network et al., ISBN 0874141230
- Rothenberg, M. The Breath-Stream Dynamics of
Simple-Released Plosive Production, Vol. 6, Bibliotheca
Phonetica, Karger, Basel, 1968.
- Rothenberg, M. The glottal volume velocity waveform during
loose and tight voiced glottal adjustments, Proceedings of the
Seventh International Congress of Phonetic Sciences, 22-28 August
1971 ed. by A. Rigault and R. Charbonneau, published in 1972 by
Mouton, The Hague – Paris.
- Sundberg, Johan, The Acoustics of the Singing Voice, Scientific
American Mar 77, p82
- E. J. Hunter, J. G. Svec, and I. R. Titze. Comparison of the
Produced and Perceived Voice Range Profiles in Untrained and
Trained Classical Singers. J.Voice 2005.
- Musical roots may lie in human voice - 06 August
2003 - New Scientist
Further reading
- Puts, D. A., Gaulin, S. J. C., & Verdolini, K. (2006).
Dominance and the evolution of sexual dimorphism in human voice
pitch. Evolution and Human Behavior, 27: 283-296. Full text
- Titze, I. R. (2008). The human instrument. Sci.Am. 298
(1):94-101. PM 18225701
- Thurman, Leon & Welch, ed., Graham (2000), Bodymind &
voice: Foundations of voice education (revised ed.), Collegeville,
Minnesota: The VoiceCare Network et al., ISBN 0874141230
External links