The Throat Singers of Tuva    

September 20, 1999Testing the limits of vocal ingenuity, throat-singers can create sounds unlike anything in ordinary speech and song–carrying two musical lines simultaneously, say, or harmonizing with a waterfall

 By Theodore C. Levin and Michael E. Edgerton

From atop one of the rocky escarpments that crisscross the south Siberian grasslands and taiga forests of Tuva, one’s first impression is of an unalloyed silence as vast as the land itself. Gradually the ear habituates to the absence of human activity. Silence dissolves into a subtle symphony of buzzing, bleating, burbling, cheeping, whistling–our onomatopoeic shorthand for the sounds of insects, beasts, water, birds, wind. The polyphony unfolds slowly, its colors and rhythms by turns damped and reverberant as they wash over the land’s shifting contours.For the seminomadic herders who call Tuva home, the soundscape inspires a form of music that mingles with these ambient murmurings. Ringed by mountains, far from major trade routes and overwhelmingly rural, Tuva is like a musical Olduvai Gorge–a living record of a protomusical world, where natural and human-made sounds blend. 

Among the many ways the pastoralists interact with and represent their aural environment, one stands out for its sheer ingenuity: a remarkable singing technique in which a single vocalist produces two distinct tones simultaneously. One tone is a low, sustained fundamental pitch, similar to the drone of a bagpipe. The second is a series of flutelike harmonics, which resonate high above the drone and may be musically stylized to represent such sounds as the whistle of a bird, the syncopated rhythms of a mountain stream or the lilt of a cantering horse.

In the local languages, the general term for this singing is kh鲻meior khoomii, from the Mongolian word for “throat.” In English it is commonly referred to as throat-singing. Some contemporary Western musicians also have mastered the practice and call it overtone singing, harmonic singing or harmonic chant. Such music is at once a part of an expressive culture and an artifact of the acoustics of the human voice. Trying to understand both these aspects has been a challenge for Western students of music, and each of us–one a musical ethnographer (Levin), the other a composer with an interest in extended vocal techniques (Edgerton)–has had to traverse the unfamiliar territory of the other.

Sound Mimesis

In Tuva, legends about the origins of throat-singing assert that humankind learned to sing in such a way long ago. The very first throat-singers, it is said, sought to duplicate natural sounds whose timbres, or tonal colors, are rich in harmonics, such as gurgling water and swishing winds. Although the true genesis of throat-singing as practiced today is obscure, Tuvan pastoral music is intimately connected to an ancient tradition of animism, the belief that natural objects and phenomena have souls or are inhabited by spirits.

According to Tuvan animism, the spirituality of mountains and rivers is manifested not only through their physical shape and location but also through the sounds they produce or can be made to produce by human agency. The echo off a cliff, for example, may be imbued with spiritual significance. Animals, too, are said to express spiritual power sonically. Humans can assimilate this power by imitating their sounds.

Among the pastoralists, emulating ambient sounds is as natural as speaking. Throat-singing is not taught formally (as music often is) but rather picked up, like a language. A large percentage of male herders can throat-sing, although not everyone is tuneful. A taboo against female throat-singers, based on a belief that it causes infertility, is gradually receding, and younger women are beginning to practice the technique as well. The popularity of throat-singing among Tuvan herders seems to have arisen from a coincidence of culture and geography: on the one hand, the animistic sensitivity to the subtleties of sound, especially its timbre, and on the other, the ability of reinforced harmonics to project over the broad open landscape of the steppe. In fact, two decades ago concert performances were uncommon because most Tuvans regarded the music as too “down home” to spend money on. But now it leads a parallel public life. Professional ensembles have achieved celebrity status, and the favorite singers are symbols of national cultural identity. 

The most virtuosic practices of throat-singing are concentrated in Tuva (now officially called Tyva), an autonomous republic within Russia on its border with Mongolia, and in the surrounding Altai region, particularly western Mongolia. But vocally reinforced harmonics can also be heard in disparate parts of central Asia. Among the Bashkirs, a Turkic-speaking people from the Ural Mountains, musicians sing melodies with breathy reinforced harmonics in a style called uzliau. Epic singers in Uzbekistan, Karakalpakstan and Kazakhstan introduce hints of reinforced harmonics in oral poetry, and certain forms of Tibetan Buddhist chant feature a single reinforced harmonic sustained over a fundamental pitch. Beyond Asia, the use of vocal overtones in traditional music is rare but not unknown. It turns up, for example, in the singing of Xhosa women in South Africa and, in an unusual case of musical improvisation, in the 1920s cowboy songs of Texan singer Arthur Miles, who substituted overtone singing for the customary yodeling.

The ways in which singers reinforce harmonics and the acoustical properties of these sounds were little documented until a decade ago, when Tuvan and Mongolian music began to reach a worldwide audience. Explaining the process is best done with the aid of a widely used model of the voice, the source-filter model. The source–the vocal folds–provides the raw sonic energy, which the filter–the vocal tract–shapes into vowels, consonants and musical notes.


Hooked on Harmonics

At its most basic, sound is a wave whose propagation changes pressure and related variables–such as the position of molecules in a solid or fluid medium–from moment to moment. In speech and song the wave is set in motion when the vocal folds in the larynx disturb the smoothly flowing airstream out from (or into) the lungs. The folds open and close periodically, causing the air pressure to oscillate at a fundamental frequency, or pitch. Because this vibration is not sinusoidal, it also generates a mixture of pure tones, or harmonics, above the fundamental pitch. Harmonics occur at whole number multiples of the fundamental frequency. The lowest fundamental in operatic repertoire, for example, is a low C note whose conventional frequency is 65.4 hertz; its harmonics are 130.8 hertz, 196.2 hertz and so on. The strength of the harmonics diminishes as their frequencies rise, such that the loudness falls by 12 decibels (a factor of roughly 16 in sonic energy) with each higher octave (a factor of two in pitch

The second component of the source-filter model, the vocal tract, is basically a tube through which the sound travels. Yet the air within the tract is not a passive medium that simply conveys sound to the outside air. It has its own acoustical properties–in particular, a natural tendency to resonate at certain frequencies. Like the whistling sound made by blowing across the top of a glass, these resonances, known as formants, are set in motion by the buzz from the vocal folds. Their effect is to amplify or dampen sound from the folds at distinctive pitches, transforming the rather boring buzz into a meaningful clutch of tones.

The sculpting of sound does not end once it escapes from the mouth. As the wave wafts outward, it loses energy as it spreads over a larger area and sets the freestanding air in motion. This external filtering, known as the radiation characteristic, dampens lower frequencies to a greater extent than it does higher frequencies. When combined, the source, filter and radiation characteristic produce sound whose harmonics decrease in power at the rate of six decibels (dB) per octave–except for peaks around certain frequencies, the formants [see “The Acoustics of the Singing Voice,” by Johan Sundberg; Scientific American, March 1977; and “The Human Voice,” by Robert T. Sataloff; Scientific American,December 1992]. 

In normal speech and song, most of the energy is concentrated at the fundamental frequency, and harmonics are perceived as elements of timbre–the same quality that distinguishes the rich sound of a violin from the purer tones of a flute–rather than as different pitches. In throat-singing, however, a single harmonic gains such strength that it is heard as a distinct, whistlelike pitch. Such harmonics often sound disembodied. Are they resonating in the vocal tract of the singer, in the surrounding physical space or merely in the mind of the listener? Recent research by us and by others has made it clear that the vocally reinforced harmonics are not an artifact of perception but in fact have a physical origin.



The mechanism of this reinforcement is not fully understood. But it seems to involve three interrelated components: tuning a harmonic in the middle of a very narrow and sharply peaked formant; lengthening the closing phase of the opening-and-closing cycle of the vocal folds; and narrowing the range of frequencies over which the formant will affect harmonics. Each of these processes represents a dramatic increase of the coupling between source and filter. Yet despite a widespread misconception, they do not involve any physiology unique to Turco-Mongol peoples; anybody can, given the effort, learn to throat-sing.

To tune a harmonic, the vocalist adjusts the fundamental frequency of the buzzing sound produced by the vocal folds, so as to bring the harmonic into alignment with a formant. This procedure is the sonic equivalent of lifting or lowering a ladder in order to move one of its higher steps to a certain height. Acoustic analysis has verified the precision of the tuning by comparing two different harmonics, the first tuned to the center of a formant peak and the second detuned slightly. The former is much stronger. Singers achieve this tuning through biofeedback: they raise or lower the fundamental pitch until they hear the desired harmonic resonate at maximum amplitude.

Throat-singers tweak not only the rate at which the vocal folds open and close but also the manner in which they do so. Each cycle begins with the folds in contact and the glottis–the space between the folds–closed. As the lungs expel air, pressure builds to push the folds apart until the glottis opens. Elastic and aerodynamic forces pull them shut again, sending a puff of air into the vocal tract. Electroglottographs, which use transducers placed on the neck to track the cycle, show that throat-singers keep the folds open for a smaller fraction of the cycle and shut for longer. The more abrupt closure naturally puts greater energy into the higher harmonics. Moreover, the longer closing phase helps to maintain the resonance in the vocal tract by, in essence, reducing sound leakage back down the windpipe. Both effects lead to a spectrum that falls off less drastically with frequency, which further accentuates the desired harmonics


The third component of harmonic isolation is the assortment of techniques that throat-singers use to increase the amplification and selectivity provided by the vocal tract. By refining the resonant properties normally used to articulate vowels, vocalists reposition, heighten and sharpen the formants [see Forming Formants]. In so doing, they strengthen the harmonics that align with the narrow formant peak, while simultaneously weakening the harmonics that lie outside of this narrow peak. Thus, a single overtone can project above the others. In addition, singers move their jaws forward and protrude, narrow and round their lips. These contortions reduce energy loss and feed the resonances back to the vocal-fold vibration, further enhancing the resonant peak.

In a study of both Tuvan and Western overtone singers conducted at the University of Wisconsin’s hospitals and clinics with support from the National Center for Voice and Speech, video fluoroscopy (motion x-ray) and nasoendoscopy (imaging the vocal folds using a miniature camera) have confirmed that singers manipulate their vocal tracts to shift the frequency of a formant and align it with a harmonic. By reinforcing different harmonics in succession, they can sing a melody. The nine musicians in the study demonstrated at least four specific ways to accomplish the shifting. Other methods may also be possible.


In the first, the tip of the tongue remains behind the upper teeth while the midtongue rises to intone successively higher harmonics. Additionally, vocalists fine-tune the formant by periodically opening their lips slightly. In Tuvan the style of music produced by this means is known as sygyt (“whistle”). In the second method, singers move the tongue forward, an act that in normal speech changes the vowel sound /o/ (“hoe”) to /i/ (“heed”). The lowest formant drops, and the second rises. By precisely controlling how much the formants separate, a Tuvan musician can tune each to a separate harmonic–thereby reinforcing not one but two pitches simultaneously, as sometimes occurs in the kh鲻mei style.


The third approach entails movement in the throat rather than in the mouth. For lower harmonics, vocalists place the base of the tongue near the rear of the throat. For mid-to-high harmonics, they move the base of the tongue forward until a gap appears in the vallecula–the space between the rear of the tongue and the epiglottis (the flap of cartilage that prevents food from entering the lungs). For the highest harmonics, the epiglottis swings forward to close the vallecula.

In the fourth method, vocalists widen the mouth in precise increments. The acoustical effect is to shorten the vocal tract, raising the frequency of the first formant. The uppermost harmonic that can be reinforced is limited primarily by radiation losses, which worsen as the mouth widens. Depending on the pitch of the fundamental, a singer can isolate up to the 12th harmonic. Tuvans combine this technique with a second vocal source to create the kargyraa style, in which one may reinforce harmonics as unbelievably high as the 43rd harmonic.


Two Voices

This additional source is another fascinating aspect of throat-singing. Singers draw on organs other than the vocal folds to generate a second raw sound, typically at what seems like an impossibly low pitch. Many such organs are available throughout the vocal tract. Kargyraa utilizes flexible structures above the vocal folds: the so-called false folds (paired tissues that occur directly above the true folds and are also capable of closing the airstream); arytenoid cartilages (which sit in the rear of the throat and, by rotating side to side and back and forth, help to control phonation); aryepiglottic folds (tissue that connects the arytenoids and the epiglottis); and the epiglottic root (the lower part of the epiglottic cartilage).

A different technique, which produces much the same sound but probably does not figure in kargyraa, combines a normal glottal pitch with the low-frequency, pulselike vibration known as vocal fry.

Because kargyraa resembles the sound of Tibetan Buddhist chant, some researchers have used the term “chant mode” to describe it. It generally, though not always, assumes a 2:1 frequency ratio, with supraglottal closure at every other vocal-fold closure. A typical fundamental pitch would be the C at 130.8 hertz, with the false folds vibrating one octave below at 65.4 hertz. Spectral analysis shows that when a singer switches into chant mode, the number of frequency components doubles, verifying that the second source is periodic and half the normal pitch. Chant mode also affects the resonant properties of the vocal tract. Because use of the false folds shortens the vocal tract by one centimeter (about half an inch), formant frequencies shift higher or lower depending on the location of the constriction on the selected formant.



Image: Theodore C. Levin
SHAMANS in Tuva use a variety of sound makers as tools of spiritual healing. Animism has shaped Tuvan music and has helped to keep throat-singing a vibrant custom.

Another cultural preference is for extended pauses between breaths of throat-singing. (These breaths may last as long as 30 seconds.) To a Western listener, the pauses seem unmusically long, impeding the flow of successive melodic phrases. But Tuvan musicians do not conceive of phrases as constituting a unitary piece of music. Rather each phrase conveys an independent sonic image. The long pauses provide singers with time to listen to the ambient sounds and to formulate a response–as well as, of course, to catch their breath.

The stylistic variations all reflect the core aesthetic idea of sound mimesis. And throat-singing is just one means used by herder-hunters to interact with their natural acoustic environment. Tuvans employ a range of vocalizations to imitate the calls and cries of wild and domestic animals. They play such instruments as the ediski, a single reed designed to mimic a female musk deer; khirlee, a thin piece of wood that is spun like a propeller to emulate the sound of wind; amyrga, a hunting horn used to approximate the mating call of a stag; and chadagan, a zither that sings in the wind when Tuvan herders place it on the roofs of their yurts. Players of the khomus, or jew’s harp, re-create not only natural sounds, like that of moving or dripping water, but also human sounds, including speech itself. Good khomus players can encode texts that an experienced listener can decode.

Yet it is throat-singing that Tuvans recognize as the quintessential achievement of their mimesis, the revered element of an expressive language that begins where verbal language ends. For the herders, it expresses feelings of exultation and independence that words cannot. And as is often a defining feature of traditional art, inner freedom blooms within the strictest of constraints–in this case, the physical limits of the harmonic series.


Further ListeningTUVA: VOICES FROM THE CENTER OF ASIA. Smithsonian Folkways, 1990.

SIXTY HORSES IN MY HERD. Huun-Huur-Tu. Shanachie Records, 1993.

HEARING SOLAR WINDS. David Hykes and the Harmonic Choir. Ocora, 1994. (Distributed in the U.S. by Harmonia Mundi.)


WHERE YOUNG GRASS GROWS. Huun-Huur-Tu. Shanachie Records, 1999.

MUSICAL CLIPS AND FURTHER INFORMATION are on the Scientific American site and on the Friends of Tuva site

Further Information:

ACOUSTICS AND PERCEPTION OF OVERTONE SINGING.Gerrit Bloothooft, Eldrid Bringmann, Marieke van Capellen, Jolanda B. van Luipen and Koen P. Thomassen in Journal of the Acoustical Society of America, Vol. 92, No. 4, Part 1, pages 1827?836; October 1992.

REISE INS ASIATISCHE TUWA. Otto J. M鋘chen-Helfen. Verlag Der Bucherkreis, 1931. Published in English as Journey to Tuva: An Eyewitness Account of Tannu-Tuva in 1929. Translated by Alan Leighton. Ethnographics Press, University of Southern California, 1992.

PRINCIPLES OF VOICE PRODUCTION. Ingo R. Titze. Prentice Hall, 1994.

A TUVAN PERSPECTIVE ON THROAT SINGING. Mark van Tongeren in Oideion: The Performing Arts Worldwide, Vol. 2, pages 293?12. Edited by Wim van Zanten and Marjolijn van Roon. Centre of Non-Western Studies, University of Leiden, 1995.

THE HUNDRED THOUSAND FOOLS OF GOD: MUSICAL TRAVELS IN CENTRAL ASIA (and Queens, New York). Theodore Levin. Indiana University Press.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s