The Bell vowel model: its acoustic weaknesses

1. The single resonance and two resonance theories.

Bell coupled his new vowel model to the then popular single resonance theory, claiming the vowel tone (resonance) depended on the dimensions of the buccal cavity. In Visible Speech (1867:71), he postulated a “point of greatest contraction, or the configurative aperture”, that “may be shifted to any part of the back or front of the palatal arch”, determined by the tongue location along his revolutionary innovation of a front–back continuum (Fig. 1).

Figure 1.
Bell’s acoustic theory: a configurative aperture (0), located anywhere along the hard or soft palates by tongue fronting or backing, is the posterior limit of the buccal cavity, whose resonance determines vowel timbre. The dimension of the aperture (0) is set by tongue height.

Bell’s tongue height dimension replaced the mouth opening (or jaw opening) of the ancient throat-tongue-lip model. Tongue height explicitly adjusted the degree of constriction in the configurative aperture, but, unfortunately, Bell did not explain its spectral effect. Roudet pointed out that both widening and lengthening of the buccal cavity would have the same effect on its volume and consequently on its resonance frequency, implying that tongue backing and tongue height would be mutually compensatory in Bell’s model.

However, the single resonance theory had already been flawed before Bell’s book was even published. In 1863, Helmoltz had reported two resonances for nonrounded vowels (p. 107 in Ellis’s English translation, which was not available before 1885). Bell’s son Graham, following and extending Helmholtz, recognized two resonances in every vowel, assigning them respectively to the anterior cavity and posterior cavity (i.e. the buccal cavity and the pharynx). Graham Bell corresponded with Ellis, who included a footnote in the Helmholtz translation. Lloyd, familiar with both Helmholtz’s and Graham Bell’s work, assigned the two resonances to the “porch” and the “chamber”, which amounted to the same thing. Moreover, he discovered a third resonance that he assigned to a cavity between the lips.

And there the story would have ended but for Henry Sweet. For A. M. Bell’s universal transcription system, that he launched in Visible Speech, was never a success and was soon forgotten. And his new vowel model would have been forgotten along with it, if Sweet hadn’t promoted it, assisted by Sievers in Germany. By the new century, the whole world had adopted it. It offered so many more possible vowel timbres (273 in Bell’s presentation), and promised infinitely more (the slightest tongue shift in any direction was claimed to produce a new timbre), so why be troubled by weak acoustics? Or by weak physiology either (but that’s a separate story).

There was no way of testing Bell’s configurative aperture until probing methods were introduced (Grandgent, Atkinson) and then radiography (Scheier). It turned out not to be easy to find, and by the time of Daniel Jones reference was made to the highest point of the tongue instead.

Among phoneticians in general, the number of resonances remained a controversial issue, many still preferring the single resonance theory. One possible reason is that their basic textbooks, especially Sweet’s and later Jones’s, continued to do so, providing them with what appeared to be a rational tool. This was also a period of severe dispute between the “acoustic” and “organic” schools as to the primacy of sound or articulation for speech communication, adding to any uncertainty. The acoustic school (e.g. Pipping) argued that the irrelevance of articulation for speech sound was proved by playing back speech recordings. Another acoustic school argument (e.g. Roudet) claimed that mutually compensatory manoeuvres (citing lip rounding and tongue retraction) would render articulation unpredictable. Lloyd pleaded that both the organic and acoustic lines are complementary and that phonetic science would benefit from any investigation that contributes to unification. In vain. Pipping wrote a scathing review of Lloyd’s publications.

Paget, a student of Daniel Jones around 1920, also identified two resononances for both unrounded and rounded vowels, initially unaware of earlier work. By then, the two resonances were generally known as the mouth and throat formants, with Bell’s elusive configurative aperture seen as a dividing isthmus separating the two cavities. Paget’s and Lloyd’s studies were reworked and confirmed by Crandall.

After 1945, spectrograms from the Kay Sonagraph showed that the frequencies of F1 and F2 were correlated with judgments of tongue height and backing, and it seemed again that tongue height was determining a throat resonance and F1, and tongue backing a mouth resonance and F2. Alas, they were tumbling into the classic pitfall, confusing correlation and causality. No-one has ever yet demonstrated a consistent causal relationship between true tongue height and F1, or true tongue backing and F2. It just does not work.

The overshadowing difficulty was the higher resonances. A third resonance had been glimpsed ever since the 19th. century, and spectrograms now revealed several more. The problem was not acute for speech perception or specification, just two, occasionally three, formants being deemed adequate. But the higher formants were a reminder that the theory was far from complete and might be hiding surprises. The solution would once again demonstrate that Bell’s tongue backing and tongue height were irrelevant parameters for shaping the vocal tract and tuning it for vowel spectra.

2. Transmission line theory

Suppose, for a moment, a home might be found for F3 in its own unique cavity between the lips, as Lloyd had proposed more than 100 years ago. Yet, additionally, anyone familiar with reading spectrograms will also know that F3 rises to its maximum when the tongue blade is elevated for dental consonants, narrowing the palato-alveolar region (part of the buccal cavity where F2 was said to reside). Evidently, F3 cannot be assigned to some unique cavity, but something else is happening here, something unforeseeable by Bell (Rayleigh’s two volumes on the theory of sound were not published until some ten years after Bell’s Visible Speech).

The Bell model had been extended from the single resonance theory to the two-resonance theory by stretching the imagination a little (by supposing Bell’s configurative aperture really existed, and by pretending this imagined aperture constituted an isthmus between a mouth cavity and a throat cavity). But a third resonance was too much for it. The third resonance was homeless. And we still have not looked at F4, F5 and beyond.

Joos (1948:57-59) pointed out that the mathematical theory does not allow for resonances located in minor side chambers or irregularities. Instead, each resonance is a mode of oscillation of the entire vocal tract, its frequency being sensitive to local expansion or narrowing at its respective nodes or antinodes (bellies). Consequently, a given articulator manoeuvre will modify several resonances, and a given resonance can be modified in several different parts of the vocal tract. Chiba and Kajiyama (1941, available more widely as 1958) were the first to pursue this route, followed by Fant (1960) and Stevens and House (1961). Fant referred to this approach as transmission line theory (modelling the vocal tract with electronic circuits for measurements and calculations).

In (1965), Fant demonstrated once again (i) the irrelevance of tongue positions defined by height and backing ([a]-like vowels need a low pharyngeal constriction), (ii) the affiliation of F2 to both the mouth and the throat regions (refuting theories assigning resonances uniquely to own cavities), and (iii) the futility of searching for individual cavities for the higher resonances (discredited by the standing wave phenomena revealed by transmission line theory).

So, yet again, the acoustic theory behind the Bell vowel model was flawed. At this point it would be easy to join the old acoustic school and claim that articulation is irrelevant. However, there are some simple rules that link vowel articulation with the spectral properties of vowels. These are introduced in the next page in this thread: Interpreting vowel articulation from formant frequencies.

Top

________________________________________________________________