Phonology 101: Vowels

And so we come at last to vowels, the final stop on this journey we call Phonology 101. So far, we’ve talked about the vocal tract (here), phonemes (here), and consonants (here). There’s also a post on the International Phonetic Alphabet (here).

As far as human speech sounds go, I feel like consonants are pretty straightforward. Sure, some are harder to produce than others, but you can point to a relatively clear place of articulation that’s easy to feel when you produce the sound yourself. When you produce [m], your lips are together. When you produce [t], your tongue touches right behind your teeth. Vowels are…squishier.

Let’s start with the easy part: all vowels are, by nature, voiced.1 Like consonants, vowels can be either oral or nasal, depending on whether the velum is raised or lowered (see if you can tell the difference between the vowels in mat /mæt/ and man /mæn/). Vowels can also be rounded or unrounded; this describes the shape of the lips during production.

Now, we need one more thing, one more feature, something similar to the consonants’ place of articulation, to distinguish each vowel from all the others. That magic feature is the position of the tongue within the vowel space. Well, it’s actually two features: height (or closeness, as you’ll see on the IPA chart) and backness.

IPA Chart, http://www.internationalphoneticassociation.org/content/ipa-chart, available under a Creative Commons Attribution-Sharealike 3.0 Unported License. Copyright © 2015 International Phonetic Association.

Vowel height is kind of what it sounds like: how high (close) or low (open) in the mouth the tongue is. Backness refers to which part of the tongue is providing the pertinent interaction with the height variable—the back (closest to the throat), front, or center.

When talking about vowels, it’s good to keep in mind that as far as words meaning “middle” are concerned, “central” refers to the horizontal feature and “mid” refers to the vertical.

So, why do I say vowels are kind of squishy? In consonants, the tongue hits specific points along the roof of the mouth. In vowels, the tongue is interacting with empty air—a much less precise target to hit thousands and millions of times while speaking. In fact, phoneticians call our phonemes target vowels.

But what’s all this talk without some examples?

In cheese /t͡ʃiːz/, we find the high, front, unrounded vowel. The front of the tongue is high in the mouth, and the lips, rather than pursed, are pulled back into something resembling a smile (as many a subject of a painfully posed photograph can attest).

In choose /t͡ʃuːz/, the tongue is again high, but this vowel is back and rounded.

A few more:

The vowel in chess /t͡ʃɛs/ is low-mid front unrounded.
The vowel in Chaz /t͡ʃæz/ is low front unrounded.
The first vowel in Chaucer /t͡ʃɔːsəɹ/ is low-mid back unrounded.

You get the picture.

I’d like to point out a couple funny things you may have noticed in my phonemic transcriptions of these words.

First, I used /i/ to represent what native English speakers commonly think of as the E sound. The symbols used in IPA are a mix of letters and specialized characters. In the case of the vowels, the commonly familiar letters correspond to the so-called Continental vowels, i.e. they represent the sounds they spell in most European languages. English is the misfit here. The spellings we use for vowels are mismatched to the way they’re treated in other languages. There’s a very good reason for this, but that exciting story will have to keep for another day. As for /e/, this is used in IPA for what we English speakers think of as the A sound.

Next, the keen-eyed will have noticed that when I transcribed cheese, I didn’t just use /i/. There’s two funny dots after it that look like a colon. This symbol indicates that the vowel is long. American schoolchildren learn that the difference between Pete and pet is that the first has a long E and the second has a short E. This is not what I mean. For a phonologist, that sense of “long” and “short” is erroneous (and in fact, those two words have completely different vowels). Rather, this symbol means that the vowel is held for a longer length of time; it indicates duration. In some languages, vowel length is phonemic, i.e. it differentiates one word from another. Heck, it used to be phonemic in English way back in the day. You don’t have to worry about whether your vowels are long or short in Modern English. That’s one of those allophonic2 features that just kind of happens.

Last, there’s a second vowel in Chaucer that I haven’t mentioned yet, represented by a flipped-around lowercase E: /ə/. A lot of these symbols I’m using are probably new and unfamiliar, so you may not have noticed it hiding there, but this disoriented little letter is special. It’s name is schwa. It represents a mid central unrounded vowel, which is to say it represents a relaxed, mushy, indiscriminate vowel sound. It’s often found in unstressed syllables, and it can be an allophone of a lot of other vowels when you’re not going out of your way to enunciate every word. It never really satisfies the phonemic requirement of separating one word from another, but it’s an important little sound nonetheless, this lazy vowel. Plus, schwa is fun to say.

There are a few more things to bear in mind about vowels. First, very few of the vowels we produce in English are pure, single sounds. Much like the consonantal affricates, sometimes two vowels get produced in such close sequence they act like a single sound. For vowels, these are called diphthongs. You can hear diphthongs in words like chase /t͡ʃeɪs/ and chai /t͡ʃaɪ/ (or the names of the letters A and I).

Second, vowels tend to vary a lot more from dialect to dialect than consonants do. You can look at all the varieties of English across the world, from Canada to New Zealand, and draft almost identical lists of consonants for all of them. On the other hand, you don’t have to be a linguist to tell that they all use (sometimes drastically) different sets of vowels.

Several times throughout this series, I’ve encouraged you to stop and to try producing some of these sounds. This is especially useful if you’d like to get a handle on vowels. Say some. Just roll sound around in your mouth. Make random, vowel-ish noises. I can give you clinical definitions of consonants. Vowels you’ve got to feel.

This is the end of Phonology 101, but if you’ve fallen in love with the science of speech production and sounds the way I have, don’t you fret. Phonology has a starring role in some of the most exciting stories English has to offer, and you can bet we’ll be bringing them your way. For now, you Scots lovers better hold onto your hats. Lisa’s back next week to help us understand some of the differences between English and Scots.

Notes

1As a point of curiosity, some languages, like Japanese, occasionally have voiceless vowels in specific contexts. But, this is a rarity. Like how hand sanitizer claims to kill 99.9% of germs.
2I know this is a blog, and we’re kind of asking a lot by asking you to remember all this terminology. Your cheat sheet for phonemes and allophones is right this way.

Phonology 101: Phonemes, aka phonology’s little superheroes

Welcome back to Phonology 101! Last time, we talked about the vocal tract and two major features of speech sounds, namely place of articulation and voicing (catch up here). Today, we’re going to stop using the word “sound” in favor of some more precise terms: phone, phoneme, and allophone. You may also find it useful to catch up on IPA (here); I’ll be introducing some important notation, and moving forward, I’ll be using IPA more and more.

Individual speech sounds are called phones. In talking about speech production, we keep distinguishing between phonetics (the concrete/physical) and phonology (the abstract/mental). Phones exist on the phonetic plane. Our mental representations of the sounds in our language are called phonemes. If your brain was a computer running a language program, phonemes would be all the sound files. As mentioned previously, the International Phonetic Alphabet is the standard for transcribing phones and phonemes. When using IPA, brackets put around the symbols indicate whether phonetic or phonological data is being represented. Phones go in square brackets (e.g. [a]), and phonemes go between slashes (/a/). On both levels, this notation can be used for individual sounds, complete words, or even phrases and longer utterances.

More technically, a phoneme is “a minimal contrastive sound unit of a language.”1 What that means is, phonemes cannot be broken down into smaller units, and switching a single phoneme for another can make the difference between one word and another. To determine the phonemes of a language, linguists look for something called minimal pairs, i.e. pairs of words that differ in pronunciation by a single phone. An example of a minimal pair is mine /maɪn/ and kine /kaɪn/,2 which demonstrates that /m/ and /k/ are phonemes in English. As you can guess, minimal pairs have a tendency to rhyme, but the differing sounds don’t need to be at the beginning of words. They can also come at the end, as in mine /maɪn/ and might /maɪt/, or in the middle, as in can /kæn/ and kin /kɪn/.3 (These examples also nicely illustrate the point we keep emphasizing about how separate spelling and pronunciation actually are.)

Now, the thing about phonemes is, the sound we think we’re producing isn’t always technically the sound that comes out. Based on where it is in a word or the sounds surrounding it, a single phoneme might be pronounced in several ways. The physical manifestations of a phoneme are called allophones. Some phonemes might have only one allophone, while others might have several. Some allophones might be associated with more than one phoneme. I’ve had several teachers who explained it like this: Sometimes Superman is Superman, and sometimes Superman is Clark Kent. Sometimes, he’s even Kal-El. How he appears depends on the environment he’s in, but at the end of the day, it’s all the same guy. A linguistic example: In my dialect, I pronounce rot as [ɹɑt], but rotten is [ɹɑ.ʔən]. (Like generic phones, allophones go in brackets.) My “t” sound in rotten is something called a glottal stop. Even though it’s a different phone, my brain says the middle sound in rotten is the phoneme /t/ (an alveolar stop). This comes out when I enunciate it and it sounds more like [ɹɑ.tən]. [ʔ] is an allophone of /t/. (As we haven’t really talked about any specific sounds yet, I’ve tried to pick examples that would be kind of intuitive for most speakers. Even so, it was hard to avoid using some sounds and symbols that may be making you go, “Huh?” Here’s an interactive IPA chart with sound clips if you’d like to hear what I’m talking about.)

Some allophones are pretty obvious to spot, but others are more subtle. In ram [ɹæ̃m] the vowel takes on a nasal quality not present in rat [ɹæt] through the influence of the following nasal consonant /m/. This is how allophones tend to come about. In actual speech, no sound is produced in a vacuum. In the interest of ease of articulation, our mouths carry over features from previous sounds or, as in the case of ram, anticipate upcoming sounds. This process is highly predictable and regular. A phoneme will consistently produce the same allophone in a given environment.

Okay, so what? This is all well and good for linguists who just want to Know, but does it have any impact on your life? Well actually, yes. Every language variety has its own phoneme inventory. Phonemes in one language may only be allophones in another (or may be missing entirely). When we pick up a language as children, our boxed set of language features includes a list of sounds we consider phonemes. As we grow up, this internal phoneme inventory solidifies and becomes very hard to alter. This explains why people have an accent when they learn a second language—they’re unconsciously using their native phonemes.

Our phoneme inventories don’t just influence the way we pronounce words. They also filter how we process what we hear. In my dialect, I pronounce merry, marry, and Mary the same. In some dialects here in the United States, these words contain three different vowels and therefore have three distinct pronunciations. I’ve studied phonology long enough that I can carefully reproduce these pronunciations (when I remember which vowel goes with which word), but I still can’t hear the different when a native speaker of this dialect produces these words. For me, all these vowels get lumped into the same phoneme, so that’s how I hear them.

This isn’t a big deal for a situation like mine, where the phoneme difference is slight and it will be easy for me to figure out what’s being said. The differences between phoneme inventories can get more pronounced4, and interfere more with comprehension, when you’re speaking a second language. For example, in Arabic, [b] and [p] are not distinct phonemes, so it’s hard for a native Arabic speaker to distinguish between English barking and parking.

And this is why we’re wading through all this terminology and technical swampland: Language is a real world phenomenon with real world consequences. As in the Arabic example, phonology can have an effect on actual communication, as can morphology, syntax, semantics, and the other basic disciplines of linguistics. Once we’re done laying the groundwork, we can get to the fun stuff (like, what’s up with English spelling? And how do you pronounce GIF?).

There will be two more posts in the Phonology 101 series (for now). In two weeks, we’ll get into the nitty-gritty of consonants, and then in March, we’ll go over vowels. Before we get to that, though, Sabina will be back next week to talk about something called the Sapir-Whorf hypothesis and ask the big questions: Does our language influence how we think?

Notes

1Giegerich, Heinz J. 1992. English phonology: An introduction. Cambridge: Cambridge University Press.
2An archaic plural of cow. I just really wanted to use it.
3Another somewhat archaic word for family and relatives. (But maybe you knew this one.)
4Lol, phonology puns.