To boldly split what no one should split: The infinitive.

Lies your English teacher told you: “Never split an infinitive!”

To start off this series of lies in the English classroom, Rebekah told us last week about a common misconception regarding vowel length. With this week’s post, I want to show you that similar misconceptions also apply to the level of something as fundamental as word order.

The title paraphrases what is probably one of the most recognisable examples of prescriptive ungrammaticality – taken from the title sequence of the original Star Trek series, the original sentence is: To boldly go where no man has gone before. In this sentence, to is the infinitive marker which “belongs to” the verb go. But lo! Alas! The intimacy of the infinitive marker and verb is boldly hindered by an intervening adverb: boldly! This, dear readers, is thus a clear example of a split infinitive.

Or rather, “To go boldly”1

Usually an infinitive is split with an adverb, as in to boldly go. This is one of the more recognisable prescriptive rules we learn in the classroom, but the fact is that in natural speech, and in writing, we split our infinitives all the time! There are even chapters in syntax textbooks dedicated to explaining how this works in English (it’s not straightforward though, so we’ll stay away from it for now).

In fact, sometimes not splitting the infinitive leads to serious changes in meaning. Consider the examples below, where the infinitive marker is underlined, the verb it belongs to is in bold and the adverb is in italics:

(a) Mary told John calmly to leave the room

(b) Mary told John to leave the room(,) calmly

(c) Mary told John to calmly leave the room

Say I want to construct a sentence which expresses a meaning where Mary, in any manner, calm or aggressive, tells John to leave the room but to do so in a calm manner. My two options to do this without splitting the infinitive is (a) and (b). However, (a) expresses more strongly that Mary was doing the telling in a calm way. (b) is ambiguous in writing, even if we add a comma (although a little less ambiguous without the comma, or what do you think?). The only example which completely unambiguously gives us the meaning of Mary asking John to do the leaving in a calm manner is (c), i.e. the example with the split infinitive.

This confusion in meaning, caused by not splitting infinitives, becomes even more apparent depending on what adverbs we use; negation is notorious for altering meaning depending on where we place it. Consider this article title: How not to raise a rapist2. Does the article describe bad methods in raising rapists? If we split the infinitive we get How to not raise a rapist and the meaning is much clearer – we do not want to raise rapists at all, not even using good rapist-raising methods. Based on the contents of the article, I think a split infinitive in the title would have been more appropriate.

So you see, splitting the infinitive is not only commonly done in the English language, but also sometimes actually necessary to truly get our meaning across. Although, even when it’s not necessary for the meaning, as in to boldly go, we do it anyway. Thus, the persistence of anti-infinitive-splitting smells like prescriptivism to me. In fact, this particular classroom lie seems like it’s being slowly accepted for what it is (a lie), and current English language grammars don’t generally object to it. The biggest problem today seems to be that some people feel very strongly about it. The Economist’s style guide phrases the problem eloquently3:

“Happy the man who has never been told that it is wrong to split an infinitive: the ban is pointless. Unfortunately, to see it broken is so annoying to so many people that you should observe it.”

We will continue this little series of classroom lies in two weeks. Until then, start to slowly notice split infinitives around you until you start to actually go mad.

Footnotes

I’ve desperately searched the internet for an original source for this comic but, unfortunately, I was unsuccessful. If anyone knows it, do let me know and I will reference appropriately.

This very appropriate example came to my attention through the lecture slides presented by Prof. Nik Gisborne for the course LEL1A at the University of Edinburgh.

This quote is frequently cited in relation to the split infinitive, you can read more about their stance in the matter in this amusing post: https://www.economist.com/johnson/2012/03/30/gotta-split

“A language is a dialect with an army and a navy”

Hello HLC readers! I’m Lisa, I’m a Swede (this kind, not this kind, and hopefully never this kind) but I live in Scotland, and I’m here to talk to you about the differences between languages and dialects. Now, the title of this post, “A language is a dialect with an army and navy”, should have made everything clear, so that will be my contribution for today.

Joking!

I’m so not done. The title quote was made popular by the sociolinguist, and Yiddish scholar, Max Weinreich (in Yiddish, with Roman letters: a shprakh iz a dialekt mit an armey un flot)1. This particular quote has been passed down to me on average once per each course I’ve taken in my four years of studying linguistics, which either tells you 1. Linguists are in serious need of new content, or 2. This is probably important for budding linguists to discuss. Both might be true in some cases, but most of the time 2 is the correct answer. We will need to tread carefully, and I don’t intend to make any political statements, but simply to shine some light on the complexity of the matter which, in fact, is often highly political. One final disclaimer: This is a really difficult topic to summarise. Bear with me.

For some of you reading, the question of what is and isn’t a language is probably something you haven’t thought about a lot. Some of you may think that the distinction is clear-cut; a language is distinct, it’s not similar to or dependent on anything else, and a dialect isn’t. You may even say that dialects are clearly sub-languages, because of the very way we phrase “dialects of a language” to imply that dialects belong to a language and not vice versa. Further, dialects are mutually intelligible (i.e. speakers of different dialects of one language can understand each other), which is not the case with languages. This is not exactly wrong, it’s just overly simplified.

First of all, if mutual intelligibility is a dialect criterion then my native Swedish could arguably be a Scandinavian dialect rather than a proper language – I, like most Swedes, understand Norwegian very well, and to some extent Danish, if spoken slowly (I’m currently working on my spoken Danish comprehension by watching both the Bridge and the Killing… My crime vocabulary is looking pretty solid by now). However, a lot of Swedes would not be thrilled to be told that their language is a dialect, and it does feel counter-intuitive to call it one.

On the other hand, there are agreed-upon dialects that are not mutually intelligible. Why are the dialects of, for example, Italian still called dialects, despite speakers of, for example, Emilian and Sicilian not being able to understand each other2 , while Norwegian and Swedish are officially agreed upon to be different languages? Also, what makes people call Catalan a dialect of Spanish (Don’t shoot the messenger!), or Cantonese a dialect of Chinese? Can you see a pattern forming? I’ll spell it out: The term language is most often, but not always, awarded to those “dialects” that have, or have had, official language status in a country, i.e. the dialect of those in power. The term dialect, or lect, is sometimes used neutrally in linguistics to cover both official languages and dialects, but there is  another term which is also used that I like more: variety. Variety is less socio-politically charged, and I use it all the time to avoid having to make a language/dialect distinction when I talk about linguistics.

There are, however, exceptions to the ‘official language’-criterion. If we go back to Spain, for example, no one would argue that Basque is a dialect of Spanish because Basque looks and sounds nothing like Spanish at all (or maybe some would argue this, but could we all agree that this is an unusual opinion?). So, there must be an element of likeness, or similarity, involved. Preferably the variety in question would be a part of the same language family3  – this could be why no one argues the language status of indigenous varieties, like Sami varieties in northern Scandinavia or the various native American varieties like Navajo and Cree.

My take on the issue is this: What people choose to call a language is largely based on four criteria:

  1. Is this variety an official language of a country?
  2. Is the variety distinct in terms of likeness to the official language of that region? Recall what was said above about indigenous languages.
  3. Is this variety considered an example of how that variety should be spoken, i.e. a standard variety, that also has sub-varieties (dialects) that diverge from that standard? An example: British English has a standard, sometimes called BBC English, or RP, but also a plethora of quirky dialects like Geordie, Scouse, Scottish English, Brummie, etc., all still considered to be English.
  4. Does it have an army and a navy?
  5. I jest.
  6. The real number 4: Is the variety standardised? Can we study it with the help of grammars and lexicons? Is it taught in schools? (Language standardisation is a whole topic of its own, which we will come back to in a later post.)

We can see that the term language is strongly connected to the status a variety has in a nation, it is a term that is awarded or given. When we attempt linguistic distinctions between languages and dialects, things get confusing really quickly. Is differing syntax, for example word order differences, more distinguishing than differing vocabulary? Norwegian and Danish have largely similar vocabularies, but very distinct pronunciations, so how does that factor in when we determine whether they are distinct languages or dialects of one variety? How much is the mutual intelligibility due to close contact, rather than actual similarities4 – do I understand Norwegian well because I grew up a couple of hours from the border to Norway, or because Norwegian and Swedish are so similar?

It is also relevant to talk about the historical perspective (after all this is is the Historical Linguist Channel). To throwback to Rebekah’s post last week, we know that English has changed a lot since the Anglo-Saxon times. We all tend to agree that Latin is one language distinct from Spanish, French, Italian, Portuguese and Romanian, but we also know that these languages all originate from Latin. What about English then? Old English and Present Day English look different enough that we could happily call them distinct languages, but what about Early Modern English? When do we say a variety has diverged enough from its parent language to be considered a language in its own right? Is my grandmother’s sister, my great-aunt, a part of my immediate or extended family? Well, that often depends on my relationship to my great-aunt, which brings us back to the subjectivity of the question.

The point I’m trying to make with these confused ramblings is that the term language cannot be defined linguistically, but is a wholly social and political term. The people of Montenegro generally refuse to recognise their variety’s similarity to Serbian, despite the varieties being largely indistinguishable – they speak Montenegrin. Knowing the history of the region though, we might be able to see where the Montenegrians are coming from, why it feels important for them to distinguish themselves as a people through their language5 . When we discuss what a language is, it’s important to keep in mind what the term means for the people who use it. Our language is tightly connected to our sense of identity; this is one reason why we’re so reluctant to see it changing or being used in a way we perceive as wrong (throwback to Sabina’s and Riccardo’s posts). The term dialect is somehow seen as inferior to language, and thus the terminology becomes a much larger issue than any linguistic definitions we can make.

Related to this issue are topics like standardisation (mentioned above), minority languages, and the idea of debased English. The latter two are also upcoming topics. In future posts, I will be addressing a variety that is my special interest, Scots 6, which is particularly affected by the issues discussed here. Scots is a Germanic variety spoken in Scotland, which is closely related to English but is still distinct from English (much like Swedish and Norwegian). First, however, I will be back next week to outline the main disciplines that fall under the umbrella of linguistics.

Footnotes

1He didn’t utter the quote first though, but an auditor in one of his lectures said it to him. I recommend reading about the situation on Wikipedia.

2Ask Riccardo about this issue and your evening entertainment is sorted.

3“Language family” is the name given to a group of languages which share an ancestor. We will dedicate more time to this topic at a later point. Meanwhile, you may admire this beautiful Indo-European and Uralic family tree.

4These and other questions are addressed by linguistic typologists, who try to map the languages of the world, categorise them and determine their relatedness.

5This fact was brought to my attention by a student from Montenegro during the course Scots and Scottish English, taught by Dr. Warren Maguire at the University of Edinburgh. A lot of the discussions we had in that course have provided background for the arguments and questions presented here.

6The Angus Macintosh Centre for Historical Linguistics have made brilliant videos explaining the history of Scots, in both Scots and English. I strongly recommend watching these!

Old English ain’t Shakespeare (feat. Dinosaurs)

Yes, hello. Rebekah, 26, American. I can hardly contain myself, so let’s just get straight to it:

When I was a teenager, one of my favorite things was the part of the dictionary where it tells you the history of the word. “And Latin bos begat Old French boef, and Old French boef begat English beef.”1 (Okay, that’s not how they phrase it. Also, this area of study is called etymology.) Then, my senior year in high school, while I was applying to colleges, I learned you could actually major in that. Somehow, I had never heard of linguistics before.

Of course, there’s a whole lot more to linguistics than just where words come from. There’s how the words fit together to form sentences, and there’s the 7,000+ languages in the world and how they’re alike and how they’re not, and there’s all these crazy sounds our mouths can make to combine in a billion different ways and become human speech.

I was taking a class on the history of English when I had my eyes-meeting-across-a-crowded-room, have-we-met-before, do-you-think-this-is-destiny moment. I was doing the assigned reading on Old English, and it was all about Saxons and the Danelaw and Alfred the Great and scops, and something about it all reverberated in the marrow of my bones. It was like hearing a song I’d forgotten a long time ago. A thousand-odd years of history collapsed in on itself, and I could feel the blood of my Anglo-Saxon forebears humming through me. (Too much? Too much. Moving on.)

It was only when I went to share this indescribable feeling with everyone I met that I realized I had a problem. The conversation went like this:

Me: I love Old English! *heart eyes, preparing to gush*
Them: Oh, that’s cool. So you like Shakespeare?
Me: *wilting and dying inside*

Don’t get me wrong, I do love Shakespeare. But here’s a super cool linguistic fun fact: Shakespeare’s language, and the language of the King James Bible, and the language of all those other historic sources inspiring your friendly local Renaissance festival players, that’s a little something we linguists like to call “Early Modern English.”

The periods of English

Let’s talk about dinosaurs. Everybody loves dinosaurs, right? Between the chicken nuggets, the tee shirts, and movies like The Land Before Time and Jurassic Park, most people know the names of at least two or three, and they probably have a favorite. (Mine’s triceratops, if you’re wondering.)

Dinosaurs lived during the Mesozoic Era, a 186-million-year period of geological time further subdivided into the Triassic, Jurassic, and Cretaceous periods.2 I’m about to painfully rewrite your childhood, so sorry in advance. Littlefoot, lovable hero of The Land Before Time, was either a brontosaurus or an apatosaurus. These titanic, long-necked herbivores lived in the Late Jurassic. Cera, Littlefoot’s triceratops best friend, would have lived during the Late Cretaceous—some 77 million years later. As long-distance, time-traveling romances go, it’s arguably a little more problematic than The Lake House. Not least because dinosaurs didn’t have mailboxes.

I know what you’re thinking: “Great, Rebekah. That’s just great. Friendship over. Before I delete your number, what does this have to do with linguistics? Are you trying to tell me dinosaurs spoke English?”

As appealing as it is to imagine all our favorite dinosaurs living together as one big happy family, 186 million years is a long time for everything to stay the same. Likewise, as easy as it is to think that English is English, always has been and always will be, languages grow and evolve, too. (Sabina talked about this a little last week.) No matter how different they became, though, from the time they emerged in the Late Triassic until they disappeared at the end of the Cretaceous, dinosaurs were still dinosaurs. It’s kind of the same with languages.

A lot of the dinosaur species people are most familiar with—triceratops, hadrosaurs, velociraptors, and Tyrannosaurus rex, to name a few—lived during the last period, the Cretaceous (yep, Jurassic Park is a bit of a misnomer). This was the period of greatest dinosaur diversity. The latest period of English is called Modern English, and it’s the one you’re probably most familiar with. It started in roughly the late 1400s and runs up to the present. This, too, is a period of impressive diversity, with distinct varieties of English spoken around the world, from Australia to Canada, from India to England, and everywhere in between. As far as literature goes, a lot of the famous English-language works considered part of the Western canon were written during this time, including the works of William Shakespeare, Charles Dickens, Mark Twain, and many others. There are also contemporary works like those of Stephen King, Nicholas Sparks, and Dr. Seuss—all those books, magazines, and newspapers filling up your local library (if you happen to live in an English-speaking country).

Of course, no matter how awesome it would be to see a rap battle between Shakespeare and Dr. Seuss, even the casual reader will flag their writing as seeming like not quite the same language. As mentioned earlier, Modern English can be separated into Early and Late, with the divide being marked at about 1800. Period distinctions like this are the result of shifts in grammar, pronunciation, and word stock throughout the language, though the specific dates often coincide with historical events that had a widespread impact on culture. (Like the mass extinction events that separate the different periods of the Mesozoic Era. But somewhat less catastrophic.) In the case of Modern English, the starting point is often cited as 1476, the year William Caxton introduced the printing press to England. The ability to mass produce written materials would have a profound effect on literacy and the dissemination of linguistic features. In 1776, the American colonies declared independence from England. Some consider the American Revolution the start of Late Modern English and a period of globalization for the language, as over the following decades the British continued to spread their language, colonizing places like Australia, South Africa, New Zealand, and India.

As useful as dates like these can be for roughly marking linguistic time, languages unfortunately don’t work like that. The line between one stage of English and another isn’t as clear cut as turning over a page of your Gregorian calendar on January 1st and magically finding yourself in a new year. Linguistic shockwaves and subtle nudges take time to spread. A great example of this is Middle English.

On our timeline, Middle English is our Jurassic period. During the Jurassic, dinosaurs began to flourish. They hadn’t yet reached the height of diversity of the Cretaceous, but there are still some Jurassic species everybody recognizes, like the stegosaurus or aforementioned sauropods like the brontosaurus. There’s at least one big Middle English name you’ll recognize, too: Geoffrey Chaucer. If you’ve read just one work that predates the Modern English period, I’d bet good money it was some portion of Chaucer’s seminal Canterbury Tales. See? You knew there was English older than Shakespeare’s, even if you didn’t know you knew it. The Canterbury Tales begins:

WHAN that Aprille with his shoures soote
The droghte of Marche hath perced to the roote,
And bathed every veyne in swich licour,
Of which vertu engendred is the flour;3

It might look a little odd and incomprehensible, but with just a little elbow grease, most people can puzzle Chaucer out. (It helps to read it out loud.)

Chaucer died in 1400, and his language was that of the latter end of Middle English. Works from Early Middle English are rare, but one very important one is the Peterborough Chronicle, a historical record periodically updated with the important events of each year up to 1154. It only takes a little squinting to recognize Chaucer’s language as an earlier form of English, but the Peterborough Chronicle starts to look like it was written in a different language entirely. If Chaucer was writing in a kind of pre-Shakespeare, the Peterborough Chronicle was written in a kind of post-Anglo-Saxon, two ends of a transitionary continuum. Due to the nature of the Peterborough Chronicle itself, we can watch the language gradually change in the time between entries.

And so, we come at last to true Old English. The Triassic period, I guess? (Look, I can only push this metaphor so far.) The transition from Old to Middle English is traditionally marked by the Norman Conquest of England in 1066. William the Conqueror became William I, and he repopulated the court and the clergy with French-speaking Normans. The sovereignty of French men, French culture, and the French language had a profound effect on English, explaining the rather Romance sound of the language today. Strip that influence away, go back to England between AD 500 and AD 1000, and you’ll find the very Germanic origins of the language we call English. The most famous of all the surviving Old English works is the epic poem Beowulf. It begins like this:

Hwæt we Gar-Dene     in geardagum,
þeodcyninga     þrym gefrunon,
hu þa æþelingas     ellen fremedon.4

It reads something along the lines of:

Lo, we of the Spear-Danes in days of yore,
learned by inquiry of the kings of the people,
how those princes did valor.

This was the language of the Germanic tribes who migrated to Britain and displaced the Celts, the peoples who would become the Anglo-Saxons. The Beowulf poem began as part of an oral tradition and was later written down. In style and content, it’s somewhat like the Norse Eddas, which perhaps isn’t surprising considering the Anglo-Saxons shared a Germanic heritage with the Vikings and continued to have contact with them after settling Britain (both friendly and not so friendly). Old English manuscripts show a people transitioning from paganism to Christianity, a warlike people with an awful lot of synonyms for “sword” and “kill,” but also a cultured people with a sophisticated poetic meter and a penchant for alliteration. Shakespeare was a long way down the road.

Back to the future

The story of English is far from over. It’s still being written all around us. As I said, language is in constant flux, and it can be hard to know when to say, “Hang on a second. I think we’ve stumbled into a new stage of English!” Linguists today are even starting to distinguish the most current English, the one we’re speaking right now (and tweeting at each other and scribbling down on post-it notes and dropping in beats in epic rap battles), with the appellation Present Day English, leaving Shakespeare and Dickens and all the rest a little farther in the past.

Don’t think this phenomenon is unique to English. Other languages have gone through some incredible changes, too. Old French boef eventually became French boeuf, and really, French is just grown up Latin, just like Spanish, Italian, Portuguese, and all the other Romance languages. (Language families are a subject for another day.) And language, all language, is going to go right on changing as our cultures and our communication needs go right on changing. To paraphrase Jurassic Park, “Language finds a way.”

Next week with Lisa: As hard as it is to say when a language has entered a new stage of its evolution, one of the most complicated questions facing linguists is the problem of where to draw the distinction between a language and a dialect. What makes something a separate language rather than just a variety of another? When do we say a dialect has diverged enough from its parent language to be considered a language in its own right?

Bibliography

There are many wonderful works covering the history of English. For specific dates and details mentioned here, I referenced:
Algeo, John & Thomas Pyles. 2005. The origins and development of the English language, 5th edn. Boston: Thomson Wadsworth.

1Oxford English Dictionary Online

2General information about dinosaurs was found on Wikipedia & from the article “Learn about the different dinosaur periods” at ThoughtCo.

3The prologue to The Canterbury Tales at Bartleby.com

4Mitchell, Bruce & Fred C. Robinson. 2012. A guide to Old English, 8th edn. Chichester, West Sussex: Wiley-Blackwell.