What is language revitalization about? Some insights from Provence

by James Costa (Sorbonne Nouvelle / UMR LACITO (CNRS), Paris)

Should you find yourself in Provence this summer, you might wonder why some villages have bilingual signs at the entrance. Your surprise would be forgiven, since you are unlikely to have heard anything but French in most places, and likely a lot of English as you approach the Mediterranean. But if you listen more closely, observe more closely, you might come across a world that is fast vanishing, but that is still present. You might stumble upon a concert in a language that you cannot identify, or wonder why some street names don’t sound French. You might even hear people speak Occitan—for this is what it is, a language also known as Provençal, one which many locals will refer to as “Patois” (a derogatory term in France to refer to anything other than French traditionally spoken in the country).

Bilingual sign (French, Provençal)

This sort of experience might happen to you in Provence, but not only. Across the European Union, several million people speak a language that is not the official language of the state they live in. Across Europe, there are language advocates who defend and promote the right to speak one’s language. This struggle for language rights also extends to Latin America, North America, Australia, and many other places. This, many scholars assert, is a consequence of globalization—a backlash against uniformity if you like. A way of being oneself, of finding meaning locally in a world that seems to be getting smaller. In my recent book, Revitalising Language in Provence: A Critical Approach, I argue otherwise. Those movements are not a reaction to globalization—they are, on the contrary, a way of taking part in this process, on the very terms defined by those who define what globalization is (and not on their own terms, as Leena Huss [2008, 133] asserts).

But let’s start from the beginning. This book focuses on Provence, home to what is perhaps the earliest language reclamation movement, or at least one of the earliest. Poets had already started writing texts in defense of Gascon, Provençal or Languedocien (all dialects of what most scholars of Romance linguistics view as Occitan) back in the 16th and 17th centuries. This is perhaps a consequence of an increasingly aggressive move to promote French in all administrative domains at the expense of Latin and Occitan, which had been in use for official usage for centuries in what is now Southern France. But it was after the French Revolution Terror government (after 1793) sought to eradicate the “patois” that a genuine interest was born in various parts of France, resulting in the south in a rediscovery of the poetry of Medieval Troubadours and in a scholarly interest in the history of Provence and Languedoc before their annexation to France. It wasn’t, however, before the 1850s that an organized language-based movement was formed, under the aegis of poets such as Frederic Mistral or Joseph Roumanille.

The Felibrige was the name they gave to their movement, a name whose origin remains mysterious. The Felibres sought to revive the Provençal or Occitan language (which was still almost universally spoken in all of Southern France) through poetry and literature. And indeed, Mistral published a series of long, epic poems that were hailed across Europe as monuments of literature. Mirèio is probably his most well known poem, a love story set in the Crau region of Provence and an allegory of the language revival movement. Mirèio was acclaimed in Paris as a chef d’æuvre, and was prefaced by Lamartine.

I recount parts of the history of the movement in the book but for this blog post, suffice it to say that while successful on a literary level, it never succeeded in political terms. Provençal was long banned in education, and despite a strong Occitan movement throughout the 20th century, the use of Provençal continued (and continues) to decline. But the story I tell in this book isn’t the story of the language movement. Instead, following a two-year ethnographic study in Provence, I ask why the movement was based on language at all, like so many others afterwards—but, crucially, none before, or at least none before the 1840s.  Continue reading “What is language revitalization about? Some insights from Provence”

Big and small data in ancient languages

by Nicholas Zair (University of Cambridge)

Back in November I gave a talk at the Society’s round table on ‘Sources of evidence for linguistic analysis’ on ‘Big and small data in ancient languages’. Here I’m going to focus on one of the case studies I considered under the heading of ‘small data’, which is based on an article that I and Katherine McDonald and I have written (more details below) about a particular document from ancient Italy known as the Tabula Bantina.


It comes from Bantia, modern day Banzi in Basilicata and is written in Oscan, a language which was spoken in Southern Italy in the second half of the first millennium BC, including in Pompeii prior to a switch to speaking Latin towards the end of that period. Since Oscan did not survive as a spoken language, we know it almost entirely from inscriptions written on non-perishable materials such as stone, metal and clay. There aren’t very many of these inscriptions: perhaps a few hundred, depending on definitions (for instance, do you include control marks consisting of a single letter?). We are lucky that Oscan is an Indo-European language, and, along with a number of other languages from ancient Italy, quite closely related to Latin, so we can make good headway with it. Nonetheless, our knowledge of Oscan and its speakers is fairly limited: it is certainly a language that comes under the heading of ‘small data’.



One of the ways scholars have addressed the problem of so-called corpus languages like Oscan, and even better-attested but still limited ones like Latin has been to combine as many relevant sources of information, from ancient historians to the insights of modern sociolinguistic theory as a way of squeezing as much information from what we have – and trying to fill in the blanks where information is lacking. This has been a huge success, but this approach can also be dangerous, especially when it comes to studying language death. Given that we know a language will die out in the end, it is very tempting to see every piece of evidence as a staging post in the process, and try to fit it into our narrative of language death. Often this provides very plausible histories, but we must remember that, while in hindsight history can look teleological, things are rarely so clear at the time.

The Tabula Bantina is a bronze tablet with a Latin law on one side and an Oscan law on the other side. It is generally agreed that the Latin text was written before the Oscan one, but the Oscan is not a translation of the Latin: the writer of the Oscan text simply used the conveniently blank side of the tablet to write the new material on. The striking things about the Oscan text are that it is written in the Latin alphabet, and there are lots of mistakes. It also strongly resembles Latin legal language. The date of this side is probably between about 100-90 BC, just before Rome’s ‘allies’, which is to say conquered peoples and cities in Italy, rose up against it in a rebellion generally known as the Social War. Continue reading “Big and small data in ancient languages”

The Morphological-to-Analytic Causative Continuum in Hausa: New Insights and Analyses in a Typological Perspective

by Philip J. Jaggar (School of Oriental and African Studies, University of London)

Over the last few decades, linguists have devoted considerable attention to both homogeneity and variation in the expression of causal events across languages. However, most studies, whether typological or language-specific, have focused on the category of morphologically overt (e.g., ‘lie/lay X down’) causatives, to the relative neglect of complex periphrastic (e.g., ‘get X to lie down’) formations.

The present study addresses this imbalance by elucidating a wide spectrum of causative expressions in Hausa (Chadic/Afroasiatic), supported by a strong cross-linguistic perspective. In line with contemporary approaches located within a general typology of causation, the analysis invokes the widely-accepted dichotomy between direct and indirect causative constructions. Direct causation associates with morphological causatives, indirect causation with periphrastic expressions—compare morphological ‘I lay X down’ (direct, with no intermediary) with periphrastic ‘I got X to lie down’ (indirect, where X also functions as an intervening actor/cause).

Hausa uses an indirect periphrastic causative usually formed with sâa ‘cause’ (lit. ‘put’) as the higher causal verb, e.g., nâs taa sâa yaaròn yaa kwântaa ‘the nurse got the boy to lie down’ (= intransitive kwântaa ‘lie down’). Direct morphological causatives, in contrast, associate with a specific derivational formation, known as “Grade 5” (Parsons 1960/61), e.g., nâs zaa tà kwantar̃ dà yaaròn ‘the nurse will lay the boy down’.

The monograph systematically explores, for the first time in an African language to our knowledge, the key design-features that distinguish the two mechanisms, in addition to demonstrating that Hausa periphrastic causatives can also differ from each other, e.g., in implicational strength, depending on the modal (TAM) properties of the lower clause. In so doing, it provides a rare account of how the two types are used to describe pragmatically different causal events and participant roles.

Jaggar, Philip J. (2017) The Morphological-to-Analytic Causative Continuum in Hausa: New Insights and Analyses in a Typological Perspective. (Abhandlungen für die Kunde des Morgenlandes, Band 109). Wiesbaden: Harrassowitz.
Available from 1 June 2017.

Latin in Medieval Britain

by Richard K. Ashdowne (University of Oxford; Honorary Membership Secretary, PhilSoc)

Of the many languages in use in Britain in the middle ages, Latin is arguably the best attested and yet most overlooked. Not the native language of any of its users and employed especially—though certainly not exclusively—in written functions, Latin has tended to be the elephant in the room despite its indisputable importance for its users and their societies.

After the departure of the Roman legions from Britain, Latin’s continued use was by no means assured, but there is a continuous train of use down to the time of the Tudors and beyond. Over more than a thousand years British medieval Latin was employed for all manner of functions from accountancy to zoology.

In this new collection of papers, arising from the conference held to celebrate the completion in print of the Dictionary of Medieval Latin from British Sources, the place of Latin in medieval Britain is examined from a variety of historical, cultural and linguistic perspectives and in relation to some of its many different contexts.

In the first part, David Howlett, Neil Wright, Wendy Childs and Robert Swanson look successively at the start of the Anglo-Latin tradition, the twelfth-century renaissance, the use of Latin in historiography and record-keeping in the fourteenth century, and the continued use of Latin in the medieval tradition into the fifteenth and sixteenth centuries. The vitality of the language over the ages and its users’ constant reinvention of its role emerge as central themes.

In the second part, attention is directed to particular fields, namely law (Paul Brand), musical theory (Leofranc Holford-Strevens), the church (Carolinne White) and science (Charles Burnett), as examples of how the Latin language was used and adapted to its roles. That it was being employed in historical, social, cultural and linguistic settings quite different from its ancient ancestor had important consequences. It meant that, for instance, Latin was frequently in need of new terminology for the contemporary world, especially in some of these more technical areas. Borrowing, calquing and native word-formation processes were all ways of meeting this need, reflecting the inherent contact between Latin and its users’ native vernacular languages.

In the third and final part, these linguistic contacts become the central focus in chapters examining the relationship between Welsh and Latin (Paul Russell), the relationship between Latin and English (Richard Sharpe), the development of a mixed-language code (Laura Wright), the relationship of Germanic, Anglo-Norman French and Latin (David Trotter), and the relationship between English and Latin (Philip Durkin and Samantha Schad). The final chapter, by David Howlett, ties in with some of the lexicographical questions raised by Sharpe, Trotter, and Durkin and Schad, and looks back at the process of preparing the Dictionary of Medieval Latin from British Sources.

Latin in Medieval Britain is edited by Richard Ashdowne and Carolinne White and  published by the British Academy in association with OUP. Many of the contributors are members of the Society and current or former members of Council.

Further information, including abstracts of all the chapters, can be found on the DMLBS blog and the book can be obtained directly from OUP and all good booksellers.

One Language, Two Grammars: the ‘Plight’ of Classical Armenian

by Robin Meyer (University of Oxford; Hon. Secretary for Student Associate Members)

Armenian is one of those Indo-European languages that very rarely gets much attention from students of historical linguistics or comparative philology; most frequently, it crops up only in discussions of the augment, laryngeals, and the Glottalic Theory. This, alas, is unlikely to change.
Yet, Armenian can serve as an interesting case study for a number of fields within linguistics, not least language contact and corpus linguistics. With these two topics in mind, allow me to introduce you to Armenian – albeit in extreme brevity –, and to illustrate one of its more curious traits: its two grammars.

Map of Armenia in the 2nd and 1st centuries BCE
An exceedingly short introduction: Iranian, Greek, and the Armenian language(s)

Armenian, attested in its Classical form (called գրաբար |grabar|) since the 5th century CE, is a language with a couple of twists. Until a ground-breaking paper by Heinrich Hübschmann (1875), Armenian was thought to belong to the Iranian language family. In fact, Armenian is most closely related to Greek – and even that not all that closely (Clackson 1994). For the most part, this relationship is not immediately obvious at the surface, particularly if compared to the similarities between, for instance, Vedic and Old Avestan, or Latin and Oscan.
The reason for its historical allocation to the Iranian family lies in the inordinate amount of Iranian loan words and calques, both lexical and phraseological, in Armenian. These are mostly taken from Parthian (North West Middle Iranian; Meillet 1911–12, Schmitt 1983). Less obviously, even certain Iranian syntactic structures and patterns have been replicated (Meyer 2013, 2016). These borrowings are, without doubt, owed to long-lasting contact between Armenian and Parthian speakers. Since the 5th century BCE, Armenia was under Iranian rule in one form or another: Achaemenid, Artaxiad, Arsacid Parthian, and later Sasanian Persian. For the most part, an Armenian king of Iranian origin ruled as primus inter pares among other Armenian and Iranian noble families. The history and ethnic composition of Armenia is, of course, far more complex than can be described in one sentence; excellent summaries can be found in Hovannisian (1997).

So far, so good. Continue reading “One Language, Two Grammars: the ‘Plight’ of Classical Armenian”

Transitive nouns and adjectives: evidence from Early Indo-Aryan

by John J. Lowe (University of Oxford)

LoweTransNomsTransitivity is typically thought of as a property of verbs, and perhaps of adpositions, but it is not a typical property of nouns or adjectives. In the influential cross-classification of syntactic categories developed by Chomsky (e.g. 1981: 48), nouns and adjectives are actually defined in opposition to adpositions and verbs by their inability to govern objects, that is by their inability to be transitive. A few authors have discussed exceptions to this generalization, but they tend to be rare and non-productive; for example in English there may be only a single transitive adjective, near, which is a historically explicable exception to an otherwise consistent synchronic rule that nouns and adjectives cannot govern ‘bare’ noun phrase complements (Maling, 1983). As a second example, in early Latin there are a few nouns and adjectives which may govern accusative case objects, but the process is not productive and is entirely eliminated by Classical Latin.

gnaruris                          vos                    volo                esse        hanc                 rem
acquainted.ACC.PL     you.ACC.PL    wish.1PL       be.INF   this.ACC          matter.ACC
‘I wish you to be acquainted with this matter.’ (Latin: Plautus Most. 100)

In the early Indo-Aryan languages, however, there is a relative wealth of transitive noun and adjective categories. In my forthcoming monograph Transitive Nouns and Adjectives: evidence from early Indo-Aryan (OUP, July 2017), I investigate the evidence from four periods of early Indo-Aryan, discussing the synchronic and diachronic explanation for this unusual phenomenon.

The majority of transitive noun/adjective categories in early Indo-Aryan fall under the traditional heading of ‘agent noun’ (including agentive adjectives, used in the same way); these are the categories whose transitivity is most clear, and most common. For example, in the sentence below the ‘agent adjective’ kāmin- ‘desirous, desiring’ governs an accusative object ‘drink’.

kāmī                                   hi       vīraḥ                            sadam    asya        pītim
desirous.NOM.SG.M   for      hero.NOM.SG.M     always    it.GEN    drink.ACC
‘For the hero (is) always desirous (of) a drink of it.’ (Sanskrit: RV 2.14.1c)

Superficially, kāmī here looks similar to a participle, i.e. to a word category which, as a non-finite verbal category, could unproblematically govern an object. However, I show that the majority of transitive nouns and adjectives attested in early Indo-Aryan cannot be analysed as non-finite verb forms, but must be acknowledged as part of a distinct constructional type in early Indo-Aryan.

Other transitive nouns fall under the traditional heading of ‘action nouns’; I show that for the most part action nouns are transitive only when used as infinitives, and hence their transitivity can be explained as the unexceptional transitivity of non-finite verb forms. There are also nouns and adjectives whose transitivity is adpositional, rather than verbal.

Crucially, I show that there is a statistical correlation between transitivity of nouns and adjectives and the syntactic context of predication: nouns and adjectives which are used as the primary predicate in a (perhaps null) copular construction are statistically more likely to be transitive than those which are used in other ways. This correlation is unique to transitive nouns and adjectives and securely distinguishes this formation from transitivity with non-finite verb categories.

The book provides a detailed introduction to transitivity (verbal and adpositional), to the categories of agent and action noun, and to early Indo-Aryan. The four periods of early Indo-Aryan selected for study are: Rigvedic Sanskrit, the earliest Indo-Aryan; Vedic Prose, a slightly later form of Sanskrit; Epic Sanskrit, a form of Sanskrit close to the standardized ‘Classical’ Sanskrit; and Pali, the early Middle Indo-Aryan language of the Buddhist scriptures. I show that while each linguistic stage is different, there are shared features of transitive nouns and adjectives which apply throughout the history of early Indo-Aryan.

The data is set in the wider historical context, from Proto-Indo-European to Modern Indo-Aryan, and a formal linguistic analysis of transitive nouns and adjectives is provided in the framework of Lexical-Functional Grammar.


Chomsky, Noam (1981), Lectures on Government and Binding: The Pisa Lectures, Dordrecht: Foris.

Lowe, John J. (2017), Transitive Nouns and Adjectives: Evidence from Early Indo-Aryan, volume 25 in the series Oxford Studies in Diachronic and Historical Linguistics. Oxford: Oxford University Press. c. 400 pp. ISBN: 978-0-19-879357-1.

Maling, Joan (1983), ‘Transitive adjectives: a case of categorial reanalysis’, in Frank Heny & Barry Richard (eds.), Linguistic Categories: Auxiliaries and Related Puzzles, volume 1. Dordrecht: Reidel. 253–289.

‘The Word Detective’ serialised on BBC Radio 4

by John Simpson (Chief Editor, Oxford English Dictionary, 1993–2017)

John Simpson
(© Bloomington Photography)

A generation ago, my colleagues and I at the OED were starting to become increasingly aware that the dictionary was in danger of drifting away from its audience. Or, to put it more accurately, the dictionary was standing still while its audience moved into the twentieth and then the twenty-first centuries.

Historical lexicography is demanding. There are few short cuts; standards are exacting. The editors of the First Edition of the OED had laboured for many years to capture the history of our language, and its format reflected nineteenth-century expectations about how knowledge should be presented. Nowadays the level of scholarship at the OED is the same – it has to be. But a wider audience expects to be able to access and understand the dictionary in radically new ways.  One of the challenges of the last few decades has been how to present the content of the OED to a new readership in the digital age.

Picture2I wrote The Word Detective to give readers an informal, behind-the-scenes look at the OED and the extraordinary things it has set out to achieve over the last forty years. In addition, I wanted to convey to readers the excitement of researching and defining the language – because that’s what we all felt as editors.

The Word Detective will be broadcast at 7.45 p.m. this Monday to Friday (13–17 March), on BBC Radio 4. See if I achieved it!



John Simpson’s ‘The Word Detective’ is published by Little Brown in the UK, and Basic Books in the USA.