New! Sign up for our email newsletter on Substack.

Lending an ear to rare languages

A unique collection of sound archives of rare and endangered languages is now accessible to all on the Pangloss website. Several thousand tales and stories in more than 170 languages, collected and documented by dedicated linguists, are now available in audio.

Do you know Ubykh, a Caucasian language that slowly declined and finally disappeared due to Russian expansion into the region from the late 19th century? Have you ever heard a song in Na, a Sino-Tibetan tongue still spoken in the mountains of Sichuan, a region of China located to the east of Tibet where Mandarin is gradually establishing itself as the sole means of communication? Together with 170 others, these little-known parlances are now available to all on the Pangloss website, an exceptional body of sound archives of rare and endangered languages collected thanks to the painstaking fieldwork of linguists.

โ€œOf the 6,000 tongues that exist in the world today, several thousand are insufficiently documented or not documented at all,” says Alexis Michaud, a linguist at the LACITO,1ย the laboratory that initiated the Pangloss platform. โ€œThese are non-written and have neither a dictionary nor a grammatical corpus.โ€ Dedicated linguists and ethnologists are attempting to fill this gap by recording tales and stories from the oral tradition, so that they can be transcribed phonetically and reveal their secrets. โ€œDocumenting a language usually represents a lifetimeโ€™s work,โ€ Michaud points out.

โ€œ25 languages become extinct every yearโ€

Until the advent of digital technology, research by specialists usually resulted in the publication of grammar books and dictionaries as well as, less frequently, translated stories for the general public, while the reels of tape bearing vocal recordings ended up gathering dust on shelves or were eventually lost forever. The Pangloss collection, which was launched over twenty years ago, aims to make up for these shortcomings by digitising that goldmine and making it accessible to as many people as possible, including linguists and researchers from across the world โ€“ for whom the bilingual French-English website provides dedicated pages and resources โ€“ but also, for the first time, interested members of the public.

 

Fieldwork to record shuhi (xumi) speakers, a Tibeto-Burmese tongue of oral tradition spoken by 1800 people.
Fieldwork to record shuhi (xumi) speakers, a Tibeto-Burmese tongue of oral tradition spoken by 1800 people.
โ€œHistorically, the greatest linguistic diversity is found in isolated areas, such as mountains and dense forests, like the Amazon and Papua New Guinea โ€“ the latter boasting more than 800 different languages!โ€ Michaud explains. โ€œWith rising temperatures, there is a risk that these regions may become uninhabitable. This is already happening in certain parts of the Himalayas, where melting glaciers are forcing communities to migrate to the lowlands and cities, where they gradually lose the use of their mother tongue.โ€ย ย 

3,500 audio and video documents

To save this endangered resource, 3,500 audio and video documents recorded by more than 50 linguists are already available on the Pangloss website, which is hoping to further expand its content over the coming months and years. โ€œResearchers should get into the habit of putting their sound archives online as they work, rather than waiting until the end of their career to get round to it,โ€ the specialist insists. โ€œNever mind if they donโ€™t all have a written transcription yet.โ€ To address this problem, the Pangloss platform is to provide automated language processing software in 2021, which should make the expertsโ€™ task much easier. โ€œUntil now, it took around at least a hundred hours of recording to train artificial intelligence to make transcriptions of a new language. With the interface we are preparing for the website, based on the latest available technology, one hour will be sufficient. This will be a real revolution,โ€ Michaud enthuses.

 

Written transcription, by Georges Dumรฉzil, of the Ubykh story โ€œEating fish makes you cleverโ€.

Developed on the open science principle, meaning that the documents put online are under a creative commons license and that anyone can access them freely, Pangloss also aims to be โ€œcautiously collaborativeโ€. โ€œIt wonโ€™t be possible to modify the documents directly, as on Wikipedia, but we will welcome any suggestions and offers of help, wherever they come from. Especially for translations of documents that donโ€™t yet have one.โ€ Among the many little gems to be found on the site, a story collected in the 1960s from the last Ubykh speaker (โ€œEating fish makes you cleverโ€) by the linguist and member of the Acadรฉmie Franรงaise Georges Dumรฉzil was, somewhat surprisingly, transcribed and translated thanks to the unexpected assistance of a student from California (US), who had learnt this extinct language all by himself using the dictionary and texts published by the very same Dumรฉzil.

Footnotes

  • 1.Langues et civilisation ร  tradition orale (CNRS / Universitรฉ Sorbonne Nouvelle / Inalco).
There's no paywall here

If our reporting has informed or inspired you, please consider making a donation. Every contribution, no matter the size, empowers us to continue delivering accurate, engaging, and trustworthy science and medical news. Independent journalism requires time, effort, and resourcesโ€”your support ensures we can keep uncovering the stories that matter most to you.

Join us in making knowledge accessible and impactful. Thank you for standing with us!