The Boarnsterhim Corpus (BHC) is a spoken language corpus containing bilingual Frisian-Dutch data of four generations of speakers (born between 1898 and 2000). The corpus contains two main parts: one recorded between 1984 and 1987, collected by Tony Feitsma; and one recorded between 2017 and 2019, collected by Marjoleine Sloos. Almost 30 speakers have been recorded at both moments, which make the data suitable for comparable panel and trend studies.
The corpus is still under construction and will be transcribed in Standard Frisian and Standard Dutch. It will also be POS-tagged. Eventually, the corpus will be part of the Clarin infrastructure, hosted by The Dutch Language Union.
The data are suitable for research in sociolinguistic variation and change, phonological change, variation and change in bilingualism studies, historiographic description, and also anthropology.
Students who are interested in an internship on one of the following (or related) topics are welcome to contact dr. Marjoleine Sloos firstname.lastname@example.org: bilingualism, phonetics, sociolinguistics, phonology, reading skills, language attitudes, corpus linguistics.
Interns and volunteers who are interested in the construction of the corpus (recordings, orthographic transcriptions in Dutch and/or Frisian, phonetic transcriptions, POS tagging) are also advised to contact dr. Marjoleine Sloos email@example.com