Corpus of Serbian Forms of Address


The corpus consists of transcripts of audio-recorded biographical interviews with 19 participants. The interviews are about forms of address that speakers use in colloquial and formal settings, and about their attitudes and evaluations concerning particular forms of address. It has been transcribed manually with GAT conventions and automatically aligned with the respective audio segments.

This repository contains corpus transcripts in HTML format. Corpus annotations can be shown by selecting "Annotations". Users can add and export their annotations by selecting "Annotate".

The annotated corpus (including morphosyntactic information, lemmas, and normalisations) is available at CLARIN.SI.

The corpus can be searched at NoSketchEngine and KonText.

Corpus creation


Transcripts

Interviewee Sex Age Origin Residency Date of recording
F1 female 28 Belgrade Belgrade 17.09.2008
F2 female 27 Belgrade Zurich 03.07.2008
F3 female 27 Niš Niš, Kotor 20.08.2008
F4 female 44 Lazarevo Belgrade 22.01.2009
F5 female 58 Belgrade Belgrade 16.09.2008
F6 female 55 Niš Niš 15.01.2009
F7 female 55 Skoplje Niš 14.01.2009
F8 female 64 Leskovac Niš 12.01.2009
F9 female 60 Pirot Niš 16.01.2009
M1 male 28 Niš Niš 19.08.2008
M2 male 27 Niš Niš, Kotor 18.08.2008
M3 male 29 Niš Niš 19.08.2008
M4 male 27 Užice Belgrade 13.09.2008
M5 male 33 Belgrade Belgrade 14.09.2008
M6 male 27 Belgrade Belgrade 20.01.2009
M7 male 38 Belgrade Belgrade 22.01.2009
M8 male 44 Belgrade Belgrade 26.01.2009
M9 male 54 Niš Niš 09.01.2009
M10 male 61 Belgrade Belgrade 12.09.2008

References

Lemmenmeier-Batinić, Dolores (2021): Converting raw transcripts into an annotated and turn-aligned TEI-XML corpus: the example of the Corpus of Serbian Forms of Address. Slovenščina 2.0: Empirical, Applied and Interdisciplinary Research, 9(1), 123–144. DOI Link

Lemmenmeier-Batinić, Dolores; Ljubešić, Nikola; Samardžić, Tanja (2020): XML-Encoding of a spoken Serbian corpus targeting forms of address. In: Conference on Language Technologies & Digital Humanities, Ljubljana, 24 September 2020 - 25 September 2020, 127-130. Link

Ulrich, Sonja (2018): Anredeformen im Serbischen. Slavistische Beiträge (508), Wiesbaden. Link