The corpus consists of transcripts of audio-recorded biographical interviews with 19 participants. The interviews are about forms of address that speakers use in colloquial and formal settings, and about their attitudes and evaluations concerning particular forms of address. It has been transcribed manually with GAT conventions and automatically aligned with the respective audio segments.
This repository contains corpus transcripts in HTML format. Corpus annotations can be shown by selecting "Annotations". Users can add and export their annotations by selecting "Annotate".
The annotated corpus (including morphosyntactic information, lemmas, and normalisations) is available at CLARIN.SI.
The corpus can be searched at NoSketchEngine and KonText.
Interviewee | Sex | Age | Origin | Residency | Date of recording |
---|---|---|---|---|---|
F1 | female | 28 | Belgrade | Belgrade | 17.09.2008 |
F2 | female | 27 | Belgrade | Zurich | 03.07.2008 |
F3 | female | 27 | Niš | Niš, Kotor | 20.08.2008 |
F4 | female | 44 | Lazarevo | Belgrade | 22.01.2009 |
F5 | female | 58 | Belgrade | Belgrade | 16.09.2008 |
F6 | female | 55 | Niš | Niš | 15.01.2009 |
F7 | female | 55 | Skoplje | Niš | 14.01.2009 |
F8 | female | 64 | Leskovac | Niš | 12.01.2009 |
F9 | female | 60 | Pirot | Niš | 16.01.2009 |
M1 | male | 28 | Niš | Niš | 19.08.2008 |
M2 | male | 27 | Niš | Niš, Kotor | 18.08.2008 |
M3 | male | 29 | Niš | Niš | 19.08.2008 |
M4 | male | 27 | Užice | Belgrade | 13.09.2008 |
M5 | male | 33 | Belgrade | Belgrade | 14.09.2008 |
M6 | male | 27 | Belgrade | Belgrade | 20.01.2009 |
M7 | male | 38 | Belgrade | Belgrade | 22.01.2009 |
M8 | male | 44 | Belgrade | Belgrade | 26.01.2009 |
M9 | male | 54 | Niš | Niš | 09.01.2009 |
M10 | male | 61 | Belgrade | Belgrade | 12.09.2008 |
Lemmenmeier-Batinić, Dolores (2021): Converting raw transcripts into an annotated and turn-aligned TEI-XML corpus: the example of the Corpus of Serbian Forms of Address. Slovenščina 2.0: Empirical, Applied and Interdisciplinary Research, 9(1), 123–144. DOI Link
Lemmenmeier-Batinić, Dolores; Ljubešić, Nikola; Samardžić, Tanja (2020): XML-Encoding of a spoken Serbian corpus targeting forms of address. In: Conference on Language Technologies & Digital Humanities, Ljubljana, 24 September 2020 - 25 September 2020, 127-130. Link
Ulrich, Sonja (2018): Anredeformen im Serbischen. Slavistische Beiträge (508), Wiesbaden. Link