LINGUISTICALLY ANNOTATED SPOKEN NGANASAN CORPUS
The paper discusses the key issues of the annotation method employed in the project “Lingustically annotated spoken Nganasan corpus”. The data are processed and stored in the EXMARaLDA format. The annotation of the database involves grammatical and part-of-speech tagging (made in Toolbox or Flex), translation into Russian and English. However, the present paper addresses the questions of syntactic roles, and information structure. For this purpose we use the format designed by other researchers and adapted by us to the Nganasan language. In the paper we describe the system of annotation (tags, terms and their clarification) illustrated by a large amount of Nganasan examples.
Keywords: Nganasan, annotation, corpus, endangered language, language documentation
References:
Allerton, D. J. 1978. The notion of ‘givenness’ and its relation to presupposition and theme. Lingua 44. 133–168.
Baumann, Stephan. 2006. Information Structure and Prosody: Linguistic Categories for Spoken Language Annotation. In Sudhoff, Stefan, Denisa Lenertová, Roland Meyer, Sandra Pappert, Petra Augurzky, Ina Mleinek, Nicole Richter & Johannes Schließer (eds.). Methods in Empirical Prosody Research. Berlin, New York: De Gruyter (= Language, Context, and Cognition 3). 153–180.
Butt, Miriam. 2005. Theories of Case. Cambridge: Cambridge University Press.
Büring, Daniel. 2005. Intonation und Informationsstruktur. Available online at http://semanticsarchive.net/Archive/jI0OTk3O/buring.ids2005.pdf (accessed 17.03.2015).
Dowty, David. 1989. On the Semantic Content of the Notion “Thematic Role”. In Properties, Types, and Meanings. Vol. 2. Ed. by G. Chierchia, B. H. Partee, and R. Turner. Dordrecht: Kluwer. 69–130.
Dowty, David. 1991. Thematic Proto-Roles and Argument Selection. Language 67. 547–619.
Fillmore, Charles. 1968. The Case for Case. In Universals in Linguistic Theory. Ed. by Emmon Bach and Robert T. Harms. New York: Holt, Rinehart & Winston.
Gawron, Jean Mark. 2007. Aspect, Roles, and Lexical Semantics. Available online at http://www-rohan.sdsu.edu/~gawron/semantics/course_core/lectures/roles.pdf (accessed 17.03.2015).
Götze, Michael et al. 2007. Information structure. In Dipper, S., Götze, M. and S. Skopeteas (eds.): Information Structure in Cross-Linguistic Corpora. Interdisciplinary Studies on Information Structure 07 (2007): 147–187. Available online at http://edoc.hu-berlin.de/oa/reports/reQ5PntJcwYs/PDF/23TFAo8H6FW2.pdf (accessed 05.02.2013).
Grimshaw, Jane. 1990. Argument Structure. MIT Press, Cambridge, Massachusetts.
Gundel, Jeanette. 1978. Stress, pronominalisation and the given-new distinction. University of Hawaii Working Papers in Linguistics 10/2. 1–13.
Haig, Geoffrey and Stefan Schnell. 2011. Annotations using GRAID (Grammatical relations and animacy in discourse). Introduction and guidelines for annotators. Version 6.0. Available online at http://www.linguistik.uni-kiel.de/GRAID_manual6.0_08sept.pdf (accessed 05.02.2013).
Haig, Geoffrey, Stefan Schnell. 2014. Annotations using GRAID. Manual Version 7.0.
Halliday, M.A.K. 1967. Notes on transitivity and theme in English, Part 2. Journal of Linguistics 3. 199–244.
Von Heusinger, Klaus. 1999. Intonation and Information Structure. Habilitationsschrift. Available online at http://gerlin.phil-fak.unikoeln.de/kvh/pub/pub99–98/Heusinger1999_Intonation.pdf (accessed 25.02.2014).
Von Heusinger, Klaus. 2002. Information structure and the partition of sentence meaning. In Hajičová, Eva, Petr Sgall, Jirí Hana and Tomáš Hoskovec (eds.), Prague Linguistic Circle Papers: Travaux du cercle linguistique de Prague nouvelle série. Vol. 4.
275–305, Amsterdam: Benjamins.
Kittilä, Seppo. 2005. Recipient prominence vs. benefi ciary prominence. Linguistic Typology 9. 269–297.
Malchukov, Andrej, Martin Haspelmath and Bernard Comrie, eds. 2010. Studies in ditransitive constructions: a comparative handbook. Berlin: De Gruyter Mouton.
Schmidt, Thomas, Wörner, Kai. 2005. Erstellen und Analysieren von Gesprächskorpora mit EXMARaLDA. Gesprächsforschung 6. 171–195.
Wörner, Kai. 2010. Werkzeuge zur fl achen Annotation von Transkriptionen gesprochener Sprache. PhD Thesis, Universität Bielefeld. Available online at http://bieson.ub.uni-bielefeld.de/volltexte/2010/1669/ (accessed 05.02.2013).
Zerbian, Sabine. 2006. Expression of information structure in the Bantu language Northern Sotho. ZAS Papers in Linguistics 45. Berlin: ZAS.
Issue: 2, 2015
Series of issue: Issue 2
Rubric: LINGUISTICS
Pages: 25 — 34
Downloads: 1129