Mergen corpus

posted on 28.01.2021, 10:55 by Aidan Winberry
People involved
This corpus was created by Aidan Winberry in 2020 from a recording of Raisa Alekseevna Beldy reading
the text of a Nanai fairy-tale ”Mergen ningman”. The recording was made by Vasily Kharitonov.
Annotation scheme
The information about Nanai phonemes is taken from (Ko & Yurn, 2011).
Coding scheme
The sounds and phonemes are represented by their IPA symbols in Unicode.
The text of the fairy-tale is provided in Cyrilic Nanai orthography. Nanai writing system is nearly phonemic so a Latin transcription layer would simply copy the phonemic tier in IPA.
In several segments the sound is corrupted by background music; in this case the annotation on phonemic and phonetic level is omitted.
Annotation quality
Annotations were made without consulting with the dictionaries. All phonemes and allophonic variants
are marked aurally.
Diphthongs are not thoroughly marked.
The “Words” tier follows the text of the fairy-tale while the “Phonemes” and “Sounds” tiers represent
what is actually being said instead. Several utterances end with an ellipsis which marks correcting slips
of tongue.



