Options
Compiling a Baba Malay corpus and word list for language revitalisation
Author
Thompson, Amelyn Anne
Supervisor
Renandya, Willy A.
Abstract
This study aims to support language revitalisation efforts in Baba Malay (BM), which is an endangered language in Singapore. In pursuit of this goal, a corpus of natural BM use was compiled. To fulfil the dual aims of language documentation and revitalisation, both conversational and narrative data were collected. Pedagogical applications of the corpus were explored, through applying the methodology of corpus linguistics to create a general service word list targeted at beginner level learners. The word list consisted of single words, as well as a separate list of multiword units attached to key content words. Objective criteria such as frequency, range, and dispersion were applied in tandem with subjective criteria, namely eliciting judgements of multiword units from proficient BM speakers. Ultimately, the BM word list consisted of 570 single words and 154 multiword units, and achieved a coverage of 82% of the corpus. These items present a systematic way to introduce vocabulary to new learners, as they account for a large majority of texts that one would normally encounter in BM. These texts are both spoken and written in nature, and the items on the BM word list aim to help learners both productively and receptively.
Date Issued
2020
Call Number
PM7875.B33 Tho
Date Submitted
2020