Options
A multilevel description of textbook linguistic complexity across disciplines: Leveraging NLP to support disciplinary literacy
Citation
Green, C. (2019). A multilevel description of textbook linguistic complexity across disciplines: Leveraging NLP to support disciplinary literacy. Linguistics and Education, 53, Article 100748. https://doi.org/10.1016/j.linged.2019.100748
Author
Green, Clarence
Abstract
Disciplinary Literacy as a pedagogy and research program aims to understand and teach the linguistic differences amongst disciplines. Limited studies have been done, however, on the differing linguistic complexity demands of disciplines. In this study, linguistic complexity is measured at the clause, phrase and lexical levels, across eight disciplines represented by a corpus of secondary school textbooks. Innovative natural language processing systems extract an unprecedented number of complexity measures, and discriminant function analysis describes features that best differentiate disciplinary writing. Results indicate disciplines vary along different clines of complexity. The first tends to discriminate humanities from science subjects, along features such as academic phraseology, possessive noun phrases, auxiliary verbs and clause dependents. Other clines show that subjects such as history and physics can be similarly complex in features such as their prepositional expansion of noun phrases. The paper concludes with detailed and specific pedagogical takeaways for teachers.
Date Issued
2019
DOI
10.1016/j.linged.2019.100748
Dataset
https://doi.org/10.25340/R4/ZDS5H5
Description
This is the final draft, after peer-review, of a manuscript published in Linguistics and Education. The published version is available online at https://doi.org/10.1016/j.linged.2019.100748