data and code.zip (30.34 kB)
Data and code to run the DCA and GLMM analyses in the study described in the article "Language change in multidimensional space: New methods for modelling linguistic coherence", by Xia Hua, Felicity Meakins, Cassandra Algy and Lindell Bromham, published in Language Dynamics and Change (2021).
dataset
posted on 2021-08-02, 15:18 authored by Xia HuaXia Hua, Meakins, Felicity, Cassandra Algy, Lindell BromhamThese are the supplementary materials for an article published in Language Dynamics and Change, entitled 'Language change in multidimensional space: New methods for modelling linguistic coherence', by Xia Hua, Felicity Meakins, Cassandra Algy and Lindell Bromham, with DOI: 10.1163/22105832-bja10015.
Linguistic coherence – the co-variation of language variants within speaker repertoires – has been proposed as a key process driving the divergence of language dialects. Previous studies on coherence have been often limited by dataset sizes and analyses. We analyze the use of 185 variables across 78 speakers from the Gurindji community in Australia. We use two multivariate statistical approaches to test whether clusters of variables co-vary with generation, family, household, exposure to Gurindji language speakers and education. Using Discriminant Correspondence Analysis, we find generation is the strongest grouping factor of speakers and co-varies with clusters of variants. Using the Generalized Linear Mixed Model, we find these clusters of variants not only represent a gradual loss of Gurindji language use across generations, but also contribute to distinct patterns of language usage in the different generations. Our study demonstrates the use of multivariate analyses on big datasets to identify sociolects, an important step in linking the ‘micro-level’ processes to the ‘macro-level’ outcomes.
These datasets contain the input data and codes to run the DCA and GLMM analyses in this study.
Funding
Australian Research Council Future Fellowship awarded to Felicity Meakins (FT170100042)
Australian Research Council Centre of Excellence for the Dynamics of Language (Project ID: CE140100041).
History
Usage metrics
Categories
- Applied linguistics and educational linguistics
- Comparative language studies
- Computational linguistics
- Historical, comparative and typological linguistics
- Linguistics not elsewhere classified
- Linguistic structures (incl. phonology, morphology and syntax)
- Aboriginal and Torres Strait Islander linguistics and languages
Keywords
linguistic coherencelanguage contactGurindjiGurindji Kriolmultivariate analysesApplied Linguistics and Educational LinguisticsComparative Language StudiesComputational LinguisticsLanguage in Time and Space (incl. Historical Linguistics, Dialectology)Linguistics not elsewhere classifiedLinguistic Structures (incl. Grammar, Phonology, Lexicon, Semantics)Aboriginal and Torres Strait Islander Languages