datasetposted on 28.07.2020 by Michael Dunn, Marc Tang
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
These are the supplementary materials for an article published in Language Dynamics in Change entitled 'The evolutionary trends of grammatical gender in Indo-Aryan languages', by Marc Allassonnière-Tang and Michael Dunn, with DOI: 10.1163/22105832-bja10011. This paper infers the processes of development and change of grammatical gender in Indo-Aryan languages using phylogenetic comparative methods. 48 Indo-Aryan languages are coded based on 44 presence-absence features relating to gender marking on the verbs, adjectives, personal pronouns, demonstrative pronouns, and possessive pronouns. A Bayesian Reverse Jump Hyper Prior analysis, which infers the evolutionary dynamics of changes between feature values, gives results that are consistent with historical linguistic and typological studies on gender systems in Indo-Aryan languages and predicts the evolutionary trends of the features included in the dataset. This dataset consists of the raw data used in this study.