by Poppy Siahaan (University of Cologne) & Gede Primahadi W. Rajeg (Udayana University & University of Oxford)
"Supplementary data for Cognitive Linguistics chapter" by Poppy Siahaan & Gede Primahadi W. Rajeg is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International
This repository provides curated data of the studied languages in the articles for the journal Cognitive Linguistics over the last ten years, from 2015 (volume 26 issue 1) until 2024 (volume 35 issue 3).
The data was collected by Poppy Siahaan, who also identified and categorised the studied languages. Gede Primahadi W. Rajeg fixed the raw data into a (relatively slightly more) tidy data format and performed the descriptive statistics on the distribution of the studied languages and language families.
The distributional data of these languages is used in a chapter on Cognitive Linguistics (Rajeg & Siahaan, under review) for the second edition of The Routledge Handbook of Linguistics (edited by Howard Manns, Alice Gaby, and Anna Margetts). In particular, we attempt to highlight the distribution of under-represented languages in the field of Cognitive Linguistics, as reflected in the studies in the flagship journal Cognitive Linguistics.
-
The data file is available as an MS Excel (original) and tab-separated (.tsv) files under the directory of
studied_languages_data
. -
The R code file to produce the descriptive statistics is called
cogling-studies-code.R
. -
The
data-preparation.sh
contains the command-line Bash codes to prepare the data into a repository for GitHub.