Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
This OSF project is associated with the following article: - Sönning, Lukas. (2023). Evaluation of keyness metrics: Performance and reliability. Corpus Linguistics and Linguistic Theory. https://doi.org/10.1515/cllt-2022-0116 Here is the **abstract**: - *The methodological debates surrounding keyword analysis have given rise to a wide range of keyness metrics. The present paper delineates four dimensions of keyness, which distinguish between frequency- and dispersion-related perspectives. Existing measures are then organized according to these dimensions and evaluated with regard to their performance on a specific keyword analysis task: The identification of key verbs in academic writing. To this end, the rankings produced by 32 different metrics are evaluated against an established academic word list. Further, the reliability of measures is assessed, to determine whether they produce stable rankings across repeated studies on the same pair of text varieties. We observe notable differences among metrics with regard to these criteria. Our findings provide further support for the superiority of the Wilcoxon rank sum test and text-dispersion–based measures, and allow us to identify, within each dimension of keyness, metrics that may be given preference in applied work.* An earlier version of the manuscript, which has been substantially revised, however, was made available as a **preprint** on PsyArXiv (https://psyarxiv.com/eb2n9/). For the documentation of the analyses in the paper, we tried to follow the **TIER protocol 4.0** (https://www.projecttier.org/tier-protocol/). The file **00ReadMe.pdf** gives instructions for reproducing the analyses. Note that all **R scripts** (see folder "scripts") are commented in detail, and available both as a Quarto (RMarkdown) file and as an html file. The html files need to be downloaded and then opened in a web browser. **Data** used in the study have been published on TROLLing: - Sönning, Lukas. 2023. Key verbs in academic writing: Dataset for "Evaluation of keyness metrics: Performance and reliability"", DataverseNO, V1. https://doi.org/10.18710/EUXSMW **Images** created for this study can be found in the folder "output/figures". They are published under a Creative Commons Attribution 4.0 licence (**CC BY 4.0**), which means that the licence terms for their use are quite generous (see http://creativecommons.org/licenses/by/4.0).
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.