With the current folder structure, scripts can be run to generate relevant data files assuming an initiated `.Rproj` file in the top folder `signing_rate/` and that the `data/` subfolders are populated with the ELAN (`.eaf`) annotation files for each of the corpora, and the `metadata/` subfolders contain the corresponding CMDI (`.cmdi`) metadata files for the Corpus NGT (CNGT) and Swedish Sign Language Corpus (SSLC). The metadata for the BSL Corpus (BSLC) is contained within the file names of the annotation files.
The BSLC files can be found at https://bslcorpusproject.org/cava/ after approval from the maintainer.
The CNGT files can be found at https://hdl.handle.net/1839/8e5a77a3-8d1a-492a-bc86-9a3398b0809c after registration.
The SSLC files can be found at https://hdl.handle.net/1839/b9b9c88a-f8df-4fa5-8eb0-53622108764d after registration.
A full list of the annotation files included in the analyses for the paper can be found in this repository: `./metadata/annotation_files.csv`.