This project makes available data published in an article "A perceptual study of language chunking in Estonian" in Open Linguistics (Ots & Taremaa, 2022). The data is structured into four folders:
1) **/data** - dataframes with processed data
2) **/scripts** - Praat (.psc) and R scripts (.R) for processing and evaluating data
3) **/stimuli** - subset of data that provides information about the auditive and written stimuli
4) **/results** - raw data files with results of the perception experiments
Please note, for the reasons of data protection, we do not make audio files available. If you should need them, please contact the corresponding author Nele Ots.
It should be possible to run all scripts of data processing and evaluation, if the user preserves the same structure of folders and changes the paths to files and folders into the local ones after downloading. Also, the user needs to download the statistics on collocations availables at https://datadoi.ee/handle/33/41 and give appropriate credits when using it. The data processing proceeds followingly:
1) **getAcousticData.psc** - a Praat script that collects durational data from TextGrids (/stimuli/TextGrids) and pitch data from pitch objects (/stimuli/PitchObjects). The pitch objects were extracted with the help of Praat in a way it is described in the article.
2) **getData.R** - a very long script that collects data and combines these into two final dataframes containing all variables needed for running the analysis.
3) **evaluateData.R** - a script that contains the evaluation procedures, as reported in the paper.