Code for 'Conditioning: How background variables can influence PISA scores'

doi:None

Title	Authors

Home

**General** In this repository, all information for reproducing the self-computed plausible values in Chapter 2 'Conditioning: How background variables can influence PISA scores' of my thesis 'Essays on the psychometric and statistical properties of the Programme for International Student Assessment' can be found. This code is made to share the exact steps and properties of the computation in the paper. Due to high computational effort, parts of the computations were run on a high performance computing cluster. The published code does not include these parts (commands for automation on HPC cluster), but is simplified in the way that it only covers one country as an example (the code and computations are the same). The code is written with the following folder structure in mind: - 0_Raw-Data - 1_Data - 2_R-Syntax - 3_Models - 4_Results All R code of this repository is supposed to be put in the folder "2_R-Syntax". **0_Raw-Data** All data that is used in this paper is publicly available. As a result, no data sets are hosted here. Please download and prepare the following files from https://www.oecd.org/pisa/data/pisa2012database-downloadabledata.htm (last accessed 14 July 2021). - PISA cognitive scored data: Download the file "Scored cognitive item response data file " and the belonging sps-file. Read in the data in SPSS and save as .sav-file with the same name ("INT_COG12_S_DEC03.sav"). - PISA cogintive scored data digital domains: Download the file "Scored cognitive item response data file" and the belonging sps-file from the subsection PISA CBA 2012 dataset download page. Read in the data with SPSS and save as .sav-file with the same name ("CBA_COG12_S_MAR31.sav"). - PISA student questionnaire: Download the file "Student questionnaire data file" and the belongign sps-file. Read in the data in SPSS and save as .sav-file with the same name ("INT_STU12_DEC03.sav"). - PISA parent questionnaire: Download the file "Parent questionnaire data file" and the belonging sps-file. Read in the data in SPSS and save as .sav-file with the same name ("INT_PAQ12_DEC03.sav"). The technical report containing the item difficulties and details on the conditioning variables can be downloaded from https://www.oecd.org/pisa/data/pisa2012technicalreport.htm (last accessed 14 July 2021). Please save it under the name "PISA_2012_TR.pdf" in "0_Raw-Data". **1_Data** Some information is already prepared/gathered from the technical report or as general information. Please download the files in "Additional-data" and put it in the folder "1_Data" on your device. The data which prepared throughout the computations will be saved in this folder. The following things are provided: - Conditioning_var_information.xlsx: The information from the pages 421-431 of the technical report, which describe the preparation of the conditioning variables, are prepared here. The columns are preapred in style for functions used in the code. - PA12_Digital_Participation.RData: Table describing which countries participate in which domain (PS = Problem Solving and DRM = Digital Reading and Mathematics) and if they participate with whole sample or just a subset (Subset). - OECD_membership_date.RData: Table containing the ISO 3166 numeric and alpha numeric codes as well as the entry data for all OECD countries. - Booklet_Effects.xlsx: The booklet effects which are applied later on to the plausible. They are not self-computed but taken from the technical report page 242. **2_R-Syntax** All code in the folder "R" should be downloaded and put in "2_R-Syntax". The R code is numbered for a reason. The R code should be computed in that order. Files beginning with same number belong to the same step can be computed in any order. Files with an "H" after the number are helper-functions, which do not need run by the user, but other R code will call/source it. Generally, the R code numbers belong to the following steps: - 0: Data preparation and checks on the cognitive items (and weights) - 1: Read-in item parameter from the technical report - 2: Prepare conditioning data - 3: Run IRT models - 4: Compute latent regression and plausible values - 5: Transform plausible values onto PISA scale - Afterwards the plausible can be analysed, we used the plausible values in combination with "PA12_weights.RData" and the package intsvy to account for the complex sampling. **3_Models** The models which are used and reused during the computations are stored here (automatically by the code - no need to do anything). **4_Results** Final data and results are stored here (automatically by the code - no need to do anything). **Session Info** The code runs successfully on - R 4.0.4 - fastDummies_1.6.3 - stringr_1.4.0 - tabulizer_0.2.2 - dplyr_1.0.5 - foreign_0.8-81 - TAM_3.5-19

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.

This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.

Create an Account Learn More Hide this message

Main content

Home

Menu

Start managing your projects on the OSF today.

Main content

Links to this project

Home

Menu

Add new wiki page

Page permissions have changed

Wiki page deleted

Connected to the collaborative wiki

Connecting to the collaborative wiki

Collaborative wiki is unavailable

Browser unsupported

Start managing your projects on the OSF today.