Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
SNPs could potentially be used to identify individuals, and so we are not able to make data open that links SNP values to phenotypes. However, we have 1. Used synthpop to generate a dataset with the same characteristics. 2. Deposited covariance matrices and means for SNP and phenotype data for both SCT cases and one twin sample. 3. Deposited original data for phenotype variables. **Synthpop_sct.csv** contains data simulated using the R package synthpop. **synthpop_twina.csv** has analogous data simulated for twinA sample. Synthpop takes as its input a raw data file and creates synthetic data that have similar distributions and covariances. This allows for other users to explore the files without compromising confidentiality. The Markdown file **Double hit results.rmd** includes a chunk of code that details how synthpop files are created. The files **realcovs_sct.csv** and **realcovs_twina.csv** contain the covariance matrices for SNP and phenotype data, for SCT cases and twinA respectively, with column means in the last row. **osf_pheno_sct_twin.csv** has original data for phenotype variables, with any identifying information removed. The variables are described in **data_dictionary_phenotype_data.xls**
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.