A Practical Guide to Sparse K-Means Clustering for Studying Molecular Development of the Human Brain Justin L. Balsor1 Keon Arbabi1 Desmond Singh2 Rachel Kwan2 Jonathan Zaslavsky2 Ewalina Jeyanesan1 Kathryn M. Murphy1,2 1 McMaster Neuroscience Graduate Program, McMaster University, Hamilton, ON, L8S 4K1, Canada 2 Department of Psychology, Neuroscience & Behavior, McMaster University, Hamilton, ON, L8S 4K1, Canada Correspondence should be addressed to Kathryn M. Murphy; firstname.lastname@example.org This project includes all of the material related to reproducing each figure in the sparse clustering paper, namely the R Markdowns and knitted PDFs of these markdowns. ## Data Folder The Data folder contains all of the data files used in the paper. The markdowns should all be able to run as long as one downloads the relevant data files to their local device and reads it in appropriately. The full list of data files is: - EJ_GeneList.csv - GSE25219 processed.csv - Human.PCAData.csv - immune_proteins_exp.csv - our_proteins_to_genes.csv - protein_identifiers.csv - synaptic_proteins_exp.csv - syngo_genes.csv - tSNE.Prot.Perplexity25.1millionIters.csv ## R Markdowns Folder The R Markdowns folder contain two subfolders: Code and PDFs. Within each of these subfolders are additional folders corresponding to the figure from the paper. Figure 1 is not included in this folder as it was not produced with code.