This project examines the quantity and quality of text in a sample of Spanish-English codeswitching picture books (N=45) available to parents in the U.S. reported in the manuscript by the same name. Included in this repository are the transcription/coding protocol provided to coders for transcribing the picture books in CLAN using CHAT, the CLAN codes used to derive our key measures, the raw data file for each key measure for each picture book, and analyses scripts (in R) reported in the manuscript.