Extracting_VS_from_childes.phy is the code used to extract variation sets and calculate proportion of words and utterances in them, as well as other measures (e.g., number of words, MLU).
English_VS_data.csv contains all linguistic measures, including variation sets measures, calculated for the English corpora.
pos_counts_Howe_data.csv contains data extracted using the childes_db packcage (Sanchez et al., 2018) to calculate the average proportion of open-class words spoken to children.
analyze_english_VS_data.r contains the code to make statistical analyses and figures for the English data.
Hebrew_VS_data.csv contains all linguistic measures, including variation sets measures, calculated for the Hebrew corpora.
analyze_hebrew_VS_data.r contains the code to make statistical analyses and figures for the Hebrew data.