Main content
SUBTLEX_US word frequency database
Date created: | Last Updated:
: DOI | ARK
Creating DOI. Please wait...
Category: Project
Description: This file contains the SUBTLEX_US word frequencies. They are based on film subtitles (Brysbaert & New, 2009). The file also includes PoS information (Brysbaert, New, & Keuleers, 2012) and the Zipf scale of word frequency (van Heuven, Mandera, Keuleers, & Brysbaert, 2014). Be careful with the frequencies of the words don and haven. The frequencies of these words are way out of range because they are mostly based on the word forms don't and haven't. Better not to use these words in your research. References Brysbaert, M., & New, B. (2009). Moving beyond Kucera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods, 41, 977-990. Brysbaert, M., New, B., & Keuleers, E. (2012). Adding Part-of-Speech information to the SUBTLEX-US word frequencies. Behavior Research Methods, 44, 991-997. Van Heuven, W.J.B., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). Subtlex-UK: A new and improved word frequency database for British English. Quarterly Journal of Experimental Psychology, 67, 1176-1190. Further information on http://crr.ugent.be/archives/1352