Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
These are files with the Dutch SUBTLEX-NL frequencies. They contain PoS information and Zipf values, which were not included in the original publication. Zipf frequency is the best word frequency measure. It is based on the equation Zipf=LOG10((frequency+1)/44.106)+3. For words not present in the database (i.e., with zero frequency), the Zipf value is Zipf=LOG10(1/44.106)+3 = 1.3555. There are two files: one with all letter strings and one with the letter strings observed in at least 2 films (the latter contains much less noise and so will be less heavy for most searches). More information can be found in: Keuleers, E., Brysbaert, M., & New, B. (2010). SUBTLEX-NL: A new frequency measure for Dutch words based on film subtitles. Behavior Research Methods, 42, 643-650. Van Heuven, W.J.B., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). Subtlex-UK: A new and improved word frequency database for British English. Quarterly Journal of Experimental Psychology, 67, 1176-1190. (for information about Zipf values) Brysbaert, M., Mandera, P., & Keuleers, E. (2018). The word frequency effect in word processing: An updated review. Current Directions in Psychological Science, 27, 45-50. (more information about the Zipf measure)
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.