These are files with the Dutch SUBTLEX-NL frequencies. They contain PoS information and Zipf values, which were not included in the original publication.
Zipf frequency is the best word frequency measure. It is based on the equation Zipf=LOG10((frequency+1)/44.106)+3.
For words not present in the database (i.e., with zero frequency), the Zipf value is Zipf=LOG10(1/44.106)+3 = 1.3555.
There are two files: one with all letter strings and one with the letter strings observed in at least 2 films (the latter contains much less noise and so will be less heavy for most searches).
More information can be found in:
Keuleers, E., Brysbaert, M., & New, B. (2010). SUBTLEX-NL: A new frequency measure for Dutch words based on film subtitles. Behavior Research Methods, 42, 643-650.
Van Heuven, W.J.B., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). Subtlex-UK: A new and improved word frequency database for British English. Quarterly Journal of Experimental Psychology, 67, 1176-1190. (for information about Zipf values)
Brysbaert, M., Mandera, P., & Keuleers, E. (2018). The word frequency effect in word processing: An updated review. Current Directions in Psychological Science, 27, 45-50. (more information about the Zipf measure)