Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
**Contents of the datasets** ======================== *For general description see [main Wiki pages][1]* Unique types ======================== fr2020_uni ---------- **Word form information** *word* - bare word form in lowercase, unique in this file (may be used as the primary key in a database) **Frequency information:** *fq_c_sum* - the overall frequency (including all uppercase/lowercase variants and POS tags) **Spelling** *fq_c_lc ; fq_c_uc ; fq_c_lc_rate* - absolute frequencies of the form with lowercase and uppercase spelling and the rate of lowercase spelling variant. The uppercase/lowercase spelling was determined using the first letter of the word form only, in order to identify potential proper nouns (i.e. *Roma* and *ROMA* are counted as uppercase, while *roma* or *rOMA* are counted as lowercase). **POS tags (by Google n-grams)** *fq_c_tag_noun;fq_c_tag_noun_rate [...] - absolute frequencies (and rates) of the form with original Google n-gram POS tags. Notice that according to our analysis, only about 50% of original data was POS tagged. **Rounded frequency counts** *fq_c_lc_rate_rounded;fq_c_tag_notag_rate_rounded fr2020_unidet ------------- *to be completed* fr2020_bi --------- *to be completed* Diachronic data ======================== **fr2020_uni_dia_grouped** - word; year; fq; volumes; fq_rel (*word* matches *word* from fr2020_uni; *fq_rel* value is the relative frequency in i.p.100m - items per 100 million) **fr2020_uni_dia10_grouped** - word; decade; fq; vol; fq_rel (*decade* = the first 3 digits of the decade, i.e. 147 = 1470-1479; *fq_rel* in i.p.100m) Other data ======================== *to be completed* Custom data ======================== The folder contains specific data extracted on demand for different researchers **ft2020_valentina_pro_1890_1909** - List of words beginning in "pro" with their frequencies for the decade 1900-1909 [1]: https://osf.io/46qcd/wiki/home/
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.