Main content

Contributors:

Date created: | Last Updated:

: DOI | ARK

Creating DOI. Please wait...

Create DOI

Category: Project

Description: There are three main types of number used in modern, industrialised societies. Cardinals count sets (e.g., people, objects) and quantify elements of conventional scales (e.g., money, distance), ordinals index positions in ordered sequences (e.g., years, pages), and nominals serve as unique identifiers (e.g., telephone/player numbers). Many studies that have cited number frequencies in support of claims about numerical cognition and mathematical cognition hinge on the assumption that most numbers analysed are cardinal. This paper is the first to investigate the relative frequencies of different number types, presenting a corpus analysis of morphologically unmarked numbers (not, e.g., ‘eighth’ or ‘21st') in which we manually annotated 3,600 concordances in the Corpus of Contemporary American English. Overall, cardinals are dominant—both pure cardinals (sets) and measurements (scales)—except in the range 1,000–10,000, which is dominated by ordinal years, like 1996 and 2004. Ordinals occur less often overall, and nominals even less so. Only for cardinals do round numbers, associated with approximation, dominate overall and increase with magnitude. In comparison with other registers, academic writing contains a lower proportion of measurements, as well as a higher proportion of ordinals and, to some extent, nominals. In writing, pure cardinals and measurements are usually represented as number words, but measurements—especially larger, unround ones—are more likely to be numerals. Ordinals and nominals are mostly represented as numerals. Altogether, this paper reveals how numbers are used in American English, establishing an initial baseline for any analyses of number frequencies, and shedding new light on the cognitive and psychological study of number.

Wiki

Operating System: macOS Ventura 13.6

Libraries required

  • Python:
    • Re
    • Time
    • OS
    • IterTools
    • Pandas
    • NLTK
    • Word2Number
    • Num2Words
  • R:
    • tidyverse
    • ggpubr

Files

README

  • README.md (current document)

Manuscript

Codebook for number annotation

  • codebook.pdf: Codebook to explain the columns a…

Files

Files can now be accessed and managed under the Files tab.

Citation

Recent Activity

Unable to retrieve logs at this time. Please refresh the page or contact support@osf.io if the problem persists.

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.