Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
The materials here represent data and metadata associated with the World_morphosyntax database, as used in Kalyan and Donohue (2023). It is curated by Mark Donohue (mhdonohue@gmail.com), and has been developed since 2010. The original database was based on the most robustly coded languages and features from the *World Atlas of Language Structures* (https://wals.info), but substantially recoded to make it more computationally tractable, filled in to reduce the number of missing values, and extended by the addition of new features. Feedback is welcome on any aspect of the database and its contents, via email: mhdonohue@gmail.com. A visualisation of the results of the analysis of morphosyntactic variables, in the form of an interactive globe, can be found at: https://skalyan91.github.io/d3-language-maps/globe.html?data=ms-points-2023-noPC-Autotyp (best viewed in Google Chrome, for fullest functionality) High-resolution downloadable versions of the map shown on this globe in the form of a flat map that can be zoomed into are available in the set of files available for download at this site. The code used to perform the analyses (in R) presented in the paper cited below is also available from this site, as is a list of errata relating to the article listed below. If you use materials from this site, please cite them as: Kalyan, Siva, and Mark Donohue. 2023. The Dimensions of Morphosyntactic Variation: Whorf, Greenberg and Nichols were right. *Linguistic Typology at the Crossroads* 3 (2): 132-190. Available online at https://typologyatcrossroads.unibo.it/article/view/17482. Data downloaded from https://osf.io/u9qbe/ (Date downloaded). The article has been highlighted on the website of the Living Tongues Institute for Endangered Languages: Linguistic diversity mapped and visualised (https://livingtongues.org/kalyan-donohue-paper-2024/) (18 January 2024). The database available for download here represents the database that was used in the analysis for the article mentioned above. Updated versions of the database, showing back-filling of existing languages, the addition of new languages, or the addition of new features, will be added and noted as such as they become available. For example, the database used in the article contained 945,000 coded features, while that same set of languages totals 959,000 features as of February 2024, and the database with an additional 300 languages added contains 1,045,000 features (as of February 2024).
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.