Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
## SCOTS Corpus ## The Scottish Corpus Of Texts and Speech [(SCOTS)][1] is a corpus containing over 1300 written and spoken texts, 77% of which is made up of written texts and 23% of spoken texts. The spoken portion was aligned as part of the SPADE project using MFA, using a pronunciation dictionary derived from the Edinburgh accent of [UNISYN.][2] The corpus is largely comprised of spontaneous speech, with some read speech (e.g. poetry reading). **Number of Speakers:** 226, 139F in 179 recordings \ **Hours of Speech:** about 107 \ **Year Recorded:** 1973, 1993, 1998, 2000, 2002-2011 \ **Speaker Dimensions:** Gender, decade of birth, place of birth (birthplace), occupation. ### Corpus Reference ### Anderson, J., Beavan, D.& Kay, C. (2007). Scots: Scottish corpus of texts and speech. In: Creating and digitizing language corpora. Springer 17–34. [1]: http://www.scottishcorpus.ac.uk [2]: http://www.cstr.ed.ac.uk/projects/unisyn/
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.