The Corpus of Turkish Youth Language (CoTY) was designed to encompass various modes and mediums of youth interaction and expand over the years. The current version includes naturally occurring spontaneous multi-party talk of contemporary spoken Turkish.
The CoTY comprises 168,748 tokens of 24,736 word types within the single domain of informal conversation exclusively among friends. The corpus has 123 unique speakers (62 females and 61 males) between the ages 14 to 18 and consists of 49 conversations which correspond to 26 hours 11 minutes of multi-party spoken interaction.
The corpus construction software [EXMARaLDA][1] and its tools Partitur-Editor, COMA and EXAKT were used to transcribe, annotate, and construct the corpus.
[1]: https://exmaralda.org/en/