Main content
PARADISE
- Hannah Seemann
- Sara Shahmohammadi
- Manfred Stede
- Tatjana Scheffler
Date created: | Last Updated:
: DOI | ARK
Creating DOI. Please wait...
Category: Project
Description: A German PARAllel DIScoursE annotated multi-media corpus. This repository contains 69 blog posts and 69 podcast transcripts from the 'business' and 'science & culture' domain, annotated for parallel parts and labelled for the type of parallelity (details provided in the documentation). We also provide the full version of the corpus that also contains non-parallel parts. This repository will continuously be updated and the full documentation will be added soon. Please cite when using the corpus: Seemann, Hannah J, Shahmohammadi, Sara, Stede, Manfred, Scheffler, Tatjana (2024): Spoken vs. Written Computer-Mediated Communication. In: Céline Poudat, Mathilde Guernut (eds.): Proceedings of the 11th International Conference on CMC and Social Media Corpora for the Humanities 2024 (CMC-2024). Nice, France, pp. 70-74.