Main content
UgChDial: A Uyghur Chat-based Dialogue Corpus for Response Space Classification
- Zulipiye Yusupujiang
- Jonathan Ginzburg
Date created: | Last Updated:
: DOI | ARK
Creating DOI. Please wait...
Category: Data
Description: We introduce a carefully designed and collected language resource: UgChDial -- a Uyghur dialogue corpus based on a chatroom environment. The Uyghur Chat-based Dialogue Corpus (UgChDial) is divided into two parts: (1). Two-party dialogues and (2). Multi-party dialogues. We ran a series of 25, 120-minutes each, two-party chat sessions, totaling 7323 turns and 1581 question-response pairs. We created 16 different scenarios and topics to gather these two-party conversations. The multi-party conversations were compiled from chitchats in general channels as well as free chats in topic-oriented public channels, yielding 5588 unique turns and 838 question-response pairs. The initial purpose of this corpus is to study query-response pairs in Uyghur, building on an existing fine-grained response space taxonomy for English. We provide here initial annotation results on the Uyghur response space classification task using UgChDial.
Add important information, links, or images here to describe your project.