The Gab Hate Corpus

Brendan Kennedy; Mohammad Atari; Aida Mostafazadeh Davani; Leigh Yeh; Ali Omrani; Yehsong Kim; Kris Coombs; Gwenyth Portillo-Wightman; Shreya Havaldar; Elaine Gonzalez

doi:10.17605/OSF.IO/EDUA3

Title	Authors

The Gab Hate Corpus

Contributors:

Date created: | Last Updated:

: DOI | ARK

Creating DOI. Please wait...

Create DOI

Category: Project

Description: The growing prominence of online hate speech is a threat to a safe and just society. This endangering phenomenon requires collaboration across the sciences in order to generate evidence-based knowledge of, and policies for, the dissemination of hatred in online spaces. To foster such collaborations, here we present the Gab Hate Corpus (GHC), consisting of 27,665 posts from the social network service gab.ai, each annotated by a minimum of three trained annotators. Annotators were trained to label posts according to a coding typology derived from a synthesis of hate speech definitions across legal, computational, psychological, and sociological research. We detail the development of the corpus, describe the resulting distributions of hate-based rhetoric, target group, and rhetorical framing labels, and establish baseline classification performance for each using standard natural language processing methods. The GHC, which is the largest theoretically-justified, annotated corpus of hate speech to date, provides opportunities for training and evaluating hate speech classifiers and for scientific inquiries into the linguistic and network components of hate speech.

License: CC-By Attribution 4.0 International

Projects
Registrations

Results: All Projects Results: My Projects Results: All Registrations Results: My Registrations

Has supplemental materials for Introducing the Gab Hate Corpus: Defining and applying hate-based rhetoric to social media posts at scale on PsyArXiv

Files

Loading files...

Citation

Recent Activity

Loading logs...

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.

This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.

Create an Account Learn More Hide this message

Main content

The Gab Hate Corpus

Files

Citation

Tags

Recent Activity

Start managing your projects on the OSF today.

Main content

Links to this project

The Gab Hate Corpus

Link other OSF projects

Files

Citation

Tags

Recent Activity

Start managing your projects on the OSF today.