A Benchmark Dataset of Check-Worthy Factual Claims

07/06/2020

A Benchmark Dataset of Check-Worthy Factual Claims

Fatma Arslan, Naeemul Hassan, Chengkai Li, Mark Tremayne

Keywords: building, claims, communities, elections, humans, sources, traditional

Abstract: In this paper we present the ClaimBuster dataset of 23,533 statements extracted from all U.S. general election presidential debates and annotated by human coders. The ClaimBuster dataset can be leveraged in building computational methods to identify claims worth fact-checking from the myriad of sources of digital or traditional media. The ClaimBuster dataset is publicly available to the research community, and it can be found at http://doi.org/10.5281/zenodo.3609356.

ICWSM

This is an embedded video. Talk and the respective paper are published at ICWSM 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet