BABE Dataset (2022)

Bias Annotations By Experts (Media Bias Group)

4.12k records | 2023 | Media Bias Group

Human annotated, and all annotators must agree. In its paper, BABE showed great results with BERT for sequence classification of news articles. While smaller than some other datasets, the annotations are very reliable (highly recommended as an external dataset for model eval).

🤗HuggingFace Dataset

📑 Contents

Fields
Description

text

The text fragment (few sentences or less).

outlet

The source of the text fragments.

label

0 or 1 (biased or unbiased).

topic

The subject of the text fragment.

news_link

URL to the original source.

biased_words

Full words contributing to bias, in a list.

type

Political sentiment (if applicable).

📄 Research Paper

Last updated