🔬
The Fair-ly Project
  • Welcome to RumorMill
    • Recent Papers Timeline
  • Fair-ly Toolkit
    • Chrome Extension
    • Python Package
      • TextAnalyzer Pipeline
      • MultimodalAnalyzer Pipeline
    • Hosted APIs
  • Research
    • Sequence Classification
      • Binary
      • Multi-class
    • Named-Entity Recognition
      • Token Classification
    • Multimodal
      • Image + Text Pair Classification
    • Datasets
      • News Media Bias Plus (2024)
      • BEADs Dataset (2024)
      • GUS Dataset (2024)
      • BABE Dataset (2022)
  • Learn
    • Blog Posts
      • Training a model for multi-label NER
      • Binary Classification w/ BERT
  • Join the Project
    • To Do List
    • Discord Server
    • GitHub Repo
  • Misc
    • Privacy Policy
Powered by GitBook
On this page
Edit on GitHub
  1. Research
  2. Datasets

BABE Dataset (2022)

Bias Annotations By Experts (Media Bias Group)

PreviousGUS Dataset (2024)NextBlog Posts

Last updated 7 months ago

4.12k records | 2023 |

Human annotated, and all annotators must agree. In its paper, BABE showed great results with BERT for sequence classification of news articles. While smaller than some other datasets, the annotations are very reliable (highly recommended as an external dataset for model eval).

🤗HuggingFace Dataset

📑 Contents

Fields
Description

text

The text fragment (few sentences or less).

outlet

The source of the text fragments.

label

0 or 1 (biased or unbiased).

topic

The subject of the text fragment.

news_link

URL to the original source.

biased_words

Full words contributing to bias, in a list.

type

Political sentiment (if applicable).

📄 Research Paper

Media Bias Group
mediabiasgroup/BABE · Datasets at Hugging Facehuggingface
Logo
Neural Media Bias Detection Using Distant Supervision With BABE --...arXiv.org
Logo