UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.
There is a newer version of this record available.

Dataset Open Access

SemEval 2019 Task 4 - Hyperpartisan News Detection

Johannes Kiesel; Martin Potthast; Maria Mestre; Rishabh Shukla; Benno Stein; David Corney; Emmanuel Vincent; Payam Adineh


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.1406208</identifier>
  <creators>
    <creator>
      <creatorName>Johannes Kiesel</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-1617-6508</nameIdentifier>
      <affiliation>Bauhaus-Universität Weimar</affiliation>
    </creator>
    <creator>
      <creatorName>Martin Potthast</creatorName>
      <affiliation>Leipzig University</affiliation>
    </creator>
    <creator>
      <creatorName>Maria Mestre</creatorName>
      <affiliation>Factmata Ltd.</affiliation>
    </creator>
    <creator>
      <creatorName>Rishabh Shukla</creatorName>
      <affiliation>Factmata Ltd.</affiliation>
    </creator>
    <creator>
      <creatorName>Benno Stein</creatorName>
      <affiliation>Bauhaus-Universität Weimar</affiliation>
    </creator>
    <creator>
      <creatorName>David Corney</creatorName>
    </creator>
    <creator>
      <creatorName>Emmanuel Vincent</creatorName>
      <affiliation>Factmata Ltd.</affiliation>
    </creator>
    <creator>
      <creatorName>Payam Adineh</creatorName>
      <affiliation>Bauhaus-Universität Weimar</affiliation>
    </creator>
  </creators>
  <titles>
    <title>SemEval 2019 Task 4 - Hyperpartisan News Detection</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2018</publicationYear>
  <subjects>
    <subject>Hyperpartisan news</subject>
    <subject>SemEval</subject>
    <subject>SemEval 2019</subject>
    <subject>SemEval 2019 Task 4</subject>
    <subject>Biased news</subject>
    <subject>News bias</subject>
    <subject>Hyperpartisan</subject>
    <subject>Hyperpartisanship</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2018-09-03</date>
  </dates>
  <language>en</language>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/1406208</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsReferencedBy">https://pan.webis.de/semeval19/semeval19-web/</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.1310145</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/pan</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/webis</relatedIdentifier>
  </relatedIdentifiers>
  <version>Trial v3</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;Third trial dataset for the SemEval 2019 Task 4: Hyperpartisan News Detection.&lt;/p&gt;

&lt;p&gt;The dataset contains 1 million articles. It is split in training (200,000 left, 400,000 least, 200,000 right) and validation (50,000 left, 100,000 least, 50,000 right), where &lt;strong&gt;no&lt;/strong&gt; publisher that occurs in the training set also occurs in the validation set. All articles are labeled by the overall bias of the publisher as provided by BuzzFeed journalists or MediaBiasFactCheck.com.&lt;/p&gt;

&lt;p&gt;The trial data is not fully cleaned. Due to some encoding error, some characters are replaced by question marks. However, all files are already fully compatible with the XML schema files.&lt;/p&gt;</description>
  </descriptions>
</resource>
21,135
24,031
views
downloads
All versions This version
Views 21,1358,932
Downloads 24,0311,952
Data volume 10.6 TB982.7 GB
Unique views 17,1688,432
Unique downloads 5,745550

Share

Cite as