Fun with Community Notes data

several types of data related to X/Twitter’s Community Notes feature are available for downloadAmong the features of X/Twitter’s crowdsourced fact-checking system, Community Notes (formerly known as Birdwatch), is the ability to download and study the underlying data. Four types of data are available: the full set of notes (both displayed and otherwise), the ratings these notes have received from users, the current and historical status of each note, and an anonymized list of participating users. (An important caveat to the anonymity of Community Notes authors: it is possible to determine which notes were written by the same author, which in some cases may render the author identifiable.) This article focuses on two of those datasets, specifically the note and status data.Community Notes data is stored in .TSV (tab-separated values) files, which are plain text files that can be read by a variety of programming language libraries as well as most spreadsheet software. The examples in this article are in Python, but equivalent code can be conjured up in pretty much any programming language. First, let’s load the note and status files into data frames and plot a basic bar chart showing the daily volume of notes colored by the current note status (helpful, not helpful, or needs more ratings).import bokeh.plotting as bk import bokeh.palettes as pal import bs4 import pandas as pd import requests import time # load the notes and their current status notes = pd.read_csv (“notes-00000.tsv”, sep=”t”) notes[“t”] = pd.to_datetime (notes[“createdAtMillis”], unit=”ms”) print (“total notes: ” + str…Fun with Community Notes data

Leave a Reply

Your email address will not be published. Required fields are marked *