Political Twitter Discourse Corpus (PTDC)


The Political Twitter Discourse Corpus (PTDC) has been created for use as a discourse reference corpus tool in the focused analysis of political discourse occurring on Twitter - particularly for comparative keyword analyses. It is comprised of the most recent original tweets (i.e. not inclusive of retweets, up to a maximum of 3000 per user) of all serving US state governors, members of congress and senators. The PTDC consists of 205,303 individual tweets and 4,659,381 words.