Trending topics on cable news: the newsflash package

[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Want to know what's capturing the attention of the producers at the 24-hour cable news stations? There's no equivalent of Twitter's trending topics for the likes of CNN or BBC News, but the newsflash package for R by Bob Rudis can extract the latest trending topics from the TV news stations.

The newsflash package is an interface to the GDELT Project's Television Explorer, which provides access to the closed-captioning transcripts from seven major cable-news stations, with archives available for the past 6 years. In particular, it provides access to the top trending “entities” (in the sense of the Stanford Names Entity Recognizer), ranked by the number of sentences in which they are mentioned during the last 24 hours. You can see R code extracting the rankings here.

The newsflash package is still in alpha-test mode and only available on Github (and not yet on CRAN). Also, it seems that the GDELT API can be a little unreliable and sometimes fails to return results. Nonetheless, it looks to be a useful resource for exploring what the TV news networks are reporting.

rud.is: Teasing Out Top Daily Topics with GDELT’s Television Explorer

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)