Dataset: Tweets from the Chinese Protests #cn220

February 20, 2011
By

(This article was first published on Michael Bommarito » r, and kindly contributed to R-bloggers)

  Earlier this week, I posted a ~100k tweet dataset on the #25bahman protests in Iran.  The corresponding figure of frequencies showed a strong presence on Twitter, with over 500 tweets per 5 minute period at peak.  You can download the dataset or check out the figure in that post.

  I decided to take a quick snapshot of the corresponding #cn220 protests in China.  There are clearly a larger number of sampling issues in this case than in the previous cases such as Tunisia and Egypt.  First, while China has many microblogging users, the market is significantly more fragmented.  Second, China has a large infrastructure dedicated to "managing" content and flow of content online.  Therefore, this sample should be viewed as a portrait, not a perfect characterization of online coordination regarding #cn220.  You can download the tweets here, view the code for plotting below the break, and enjoy the simple 5-minute frequency plot below.

Email Google Buzz Facebook Twitter Share

To leave a comment for the author, please follow the link and comment on his blog: Michael Bommarito » r.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Tags: , , , , , ,

Comments are closed.