A map of worldwide email traffic, created with R

March 12, 2013

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

The Washing Post reports that by analyzing more than 10 million emails sent through the Yahoo! Mail service in 2012, a team of researchers used the R language to create a map of countries whose citizens email each other most frequently:

Email map

The chart above shows the top 1000 country-country pairs by email frequency, arranged in a clustered network using the R igraph package. While it's not surprising that countries that share proximity, language and culture might tend to email each other more often, it's interesting that the clusters tend to resemble those of Huntington's Clash of Civilizations, whose "major civilizations" are used to color-code the map. (A formal test of similarity was done using R's SNA package, with country-specific factors corrected for using a linear mixed-effects model.) On this point, the authors write:

In this respect we cautiously assign a level of validity to Huntington’s contentions, with a few caveats. The first issue was already mentioned – overlap between civilizations and other factors contributing to countries’ level of association. Huntington’s thesis is clearly reflected in the graph [above], but some of these civilizational clusters are found to be explained by other factors … The second limitation concerns the fact that we investigated a communication network. There is no necessary “clash” between countries that do not communicate, and Huntington’s thesis was concerned primarily with ethnic conflict.

Indeed, the validity of Huntington’s ideas with respect to ethnic conflict has come into controversy, and we limit ourselves to showing the validity — at least partial — of this division for communication networks.

The Washington Post article (linked below) has more discussion about the research, and you can find details of the methodology in the arXiv paper.

Washington Post: An incredible map of which countries e-mail each other, and why (via reader GF, with thanks)

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.


Mango solutions

RStudio homepage

Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training




CRC R books series

Six Sigma Online Training

Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)