6174 search results for "git"

Notes from the Kölner R meeting, 12 September 2014

September 16, 2014
By
Notes from the Kölner R meeting, 12 September 2014

Last Friday we had guests from Belgium and the Netherlands joining us in Cologne. Maarten-Jan Kallen from BeDataDriven came from The Hague to introduce us to Renjin, and the guys from DataCamp in Leuven, namely Jonathan, Martijn and Dieter, gave an overview of their new online interactive training platform.RenjinMaarten-Jan gave a fascinating introduction to Renjin,...

Read more »

how to provide a variance calculation on your public-use survey data file without disclosing sampling clusters or violating respondent confidentiality

September 16, 2014
By

this post and accompanying syntax would not have been possible without dan oberski.  read more, find out why.  thanks dan.dear survey administrator: someone sent you this link because you work for an organization or a government agency that c...

Read more »

Using Reddit’s JSON API to analyze post popularity

September 15, 2014
By
Using Reddit’s JSON API to analyze post popularity

Graduate student Clay McLeod decided to find out what makes a post on the social-sharing site Reddit popular. These are the questions he seeks to answer: What’s in a post? Reddit pulls in around 115 million unique visitors each month, amassing a staggering 5 billion page views per month. For a long time, I’ve wondered what factors draw people...

Read more »

Creating a map showing land covered by rising sea levels

September 15, 2014
By

I joined the Geekli.st climate Hackathon this weekend at the Hub Westminster (my favorite venue for Hackathons). While the organizers had lots of enthusiasm they had very little in the way of data for us to work on. No problem, ever since the Flood-relief hackathon I have wanted to use the SRTM ‘whole Earth’ elevation

Read more »

Mapping every IPv4 address

September 15, 2014
By
Mapping every IPv4 address

During July I was working with a commercial data source that provides extra data around IP addresses and it dawned on me: rather than pinging billions of IP addresses and creating map, I could create a map from all the geolocation data I had at my finger tips. At a high level I could answer “Where are all the IPv4 addresses worldwide?” But in...

Read more »

PCA / EOF for data with missing values – a comparison of accuracy

September 15, 2014
By
PCA / EOF for data with missing values – a comparison of accuracy

Not all Principal Component Analysis (PCA) (also called Empirical Orthogonal Function analysis, EOF) approaches are equal when it comes to dealing with a data field that contain missing values (i.e. "gappy"). The following post compares several methods by assessing the accuracy of the derived PCs to reconstruct the "true" data set, as was similarly...

Read more »

How do you say π^π^π?

September 15, 2014
By
How do you say π^π^π?

Well, not that you really probably want to know how to say such an absurdly large number. However for those of you who are interested (allowing for rounding) it is:one quintillion, three hundred forty quadrillion, one hundred sixty-four trillion, one h...

Read more »

Departure of 2014 US Open Tennis Players

September 14, 2014
By
Departure of 2014 US Open Tennis Players

This was generated using R and ggplot2. The code and details are coming soon. Departure of 2014 US Open Tennis Players was originally published by Vivek Patil at Adventures in Analytics and Visualization on September 15, 2014.

Read more »

Google uses R to calculate ROI on advertising campaigns

September 12, 2014
By
Google uses R to calculate ROI on advertising campaigns

Google has just released a new package for R: CausalImpact. Amongst many other things, this package allows Google to resolve the classical conundrum: how can we asses the impact of an intervention (for example, the effect of an advertising campaign on website clicks) when we can't know what would have happened if we hadn't run the campaign? For a...

Read more »

R: k-Means Clustering on an Image

September 12, 2014
By
R: k-Means Clustering on an Image

Enough with the theory we recently published, let's take a break and have fun on the application of Statistics used in Data Mining and Machine Learning, the k-Means Clustering.k-means clustering is a method of vector quantization, originally from signa...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)