Identifying Position Change Groupings in Rank Ordered Lists

December 9, 2014
By
Identifying Position Change Groupings in Rank Ordered Lists

The title says it all, doesn’t it?! Take the following example – it happens to show race positions by driver for each lap of a particular F1 grand prix, but it could be the evolution over time of any rank-based population. The question I had in mind was – how can I identify positions that

Read more »

Next Kölner R User Meeting: Friday, 12 December 2014

December 9, 2014
By
Next Kölner R User Meeting: Friday, 12 December 2014

The next Cologne R user group meeting is scheduled for this Friday, 12 December 2014.We have an exciting agenda with two talks on Julia and Dynamic Linear Models:Introduction to Julia for R Users Hans Werner BorchersJulia is a high-performance dynamic programming language for scientific computing, with a syntax that is familiar to users of other technical...

Read more »

maps and the art of survey-weighted maintenance

December 9, 2014
By
maps and the art of survey-weighted maintenance

introducing swmap: cartography with complex survey data. this collection of publicly-available r syntax implements granular geographic variation visualizations similar to those seen in the popular press without compromising on survey methodology.thanks to david rae, paul regular, joseph larmarange, djalma pessoa, hadley wickham, thomas lumley, john baumgartner, viviane quintaes,

Read more »

Going Bananas #1: Superman Was Born In Kentucky!

December 8, 2014
By
Going Bananas #1: Superman Was Born In Kentucky!

Who knows? Could be the tropic heat or something that I eat, that makes me gonzo (I’m Going Bananas, Madonna) This is the first post of a new category called “Going Bananas” in which I will post totally crazy experiments. My aim is to try some techniques without taking care of making serious things. These are … Continue reading...

Read more »

Am I a data scientist?

December 8, 2014
By
Am I a data scientist?

Last night I gave a very short talk (less than 5 minutes) at the Melbourne Analytics Charity Christmas Gala, a combined event of the Statistical Society of Australia, Data Science Melbourne, Big Data Analytics and Melbourne Users of R Network. This is (roughly) what I said. Statisticians seem to go through regular periods of existential crisis as

Read more »

Two New Cumulative Sum Functions

December 8, 2014
By
Two New Cumulative Sum Functions

In this document on RPubs, I describe two functions in FSA for computing cumulative sums.  The first is pcumsum() for computing the cumulative sum NOT including the current value and the second is rcumsum() for computing the “reverse” cumulative sum.Filed … Continue reading →

Read more »

amazonish thanks (& repeated warning)

December 8, 2014
By
amazonish thanks (& repeated warning)

As in previous years, at about this time, I want to (re)warn unaware ‘Og readers that all links to Amazon.com and more rarely to Amazon.fr found on this blog are actually susceptible to earn me an advertising percentage if a purchase is made by the reader in the 24 hours following the entry on Amazon

Read more »

Switch from SAS, SPSS or STATA to R with our latest course

December 8, 2014
By
Switch from SAS, SPSS or STATA to R with our latest course

If you already know SAS, SPSS or Stata, you don’t need to spend time learning how to analyze data. You need a course that focuses on translating your knowledge into R. A course that facilitates switching from SAS, SPSS or STATA to R. That’s why DataCamp’s latest interactive course focuses on statisticians, data analysts, academic institutions, The post

Read more »

Importing Illumina BeadArray data into R

December 8, 2014
By

A colleague needed some help getting Illumina BeadArray gene expression data loaded into R for data analysis with limma. Hopefully whoever ran your arrays can export the data as text files formatted as described in the code below. If so, you can import...

Read more »

Thursday Dec 11: Webinar on sports analytics with R and Storm

December 8, 2014
By

A quick heads-up that this Thursday (December 11), Allen Day from MapR and Bill Jacobs from Revolution Analytics will be live presenting a new webinar, Batter Up! Advanced Sports Analytics with R and Storm. The analysis will be of baseball data, but the webinar will be of interest to anyone interested in doing large-scale statistical analysis with R of...

Read more »

Revolution R Enterprise tutorial: Free 8h interactive tutorial on Big Data Analytics

December 8, 2014
By
Revolution R Enterprise tutorial: Free 8h interactive tutorial on Big Data Analytics

In need for better ways to handle large data sets? Interested in manipulating, visualizing, and analysing large datasets with RevoScaleR? Then make sure to have a look at this free hands-on Revolution R Enterprise tutorial on Big Data Analytics by Revolution Analytics and DataCamp. Everything takes place in the online interactive learning interface of DataCamp, so no The post

Read more »

Bonds are boring…read this

December 8, 2014
By

If you would have invested in 1992 in the DAX ETF - provided it would have been around, of course - you would have earned a decent amount of money.That's the story of the passive guys and in my previous post I'm borrowing a few arguments of this guys t...

Read more »

tidyr 0.2.0 (and reshape2 1.4.1)

December 8, 2014
By
tidyr 0.2.0 (and reshape2 1.4.1)

tidyr 0.2.0 is now available on CRAN. tidyr makes it easy to “tidy” your data, storing it in a consistent form so that it’s easy to manipulate, visualise and model. Tidy data has variables in columns and observations in rows, and is described in more detail in the tidy data vignette. Install tidyr with: install.packages("tidyr")

Read more »

Comparing the Bootstrap and Cross-Validation

December 8, 2014
By
Comparing the Bootstrap and Cross-Validation

This is the second of two posts about the performance characteristics of resampling methods. The first post focused on the cross-validation techniques and this post mostly concerns the bootstrap. Recall from the last post: we have some simulations to evaluate the precision and bias of these methods. I simulated some regression data (so that I know the real...

Read more »

Risk as a “Survival Variable”

December 8, 2014
By
Risk as a “Survival Variable”

I come across a lot of strategies on the blogosphere some are interesting some are a complete waste of time but most share a common feature: people developing those strategies do their homework in term of analyzing the return but much less attention is paid to the risk side its random nature. I’ve seen comment like “a

Read more »

Mucking around with maps, schools and ethnicity in NZ

December 7, 2014
By
Mucking around with maps, schools and ethnicity in NZ

I’ve been having a conversation for a while with @kamal_hothi and @aschiff on maps, schools, census, making NZ data available, etc. This post documents some basic steps I used for creating a map on ethnic diversity in schools at the census-area-unit level. This “el quicko” version requires 3 ingredients: Census area units shape files (available

Read more »

Snowdoop, Part II

December 7, 2014
By
Snowdoop, Part II

In my last post, I questioned whether the fancy Big Data processing tools such as Hadoop and Spark are really necessary for us R users.  My argument was that (a) these tools tend to be difficult to install and configure, especially for non-geeks; (b) the tools require learning new computation paradigms and function calls; and … Continue reading...

Read more »

Names in the U.S., from James Smith to Jose Rodriguez

December 7, 2014
By
Names in the U.S., from James Smith to Jose Rodriguez

Two weeks ago, @mona published an interesting post on her blog, about a difficult question, What’s The Most Common Name In America? There were stats about first names, in the U.S., and last names, too. Those informations are - somehow - easy to get. But usually, it is more complicated to get the first and the last name together....

Read more »

Fibonacci Sequence in R with Memoization

December 7, 2014
By
2014-12-08 16_36_20-Clipboard

This is in response to Andrew Z’s post on R-Bloggers Friday about using recursion to calculate numbers in the Fibonacci sequence. http://heuristicandrew.blogspot.com/2014/12/fibonacci-sequence-in-r-and-sas.html I’ve re-written the author’s Fibonacci function here. The only really change is that this one is extended to … Continue reading →

Read more »

auto-complete in ESS

December 7, 2014
By

To support auto complete in Emacs, we need to install the auto-complete extension. To activate auto complete, we need to load it at Emacs startup. Read More: 440 Words Totally

Read more »

SAS PROC MCMC in R: Nonlinear Poisson Regression Models

December 6, 2014
By
SAS PROC MCMC in R: Nonlinear Poisson Regression Models

In exercise 61.1 the problem is that the model has bad mixing. In the SAS manual the mixing is demonstrated after which a modified distribution is used to fix the model.In this post the same problem is tackled in R; MCMCpack, RJags, RStan and LaplaceDemon. MCMCpack has quite some mixing problems, RStan seems to do best.DataTo quote the SAS...

Read more »

Archetypal Analysis: Similarity Defined by Distances from Contrasting Ideals

December 5, 2014
By
Archetypal Analysis: Similarity Defined by Distances from Contrasting Ideals

Carl Jung was at least partially correct. We do tend to think in terms of the extremes as shown in this archetypal wheel with rulers versus outlaws and heroes versus caregivers at different ends of bipolar dimensions. Happily, we are not required to ac...

Read more »

Cindy Brewer: helping you choose better color scales for maps

December 5, 2014
By
Cindy Brewer: helping you choose better color scales for maps

The choice of colors you use in a statistical graphic isn't just about making your chart look good: the colors you choose are often critical to interpretation. For example, you wouldn't want to use a scale like this to represent, say, average income on a map: That palette would be suitable for qualitative data without implicit ordering (say, political...

Read more »

Winning a Marathon

December 5, 2014
By
Winning a Marathon

The proliferation and participation in the marathon has increased substantially in recent years.  No longer is the distance an event reserved for the super-athletic, but at least in the US one can from many vantage points on highways or streets se...

Read more »

Visualize Prospect Theory

December 5, 2014
By
Visualize Prospect Theory

Cool Prospect Theory Visualizer draws the curves for you. The post Visualize Prospect Theory appeared first on Decision Science News.

Read more »

Fibonacci sequence in R and SAS

December 5, 2014
By

Because the Fibonacci sequence is simply defined by recursion, it makes for an elegant programming exercise. Here is one way to do it in SAS, and another way to do it in R. I've also included unit testing code to check that it works. Fibonacci sequence in SAS using a recursive macro: %macro fib(n);%if &n = 1 %then 1;...

Read more »

Learn ggvis with RStudio and DataCamp

December 5, 2014
By
Learn ggvis with RStudio and DataCamp

We’ve teamed up with DataCamp to make a self-paced online course that teaches ggvis, the newest data visualization package by Hadley Wickham and Winston Chang. The ggvis course pairs challenging exercises, interactive feedback, and “to the point” videos to let you learn ggvis in a guided way. In the course, you will learn how to

Read more »

Data point locator function

December 5, 2014
By
Data point locator function

Here's a little function to select data points in an open graphical device (ptlocator()). The function does a scaling of the x and y axes in order to give them equal weighting and remove the influence of differing units or ranges. The function then cal...

Read more »

New features in jsonlite 0.9.14

December 4, 2014
By
New features in jsonlite 0.9.14

The jsonlite package implements a robust, high performance JSON parser and generator for R, optimized for statistical data and the web. This week version 0.9.14 appeared on CRAN which adds some handy new features. Significant Digits By defaul...

Read more »