Monthly Archives: December 2013

24 Days of R: Day 23

December 23, 2013
By
24 Days of R: Day 23

Penultimate post, I'm going to take a quick look at the Gini indicator for wealth inequality. Data comes from the World Bank. I've downloaded the zipped file, decompressed it and given it a different name. I'm going to This will give us a decent set of data. How does this look when we plot it?

Read more »

Apache Spark for Big Analytics

December 23, 2013
By
Apache Spark for Big Analytics

by Thomas Dinsmore, Director of Product Management at Revolution Analytics The emergence of Apache Spark is a key development for Big Analytics in 2013. Spark, an Apache incubator project, is an open source distributed computing framework for advanced analytics in Hadoop. Originally developed as a research project at UC Berkeley's AMPLab, the project achieved incubator status in Apache in...

Read more »

New book release: Data Mining Applications with R

December 23, 2013
By
New book release: Data Mining Applications with R

Book title: Data Mining Applications with R Editors: Yanchang Zhao, Yonghua Cen Publisher: Elsevier Publish date: December 2013 ISBN: 978-0-12-411511-8 Length: 514 pages URL: http://www.rdatamining.com/books/dmar An edited book titled Data Mining Applications with R was released in December 2013, which … Continue reading →

Read more »

Further adventures with higher moments

December 23, 2013
By
Further adventures with higher moments

Additional views of the stability of skewness and kurtosis of equity portfolios. Previously A post called “Four moments of portfolios” introduced the idea of looking at the stability of the mean, variance, skewness and kurtosis of portfolios through time. That post gave birth to a presentation at the London Quant Group. That talk gave birth … Continue reading...

Read more »

Calculating Customer Lifetime Value with Recency, Frequency, and Monetary (RFM)

December 23, 2013
By
Calculating Customer Lifetime Value with Recency, Frequency, and Monetary (RFM)

Introducing Customer Lifetime Value (CLV) Customer Lifetime Value is “the present value of the future cash flows attributed to the customer during his/her entire relationship with the company.”1 There are different kinds of formulas, from simplified to advanced, to calculate CLV.  But the following one might be the one being used most commonly:- Where, t

Read more »

24 Days of R: Day 22

December 22, 2013
By
24 Days of R: Day 22

I like to use Goodreads to keep track of which books I'm reading (and not reading). They very helpfully sent me an e-mail to inform me how many books I've read so far in 2013. The number is 19. Hardly an impressive number, but between job, family and trying to develop my R skills, I'm

Read more »

Pharmacology data from Open PHACTS in R

December 22, 2013
By

Where I reported yesterday the initial steps of an client R package for Open PHACTS, I can now report the first pharmacology data import:

Read more »

The Relationship between Vectorized and Devectorized Code

December 22, 2013
By

Introduction Some people have come to believe that Julia’s vectorized code is unusably slow. To correct this misconception, I outline a naive benchmark below that suggests that Julia’s vectorized code is, in fact, noticeably faster than R’s vectorized code. When experienced Julia programmers suggest that newcomers should consider devectorizing code, we’re not trying to beat

Read more »

Google Maps and ggmap

December 22, 2013
By
Google Maps and ggmap

The ggmap package can be used to access maps from the Google Maps API and there are a number of examples on various statistics related blogs. These include here, here and here. The ggmap package has a function get_map that can download maps from various sources including Google Maps. require(ggmap) The first example specifies the

Read more »

A new Oce coastline

December 22, 2013
By
A new Oce coastline

Part 1: coastal applications The oce package has a world coastline file that is visibly crude on a scale suitable for continental-shelf work.  At the cost of about 1 Mbyte of storage, a candidate for a replacement is a 1:10million scale version downloaded from naturalearthdata.com (full link below). As illustrated below with plots near two oceanographic centres, this candidate provides...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de









ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)