3744 search results for "gis"

Comparing the #code2013 results with the current TIOBE rankings

January 2, 2014
By
Comparing the #code2013 results with the current TIOBE rankings

The TIOBE language rankings have always been controversial but in the absence of more meaningful metrics tends to be viewed as holy writ. Over the last few days of 2013 a hashtag was started by Twitter user @deadprogram called #code2013. The ...

Read more »

Quickly Create Dummy Variables in a Data Frame

January 2, 2014
By
Quickly Create Dummy Variables in a Data Frame

On Quora, a question was asked about how to fix the error of the randomForest package in R not being able to handle more than 32 levels in a categorical variable. Seeing as how I’ve seen this question asked on Kaggle forums, StackOverflow and elsewhere, here’s the answer: code your own dummy variables instead of Quickly Create...

Read more »

Generalized linear models for predicting rates

January 1, 2014
By
Generalized linear models for predicting rates

I often need to build a predictive model that estimates rates. The example of our age is: ad click through rates (how often a viewer clicks on an ad estimated as a function of the features of the ad and the viewer). Another timely example is estimating default rates of mortgages or credit cards. You Related posts:

Read more »

Subsetting in readOGR

December 31, 2013
By

The function readOGR in the rgdal package is used to bring vector spatial data sources into R. readOGR() relies upon OGR (part of the GDAL/OGR library) for format conversion. Unfortunately, while OGR supports the ability to subset columns (with the -select switch) or rows (with the -where switch), or even to request a layer using

Read more »

Summarising multivariate palaeoenvironmental data

December 28, 2013
By
Summarising multivariate palaeoenvironmental data

Ordination methods that yield orthogonal axes of variation are often used to summarise the multivariate data obtained from sediment cores. Usually the first or, less often, the first few ordination axes are taken as directions of change or the main patterns of variance in the multivariate data. There is an oft-overlooked issue with this approach that has the potential...

Read more »

Top Songs by Artist on CD102.5 in 2013

December 27, 2013
By
Top Songs by Artist on CD102.5 in 2013

In a previous post, I showed you how to scrape playlist data from Columbus, OH alternative rock station CD102.5. Since it's the end of the year and best-of lists are all the fad, I thought I would share the most popular songs and artists of the year, a...

Read more »

Points, Polygons and Power Outages

December 27, 2013
By
Points, Polygons and Power Outages

Most of my free coding time has been spent tweaking a D3-based live power outage tracker for Central Maine Power customers (there’s also a woefully less-featured Shiny app for it, too). There is some R associated with the D3 vis, but it’s limited to a cron job that’s makes the CSV files for the sparklines

Read more »

Top Songs by Artist on CD102.5 in 2013

December 26, 2013
By
Top Songs by Artist on CD102.5 in 2013

In a previous post, I showed you how to scrape playlist data from Columbus, OH alternative rock station CD102.5. Since it's the end of the year and best-of lists are all the fad, I thought I would share the most popular songs and artists of the year, ...

Read more »

Apache Spark for Big Analytics

December 23, 2013
By
Apache Spark for Big Analytics

by Thomas Dinsmore, Director of Product Management at Revolution Analytics The emergence of Apache Spark is a key development for Big Analytics in 2013. Spark, an Apache incubator project, is an open source distributed computing framework for advanced analytics in Hadoop. Originally developed as a research project at UC Berkeley's AMPLab, the project achieved incubator status in Apache in...

Read more »

Calculating Customer Lifetime Value with Recency, Frequency, and Monetary (RFM)

December 23, 2013
By
Calculating Customer Lifetime Value with Recency, Frequency, and Monetary (RFM)

Introducing Customer Lifetime Value (CLV) Customer Lifetime Value is “the present value of the future cash flows attributed to the customer during his/her entire relationship with the company.”1 There are different kinds of formulas, from simplified to advanced, to calculate CLV.  But the following one might be the one being used most commonly:- Where, t

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)