654 search results for "sql"

Introducing dplyr

January 20, 2014
By
Introducing dplyr

dplyr is a new package which provides a set of tools for efficiently manipulating datasets in R. dplyr is the next iteration of plyr, focussing on only data frames. dplyr is faster, has a more consistent API and should be easier to use. There are three key ideas that underlie dplyr: Your time is important,

Read more »

In data scientist survey, R is the most-used tool (other than databases)

January 15, 2014
By
In data scientist survey, R is the most-used tool (other than databases)

O'Reilly has just published the results of the Data Scientist Salary Survey, based on data collected from attendees of the O'Reilly Strata conferences in 2012 and 2013. There were some interesting results from the salary portion of the survey: data scientists at early-stage startups earned a median salary of US$130,000 data scientists at public companies earned a higher median...

Read more »

Load PostGIS geometries in R without rgdal

January 14, 2014
By

As I said in my last post, rgdal lacks some of the features of GDAL, including the ability to subset columns and rows the source layer, and I demonstrated a workaround. The workaround relied upon the RPostgreSQL package, and this raises a question: Is it possible to transfer geographic data from PostGIS to R just

Read more »

Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis

January 13, 2014
By

Many articles have been written on why R is better than Excel for data analysis.  In this post, I will summarize the reasons why R is advantageous in most data The post Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis appeared first on Fantasy Football Analytics.

Read more »

Pivoting Data in R Excel-style

January 2, 2014
By
Pivoting Data in R Excel-style

(This article is referring to an initial proof-of-concept version of r-big-pivot) I have to admit that I very much enjoy pivoting through data using Excel. Its pivoting tool is great for getting a quick insight into a data set’s structure … Continue reading → The post Pivoting Data in R Excel-style appeared first on joy...

Read more »

Subsetting in readOGR

December 31, 2013
By

The function readOGR in the rgdal package is used to bring vector spatial data sources into R. readOGR() relies upon OGR (part of the GDAL/OGR library) for format conversion. Unfortunately, while OGR supports the ability to subset columns (with the -select switch) or rows (with the -where switch), or even to request a layer using

Read more »

Blog recap of 2013

December 31, 2013
By

Posts by page views Interview with a forced convert to R from Matlab A first step towards R from spreadsheets Plot ranges of data in R A statistical review of ‘Thinking, Fast and Slow’ by Daniel Kahneman The 3 dots construct in R Translating between R and SQL: the basics An R debugging example R The post Blog...

Read more »

Hadoop for R’s Data scientist

December 29, 2013
By
Hadoop for R’s Data scientist

I don’t exactly know where to start. But, after a real pleasant discussion with one of my ex colleague, it seems that there are many thongs around Hadoop ecosystem and R for analyst that should be said by a data scientist, means that, someone who don’t know much more about big data architecture, but who should know the essentials...

Read more »

Top Songs by Artist on CD102.5 in 2013

December 27, 2013
By
Top Songs by Artist on CD102.5 in 2013

In a previous post, I showed you how to scrape playlist data from Columbus, OH alternative rock station CD102.5. Since it's the end of the year and best-of lists are all the fad, I thought I would share the most popular songs and artists of the year, a...

Read more »

Points, Polygons and Power Outages

December 27, 2013
By
Points, Polygons and Power Outages

Most of my free coding time has been spent tweaking a D3-based live power outage tracker for Central Maine Power customers (there’s also a woefully less-featured Shiny app for it, too). There is some R associated with the D3 vis, but it’s limited to a cron job that’s makes the CSV files for the sparklines

Read more »