670 search results for "SQL"

Caching API calls offline

February 2, 2014
By

I've recently heard the idea of "offline first" via especially Hood.ie. We of course don't do web development, but primarily build R interfaces to data on the web. Internet availablility is increasinghly ubiqutous, but there still are times and places where you don't have internet, but need to get work done. In the R packages we write there...

Read more »

“Show me the way to the next whiskey bar” (The Doors – Alabama Song) – Interactive Location Recommendation using Tableau

February 2, 2014
By
“Show me the way to the next whiskey bar” (The Doors – Alabama Song) – Interactive Location Recommendation using Tableau

Since I started using Tableau I’m quite fascinated about the capabilities of this piece of software. Before Christmas I was looking how I could build an interactive visualization that helps me to explore the relationships between different objects in a form that shows which objects are very close to each other according to some similarity measure or vice versa....

Read more »

Introducing the ecoengine package

January 30, 2014
By
Introducing the ecoengine package

Natural history museums have long been valuable repositories of data on species diversity. These data have been critical for fostering and shaping the development of fields such as biogeography and systematics. The importance of these data repositories is becoming increasingly important, especially in the context of climate change, where a strong understanding of how species responded to past...

Read more »

Data corruption in R 3.0.2 when using read.csv

January 29, 2014
By

Introduction It may be old news to some, but I just recently discovered that the automatic type inference system that R uses when parsing CSV files assumes that data sets will never contain 64-bit integer values. Specially, if an integer value read from a CSV file is too large to fit in a 32-bit integer

Read more »

BLATting the internet: the most frequent gene?

January 23, 2014
By
BLATting the internet: the most frequent gene?

I enjoyed this story from the OpenHelix blog today, describing a Microsoft Research project to mine DNA sequences from web pages and map them to UCSC genome builds. Laura DeMare asks: what was the most-hit gene? Most hit gene? APOE? MT @GenomeBrowser We BLATed the Internet! DNA sequences from 40 billion webpages mapped to hg19

Read more »

Database Reflection using dplyr

January 22, 2014
By
Database Reflection using dplyr

At work I write a ton of SQL, and I do most of my querying using R.  The workflow goes: Create a string with the SQL in R Plug the string into fetchQuery (see my previous post) This solution works relatively well, but i’m a bit unhappy writing strings rather than using function calls. I

Read more »

Introducing dplyr

January 20, 2014
By
Introducing dplyr

dplyr is a new package which provides a set of tools for efficiently manipulating datasets in R. dplyr is the next iteration of plyr, focussing on only data frames. dplyr is faster, has a more consistent API and should be easier to use. There are three key ideas that underlie dplyr: Your time is important,

Read more »

In data scientist survey, R is the most-used tool (other than databases)

January 15, 2014
By
In data scientist survey, R is the most-used tool (other than databases)

O'Reilly has just published the results of the Data Scientist Salary Survey, based on data collected from attendees of the O'Reilly Strata conferences in 2012 and 2013. There were some interesting results from the salary portion of the survey: data scientists at early-stage startups earned a median salary of US$130,000 data scientists at public companies earned a higher median...

Read more »

Load PostGIS geometries in R without rgdal

January 14, 2014
By

As I said in my last post, rgdal lacks some of the features of GDAL, including the ability to subset columns and rows the source layer, and I demonstrated a workaround. The workaround relied upon the RPostgreSQL package, and this raises a question: Is it possible to transfer geographic data from PostGIS to R just

Read more »

Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis

January 13, 2014
By

Many articles have been written on why R is better than Excel for data analysis.  In this post, I will summarize the reasons why R is advantageous in most data The post Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis appeared first on Fantasy Football Analytics.

Read more »