883 search results for "sql"

Big RAM is eating big data – Size of datasets used for analytics

November 18, 2015
By
Big RAM is eating big data – Size of datasets used for analytics

With so much hype about “big data” and the industry pushing for “big data” analytical...

Read more »

Geocoding 18 million addresses with PostGIS Tiger Geocoder

November 17, 2015
By

SummaryThis post discussed the background, approaches, windows and linux environment setup for my Geocoding task. See more details about the script and workflow in next post.

Read more »

Analyzing 1.1 Billion NYC Taxi and Uber Trips, with a Vengeance

November 17, 2015
By
Analyzing 1.1 Billion NYC Taxi and Uber Trips, with a Vengeance

The New York City Taxi & Limousine Commission has released a staggeringly detailed historical dataset covering over 1.1 billion individual taxi trips in the city from January 2009 through June 2015. Taken as a whole, the detailed trip-level data is more than just a vast list of taxi pickup and drop off coordinates: it’s a story of New York....

Read more »

Interactive Data Science with R in Apache Zeppelin Notebook

November 16, 2015
By
Interactive Data Science with R in Apache Zeppelin Notebook

Introduction The objective of this blog post is to help you get started with Apache Zeppelin notebook for your R data science requirements. Zeppelin is a web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with Scala(with Apache Spark), Python(with Apache Spark), SparkSQL, Hive, Markdown, Shell and more.…

Read more »

Using htmlwidgets with knitr and Jekyll

November 15, 2015
By
Using htmlwidgets with knitr and Jekyll

A few weeks ago I gave a talk at BARUG (and wrote a post) about blogging with the excellent knitr-jekyll repo. Yihui’s system is fantastic, but it does have one drawback: None of those fancy new htmlwidgets packages seem to work… A few people have run into this. I recently figured out...

Read more »

In case you missed it: October 2015 roundup

November 13, 2015
By

In case you missed them, here are some articles from October of particular interest to R users. A video from the PASS 2015 conference in Seattle shows R running within SQL Server 2016. The preview for SQL Server 2016 includes Revolution R Enterprise (as SQL Server R Services). A way of dealing with confounding variables in experiments: instrumental variable...

Read more »

Let’s meet on SatRdays: the link between RUGs and conferences

November 13, 2015
By
Let’s meet on SatRdays: the link between RUGs and conferences

I am always very happy to attend local R meetups and international R conferences, as these are great opportunities tomeet other R users, developers, rock stars and friends from all around the world/GH/SO/Twitter etc, listen to inspiring presentati...

Read more »

Using MonetDB[Lite] with real-world CSV files

November 11, 2015
By

MonetDBLite (for R) was announced/released today and, while the examples they provide are compelling there’s a “gotcha” for potential new folks using SQL in general and SQL + MonetDB + R together. The toy example on the site shows dumping mtcars with dbWriteTable and then doing things. Real-world CSV files have headers and commas (MonetDB

Read more »

Red Cross Smoke Alarm Project

November 11, 2015
By
Red Cross Smoke Alarm Project

SummaryThis is a write-up for my volunteer Data Science project for the American Red Cross. The project used public data to help Red Cross directing limited resources to homes that more vulnerable to fire risk and loss. My work in the project:Discovered the hidden information in NFIRS dataset, obtained and analyzed 10G NFIRS data. Major contribution on model design,...

Read more »

Launch Apache Spark on AWS EC2 and Initialize SparkR Using RStudio

November 10, 2015
By
Launch Apache Spark on AWS EC2 and Initialize SparkR Using RStudio

This post was first published on SparkIQ Labs’ blog and re-posted on my personal blog. Introduction In this blog post, we shall learn how to launch a Spark stand alone cluster on Amazon Web Services (AWS) Elastic Compute Cloud (EC2) for analysis of Big Data. This is a continuation from our previous blog, which showed us how to

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)