925 search results for "SQL"

R in Business Intelligence

January 18, 2015
By

Business Intelligence (BI) can be simply described as extracting useful informations from the data. This is quite a broad process as the source data structure (and quality) can vary, as well the useful information structure can vary. More technically process of such transformation can be described as ETL (extract, transform, load), plus presentation of the useful information. The...

Read more »

How to Predict Where Will Next Disaster Strike?

January 18, 2015
By
How to Predict Where Will Next Disaster Strike?

It is amusing coincidence that another MOOC that I took this week (Geospatial Intelligence & the Geospatial revolution) mentioned disasters. About the other course see my recent Disasters: Myth or the Reality post.In Geospatial Intelligence they gave a weird assignment: one need to mark the location on the world map where...

Read more »

REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

January 16, 2015
By
REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

When I was looking for Data Science certificate programs back in 2013, there were only a few available and most had only graduated one or two cohorts. Even worse, I could not find a single review for any of them. So, this is my review of the University of Washington Data Science certificate. Background: ...

Read more »

The echo of a tragedy in Social Media – “The Making of”

January 14, 2015
By
The echo of a tragedy in Social Media – “The Making of”

In this post I’m going to describe how I pull together the visualization of The echo of a tragedy in Social Media. I used 3 different technologies to pull it together: PHP for data...

Read more »

Using the microbenchmark package to compare the execution time of R expressions

January 14, 2015
By
Using the microbenchmark package to compare the execution time of R expressions

I recently learned about the microbenchmark package while browsing through Hadley’s advanced R programming book. I’ve done some quick benchmarking using system.time() in a for loop and taking the average, but the microbenchmark function in the microbenchmark package makes this much easier. Hadley gives the example of taking the square root of a vector using the built-in...

Read more »

Sessionizing Log Data Using dplyr [Follow-up]

January 13, 2015
By
Sessionizing Log Data Using dplyr [Follow-up]

Last week, I wrote a blog post showing how to sessionize log data using standard SQL. The main idea of that post is that if your analytics platform supports window functions (like Postgres and Hive do), you can make quick work out of sessionizing logs. Here’s the winning query:One nested sub-query and two window functions are

Read more »

Calculating Churn in Seasonal Leagues

January 9, 2015
By
Calculating Churn in Seasonal Leagues

One of the things I wanted to explore in the production of the Wrangling F1 Data With R book was the extent to which I could draw on published academic papers for inspiration in exploring the the various results and timing datasets. In a chapter published earlier this week, I explored the notion of churn,

Read more »

dplyr 0.4.0

January 9, 2015
By
dplyr 0.4.0

I’m very pleased to announce that dplyr 0.4.0 is now available from CRAN. Get the latest version by running: install.packages("dplyr") dplyr 0.4.0 includes over 80 minor improvements and bug fixes, which are described in detail in the release notes. Here I wanted to draw your attention to two areas that have particularly improved since dplyr

Read more »

Top 77 R posts for 2014 (+R jobs)

January 7, 2015
By
Top 77 R posts for 2014 (+R jobs)

R-bloggers.com is 5 years old this month! In celebration, this post share links to the top 77 most read R posts of 2014 (+stats on R-bloggers, + top R jobs for the beginning of 2015)

Read more »

New York Times Article Search API to MongoDB

January 5, 2015
By

Motivation Accessing NYT API Extracting and parsing the article body text Writing to MongoDB Pipeline Results Motivation I’ve learned a little about a lot of different corners of the text mining and NLP world over the last few years… which sometimes makes me feel like I know nothing for certain....

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)