Blog Archives

More data scientists prefer R: survey

July 22, 2016
By
More data scientists prefer R: survey

by Joseph Rickert Last week in a webinar, Burtch Works, an Illinois based executive recruiting firm that specializes in finding analytic talent, released the results of their third annual survey of "quantitative business professionals". Other than having attended this webinar, I have no knowledge of Burtch Works, but I am willing to take their survey at face value as...

Read more »

Sustainability through Energy Load Shaping for Buildings using R

July 19, 2016
By
Sustainability through Energy Load Shaping for Buildings using R

by Mike Wise, Data Scientist / Solution Architect – MCS Incubation Services Background Building energy consumption is a huge component of energy consumption. In the USA for example, around 40 percent of the energy consumed is in buildings, and as the USA consumes something like 25 percent of the worlds energy, it is clear we are talking about a...

Read more »

Some New R Packages

July 14, 2016
By
Some New R Packages

by Joseph Rickert New R packages keep rolling into CRAN at a prodigious rate: 184 in May, 195 in June and July looks like it will continue the trend. I spent some time sorting through them and have picked out a few that that are interesting from a data science point of view. ANLP provides functions for building text...

Read more »

useR! 2016 Tutorials: Part 2

July 7, 2016
By
useR! 2016 Tutorials: Part 2

by Joseph Rickert Last week, I mentioned a few of the useR tutorials that I had the opportunity to attend. Here are the links to the slides and code for all but two of the tutorials: Regression Modeling Strategies and the rms Package - Frank Harrell Using Git and GitHub with R, RStudio, and R Markdown - Jennifer Bryan...

Read more »

Using Microsoft R Server and dplyrxdf to Predict Flight Arrival Delays

July 5, 2016
By
Using Microsoft R Server and dplyrxdf  to Predict Flight Arrival Delays

by Konstantin Golyaev, Data Scientist at Microsoft I recently participated in an internal one-day Microsoft R Server (MRS) hackathon. For an experienced base R user but a complete MRS novice, this turned out to be an interesting challenge. R has fantastic and unparalleled set of tools for exploratory data analysis, as long as your data set is small enough...

Read more »

The useR! 2016 Tutorials

June 30, 2016
By

by Joseph Rickert Over the years I have seen several excellent tutorials at useR!conferences that were not only very satisfying "you had to be there" experiences but were also backed up with meticulously prepared materials of lasting value. This year, quite a few useR!20i6 tutorials measure up to this level of quality. My take on why things turned out...

Read more »

R, Stan and Bayesian Statistics

June 23, 2016
By
R, Stan and Bayesian Statistics

by Joseph Rickert Just about two and a half years ago I wrote about some resources for doing Bayesian statistics in R. Motivated by the tutorial Modern Bayesian Tools for Time Series Analysis by Harte and Weylandt that I attended at R/Finance last month, and the upcoming tutorial An Introduction to Bayesian Inference using R Interfaces to Stan that...

Read more »

Exploring Global Internet Performance Data Using R

June 21, 2016
By
Exploring Global Internet Performance Data Using R

by Lourdes O. Montenegro Lourdes O. Montenegro is a PhD candidate at the Lee Kuan Yew School of Public Policy, National University of Singapore. Her research interests cover the intersection of applied data science, technology, economics and public policy. Many of us now find it hard to live without a good quality internet connection. As a result, there is...

Read more »

The R Packages of UseR! 2016

June 16, 2016
By
The R Packages of UseR! 2016

by Joseph Rickert It is always a delight to discover a new and useful R package, and it is especially nice when the discovery comes with at context and testimonial to its effectiveness. It is also satisfying to be able to check in once in awhile and get an idea of what people think is hot, or current or...

Read more »

Using Microsoft R Server on a single machine for experiments with 600 million taxi rides.

June 14, 2016
By
Using Microsoft R Server on a single machine for experiments with 600 million taxi rides.

by Dmitry Pechyoni, Microsoft Data Scientist The New York City taxi dataset is one of the largest publicly available datasets. It has about 1.1 billion taxi rides in New York City. Previously this dataset was explored and visualized in a number of blog posts, where the authors used various technologies (e.g., PostgreSQL and Apache Elastic Search). Moreoever, in a...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)