Set up RStudio in the cloud to work with GitHub

January 17, 2016
By
Set up RStudio in the cloud to work with GitHub

I love GitHub for version control and collaboration, though I'm no master of it. And the tools for integrating git and GitHub with RStudio are just amazing boons to productivity. Unfortunately, my University-supplied computer does not play well with GitHub. Various directories are locked down, and I can't push or pull to GitHub directly from RStudio. ...

Read more »

Automating R exercises and exams using the exams package

January 17, 2016
By

It's a pain to design statistics exercises each semester, and because students from previous share old exercises with the new incoming students, it's hard to design simple exercises that students haven't already seen the answers to. On top of that, som...

Read more »

Nina Zumel and John Mount part of R Day at Strata + Hadoop World in San Jose 2016

January 17, 2016
By

Nina Zumel and I are honored to have been invited to be part of Strata + Hadoop World in San Jose 2016 R Day organized by RStudio and O’Reilly. We have written a lot on the topic of model validation in R and we are very excited to distill it down to an exciting tutorial. … Continue reading...

Read more »

A simple ANOVA

January 17, 2016
By
A simple ANOVA

I was browsing Davies Design and Analysis of Industrial Experiments (second edition, 1967). Published by for ICI in times when industry did that kind of thing. It is quite an applied book. On page 107 there is an example where the variance of a process is estimated.DataData is from nine batches from which three samples were selected (A, B and...

Read more »

First step on GIS with R

January 17, 2016
By
First step on GIS with R

The PM 2.5 checker written by R has been working nicely for me.  I put a shortcut icon of this small script on my desktop PC, to check the air … Continue reading →

Read more »

Creating Calendars for Future’s Expiration

January 17, 2016
By
Creating Calendars for Future’s Expiration

Lately I have been doing calendar analysis of various markets (future contracts). Not an overly complicated task, but has a few interesting angles and since I haven’t seen anything similar on the Net – here we go. The world of futures is not friendly – pretty much every contract has its own definition for expiration. The post

Read more »

ggtern 2.0 now available

January 16, 2016
By
ggtern 2.0 now available

Recently ggplot2 received a severe makeover by releasing version 2.0, and in the spirit of improvement, I thought ggtern should also get an overhaul, so after a few-hundred hours of code review, here is what has changed: Theme elements: Previously, the nomenclature scheme for the new theme elements was a bit all over the shop, The post

Read more »

Two Y-Axes

January 16, 2016
By
Two Y-Axes

A few days ago, Matt Yglesias shared this tweet from Liz Ann Sonders, Chief Investment Strategist with Charles Schwab, Inc: DailyShot: Here is a comparison of the monetary base with the S&P500 ... Coincidence? pic.twitter.com/QsdNhJdbRP— Liz...

Read more »

NBA Stat in a Shiny App

January 16, 2016
By

IntroductionShiny app is a new way to present data interactively. Unlike methods like D3, shiny performs complex calculation in real time. As a result, shiny app is more powerful and versatile. However, this also means one cannot simply embed an interactive shiny app in an html document. It needs to be hosted on a shiny server. I rented one...

Read more »

S-shaped data: Smoothing with quasibinomial distribution

January 16, 2016
By
S-shaped data: Smoothing with quasibinomial distribution

Figure 1: Synthetic data and fitted curves.S-shaped distributed data can be found in many applications. Such data can be approximated with logistic distribution function .  Cumulative distribution function of logistic distribution function is a...

Read more »

10 D3.js and WebGL weather, maps, and geography charts made in Python and R

January 16, 2016
By
10 D3.js and WebGL weather, maps, and geography charts made in Python and R

Below are 10 charts made in R or Python by Plotly users on weather, maps and geography. 1. Doctor Who? – Timelords weather forecast for different cities Chart Link: https://plot.ly/~wthrmn/30/timelords-weather-forecast-1152016/Chart Author: @ wthrmnTouched up in: Plotly online editorCode: Python code R Code 2. Its raining ! – US Precipitation (June 2015) Chart Link: https://plot.ly/~RPlotBot/334/us-precipitation-06-30-2015-source-noaa/Chart Author:

Read more »

Turkopticon: Defender of Amazon’s Anonymous Workforce

January 16, 2016
By
Turkopticon: Defender of Amazon’s Anonymous Workforce

Labor crowdsourcing is the system by which large crowds or workers contribute to a project allowing for complex and tedious tasks to be rapidly and efficiently completed. The largest labor crowdsourcing platform in the world, Amazon Mechancial TURK (Mt...

Read more »

McElreath’s Statistical Rethinking: A Bayesian Course with Examples in R and Stan

January 15, 2016
By
McElreath’s Statistical Rethinking: A Bayesian Course with Examples in R and Stan

We’re not even halfway through with January, but the new year’s already rung in a new book with lots of Stan content: Richard McElreath (2016) Statistical Rethinking: A Bayesian Course with Examples in R and Stan. Chapman & Hall/CRC Press. This one got a thumbs up from the Stan team members who’ve read it, and The post

Read more »

Using Excel versus using R

January 15, 2016
By

Here is a video I made showing how R should not be considered “scarier” than Excel to analysts. One of the takeaway points: it is easier to email R procedures than Excel procedures. Win-Vector’s John Mount shows a simple analysis both in Excel and in R.

Read more »

Artificial Intelligence: Solving the Chinese Room Argument

January 15, 2016
By
Artificial Intelligence: Solving the Chinese Room Argument

Yesterday, the very best AI (artificial intelligence) had trouble beating a novice human chess player. Today, the very best human player has enormous difficulty beating the best AI. Tomorrow, the very best human player will never beat any AI. However, ...

Read more »

RcppParallel: Getting R and C++ to work (some more) in parallel

January 15, 2016
By
RcppParallel: Getting R and C++ to work (some more) in parallel

(Post by Dirk Eddelbuettel and JJ Allaire) A common theme over the last few decades was that we could afford to simply sit back and let computer (hardware) engineers take care of increases in computing speed thanks to Moore’s law. That same line of thought now frequently points out that we are getting closer and closer

Read more »

Kaggle: Walmart Trip Type Classification

January 15, 2016
By
Kaggle: Walmart Trip Type Classification

Walmart Trip Type Classification was my first real foray into the world of Kaggle and I'm hooked. I previously dabbled in What's Cooking but that was as part of a team and the team didn't work out particularly well. As a learning experience the competition was second to none. My final entry put me at The post

Read more »

Update to plot.qcc using ggplot2 and grid

January 15, 2016
By
Update to plot.qcc using ggplot2 and grid

plot.qcc updated for compatibility with ggplot2 v2.0. Use grid with qcc!

Read more »

In case you missed it: December 2015 roundup

January 15, 2016
By

In case you missed them, here are some articles from December of particular interest to R users. A look back at accomplishments of the R Project and community in 2015. Segmented regression with the "segmented" package, applied to long-distance running records. Creating multi-tab reports in R with knitr and jQuery UI. New version 2.0 update to ggplot2 adds extensibility...

Read more »

Speeding “Bayesian Power Analysis t-test” up with Snowfall

January 15, 2016
By
Speeding “Bayesian Power Analysis t-test” up with Snowfall

This is a direct (though minor) answer to Daniel’s blogpost Power Analysis for default Bayesian t-tests, which I found very interesting, as I have been trying to get my head around Bayesian statistics for quite a while now. However, one thing that bugs me, is the time needed for the simulation. On my machine it

Read more »

Data Science Radar – Communicator Profile

January 15, 2016
By
Data Science Radar – Communicator Profile

                by Andy Nichols @andyofsmeg 1. Tell us a bit about your background in Data Science. In short I am a statistician.  That’s what I qualified anyway.  Following my degree I was employed … Continue reading →

Read more »

ggtree supports phylip tree format

January 14, 2016
By
ggtree supports phylip tree format

Phylip is also a widely used tree format, which contains taxa sequences with Newick tree text. In ggtree, we can use read.phylip() function to parse the file and use ggtree() to visualize the tree. library(ggtree) phyfile <- system.file("extdata", "sample.phy", package="ggtree") phylip <- read.phylip(phyfile) phylip ## 'phylip' S4 object that stored information of ## '/Users/guangchuangyu/Library/R/3.2/library/ggtree/extdata/sample.phy'. ## ## ...@ tree: ## Phylogenetic tree with 15 tips and...

Read more »

New Tutorial Launched: Introduction to Machine Learning

January 14, 2016
By
New Tutorial Launched: Introduction to Machine Learning

In DataCamp’s brand new course, Introduction to Machine Learning with R, you’ll learn all about the most widely used machine learning techniques such as classification, regression and clustering.  Whether you want to build your personal recommendation engine, become better at picking stocks, or develop your own self-driving car, machine learning is essential. It nicely interweaves statistics with computer...

Read more »

Learn R interactively with our new Introduction to R tutorial

January 14, 2016
By

At DataCamp we are obsessed with how we can make our R courses and learning technology better. As a result, we’re continuously innovating and nothing stays the same for very long.Today, we’re proud to announce we’ve made some huge updates to our free Introduction to R tutorial. It’s our most popular course, taken by over 90,000 R...

Read more »

Microsoft Launches Its First Free Online R Course on edX

January 14, 2016
By

Today, Microsoft and DataCamp launched an exciting new course on edX.org covering the basics of the statistical programming language R. This four week course is free for everyone, and no prior knowledge in programming or data science is required.Make sure to watch the course promotion video: What sets this Introduction to R course apart from...

Read more »

DataCamp talks @user2015aalborg

January 14, 2016
By
DataCamp talks @user2015aalborg

The team behind DataCamp is getting ready for its third useR! attendance in as many years. This year we will contribute two talks. One on teaching R in class, and another on our testwhat package. Talk One: Teaching R in (an Online) ClassToday, over 145,000 people have started a course on DataCamp....

Read more »

Case Study: DataCamp, dplyr, and blended learning

January 14, 2016
By
Case Study: DataCamp, dplyr, and blended learning

Editorial Note: This is a guest blog post by Professor Matthew J. Salganik (Princeton University) in which he describes his experiences using the DataCamp interactive learning platform for blended learning. The article was first published on Wheels on the Bus. Want to use DataCamp in your class as well? Contact us via [email protected], dplyr, and blended learningAs I’ve written...

Read more »

Free Kaggle Machine Learning Tutorial for R

January 14, 2016
By

Always wanted to compete in a Kaggle competition, but not sure you have the right skill set? We created a free interactive Machine Learning tutorial to help you out!Together with the team behind Kaggle, we have developed a free interactive tutorial on how to apply Machine Learning Techniques that can be used in your Kaggle...

Read more »

New online R tutorial by DataCamp: Intermediate R programming

January 14, 2016
By

Today the course creation team at DataCamp released a new online R tutorial called Intermediate R. It is the sequel to our infamous Introduction to R tutorial that has been taken by over 60,000 R enthusiasts. This new tutorial combines short videos with in-browser coding exercises to increase your R knowledge even more.Start the new...

Read more »

Sponsors