Analysis of International T20 matches with yorkr templates

February 25, 2017
By
Analysis of International T20 matches with yorkr templates

Introduction In this post I create yorkr templates for International T20 matches that are available on Cricsheet. With these templates you can convert all T20 data which is in yaml format to R dataframes. Further I create data and the necessary templates for analyzing. All of these templates can be accessed from Github at yorkrT20Template. The … Continue...

Read more »

Logistic Regression Regularized with Optimization

February 25, 2017
By
Logistic Regression Regularized with Optimization

Logistic regression predicts the probability of the outcome being true. In this exercise, we will implement a logistic regression and apply it to two different data sets. The file...

Read more »

Success rates of appeals to the Supreme Court by Circuit

February 25, 2017
By
Success rates of appeals to the Supreme Court by Circuit

In the chaos of the last month or so of United States of America governance, one item that grabbed my attention was the claim by President Trump that 80%...

Read more »

Gender gap in Swedish mortality

February 24, 2017
By
Gender gap in Swedish mortality

Why Sweden?

Read more »

Prophet: How Facebook operationalizes time series forecasting at scale

February 24, 2017
By
Prophet: How Facebook operationalizes time series forecasting at scale

Facebook is a famously data-driven organization, and an important goal in any data science activity is forecasting. Now, Facebook has released Prophet, an open-source package for R and Python...

Read more »

H-1B Visa Petitions Exploratory Data Analysis

February 24, 2017
By
H-1B Visa Petitions Exploratory Data Analysis

Contributed by Sharan Naribole. He is currently undertaking the part-time online bootcamp organized by NYC Data Science Academy (Dec 2016- April 2017). This blog is based The post

Read more »

More January Package Picks

February 24, 2017
By
More January Package Picks

by Joseph Rickert In a recent post, I highlighted several new packages that arrived on CRAN in January that provided R users with access to data. In this post,...

Read more »

When Size Matters: Weighted Effect Coding

February 24, 2017
By

Categorical variables in regression models are often included by dummy variables. In R, this is done with factor variables with treatment coding. Typically, the difference and significance of each...

Read more »

Make Power Fun (Again?)

February 24, 2017
By
Make Power Fun (Again?)

Make Power Fun (Again?) Brandon LeBeau University of Iowa Overview (G)LMMs Power simglm package Shiny Demo - Broken! Linear Mixed Model (LMM) Power Power is the ability...

Read more »

a riddle at the end of its tether

February 23, 2017
By
a riddle at the end of its tether

A simply worded riddle this week on The Riddler, about four ropes having non-uniform and unknown burning rates, the only constraint being they all burn completely in one hour....

Read more »

Factor Analysis with the Principal Factor Method and R

February 23, 2017
By

As discussed in a previous post on the principal component method of factor analysis, the term in the estimated covariance matrix , , was excluded and we proceeded directly...

Read more »

Preview: R Tools for Visual Studio 1.0

February 23, 2017
By
Preview: R Tools for Visual Studio 1.0

After more than a year in preview R Tools for Visual Studio, the open-source extension to the Visual Studio IDE for R programming, is nearing its official release. RTVS...

Read more »

On Watering Holes, Trust, Defensible Systems and Data Science Community Security

February 23, 2017
By
On Watering Holes, Trust, Defensible Systems and Data Science Community Security

I’ve been threatening to do a series on “data science community security” for a while and had cause to issue this inaugural post today. It all started with this:...

Read more »

Gartner’s 2017 Take on Data Science Software

February 23, 2017
By
Gartner’s 2017 Take on Data Science Software

In my ongoing quest to track The Popularity of Data Analysis Software, I’ve finally decided to change the title to use the newer term “data science”. The 2017 version...

Read more »

Reporting in a Repeatable, Parameterised, Transparent Way

February 23, 2017
By
Reporting in a Repeatable, Parameterised, Transparent Way

Earlier this week, I spent a day chatting to folk from the House of Commons Library as a part of a bit of temporary day-a-week-or-so bit of work I’m...

Read more »

Mapping Biodiversity data on smaller than one degree scale

February 22, 2017
By
Mapping Biodiversity data on smaller than one degree scale

Guest Post by Enjie (Jane) LI I have been using bdvis package (version 0.2.9) to visualize the iNaturalist records of RAScals project (http://www.inaturalist.org/projects/rascals). Initially, the mapgrid function in the bdvis...

Read more »

Announcing ggraph: A grammar of graphics for relational data

February 22, 2017
By
Announcing ggraph: A grammar of graphics for relational data

I am absolutely thrilled to announce that ggraph has finally been released on CRAN. ggraph is my most ambitious package to date and its very early genesis has been described...

Read more »

Euler Problem 13: Large Sum of 1000 Digits

February 22, 2017
By
Euler Problem 13: Large Sum of 1000 Digits

Euler Problem 13 asks to add one hundred numbers with fifty digits. This seems like a simple problem where it not that most computers are not designed to deal with...

Read more »

The difference between R and Excel

February 22, 2017
By
The difference between R and Excel

If you're an Excel user (or any other spreadsheet, really), adapting to learn R can be hard. As this blog post by Gordon Shotwell explains, one of the reasons...

Read more »

#AskNASA: What’s the Optimal Time for Aliens to Invade Earth?

February 22, 2017
By
#AskNASA: What’s the Optimal Time for Aliens to Invade Earth?

This post was originally published on SmartCat, 22 Feb 2017.My inaugural blog as a Data Science Consultant for SmartCat. The code that accompanies the analyses presented here...

Read more »

Finding Radiohead’s most depressing song, with R

February 22, 2017
By
Finding Radiohead’s most depressing song, with R

Radiohead is known for having some fairly maudlin songs, but of all of their tracks, which is the most depressing? Data scientist and R enthusiast Charlie Thompson ranked all...

Read more »

How to Teach R: Common mistakes

February 22, 2017
By

by Garrett Grolemund Would you like to teach people to use R? If so, I would like to jump-start your efforts. I’m one half of RStudio’s education team, and...

Read more »

Quick tip: knitr Python Windows setup checklist

February 22, 2017
By
Quick tip: knitr Python Windows setup checklist

One of the nifty things about using R is that you can use it for many different purposes and even other languages! If you want to use Python in...

Read more »

leaflet 1.1.0

February 22, 2017
By
leaflet 1.1.0

Leaflet 1.1.0 is now available on CRAN! The Leaflet package is a tidy wrapper for the Leaflet.js mapping library, and makes it incredibly easy to generate interactive maps based on...

Read more »

Data Transformation in R: The #Tidyverse-Approach of Organizing Data #rstats

February 22, 2017
By
Data Transformation in R: The #Tidyverse-Approach of Organizing Data #rstats

Yesterday, I had the pleasure to give a talk at the 8th Hamburg R User-Group meeting. I talked about data wrangling and data transformation, and how the…

Read more »

Part 3: Spatial analysis of geotagged data

February 21, 2017
By
Part 3: Spatial analysis of geotagged data

Part 3: Spatial analysis of geotagged data See the other parts in this series of blog posts. In parts 1 and 2 we extracted spatial coordinates from our photos and then made an...

Read more »

Raccoon | Ch 2.5 – Unbalanced and Nested Anova

February 21, 2017
By
Raccoon | Ch 2.5 – Unbalanced and Nested Anova

Raccoon is a free web-book about Statistical Models with R. This chapter tackles two Anova special cases: Unbalanced Anova and Nested Anova . The post Raccoon |...

Read more »

The Zero Bug

February 21, 2017
By
The Zero Bug

I am going to write about an insidious statistical, data analysis, and presentation fallacy I call “the zero bug” and the habits you need to cultivate to avoid it....

Read more »

Free DataCamp for your Classroom

February 21, 2017
By
Free DataCamp for your Classroom

Announcing: DataCamp for the classroom, a new free plan for Academics. We want to support every student that wants to learn Data Science. That is why, as of today, professors/teachers/TA’s/…...

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.