Exploring World Gender Statistics with Shiny

January 28, 2017
By
Exploring World Gender Statistics with Shiny

This week I explored the World Gender Statistics dataset. You can look at 160 measurements over 56 years with my Shiny app here. I prepared the data as follows: Data.csv Country.Name: the name of the country Country.Code: the country’s co...

Read more »

The Rt of naming your blog

January 28, 2017
By
The Rt of naming your blog

In this post, I’m sharing a brand-new analysis! The reason for this is my blog being added to R-bloggers by Tal Galili after I filled this form. R-bloggers is a collection of blogs about R, whose new posts get added to the website via the magic of RS...

Read more »

Inauguration speeches

January 27, 2017
By
Inauguration speeches

Acquiring inauguration speeches Though not about Greenville especially, it might be interesting to quantitatively analyze inauguration speeches. This analysis will be done using two paradigms: the tm package and the tidytext package. We will read the speeches in such a way that we use the tidytext package; later on we will use some tools from that package to make analyses...

Read more »

The “Ten Simple Rules for Reproducible Computational Research” are easy to reach for R users

The “Ten Simple Rules for Reproducible Computational Research” are easy to reach for R users

“Ten Simple Rules for Reproducible Computational Research” is a freely available paper on PLOS computational biology. As I’m currently very interested on the subject of reproducible data analysis, I will these ten rules and the possible implem...

Read more »

CRAN now has 10,000 R packages. Here’s how to find the ones you need.

January 27, 2017
By
CRAN now has 10,000 R packages. Here’s how to find the ones you need.

CRAN, the global repository of open-source packages that extend the capabiltiies of R, reached a milestone today. There are now more than 10,000 R packages available for download*. (Incidentally, that count doesn't even include all the R packages out there. There are also another 1294 packages for genomic analysis in the BioConductor repository, hundreds of R packages published only...

Read more »

January ’17 Tips and Tricks

January 27, 2017
By

by Sean Lopp This month’s collection of Tips and Tricks comes from an excellent talk given at the 2017 RStudio::Conf in Orlando by RStudio Software Engineer Kevin Ushey. The slides from his talk are embedded below and cover features from autocompletion to R Markdown shortcuts. Use the left and right arrow keys to change slides.

Read more »

Visualization of MRI data in R

January 27, 2017
By
Visualization of MRI data in R

Lately I was getting a little bored with genomic data (and then TCGA2STAT started to give me a segfault on my university’s high performance computing facility too :stuck_out_tongue:). So I decided to analyze some brain imaging data that I had lying a...

Read more »

Visualization of MRI data in R

January 27, 2017
By
Visualization of MRI data in R

Lately I was getting a little bored with genomic data (and then TCGA2STAT started to give me a segfault on my university’s high performance computing facility too :stuck_out_tongue:). So I decided to analyze some brain imaging data that I had lying a...

Read more »

7 Interactive Plots from the Pharmaceutical Industry

January 27, 2017
By

Introduction In a recent blog post we introduced 7 Interactive Bioinformatics Plots Made in Python and R. Here I introduced 7 Interactive Plots from the Pharmaceutical Industry using the plotly R package. These plots are essential for any survival analysis study, where there is interest in time-to-events as often seen in the Pharmaceutical industry. For

Read more »

Simulating from a specified seasonal ARIMA model

January 26, 2017
By

From my email today You use an illustration of a seasonal arima model: ARIMA(1,1,1)(1,1,1)4 I would like to simulate data from this process then fit a model… but I am unable to find any information as to how this can be conducted… if I set phi1, Phi1, theta1, and Theta1 it would be reassuring that

Read more »

Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

January 26, 2017
By
Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

As I wrote in the previous post, I will continue in describing regression methods, which are suitable for double seasonal (or multi-seasonal) time series. In the previous post about Multiple Linear Regression, I showed how to use “simple” OLS regression method to model double seasonal time series of electricity consumption and use it for accurate forecasting. Interactions...

Read more »

Cards on the table

January 26, 2017
By
Cards on the table

After the last post building on feedback from readers, the blog is back to the regular program of recycling old Github repos. Today’s project was waiting for its turn here and will involve a Catan card game. Nearly a year ago, I played Catan with my ...

Read more »

BiclustGUI 1.1.0

January 26, 2017
By
BiclustGUI 1.1.0

BiBitR enters the fray! Ewoud De Troyer, University of Hasselt (CenStat) Introduction The latest patch of the BiclustGUI includes the implementation of BiBitR, a simple R wrapper which directly calls the original Java code for applying the BiBit algo...

Read more »

Kung Fu R

January 26, 2017
By
Kung Fu R

A great way to hone your skills as a data scientist is to pick a topic you're passionate about, find some data related to it, and analyze the heck out of it. Jim Vallandingham is clearly passionate about old Kung Fu movies — particularly those from the Shaw Brothers Studio — and has used R to analyze data the...

Read more »

Image Compression with Principal Component Analysis

January 26, 2017
By
Image Compression with Principal Component Analysis

Image compression with principal component analysis is a frequently occurring application of the dimension reduction technique. Recall from a previous post that employed singular value decomposition to compress an image, that an image is a matrix of pixels represented by RGB color values. Thus, principal component analysis can be used... The post Image Compression with Principal Component Analysis...

Read more »

New Zealand bank replaces SAS server with R Server

January 26, 2017
By

Heartland Bank, a rapidly growing bank in New Zealand, has adopted a data-driven approach to analyzing risk, evaluating credit lines, and understanding cash flows. But they found their legacy SAS system to be labor-intensive and time consuming when it came to updating financial models, and it was expensive to boot. (Being licensed on a per-user basis, it was available...

Read more »

Multiple Regression (Part 2) – Diagnostics

January 26, 2017
By
Multiple Regression (Part 2) – Diagnostics

Multiple Regression is one of the most widely used methods in statistical modelling. However, despite its many benefits, it is oftentimes used without checking the underlying assumptions. This can lead to results which can be misleading or even completely wrong. Therefore, applying diagnostics to detect any strong violations of the assumptions is important. In the

Read more »

Using CPLEX in R: Installing cplexAPI in Windows 10

January 26, 2017
By
Using CPLEX in R: Installing cplexAPI in Windows 10

I have a very large mixed integer problem to solve.   COIN‘s solver accessed through Rsymphony did a decent job, but had trouble finding feasible solutions.  I had been told that the IBM cplex  solver was the best on the market for mixed integer programming. Installing cplex is a relatively easy task.  The difficulties began when I … Continue...

Read more »

Upcoming Win-Vector LLC public speaking engagements

January 26, 2017
By

I am happy to announce a couple of exciting upcoming Win-Vector LLC public speaking engagements. BARUG Meetup Tuesday, Tuesday February 7, 2017 ~7:50pm, Intuit, Building 20, 2600 Marine Way, Mountain View, CA. Win-Vector LLC’s John Mount will be giving a “lightning talk” (15 minutes) on R calling conventions (standard versus non-standard) and showing how to … Continue...

Read more »

Upgrading to macOS Sierra (nee OSX) for R users

January 26, 2017
By
Upgrading to macOS Sierra (nee OSX) for R users

A good fraction of R users use Apple computers. Apple machines historically have sat at a sweet spot of convenience, power, and utility: Convenience: Apple machines are available at retail stores, come with purchasable support, and can run a lot of common commercial software. Power: R packages such as parallel and Rcpp work better on … Continue...

Read more »

Predicting the Willingen 2017 men’s ski jumping competition

January 26, 2017
By
Predicting the Willingen 2017 men’s ski jumping competition

In an earlier post of mine, I carried out an analysis on ski jumping data for Zakopane, Poland and attempted to predict which athletes would end up on the podium. I also created a classification tree and tested it on the 2017 competition data with good results. For this side project of mine, I hope… Continue reading...

Read more »

One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)

January 26, 2017
By
One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)

Dear Leader has made good on his campaign promise to “crack down” on immigration from “dangerous” countries. I wanted to both see one side of the impact of that decree — how many potential immigrants per year might this be impacting — and show toss up some code that shows how to free data from... Continue reading...

Read more »

Metro Systems Over Time: Part 2

January 26, 2017
By
Metro Systems Over Time: Part 2

Note, at the time of this writing using the packages ggplot2 and ggmap from CRAN will result in an error. To avoid the error be sure to install both packages from GitHub with the package devtools and restart R if the problem persists. devtools::install_github("dkahle/ggmap") devtools::install_github("hadley/ggplot2") Introduction In Part 1 of this series we collected geodata Related Post

Read more »

digest 0.6.12

January 26, 2017
By

A new release, now at version 0.6.12, of the digest package is now on CRAN and in Debian. The digest creates hash digests of arbitrary R objects (using the 'md5', 'sha-1', 'sha-256', 'crc32', 'xxhash' and 'murmurhash' algorithms) permitting easy comp...

Read more »

What is new in the intrval R package?

January 26, 2017
By

An update (v 0.1-1) of the intrval package was recently published on CRAN. The package simplifies interval related logical operations (read more about the motivation in this post). So what is new in this version? Some of the inconsistencies in the 1st CRAN release have been cleaned up, and I have been pushed hard (see GitHub issue to implement all...

Read more »

More water, a bit more about saints

January 25, 2017
By
More water, a bit more about saints

I was lucky enough to get some nice and interesting feedback on my last post. One comment was really useful and pretty embarrassing: I had written “see” instead of “sea” in the whole post… Thanks Steve Dempsey for the correction! I also got s...

Read more »

R Markdown for the Enterprise

January 25, 2017
By
R Markdown for the Enterprise

by Edgar Ruiz In the corporate world, spreadsheets and PowerPoint presentations still dominate as the tools used for analyzing and sharing information. So, it is not at all surprising that even when business analysts use R for the analytical heavy lifting, they frequently revert to using spreadsheets and slide decks to share their results. This

Read more »

Data Driven Cheatsheets

January 25, 2017
By
Data Driven Cheatsheets

Guest post by Jonathan Sidi Cheatsheets are currently built and used exclusivley as a teaching tool. We want to try and change this and produce a cheat sheet that gives a roadmap to build a known product, but also is built as a function so users can input data into it to make the cheatsheet … Continue...

Read more »

DataChats: An interview with Mine Çetinkaya-Rundel

January 25, 2017
By
DataChats: An interview with Mine Çetinkaya-Rundel

Hey R fans! We've just released a new episode of our DataChats video series. In this episode, we interview Mine Çetinkaya-Rundel, Ph.D.. Mine is Director of Undergraduate Studies and an Associate Professor of the Practice in the Department of St...

Read more »

Sponsors

Mango solutions











Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.