August 2015

Normality tests for continuous data

August 21, 2015 | 0 Comments

We use normality tests when we want to understand whether a given sample set of continuous (variable) data could have come from the Gaussian distribution (also called the normal distribution). Normality tests are a form of hypothesis test, which is used to make an inference about the population from which ... [Read more...]

Doh! I Could Have Had Just Used V8!

August 21, 2015 | 0 Comments

An R user recently had the need to split a “full, human name” into component parts to retrieve first & last names. The full names could be anything from something simple like “David Regan” to more complex & diverse such as “John Smith Jr.”, “Izaque Iuzuru Nagata” or “Christian Schmit de la ... [Read more...]

functional enrichment analysis with NGS data

August 20, 2015 | 0 Comments

I found that there is a Bioconductor package, seq2pathway, that can apply functional analysis to NGS data. It consists of two components, seq2gene and gene2pathway. seq2gene converts genomic coordination to genes while gene2pathway performs functional analysis at gene level. Read More: 1007 Words Totally
[Read more...]

Is Bayesian A/B Testing Immune to Peeking? Not Exactly

August 20, 2015 | 0 Comments

Since I joined Stack Exchange as a Data Scientist in June, one of my first projects has been reconsidering the A/B testing system used to evaluate new features and changes to the site. Our current approach relies on computing a p-value to measure our confidence in a new feature. ... [Read more...]

Deploying a car price model using R and AzureML

August 20, 2015 | 0 Comments

Recently Microsoft released the AzureML R package, it allows R users to publish their R models (or any R function) as a web service on the Microsoft Azure Machine Learning platform. Of course, I wanted to test the new package, … Continue reading →
[Read more...]

Data Manipulation with dplyr

August 20, 2015 | 0 Comments

dplyr is a package for data manipulation, written and maintained by Hadley Wickham. It provides some great, easy-to-use functions that are very handy when performing exploratory data analysis and manipulation. Here, I will provide a basic overview of some of the most useful functions contained in the package. For this ...
[Read more...]

Knitting patterns categories

August 20, 2015 | 0 Comments

Let’s have a look at the repartition of Ravelry patterns among the various categories. The categories have a tree structure. The number of patterns in each category is not accessible via the API, but it is visible on the website main page. The code below is not the most ...
[Read more...]

Track Hurricane Danny (Interactively) with R + leaflet

August 20, 2015 | 0 Comments

Danny became the first hurricane of the 2015 Season, so it’s a good time to revisit how one might be able to track them with R. We’ll pull track data from Unisys and just look at Danny, but it should be easy to extrapolate from the code. For this ...
[Read more...]

5 New R Packages for Data Scientists

August 20, 2015 | 0 Comments

by Joseph Rickert One great beauty of the R ecosystem, and perhaps the primary reason for R’s phenomenal growth, is the system for contributing new packages. This, coupled to the rock solid stability of CRAN, R’s primary package repository, gives R a great advantage. However, anyone with enough ... [Read more...]

How to Remove State Abbreviations from a Choroplethr Map

August 20, 2015 | 0 Comments

Today I will demonstrate how to remove state abbreviations from a choroplethr map. I will also demonstrate how to combine multiple maps into a single image. This is what the final image looks like: In this case the abbreviations obscure information for the northeast part of the country, so it’...
[Read more...]

Visualizing Phoenix data using Shiny

August 20, 2015 | 0 Comments

A while back, I wrote a blog post about event data and specifically EL:DIABLO. While EL:DIABLO provides the infrastructure for you to get and process event data yourself, you can also download the daily updates that are generated by the authors’ own ... [Read more...]

Open Source Data Science at SXSW

August 20, 2015 | 0 Comments

I have pitched a panel for next years SXSW in Austin, Texas, along with Karthik Ram from rOpenSci. The panel is called “Open Source Data Science”. SXSW has a panel picker process where you submit ideas and then other people can vote for them. The popular ones get selected. So ... [Read more...]

Kickin’ it with elastic net regression

August 19, 2015 | 0 Comments

With the kind of data that I usually work with, overfitting regression models can be a huge problem if I'm not careful. Ridge regression is a really effective technique for thwarting overfitting. It does this by penalizing the L2 norm… Continue reading → [Read more...]

The World We Live In #5: Calories And Kilograms

August 19, 2015 | 0 Comments

I enjoy doing new tunes; it gives me a little bit to perk up, to pay a little bit more attention (Earl Scruggs, American musician) I recently finished reading The Signal and the Noise, a book by Nate Silver, creator of the also famous FiveThirtyEight blog. The book is a ...
[Read more...]

Spaghetti plots with ggplot2 and ggvis

August 19, 2015 | 0 Comments

This post was motivated by this article that discusses the graphics and statistical analysis for a two treatment, two period, two sequence (2x2x2) crossover drug interaction study of a new drug versus the standard. I wanted to write about implementing those graphics and the statistical analysis in R. This ...
[Read more...]

Good advice for security with R

August 19, 2015 | 0 Comments

Everyone needs to be vigilant about security on the Web today. One particular threat — the man-in-the-middle attack — is a risk anytime you are communicating over the Internet, and an attacker has access to the network between the two endpoints. This is a possibility whenever you are using the Web over ... [Read more...]

Searching Twitter with ArcGIS Pro Using R

August 18, 2015 | 0 Comments

I committed to testing this a long time ago, however, a number of other projects intervened, so I have only just got around to writing up this short tutorial. One of the exciting things from the ESRI Developers Conference this year was the launch of the R-ArcGIS bridge. In simple ... [Read more...]
1 3 4 5 6 7 14

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)