Part 3a: Plotting with ggplot2

March 4, 2015
By
Part 3a: Plotting with ggplot2

We will start off this first section of Part 3 with a brief introduction of the plotting system ggplot2. Then, with the attention focused mainly on the syntax, we will create a few graphs, based on the weather data we have prepared previously. Next, in Part 3b, where we will be doing actual EDA, specific visualisations...

Read more »

Color extraction with R

March 4, 2015
By
Color extraction with R

Given all the attention the internet has given to the colors of this dress, I thought it would be interesting to look at the capabilities for extracting colors in...

Read more »

haven 0.1.0

March 4, 2015
By
haven 0.1.0

I’m pleased to announced that the new haven package is now available on CRAN. Haven makes it easy to read data from SAS, SPSS and Stata. Haven has the...

Read more »

ASA DataFest 2015 at UCLA

What is DataFest? DataFest is a team competition, where a team of up to 5...

Read more »

How to Speak Data Science

March 4, 2015
By
How to Speak Data Science

Data Science has its own language. So, if you want to have at least a slight chance of surviving in the enterprise world of tomorrow -with its obsessive focus...

Read more »

A modular Rmarkdown workbook in action I’m now using a…

March 4, 2015
By
A modular Rmarkdown workbook in action

I’m now using a…

A modular Rmarkdown workbook in action I’m now using a properly modular Rmarkdown workbook, and really quite chuffed with it! I’ve seen discussion of electronic...

Read more »

Creating an Analytics Ecosystem by integrating ModSpace and RStudio Server Professional

March 4, 2015
By

By Richard Pugh – Commercial Director, (UK). As the importance of using analytics to drive decision making continues to grow at pace, so too does the need to make...

Read more »

Extracting the original data from a heatmap image

March 4, 2015
By
Extracting the original data from a heatmap image

The paper Analysis of the Linux Kernel Evolution Using Code Clone Coverage analysed 136 versions of Linux (from 1.0 to 2.6.18.3) and calculated the amount of source code that...

Read more »

JASP

March 3, 2015
By

JASP is an interesting project. It is based on R with additional facilities and functions such as automatic table generation. There appears to be an R package that provide...

Read more »

A Linear Congruential Generator (LCG) in R

March 3, 2015
By
A Linear Congruential Generator (LCG) in R

In my simulation classes, we talk about how to generate random numbers. One of the techniques we talk about is the Linear Congruential Generator (LCG). Starting with a seed,...

Read more »

Plotly Graphs with Domino’s New R Notebook

March 3, 2015
By
Plotly Graphs with Domino’s New R Notebook

by Matt Sundquist co-founder of Plotly Domino's new R Notebook and Plotly's R API let you code, make interactive R and ggplot2 graphs, and collaborate entirely online. Here is...

Read more »

Google Summer of Code 2015

March 3, 2015
By
Google Summer of Code 2015

The R Project has once again been selected as a mentoring organization for this year's Google Summer of Code (GSoC).  If you're not familiar with...

Read more »

Mapping Paris bikes stands

March 3, 2015
By
Mapping Paris bikes stands

A Sharp Sight Labs reader (and now student), Jason P. recently started learning data science. He has a background in data analysis (primarily with Excel and related tools in...

Read more »

Next Kölner R User Meeting: Friday, 6 March 2014

March 3, 2015
By
Next Kölner R User Meeting: Friday, 6 March 2014

The next Cologne R user group meeting is scheduled for this Friday, 6 March 2014 and we have an exciting agenda with two talks, followed...

Read more »

Supervised Classification, Logistic and Multinomial

March 2, 2015
By
Supervised Classification, Logistic and Multinomial

We will start, in our Data Science course,  to discuss classification techniques (in the context of supervised models). Consider the following case, with 10 points, and two classes (red...

Read more »

scheduleR receives big update

March 2, 2015
By
scheduleR receives big update

For the newcomers; scheduleR is a framework to deploy/schedule R tasks, reports and Shiny apps. The tool has an integrated logging and notification system to ease the maintenance...

Read more »

ComputerWorld’s R for Beginners Hands-On Guide

March 2, 2015
By

Computerworld's Sharon Machlis has done a great service for the R community — and R especially novices — by creating the on-line Beginner's Guide to R. You can read...

Read more »

At the APS Observer: a profile of JASP

March 2, 2015
By

The APS Observer has just published a profile of JASP, a graphical user interface designed to make statistics easier. It includes Bayesian procedures by means of the R and...

Read more »

Experiments in Time Series Clustering

March 2, 2015
By

Last night I spotted this tweet about the R package TSclust. Thank you Pablo and Jose for #TSclust - time series clustering package in #rstats ! http://t.co/GBQtQnQ8Lr— Pasha...

Read more »

So What Can Text Analysis Do for You?

March 2, 2015
By
So What Can Text Analysis Do for You?

Despite believing we can treat anything we can represent in digital form as “data”, I’m still pretty flakey on understanding what sorts of analysis we can easily do with...

Read more »

Electric Power System simulations using R

March 2, 2015
By
Electric Power System simulations using R

This is a guest post by Ben Ubah. The field of electric power systems engineering relies heavily on computer simulations for analysis because of its nature. These computer simulations aid...

Read more »

Silhouettes

March 2, 2015
By
Silhouettes

Romeo, Juliet, balcony in silhouette, makin o’s with her cigarette, it’s juliet (Flapper Girl, The Lumineers) Two weeks ago I published this post for which designed two different visualizations. At the end,...

Read more »

R Markdown Tutorial by RStudio and DataCamp

March 1, 2015
By
R Markdown Tutorial by RStudio and DataCamp

In collaboration with Garrett Grolemund, RStudio’s teaching specialist, DataCamp has developed a new interactive course to facilitate reproducible reporting of your R analyses. R Markdown enables you to generate...

Read more »

Using Tables for Statistics on Large Vectors

March 1, 2015
By
Using Tables for Statistics on Large Vectors

This is the first post I’ve written in a while. I have been somewhat radio silent on social media, but I’m jumping back in. Now, I work with brain...

Read more »

drat 0.0.2: Improved Support for Lightweight R Repositories

March 1, 2015
By

A few weeks ago we introduced the drat package. Its name stands for drat R Archive Template, and it helps with easy-to-create and easy-to-use repositories for R packages....

Read more »

Should I use premium Diesel? Setup

March 1, 2015
By
Should I use premium Diesel? Setup

Since I drive quite a lot, I have some interest in getting the most km out every Euro spent on fuel. One thing to change is the fuel. The...

Read more »

DOSE: an R/Bioconductor package for Disease Ontology Semantic and Enrichment analysis

February 28, 2015
By
DOSE: an R/Bioconductor package for Disease Ontology Semantic and Enrichment analysis

My R/Bioconductor package, DOSE, published in Bioinformatics. Summary: Disease ontology (DO) annotates human genes in the context of disease. DO is important annotation in translating molecular findings from high-throughput data...

Read more »

Book Review: Mastering Scientific Computing with R

February 28, 2015
By
Book Review:  Mastering Scientific Computing with R

PACKT marketing guys again contact me to review their new book Mastering Scientific Computing with R.  The book 432 pages (including covers) book is consist of 10 chapters which...

Read more »

One weird trick to compile multipartite dynamic documents with Rmarkdown

February 28, 2015
By
One weird trick to compile multipartite dynamic documents with Rmarkdown

This afternoon I stumbled across this one weird trick an undocumented part of the YAML headers that get processed when you click the ‘knit’ button in...

Read more »