When to Use Stacked Barcharts?

October 11, 2014
By
When to Use Stacked Barcharts?

Yesterday a few of us on Facebook’s Data Science Team released a blogpost showing how candidates are campaigning on Facebook in the 2014 U.S. midterm elections. It was picked up in the Washington Post, in which Reid Wilson calls us “data … Continue reading →

Read more »

2014 Metabolomic Data Analysis and Visualization Workshop and Tutorials

October 11, 2014
By
2014 Metabolomic Data Analysis and Visualization Workshop and Tutorials

Recently I had the pleasure of teaching statistical and multivariate data analysis and visualization at the annual Summer Sessions in Metabolomics 2014, organized by the NIH West Coast Metabolomics Center. Similar to last year, I’ve posted all the content (lectures, labs and software) for any one to follow along with at their own pace. I also

Read more »

RGLUEANN package available on GitHub

October 11, 2014
By
RGLUEANN package available on GitHub

The RGLUEANN package is now available on GitHub at http://github.com/rogiersbart/RGLUEANN. The package provides an R implementation of the coupling between general likelihood uncertainty estimation (GLUE) and artificial neural networks (ANNs)...

Read more »

My visits and RDataMining talks in North America

October 11, 2014
By
My visits and RDataMining talks in North America

  I visited Mexico recently and is now travelling in US. In the 1st week of October, I delivered a keynote talk at the CONAIS 2014 conference in Mexico, as well as a one-day workshop on data mining with R … Continue reading →

Read more »

DataFrame manipulation in R from basics to dplyr

October 11, 2014
By
DataFrame manipulation in R from basics to dplyr

  In my surroundings at work I see quite a few people managing their data in spreadsheet software like Excel or Calc, these software will do the work but I usually tend to do as little data manipulation in them as possible and to turn as soon as possible my spreadsheets into csv files and

Read more »

RPushbullet 0.1.0 with a lot more awesome

October 10, 2014
By

A new release 0.1.0 of the RPushbullet package (interfacing the neat Pushbullet service) landed on CRAN today. It brings a number of goodies relative to the first release 0.0.2 of a few months ago: pushing of files is now supported thanks to a nice ...

Read more »

14 Reasons Why R is better than Excel

October 10, 2014
By

The Fantasy Football Analytics blog shares these 14 reasons why R is better than Excel for data analysis: More powerful data manipulation capabilities Easier automation Faster computation It reads any type of data Easier project organization It supports larger data sets Reproducibility (important for detecting errors) Easier to find and fix errors It's free It's open source Advanced Statistics...

Read more »

SVG + Javascript Ekholm Decomposition in RStudio Browser

October 10, 2014
By

Our topics this week seem unrelated, but in an effort to bridge the two another random project – make website in R for these SVGs of Portland Vector Bridges result: Portland Bridges in SVGcode: R to make simple siteEkholm decomposition SelectionShare & TimingShare | Masterfully Written by Delightfully Responsive Author Popular Mutual Funds Decomposed With Ekholm (2014) Responsive...

Read more »

“R for Developers” free web-book: leave your opinion

October 10, 2014
By
“R for Developers” free web-book: leave your opinion

A week ago, my boss at Quantide, Andrea Spanò, publishe

Read more »

Waterfall and 3D plotting exploration

October 9, 2014
By
Waterfall and 3D plotting exploration

Taking the very 'waterfall graph' code posted by Robert Grant I have added some features (resistance Overall I find the graphs produced from this code to be beautiful and fascinating though I am not sure if I would really use them as a form of data exp...

Read more »

SVG + a little extra (d3.js) in RStudio Browser | No Pipes This Time

October 9, 2014
By

I’m guessing here, but yesterday’s post Responsive SVG in Your RStudio Browser might have inspired some “but,…)”s, “yes plus I need”s, “what the %>>% with the pipe”s, etc.  I’ll attempt to address a couple of these in this quick post. First, if you don’t like pipes, here is the non-piped version of the code.  I also made one change,...

Read more »

A Note on Tweedie

October 9, 2014
By
A Note on Tweedie

by Joseph Rickert In a recent post I talked about the information that can be developed by fitting a Tweedie GLM to a 143 million record version of the airlines data set. Since I started working with them about a year or so ago, I now see Tweedie models everywhere. Basically, any time I come across a histogram that...

Read more »

In case you missed it: September 2014 Roundup

October 8, 2014
By

In case you missed them, here are some articles from September of particular interest to R users. Norm Matloff argues that T-tests shouldn't be part of the Statistics curriculum and questions the "star system" for p-values in R. A nice video introduction to the dplyr package and the %>% operator, presented by Kevin Markham. An animation of police militarization...

Read more »

Data analysis the data.table way: introducing DataCamp’s newest course

October 8, 2014
By
Data analysis the data.table way: introducing DataCamp’s newest course

Together with the key people behind the data.table package, Matt Dowle and Arun Srinivasan,  DataCamp developed a brand new interactive course to bring your data analysis skillset up to date with the essentials of the powerful data.table package. Learn more…  The popularity of the data.table package is increasing and with good reason. Not only is the number

Read more »

Structural “Arbitrage”: a Working Long-History Backtest

October 8, 2014
By
Structural “Arbitrage”: a Working Long-History Backtest

For this post, I would like to give my sincere thanks to Mr. Helmuth Vollmeier, for providing the long history … Continue reading →

Read more »

Responsive SVG in Your RStudio Browser

October 8, 2014
By

For those readers who are unaware, SVG is absolutely amazing, and if you need some convincing see this 2009 paper/talk from David Dailey Why is SVG Going to Be REALLY BIG?  Most R users should be very well acquainted with graphics and plots magically ...

Read more »

Slice bivariate densities, or the Joy Division “waterfall plot”

October 8, 2014
By
Slice bivariate densities, or the Joy Division “waterfall plot”

This has been on my to-do list for a long old time. Lining up slices through a bivariate density seems a much more intuitive way of depicting it than contour plots or some ghastly rotating 3-D thing (urgh). Of course, … Continue reading →

Read more »

Julia style string literal interpolation in R

October 8, 2014
By
Julia style string literal interpolation in R

I feel like a sculptor who has been using the same metal tools for the last four years and happened to have looked at my comrades and found them sporting new, sleek electric tools. Suddenly all of the hard work put into maintaining and adapting my meta...

Read more »

Plot Me Like a Hurricane (a.k.a. animating historical North Atlantic basin tropical storm tracks)

October 7, 2014
By

Markus Gessman (@MarkusGesmann) did a beautiful job Visualising the seasonality of Atlantic windstorms using small multiples, which was inspired by both a post by Arthur Charpentier (@freakonometrics) on using Markov spatial processes to “generate” hurricanes—which was tweaked a bit by Robert Grant (@robertstats)—and Gaston Sanchez‘s Visualizing Hurricane Trajectories RPub. I have some history with hurricane

Read more »

Fitting Lasso with Julia

October 7, 2014
By
Fitting Lasso with Julia

Julia Code R Code

Read more »

Predicting Monthly Car Sales: The Residuals are the Story

October 7, 2014
By
Predicting Monthly Car Sales: The Residuals are the Story

I'll produce predictions for US car sales by manufacture every month. There are already several blogs that describe the industry and sales that do a great job. Autoblog by the Numbers and Counting Cars are some to mention. Unli...

Read more »

Efficiently Adding Gabelhouse Lengths and Relative Weights to a data.frame (using dplyr)

October 7, 2014
By
Efficiently Adding Gabelhouse Lengths and Relative Weights to a data.frame (using dplyr)

In this post on RPubs, I demonstrate how to use new functions (psdAdd() and wrAdd()) in the FSA package, along with functions in the dplyr package, to efficiently add Gabelhouse length category and relative weight variables for all species in … Continue reading →

Read more »

The Generalized Lambda Distribution and GLDEX Package: Fitting Financial Return Data

October 7, 2014
By
The Generalized Lambda Distribution and GLDEX Package: Fitting Financial Return Data

by Daniel Hanson, with contributions by Steve Su (author of the GLDEX package). Part 1 of a series. Introduction As most readers are well aware, market return data tends to have heavier tails than that which can be captured by a normal distribution; furthermore, skewness will not be captured either. For this reason, a four parameter distribution such as...

Read more »

Lot of reports with a single click!

October 7, 2014
By
Lot of reports with a single click!

Suppose you want to create a huge number of pdf files t

Read more »

Part 2 of Who We Are: Society for Judgment and Decision Making (SJDM)

October 7, 2014
By
Part 2 of Who We Are: Society for Judgment and Decision Making (SJDM)

An analysis of where the SJDM members are from in the world. The post Part 2 of Who We Are: Society for Judgment and Decision Making (SJDM) appeared first on Decision Science News.

Read more »

randomness in coin tosses and last digits of prime numbers

October 7, 2014
By
randomness in coin tosses and last digits of prime numbers

A rather intriguing note that was arXived last week: it is essentially one page long and it compares the power law of the frequency range for the Bernoulli experiment with the power law of the frequency range for the distribution of the last digits of the first 10,000 prime numbers to conclude that the power

Read more »

Visualising the seasonality of Atlantic windstorms

October 7, 2014
By
Visualising the seasonality of  Atlantic windstorms

Last week Arthur Charpentier sketched out a Markov spatial process to generate hurricane trajectories. Here, I would like to take another look at the data Arthur used, but focus on its time component. According to the Insurance Information Institute, a normal season, based on averages from 1980 to 2010, has 12 named storms, six hurricanes and...

Read more »

Popular Mutual Funds Decomposed With Ekholm (2014)

October 6, 2014
By

While we have a foundation and momentum from the last post “SelectionShare & TimingShare | Masterfully Written by Delightfully Responsive Author” , we can run the Ekholm calculations on some popular funds to see how they have evolved since the early 1980s.  Remember these are my opinions and not investment advice.  I chose these four funds for ...

Read more »

The World We Live In #1: Obesity And Cells

October 6, 2014
By
The World We Live In #1: Obesity And Cells

Lesson learned, and the wheels keep turning (The Killers – The world we live in) I discovered this site with a huge amount of data waiting to be analyzed. The first thing I’ve done is this simple graph, where you can see relationship between cellular subscribers and obese people. Bubbles are countries and its size

Read more »