Monthly Archives: October 2014

Exploration of Letter Make Up of English Words

October 31, 2014
By
Exploration of Letter Make Up of English Words

This blog post will do a quick exploration of the grapheme make up of words in the English. Specifically we will use R and the qdap package to answer 3 questions: What is the distribution of word lengths (number of … Continue reading →

Read more »

Combining FAA and Stepwise Correlation

October 31, 2014
By
Combining FAA and Stepwise Correlation

Since I debuted the stepwise correlation algorithm, I suppose the punchline that people want to see is: does it actually … Continue reading →

Read more »

Is data mining more about fitting data well? – Exercise Results

October 31, 2014
By

Today, I am going to share results of an exercise that I carried out recently for a start-up. Intention of the study was to extract those major attributes that are generally driving less/in experienced (or) re-skilled data miners towards the given objective and to understand where they are failing back. Herein, twist is majority of them have given same...

Read more »

R 3.1.2 released with over 6,000 CRAN packages available

October 31, 2014
By

The latest update to the world's most popular statistical data analysis software is now available. R 3.1.2 (codename: "Pumpkin Helmet") makes a number of minor improvements and bug fixes to the R language engine. You can see the complete list of changes here, which include improvements for the log-Normal distribution function, improved axis controls for histograms, a fix to...

Read more »

Our Latest Round: Looking to our Past and Future with Containers

October 31, 2014
By

Several months ago, I was looking for a new opportunity, one that would be part of a fundamental shift in cloud technology. I could see that Docker had elevated containers from an operational detail to a hot new developer tool, but what did this mean for the future of infrastructure? It didn't take…

Read more »

New dplyr course by RStudio and DataCamp! Learn data manipulation interactively

October 31, 2014
By
New dplyr course by RStudio and DataCamp! Learn data manipulation interactively

DataCamp just launched its latest interactive course: dplyr. This new course was developed in close collaboration with Garrett Grolemund, RStudio’s master instructor. By taking this course, you will be challenged one step at a time to master the essentials about transforming data sets fast and intuitively with the dplyr package. Start the course here. The

Read more »

Sequence of shopping carts analysis with R – Sankey diagram

October 31, 2014
By
Sequence of shopping carts analysis with R – Sankey diagram

We studied how we can visualize the structure of a shopping cart in the previous post. Although you can find a great deal of materials on how to analyze combinations of products in the shopping cart (e.g. via association rules), there is a lack of sources on how to analyze the sequences of shopping carts.... Read More »

Read more »

Wrangling F1 Data With R – F1DataJunkie Book

October 30, 2014
By
Wrangling F1 Data With R – F1DataJunkie Book

Earlier this year I started trying to pull together some of my #f1datajunkie R-related ramblings together in a book form. The project stalled, but to try to reboot it I’ve started publishing it as a living book over on Leanpub. Several of the chapters are incomplete – with TO DO items sketched in, others are

Read more »

pander 0.5.0: the next generation of markdown tables in R

October 30, 2014
By
mmmmmmmmmmmmm

A new version of pander was just released on CRAN with 200+ commits of new features, major performance updates and some minor fixes . One of the minor technical changes, which might be major good news for the knitr users, is that there is no further need to specify asis in knitr chunks when calling pander – please...

Read more »

My Commonly Done ggplot2 graphs

October 30, 2014
By
My Commonly Done ggplot2 graphs

In my last post, I discussed how ggplot2 is not always the answer to the question “How should I plot this” and that base graphics were still very useful. Why Do I use ggplot2 then? The overall question still remains: why (do I) use ggplot2? ggplot2 vs lattice For one, ggplot2 replaced the lattice package

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)