342 search results for "PCA"

Using htmlwidgets with knitr and Jekyll

November 15, 2015
By
Using htmlwidgets with knitr and Jekyll

A few weeks ago I gave a talk at BARUG (and wrote a post) about blogging with the excellent knitr-jekyll repo. Yihui’s system is fantastic, but it does have one drawback: None of those fancy new htmlwidgets packages seem to work… A few people have run into this. I recently figured out...

Read more »

H2O World 2015

November 12, 2015
By
H2O World 2015

by Joseph Rickert The second, annual H2O World conference finished up yesterday. More than 700 people from all over the US attended the three-day event that was held at the Computer History Museum in Mountain View, California; a venue that pretty much sits well within the blast radius of ground zero for Data Science in the Silicon Valley. This...

Read more »

Emoticons decoder for social media sentiment analysis in R

October 16, 2015
By
Emoticons decoder for social media sentiment analysis in R

If you have ever retrieved data from Twitter, Facebook or Instagram with R, you might have noticed a strange phenomenon. While R seems to be able to display some emoticons properly, many other times it doesn’t, making any further analysis impossible unless you get rid of them. With a little hack, I decoded these emoticons and put them all...

Read more »

A New Method for Statistical Disclosure Limitation, I

October 15, 2015
By
A New Method for Statistical Disclosure Limitation, I

The Statistical Disclosure Limitation (SDL) problem involves modifying a data set in such a manner that statistical analysis on the modified data is reasonably close to that performed on the original data, while preserving the privacy of individuals in the data set. For instance, we might have a medical data set on which we want … Continue reading...

Read more »

User-friendly scaling

October 8, 2015
By
User-friendly scaling

Back in the mists of time, whilst programming early versions of Canoco, Cajo ter Braak decided to allow users to specify how species and site ordination scores were scaled relative to one another via a simple numeric coding system. This was fine for the DOS-based software that Canoco was at the time; you entered 2 when prompted and you...

Read more »

London Vs Boston

October 2, 2015
By
London Vs Boston

By Paulin Shek Following the great success of the EARL conference in London earlier this month, our attention now turns to EARL Boston which will take place between 2-4th November just across the pond; the competition is already on to … Continue reading →

Read more »

Rebuilding Map Example With Apply Functions

September 30, 2015
By

Yesterday Hadley’s functional programming package purrr was published to CRAN. It is designed to bring convenient functional programming paradigma and add another data manipulation framework for R. “Where dplyr focusses on data frames, purrr focusses on vectors” – Hadley Wickham in a blogpost The core of the package consists of map functions, which operate similar to...

Read more »

Six lines to install and start SparkR on Mac OS X Yosemite

September 20, 2015
By
Six lines to install and start SparkR on Mac OS X Yosemite

I know there are many R users who like to test out SparkR without all the configuration hassle. Just these six lines and you can start SparkR from both RStudio and command line. One line for Spark and SparkR Apache Spark is a fast and general-purpose cluster computing system SparkR is an R package that...

Read more »

Predicting Titanic deaths on Kaggle VI: Stan

September 19, 2015
By

It is a bit a contradiction. Kaggle provides competitions on data science, while Stan is clearly part of the (Bayesian) statistics. Yet after using random forests, boosting and bagging, I also think this problem has a suitable size for Stan, which I un...

Read more »

New R Software/Methodology for Handling Missing Dat

September 16, 2015
By
New R Software/Methodology for Handling Missing Dat

I’ve added some missing-data software to my regtools package on GitHub. In this post, I’ll give an overview of missing-data methodology, and explain what the software does. For details, see my JSM paper, jointly authored with my student Xiao (Max) Gu. There is a long history of development of techniques for handling missing data. See the … Continue reading...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)