Blog Archives

Half off Win-Vector data science books and video training!

April 8, 2016
By

We are pleased to announce our book Practical Data Science with R (Nina Zumel, John Mount, Manning 2014) is part of Manning’s “Deal of the Day” of April 9th 2016. This one day only offer gets you half off for physical book (with free e-copy) or paid e-copy (e-copy simultaneous pdf + ePub + kindle, … Continue reading...

Read more »

A bit on the F1 score floor

April 2, 2016
By
A bit on the F1 score floor

At Strata+Hadoop World “R Day” Tutorial, Tuesday, March 29 2016, San Jose, California we spent some time on classifier measures derived from the so-called “confusion matrix.” We repeated our usual admonition to not use “accuracy” as a project goal (business people tend to ask for it as it is the word they are most familiar … Continue reading...

Read more »

WVPlots: example plots in R using ggplot2

April 1, 2016
By
WVPlots: example plots in R using ggplot2

Nina Zumel and I have been working on packaging our favorite graphing techniques in a more reusable way that emphasizes the analysis task at hand over the steps needed to produce a good visualization. The idea is: we sacrifice some of the flexibility and composability inherent to ggplot2 in R for a menu of prescribed … Continue reading...

Read more »

For loops in R can lose class information

March 24, 2016
By

Did you know R‘s for() loop control structure drops class annotations from vectors? Consider the following code R code demonstrating three uses of a for-loop that one would expect to behave very similarly. dates <- c(as.Date('2015-01-01'),as.Date('2015-01-02')) for(ii in seq_along(dates)) { di <- dates print(di) } ## "2015-01-01" ## "2015-01-02" for(di in as.list(dates)) { … Continue reading...

Read more »

Upcoming Win-Vector LLC appearances

March 23, 2016
By
Upcoming Win-Vector LLC appearances

Win-Vector LLC will be presenting on statistically validating models using R and data science at: Strata+Hadoop World “R Day” Tutorial 9:00am–5:00pm Tuesday, March 29 2016, San Jose, California. ODSC San Francisco Meetup, 6:30pm-9:00pm Thursday, March 31, 2016, San Francisco, California. We will share code and examples. Registration required (and Strata is a paid conference). Please … Continue reading...

Read more »

sample(): “Monkey’s Paw” style programming in R

March 22, 2016
By
sample(): “Monkey’s Paw” style programming in R

The R functions base::sample and base::sample.int are functions that include extra “conveniences” that seem to have no purpose beyond encouraging grave errors. In this note we will outline the problem and a suggested work around. Obviously the R developers are highly skilled people with good intent, and likely have no choice in these matters (due … Continue reading...

Read more »

More on preparing data

March 18, 2016
By
More on preparing data

The Microsoft Data Science User Group just sponsored Nina Zumel‘s presentation “Preparing Data for Analysis Using R”. Microsoft saw Win-Vector LLC‘s ODSC West 2015 presentation “Prepping Data for Analysis using R” and generously offered to sponsor improving it and disseminating it to a wider audience. We feel Nina really hit the ball out of the … Continue reading...

Read more »

Bend or break: strings in R

March 10, 2016
By
Bend or break: strings in R

A common complaint from new users of R is: the string processing notation is ugly. Using paste(,,sep='') to concatenate strings seems clumsy. You are never sure which regular expression dialect grep()/gsub() are really using. Remembering the difference between length() and nchar() is initially difficult. As always things can be improved by using additional libraries (for … Continue reading...

Read more »

Win-Vector video courses: price/status changes

March 2, 2016
By

Win-Vector LLC has been offering a couple of online video courses on the topics of data science and A/B testing (both using R). These are high quality courses and well worth the money and time needed to work through them closely (with all materials distributed on GitHub). Our current distributor is Udemy, which has just … Continue reading...

Read more »

More Shiny user showcase demonstrations

February 24, 2016
By
More Shiny user showcase demonstrations

We at Win-Vector LLC are very proud to announce that RStudio just inducted two more of our demonstration Shiny applications into their Shiny User Showcase gallery. Checkout the gallery to see our demonstrations of: Finding the k in k-means A/B test interactive design and analysis tool The geometry of classifiers RStudio (the authors of Shiny) … Continue reading...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)