386 search results for "evaluation"

My aversion to pipes

June 3, 2015
By

At the risk of coming across as even more of a curmudgeonly old fart than people already think I am, I really do dislike the current vogue in R that is the pipe family of binary operators; e.g. %>%. Introduced by Hadley Wickham and popularised and advanced via the magrittr package by Stefan Milton Bache, the basic idea...

Read more »

Statistical Models with a Point of View: First vs. Third Person

June 2, 2015
By
Statistical Models with a Point of View: First vs. Third Person

Marketing data can be collected in the first or third person, and we require different statistical models for each point of view.Netflix encourages you to adopt a third-person perspective when it surveys your taste preferences by asking how often you w...

Read more »

Roman dataviz and inference in complex systems

May 29, 2015
By
Roman dataviz and inference in complex systems

I’m in Rome at the International Workshop on Computational Economics and Econometrics. I gave a seminar on Monday on the ever-popular subject of data visualization. Slides are here. In a few minutes, I’ll be speaking on Inference in Complex Systems, … Continue reading →

Read more »

Review of ‘Advanced R’ by Hadley Wickham

May 24, 2015
By
Review of ‘Advanced R’ by Hadley Wickham

Executive summary Surprisingly good. And it’s not like my expectations were especially low. Structure There are 20 chapters.  I mostly like the chapters and their order. Hadley breaks the 20 chapters into 4 parts.  He’s wrong.  Figure 1 illustrates the correct way to formulate parts. Figure 1: Chapters and Parts of Advanced R.    Introductory R There The post

Read more »

streaming machine learning with RMOA: stream_in > train > predict

streaming machine learning with RMOA: stream_in > train > predict

We will be showcasing our RMOA package at the next R User conference in Aalborg. For the R users who are unfamiliar with streaming modelling and want to be ahead of the Gartner Hype cycle or want to evaluate existing streaming machine learning models, RMOA allows to build, run and evaluate streaming classification models which are built in

Read more »

Data Science in HR

May 5, 2015
By
Data Science in HR

by Joseph Rickert Last year in a post on interesting R topics presented at the JSM I described how data scientists in Google's human resources department were using R and predictive analytics to better understand the characteristics of its workforce. Google may very well have done the pioneering work, but predictive analytics for HR applications is going mainstream. In...

Read more »

How large vectors in R might be stored compactly

April 30, 2015
By
How large vectors in R might be stored compactly

Vectors in R can currently have elements of two sizes — 8-byte double-precision floating-point elements for `numeric’ vectors, or 4-byte elements for `integer’ or `logical’ vectors.  You can also have vectors whose elements are 1-byte `raw’ values, but these raw vectors don’t support negative numbers, or NA values, so they aren’t suitable for general use. It seems that lots of

Read more »

AusDM 2015 submission deadline extended to Thursday 30 April

April 20, 2015
By
AusDM 2015 submission deadline extended to Thursday 30 April

The 13th Australasian Data Mining Conference (AusDM 2015) Sydney, Australia, 8–9 August 2015 (co-located with SIGKDD’15) URL: http://ausdm15.ausdm.org/ The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. It … Continue reading →

Read more »

R 3.2.0 is released (+ using the installr package to upgrade in Windows OS)

April 17, 2015
By
R 3.2.0 is released (+ using the installr package to upgrade in Windows OS)

R 3.2.0 (codename “Full of Ingredients”) was released yesterday. You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of new features and bug fixes is provided below. Upgrading to R 3.2.0 on Windows If you are using Windows you can easily upgrade to the latest version of R using the installr … Continue reading...

Read more »

Part 4a: Modelling – predicting the amount of rain

April 6, 2015
By
Part 4a: Modelling – predicting the amount of rain

In the fourth and last part of this series, we will build several predictive models and evaluate their accuracies. In Part 4a, our dependent value will be continuous, and we will be predicting the daily amount of rain. Then, in Part 4b, we will deal with the case of a binary outcome, which means we will assign probabilities to...

Read more »