# Monthly Archives: May 2013

## xkcd: Visualized

May 6, 2013
By

IntroductionIt's been said that the ideal job is one you love enough to do for free but are good enough at that people will pay you for it. That if you do what you love no matter what others may say, and if you work at it hard enough, and long enough, eventually people will recognize it and...

## Explaining real-time predictive analytics with big data (video)

May 6, 2013
By

In my presentation to the Strata Santa Clara 2013 conference earlier this year, my goal was to give a succinct (under 20 minutes!) explanation of three terms that are two often used as mere buzzwords: predictive analytics, real time, and big data. You can download the slides for my presentation, Real-time Big Data Analytics: From Deployment to Production, from...

## Veterinary Epidemiologic Research: Count and Rate Data – Zero Counts

May 6, 2013
By
$Veterinary Epidemiologic Research: Count and Rate Data – Zero Counts$

Continuing on the examples from the book Veterinary Epidemiologic Research, we look today at modelling count when the count of zeros may be higher or lower than expected from a Poisson or negative binomial distribution. When there’s an excess of zero counts, you can fit either a zero-inflated model or a hurdle model. If zero

## When the “reorder” function just isn’t good enough…

May 6, 2013
By

The reorder function, in R 3.0.0, is behaving strangely (or I’m really not understanding something).  Take the following simple data frame: df = data.frame(a1 = c(4,1,1,3,2,4,2), a2 = c(“h”,”j”,”j”,”e”,”c”,”h”,”c”)) I expect that if I call the reorder function on the … Continue reading →

## Oracle R Distribution for R 2.15.2 available on public-yum

May 6, 2013
By

Oracle R Distribution (ORD) for R 2.15.2 on Linux is now available for download from Oracle's public-yum repository.  R 2.15.2 is a maintenance update that includes improved performance and reduced memory usage for some commonly-used functions, increased memory available for data on 64-bit systems, enhanced localization for Polish language users, and a number of bug fixes.  Detailed updates...

## Bayesian and Frequentist Approaches: Ask the Right Question

May 6, 2013
By

It occurred to us recently that we don’t have any articles about Bayesian approaches to statistics here. I’m not going to get into the “Bayesian versus Frequentist” war; in my opinion, which style of approach to use is less about philosophy, and more about figuring out the best way to answer a question. Once you Related posts:

## Incomplete Data by Design: Bringing Machine Learning to Marketing Research

May 6, 2013
By

Survey research deals with the problem of question wording by always asking the same question.  Thus, the Gallup Daily Tracking is filled with examples of moving averages for the exact same question asked precisely the same way every day. &nb...

## New fixed.angle() Function

Hello morphometricians,Below you can find a new fixed angle function addressing the problem discovered by Fabio Machado in the morphmet mail archive. We will include this function in our next schedule update to geomorph. Cheers, Erik CODE: ...

## Mixed Model Example — Wagner et al. (2006)

May 6, 2013
By

I am preparing for a workshop on mixed models and looked at the paper “Accounting for multilevel data structures in fisheries data using mixed models” by Wagner et al. (2006) (PDF available here).  Wagner et al. (2006) used two examples, with the … Continue reading →

## Monitoring des médias 2

May 6, 2013
By

(This article was first published on Learning Data Science , and kindly contributed to R-bloggers) Petit monitoring de notre observatoire des médias sur Twitter. Chez Mediapart : Le Monde Le Figaro Le parisien Vue globale Le code pour réaliser ce post : To leave a comment for the author, please follow the link and comment on their blog: Learning...