April 2016

R Conferences: Europe 2016

April 28, 2016 | Joseph Rickert

by Joseph Rickert Answering email queries from friends and acquaintances from around the world wanting to attend useR! 2016 has been painful. It is amazing that the conference sold out a full two months before its start, but upon reflection, not unbelievable. From its inception useR! has been an "academic" conference ... [Read more...]

It’s Not Rocket Science, Just Pasta Carbonara

April 28, 2016 | Salvino

[1] [2] Early April 2016, the release of a video recipe of pasta carbonara on Youtube triggered a tsunami of reactions. Following all the negative and sarcastic comments, the original video –sponsored by Barilla– was removed by the author (the French food blog demotivateur.fr) but is still available online. Hostile Read More ... [Read more...]

yorkr ranks ODI batsmen and bowlers

April 28, 2016 | Tinniam V Ganesh

This is the last and final post in which yorkr ranks ODI batsmen and bowlers. These are based on match data from Cricsheet. The ranking is done on average runs and average strike rate for batsmen and average wickets and average economy rate for bowlers. This post has also been ... [Read more...]

Call for Papers: eRum 2016 (European R users meeting)

April 28, 2016 | smarterpoland

The European R users meeting (eRum) is an international conference that aims at integrating users of the R language. eRum 2016 will be held on October 13 and 14, 2016, in Poznan, Poland at the Poznan University of Economics and Business. We already confirm the following invited speakers: Rasmus Bååth, Romain Francois, Ulrike ... [Read more...]

RcppRedis 0.1.7

April 27, 2016 | Thinking inside the box

A new release of RcppRedis arrived on CRAN today. And just like for the previous release, Russell Pierce contributed a lot of changes via several pull requests which make for more robust operations. In addition, we have started to add support for Mes... [Read more...]

Le Monde puzzle [#960]

April 27, 2016 | xi'an

An arithmetic Le Monde mathematical puzzle: Given an integer k__1, consider the sequence defined by F(1)=1+1 mod k, F²(1)=F(1)+2 mod k, F³(1)=F²(1)+3 mod k, &tc. [With this notation, F is not necessarily a function.] For which value of k is the sequence the entire {0,1,…,k-1} set? This leads ...
[Read more...]

A segmented model of CRAN package growth

April 27, 2016 | Andrie de Vries

by Andrie de Vries A few weeks ago I wrote about the growth of CRAN packages, where I demonstrated how to scrape CRAN archives to get an estimate of the number of packages over time. In this post I briefly mentioned that the Ecdat package contains a dataset, CRANpackages, with ... [Read more...]

CRAN CHECK NOTE sub-directories of 1Mb or more: libs

April 27, 2016 | bhejbiostat

I just released a new package on CRAN. It’s called NPflow, it performs Dirichlet process mixture of multivariate normal, skew-normal or skew t-distributions  modeling, you should check it out. I was a little worried because the check from Travis CI was returning a NOTE. And even though the NOTEs ... [Read more...]

Solving Inequality (the math kind)

April 27, 2016 | Jonathan Carroll

This neat approach showed up recently as an answer to a FiveThirtyEight puzzle and of course I couldn’t help but throw it at dplyr as soon as I could. Turns out that’s not a terrible idea. The question posed is...Continue Reading →
[Read more...]

Your strongly correlated data is probably nonsense

April 27, 2016 | biomickwatson

Use of the Pearson correlation co-efficient is common in genomics and bioinformatics, which is OK as it goes (I have used it extensively myself), but it has some major drawbacks – the major one being that Pearson can produce large coefficients in the presence of very large measurements. This is best ... [Read more...]

Explicit semantic analysis with R

April 26, 2016 | Francesco Bailo

Explicit semantic analysis (ESA) was proposed by Gabrilovich and Markovitch (2007) to compute a document position in a high-dimensional concept space. At the core, the technique compares the terms of the input document with the terms of documents describing the concepts estimating the relatedness of the document to each concept. In ... [Read more...]

Complex Tables – Exercises

April 26, 2016 | John Akwei

The ftable() function combines Cross-Tabulation with the ability to format , or “flatten”, contingency tables of 3 or more dimensions. The resulting tables contain the combined counts of the categorical variables, (also factor variables in R), that are then arranged as a matrix, whose rows and columns correspond to the original data’... [Read more...]

A Data Scientist’s Perspective on Microsoft R

April 26, 2016 | Joseph Rickert

by Lixun Zhang, Data Scientist at Microsoft As a data scientist, I have experience with R. Naturally, when I was first exposed to Microsoft R Open (MRO, formerly Revolution R Open) and Microsoft R Server (MRS, formerly Revolution R Enterprise), I wanted to know the answers for 3 questions: What do ... [Read more...]

On Nested Models

April 26, 2016 | John Mount

We have been recently working on and presenting on nested modeling issues. These are situations where the output of one trained machine learning model is part of the input of a later model or procedure. I am now of the opinion that correct treatment of nested models is one of ...
[Read more...]
1 2 3 4 13

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)