August 2014

Vtreat: designing a package for variable treatment

August 7, 2014 | Nina Zumel

When you apply machine learning algorithms on a regular basis, on a wide variety of data sets, you find that certain data issues come up again and again: Missing values (NA or blanks) Problematic numerical values (Inf, NaN, sentinel values like 999999999 or -1) Valid categorical levels that don’t appear ... [Read more...]

Difference between magrittr and pipeR

August 7, 2014 | Kun Ren

(This post is rewritten to adapt to the latest release of pipeR) Pipeline is receiving increasing attention in R community these days. It is hard to tell when it begins but more people start to use it since the easy-and-fast dplyr package imports the magic operator %__% from magrittr, the pioneer ... [Read more...]

Incidental R

August 7, 2014 | Joseph Rickert

by Joseph Rickert Last week, I posted a list of sessions at the Joint Statistical Meetings related to R. As it turned out, that list was only the tip of the iceberg. In some areas of statistics, such as graphics, simulation and computational statistics the use of R is so ... [Read more...]

why clusterProfiler fails

August 6, 2014 | R on Guangchuang Yu

Recently, there are some comments said that sometimes clusterProfiler failed in KEGG enrichment analysis. kaji331 compared cluserProfiler with GeneAnswers and found that clusterProfiler gives larger p values. The result forces me to test it. Read More: 251 Words Totally [Read more...]

A quiz on magrittr: How many scores can you get?

August 6, 2014 | Kun Ren

Here is a quiz on magrittr and you may check if you are really good at using it. Since the CRAN version currently does not support nested ., it won't be interesting to make a quiz on that version. All the following examples are using the latest development version on GitHub. ... [Read more...]

A New Use for Pipes in R: Forkbombs

August 6, 2014 | wrathematics

Almost 3 years ago, I wrote about how to forkbomb with R. A quick recap is that a forkbomb is a low-tier, malicious misuse of a system; sort of a "baby's first denial of service". The idea is that you write a program that will start an entirely new copy of ... [Read more...]

In case you missed it: July 2014 Roundup

August 6, 2014 | David Smith

In case you missed them, here are some articles from June of particular interest to R users: The deadline for our contest to visualize the location of R user groups has been extended to August 16. Previews of R-related sessions at this year's JSM conference in Boston. Coding errors in R ... [Read more...]

NCEAS Codefest

August 6, 2014 | rOpenSci Blog - R

We're delighted to be sponsoring the upcoming Open Science Codefest in Santa Barbara, California, alongside RENCI, NCEAS, NSF, DataONE, and Mozilla Science Lab. The Open Science Codefest's goal is to gather researchers from across ecology, biodiversity science, and other earth and environmental sciences with programmer types to collaborate on coding ... [Read more...]

Results of the Readers’ Survey

August 5, 2014 | Cory Lesmeister

 First of all, let me say “Thank You” to all of the 357 people who completed the survey. I was hoping for 100, so needless to say the response blew away my expectations. This endeavor seems like a worthwhile effort to do once a year. Next year I will refine the questionnaire ... [Read more...]

New freqparcoord Example

August 5, 2014 | matloff

In my JSM talk this morning, I spoke about work done by Yingkang Xie and myself, on a novel approach to the parallel coordinates method of visualization.  I’ve made several posts to this blog in the past on freqparcoord, our implemention of our method. My talk this morning used ... [Read more...]

Introducing rlist 0.3

August 5, 2014 | Kun Ren

rlist 0.3 is released! This package now provides a wide range of functions for dealing with list objects. It can be especially useful when they are used to store non-tabular data. Two notable features are added in this version. First, list.search and equal() are added in support of fuzzy filtering ... [Read more...]

When life gives you coloured cells, make categories

August 5, 2014 | nsaunders

Let’s start by making one thing clear. Using coloured cells in Excel to encode different categories of data is wrong. Next time colleagues explain excitedly how “green equals normal and red = tumour”, you must explain that (1) they have sinned and (2) what they meant to do was add a column ... [Read more...]
1 7 8 9 10 11

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)