Querying, parsimony and golden hammers

December 20, 2012
By
Querying, parsimony and golden hammers

I love it when things are easy. I love it so much that I’ll spend a great deal of time and effort to keep things simple. At the same time, though, I think there’s some value in expending effort in pursuit of something. If you want to understand a thing, you have to spend time

Read more »

Visualising Tourism Data using R with googleVis package

December 20, 2012
By
Visualising Tourism Data using R with googleVis package

Inspired by Mages’s post on Accessing and plotting World bank data with R (using googleVis package), I created one visualising tourism receipts and international tourist  arrivals of various countries since 1995. The data used are from the World Bank’s country indicators. To see the motion chart, double click a picture below.  Code Filed under: R, Tourism

Read more »

moRe

December 20, 2012
By
moRe

Hopefully my first R post whetted your apatite for open source data software.  I’m gearing up for more R posts regardless.  I thought I’d do a quick post about a couple of useful commands, ‘View’ and ‘fix’. When you first break the shackles of Excel one of the toughest things is not being able to

Read more »

R Journal Volume 4, Issue 2

December 20, 2012
By

The latest issue of the bi-annual, peer-reviewed journal about R, the R Journal, is now available for download. This issue includes three articles on graphics from R-core member and R Graphics author Paul Murrell. He writes about accessing individual elements of an R chart by the component names, drawing complex symbols with the polypath function (useful for map icons,...

Read more »

Stealing from the internet: Part 1

December 20, 2012
By
Stealing from the internet: Part 1

Well, not stealing but rather some handy tools for data mining… About a year ago I came across the package XML as I was struggling to get some data from various web pages. The purpose of this blog is to describe how this package can be used to quickly gather data from the internet. I’ll

Read more »

Shiny/R Conversion of Another One of My Favorite Mike Bostock d3 Examples

December 20, 2012
By

Mike Bostock has revolutionized visualization with his d3 and his seemingly infinite examples.  In another adaptation of his amazing work, I will adapt one of my favorite examples to supplement the interactive scatterplot with data supplied by R t...

Read more »

Shiny SVG no d3–New and Improved

December 20, 2012
By

The fine author Joe Cheng of RStudio Shiny suggested in this Google Groups message to use htmlOutput rather than the ugly hack in my last post R Shiny svg with no d3.  As I should have known, it works great and eliminates all the useless javascrip...

Read more »

Turnovers are poison

December 20, 2012
By
Turnovers are poison

This is probably a slightly useless post, but a bit of fun all the same. If nothing else, it allows me to take a stab at learning a bit more about logistic regression. I’m still trying to unravel the mystery of why the Bears lost to the Vikings two weeks ago. This mystery is compounded

Read more »

Generation of E-Learning Exams in R for Moodle, OLAT, etc.

December 20, 2012
By
Generation of E-Learning Exams in R for Moodle, OLAT, etc.

(Guest post by Achim Zeileis) Development of the R package exams for automatic generation of (statistical) exams in R started in 2006 and version 1 was published in JSS by Gr?n and Zeileis (2009). It was based on standalone Sweave exercises, that can be combined …Read more »

Read more »

Influence.ME: Tools for Detecting Influential Data in Multilevel Regression Models

December 20, 2012
By

Despite the increasing popularity of multilevel regression models, the development of diagnostic tools lagged behind. Typically, in the social sciences multilevel regression models are used to account for the nesting structure of the data, such as students in classes, migrants ...

Read more »

Geocoding location data with dismo

December 20, 2012
By
Geocoding location data with dismo

Today’s Gist could actually end up being very useful to a number of you. It’s something of a trumped-up example, but it illustrates in very simple code how to do three interesting things: Gather Tweets by search term (which we’ve done before), and look up user info for each of the users returned by that search. Convert textual user...

Read more »

Presentations of the first Italian bioR Day at PTP

December 20, 2012
By

Reproducible Research in High-Throughput Biology: A Case Study Paolo Sonego, Bioinformatician at CBM Scrl (view code and case study on github) Metabolomics: an interpreting tool to understand kidney graft recipients grouping and their recovery trajectory Marco Calderisi, Chemometrician at Kode … Continue reading →

Read more »

Removing Records by Duplicate Values

December 20, 2012
By
Removing Records by Duplicate Values

Removing records from a data table based on duplicate values in one or more columns is a commonly used but important data cleaning technique. Below shows an example about how to accomplish this task by SAS, R, and Python respectively. SAS Example R Example Python Example

Read more »

Generating polygon boundaries for plotting simple time series data with missing data

December 19, 2012
By

Every so often I want to plot some data with pretty upper and lower error bounds, such as temperature data through time, perhaps with the maximum and minimum temperature range or standard error bounds for averaged data. The polygon( ) function can make those sorts of pretty plots. However, I’ll often have chunks of missing

Read more »

Coursera’s Data Analysis with R course starts Jan 22

December 19, 2012
By

Following on from Coursera's popular course introducing the R language, a new course on data analysis with R starts on January 22. The simply-titled Data Analysis course will provide practically-oriented instruction on how to plan, carry out, and communicate analyses of real data sets with R. Here's the course trailer from instructor Jeff Leek: Jeff Leek is the co-author...

Read more »

Italian elections (1)

December 19, 2012
By
Italian elections (1)

You'd think that the last week before the holidays would be very quiet and not much would be going on. Well, if you did, you'd be wrong, I guess, as the last few days have been quite busy (for many reasons). Anyway, I managed to track down some be...

Read more »

Basic Simulation of Soil Profile Data in R via AQP

December 19, 2012
By
Basic Simulation of Soil Profile Data in R via AQP

Something fun to play with before the new year: experimental code in aqp for simulating soil profile data from a single "template" profile. The basic idea: simulate horizon thickness data using a family of Gaussian functions with mean defined by horizo...

Read more »

Basic Simulation of Soil Profile Data in R via AQP

December 19, 2012
By
Basic Simulation of Soil Profile Data in R via AQP

Something fun to play with before the new year: experimental code in aqp for simulating soil profile data from a single "template" profile. The basic idea: simulate horizon thickness data using a family of Gaussian functions with mean defined by h...

Read more »

Finding Numeric Values of Strings using strsplit()

December 19, 2012
By
Finding Numeric Values of Strings using strsplit()

Given a random list of words, can you find which has the lowest or highest numerical value when we apply a basic number:letter cipher?  A while back I asked David how he would solve this problem:  http://projecteuler.net/problem=42 Today’s post shows how to take a vector of words, parse them into each of the individual letters comprising the word, and...

Read more »

rencontres R à Lyon, 27-28 juin 2013

December 19, 2012
By
rencontres R à Lyon, 27-28 juin 2013

Simon Barthelmé just sent me the announcement that the next French R conference (Rencontres R) will be in Lyon, on June 27-28, 2013, and that the call for propositions is now open. (No possible confusion with MCMSki IV: the later is not in Lyon but Chamonix, not in June but January, and not only about

Read more »

R in Insurance Conference, London, 15 July 2013

December 19, 2012
By
R in Insurance Conference, London, 15 July 2013

The first conference on R in Insurance will be held on Monday 15 July 2013 at Cass Business School in London, UK. The intended audience of the conference includes both academics and practitioners who are active or interested in the applications of R in insurance.This one-day conference will focus on applications in insurance...

Read more »

Maps in R: Introduction – Drawing the map of Europe

December 18, 2012
By
Maps in R: Introduction – Drawing the map of Europe

This post is a brief follow-up to a question that appeared some time ago on the “The R Project for Statistical Computing” LinkedIn group, which I’m reporting here: How can I draw a map of MODERN Europe? Hi, I'm trying … Continue reading →

Read more »

Making Friends with Multicollinearity

December 18, 2012
By

Not every system of independent variables can be decomposed into separate components, each with its own unique contribution.  Sometimes our individual variables behave “as a unit” and thus become so entangled that we cannot say where the effect of one variable begins and the effect of another variable ends.  In such cases, it might be best to ignore the ...

Read more »

PDQ 6.0.1 is Released

December 18, 2012
By
PDQ 6.0.1 is Released

As already described previously, the main purpose of Release 6.0.1 Build 121512 is improved compatibility and stability between PDQ and the R statistical environment. For example, many of the PDQ models, previously found in the ../examples/ directory, can now also be accessed via the demo() command in the R-console. Testing was carried out using R version...

Read more »

Generalized Boosted Regression with A Monotonic Marginal Effect for Each Predictor

December 18, 2012
By
Generalized Boosted Regression with A Monotonic Marginal Effect for Each Predictor

In the practice of risk modeling, it is sometimes mandatory to maintain a monotonic relationship between the response and each predictor. Below is a demonstration showing how to develop a generalized boosted regression with a monotonic marginal effect for each predictor. Plot of Variable Importance Plot of Monotonic Marginal Effects

Read more »

Four new Certificate Program for R users at Statistics.com

December 18, 2012
By

Statistics.com Launches 4 Certificate Programs The Institute for Statistics Education at Statistics.com (http://www.statistics.com), a pioneer* in online learning since 2002, announced today the launch of four online certificate programs plus an interactive planning tool that helps prospective students plan and schedule courses. The Institute offers 100+ courses in statistics and analytics, with a format tailored to working professionals. Most...

Read more »

Making prettier network graphs with sna and igraph

December 18, 2012
By
Making prettier network graphs with sna and igraph

We’ve had some requests for ideas about how to make prettier network graphs, so here is one example, using the sna package for plotting, and the igraph package to calculate PageRank. The help file for gplot is pretty self-explanatory, but Melissa Clarkson has produced the most thorough and impressive guide for any R...

Read more »

R Shiny svg with no d3

December 18, 2012
By
R Shiny svg with no d3

Paul Murrell’s Technical Report reminded me that svg in the browser does not require d3.  With his package gridSVG, we can do all sorts of wonderful things without leaving R.  I think I prefer the power of d3.js, but here is an example (doe...

Read more »

A Quick Note in Weighting with nlme

December 18, 2012
By

I’ve been doing a lot of meta-analytic things lately. More on that anon. But one quick thing that came up was variance weighting with mixed models in R, and after a few web searches, I wanted to post this, more … Continue reading →

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.