Peter Norvig’s Spell Checker in Two Lines of Base R

December 16, 2014
By
Peter Norvig’s Spell Checker in Two Lines of Base R

Peter Norvig, the director of research at Google, wrote a nice essay on How to Write a Spelling Corrector a couple of years ago. That essay explains and implements a simple but effective spelling correction function in just 21 lines of Python. Highly recommended reading! I was wondering how many lines it would take to write something similar...

Read more »

Introducing V8: An Embedded JavaScript Engine for R

December 16, 2014
By
Introducing V8: An Embedded JavaScript Engine for R

JavaScript is an fantastic language for building applications. It runs on browsers, servers and databases, making it possible to design an entire web stack in a single language. The OpenCPU JavaScript client already allows for calling R functions from JavaScript (see jsfiddles and apps). With the new V8...

Read more »

Yikes…It’s Been Awile

December 16, 2014
By

Apparently our last blog post was in August. Dang. Where did five months go? Blog guilt would be killing me, but I swear it was just yesterday that Mine posted. I will give a bit of review of some of … Continue reading →

Read more »

10 new R jobs (for December 16th 2014) – from Intel to University of Iowa and more

December 16, 2014
By
10 new R jobs (for December 16th 2014) – from Intel to University of Iowa and more

This is the bimonthly R Jobs post (for 2014-12-16), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

The Mandelbrot Set in R

December 16, 2014
By
The Mandelbrot Set in R

IntroductionI was having a conversation with someone I know about weather forecasts the other day and it went something like this:"Yes, their forecasts are really not very good. They really need to work on improving them.""Well, yes, I think they're ok...

Read more »

Interactive 2D & 3D Plots with Plotly and ggplot2

December 16, 2014
By
Interactive 2D & 3D Plots with Plotly and ggplot2

by Matt Sundquist co-founder of Plotly R, Plotly, and ggplot2 let you make, share, and collaborate on beautiful, interactive plots online. Let's see what we can do with the topographic data from Auckland's Maunga Whau Volcano that comes with R. Copy and paste this R code to make your first plot. The basic idea is: use ggplot2 code, add...

Read more »

Extending methylKit: Christmas Offer for Methylome Researchers

December 16, 2014
By

Next week is going to be a vacation and most of us are going on vacation. Me too. However, I thought that this Christmas I would OFFER fellow researchers a GOOD SERVICE. Many of the biologists are learning to deal with their high-throughput sequen...

Read more »

Simulating Intricate Branching Patterns with DLA

December 16, 2014
By
Simulating Intricate Branching Patterns with DLA

Manfred Schroeder's book Fractals, Chaos, Power Laws is a fruitful source of interesting topics and projects. He gives a thorough description of Diffusion-Limited Aggregation (DLA) as a technique for simulating physical processes which produce intricate branching structures. Examples, as illustrated below, include Lichtenberg Figures, dielectric breakdown, electrodeposition and Hele-Shaw flow. Diffusion-Limited Aggregation DLA is conceptually

Read more »

Notes from the Kölner R meeting, 12 December 2014

December 16, 2014
By
Notes from the Kölner R meeting, 12 December 2014

Last week's Cologne R user group meeting was the best attended so far, and it was a remarkable event - I believe not a single line of R code was shown. Still, it was an R user group meeting with two excellent talks, and you will understand shortly why not much R code needed to be displayed.Introduction to Julia...

Read more »

The Awesome Parrondo’s Paradox

December 15, 2014
By
The Awesome Parrondo’s Paradox

A technique succeeds in mathematical physics, not by a clever trick, or a happy accident, but because it expresses some aspect of physical truth (O. G. Sutton) Imagine three unbalanced coins: Coin 1: Probability of head=0.495 and probability of tail=0.505 Coin 2: Probability of head=0.745 and probability of tail=0.255 Coin 3: Probability of head=0.095 and … Continue reading...

Read more »

New Package: partools

December 15, 2014
By
New Package:  partools

I mentioned last week that I would be putting together a package, based in part on my posts on Snowdoop.  I’ve now done so, in a package partools., with the name alluding to the fact that they are intended for use with the cluster-based part of R’s parallel package.  The main ingredients are: Various code … Continue reading...

Read more »

Cartography with complex survey data

December 15, 2014
By
Cartography with complex survey data

Visualizing complex survey data is something of an art. If the data has been collected and aggregated to geographic units (say, counties or states), a choropleth is one option. But if the data aren't so neatly arranged, making visual sense often requires some form of smoothing to represent it on a map. R, of course, has a number of...

Read more »

Learning R: Live Webinar, Interactive Self-Paced, or Site Visit?

December 15, 2014
By
Learning R: Live Webinar, Interactive Self-Paced, or Site Visit?

My recent blog post, Why R is Hard to Learn, must have hit a nerve as it was read by over 6,000 people in its first two days online.  If you’re using R to augment your work in SAS, SPSS … Continue reading →

Read more »

Hassle-free data from HTML tables with the htmltable package

December 15, 2014
By

HTML tables are a standard way to display tabular information online. Getting HTML table data into R is fairly straightforward with the readHTMLTable() function of the XML package. But tables on the web are primarily designed for displaying and consuming data, not for analytical purposes. Peculiar design choices for HTML tables are therefore frequently made which tend to produce...

Read more »

Updated quantmod on CRAN

December 15, 2014
By

An updated version of quantmod has just been released on CRAN.  This is my first submission as the new maintainer.  The major change was removing the dependency on the now-archived Defaults package.  End-users shouldn't notice ...

Read more »

Review: Data Science at the Command Line

December 15, 2014
By
Review: Data Science at the Command Line

Admission: I didn’t really know how computers worked until around 2012. For the majority of my career, I’ve worked for large companies with centralized IT functions. Like many statisticians, I fell into a comfortable position of learning SAS in a Windows environment, had Ops people to fix any Unix problems I’d run into and DBAs to

Read more »

QQ-plots in R vs. SPSS – A look at the differences

December 15, 2014
By
QQ-plots in R vs. SPSS – A look at the differences

We teach two software packages, R and SPSS, in Quantitative Methods 101 for psychology freshman at Bremen University (Germany). Sometimes confusion arises, when the software packages produce different results. This may be due to specifics in the implemention of a method or, as in most cases, to different default settings. One of these situations occurs

Read more »

Decide which frequent flyer program is best for your city

December 14, 2014
By
Decide which frequent flyer program is best for your city

Use our interactive tool to find out which airline has the most departures from your city The post Decide which frequent flyer program is best for your city appeared first on Decision Science News.

Read more »

A time series classification contest

December 14, 2014
By
A time series classification contest

Amongst today’s email was one from someone running a private competition to classify time series. Here are the essential details. The data are measurements from a medical diagnostic machine which takes 1 measurement every second, and after 32–1000 seconds, the time series must be classified into one of two classes. Some pre-classified training data is

Read more »

stringi 0.4-1 released – fast, portable, consistent character string processing

December 14, 2014
By

A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). # install.packages("stringi") or update.packages() library("stringi") Here’s a list of changes in version 0.4-1. In the current release,…Read more ›

Read more »

httr 0.6.0

December 14, 2014
By
httr 0.6.0

httr 0.6.0 is now available on CRAN. The httr packages makes it easy to talk to web APIs from R. Learn more in the quick start vignette. This release is mostly bug fixes and minor improvements. The most important are: handle_reset(), which allows you to reset the default handle if you get the error “easy

Read more »

Hierarchical Clustering with R (feat. D3.js and Shiny)

December 14, 2014
By
Hierarchical Clustering with R (feat. D3.js and Shiny)

Agglomerative hierarchical clustering is a simple, intuitive and well-understood method for clustering data points. I used it with good results in a project to estimate the true geographical position of objects based on measured estimates. With this tutorial I would … Continue reading →

Read more »

Monthly Weather in Netherlands

December 14, 2014
By
Monthly Weather in Netherlands

When I downloaded the KNMI meteorological data, the intention was to do something which takes more than just the computers memory. While it is clearly not big data, at the very least 100 years of daily data is not small either. So I took along a load o...

Read more »

Parallelism via “parSapply”

December 13, 2014
By

In an earlier post, I used mclapply to kick off parallel R processes and to demonstrate inter-process synchronization via the flock package. Although I have been using this approach to parallelism for a few years now, I admit, it has certain important disadvantages. It works only on a single machine, and also, it doesn’t work

Read more »

rfoaas 0.0.4.20141212

December 13, 2014
By

A new version of rfoaas is now on CRAN. The rfoaas package provides an interface for R to the most excellent FOAAS service -- which provides a modern, scalable and RESTful web service for the frequent need to tell someone to eff off. The FOAAS backen...

Read more »

Notes on shrinkage & prediction in hierarchical models

December 13, 2014
By
Notes on shrinkage & prediction in hierarchical models

Ecologists increasingly use mixed effects models, where some intercepts or slopes are fixed, and others are random (or varying). Often, confusion exists around whether and when to use fixed vs. random intercepts/slopes, which is understandable given their multiple definitions. In an attempt to help clarify the utility of varying intercept models (and more generally, hierarchical modeling), specifically in...

Read more »

UEFA Champions League Round of 16 draw

December 13, 2014
By
UEFA Champions League Round of 16 draw

Each year after the group stage, there is the much awaited drawing of the eighth-final, which essentially defines a team’s fate. So far the thing is not too complicated, as there are 16 teams out of which we need to generate 8 games – no problem if it would be possible to draw the teams without

Read more »

Meetup: DataVis with Plotly on December 16th

December 12, 2014
By

Plotly is a  web-based platform for making graphs and analyzing data. Plotly’s APIs and web app...

Read more »

The ZOMMA Warthog Index

December 12, 2014
By
The ZOMMA Warthog Index

Harry Long posted another article on SeekingAlpha. As usual, it’s another “looks amazing at first glance, and winds up being … Continue reading →

Read more »