## Using R and Shiny to create “Art”

July 19, 2013
By

One big strength of packages like shiny is the ability to easily vary parameters and view the results, especially in plots.So here’s a small shiny app that I created to learn about reactivity, while also having fun.The idea is simple. Vary many aspects of geom_segments in ggplot, and see what emerges. The things...

## Oracle R Connector for Hadoop 2.2.0 released

July 19, 2013
By

Oracle R Connector for Hadoop 2.2.0 is now available for download. The Oracle R Connector for Hadoop 2.x series has introduced numerous enhancements, which are highlighted in this article and summarized as follows:  ORCH 2.0.0  ORCH 2.1.0  ORCH...

## Create SQL Rules from rpart model

July 19, 2013
By

Mapping output of rpart tree to SQL statements is not easy. In rpart package you have to print out rules and then manually write SQL CASE statement. Fortunately, we can write new function to do this job. To test the function, I will use dataset german...

July 19, 2013
By

Following up on my previous posts regarding the results of the Comrades Marathon, I was planning on putting together a set of models which would predict likelihood to finish and probable finishing time. Along the way I got distracted by something else that is just as interesting and which produces results which readily yield to qualitative

## Using the power of sound to figure out which Simpsons character is speaking

July 18, 2013
By

In a previous post, I looked at transcripts of Simpsons episodes and tried to figure out which character was speaking which line. This worked decently, but it wasn’t great. It gave us memorable scenes like this one: Homer : D'oh! A deer! A female deer. Marge : Son, you're okay! Bart : Dad, I can't let...

## Stochastic Oscillator

July 18, 2013
By

I came across the link to the John Ehlers paper: Predictive Indicators for Effective Trading Strategies, while reading the Dekalog Blog. John Ehlers offers a different way to smooth prices and incorporate the new filter into the oscillator construction. Fortunately, the EasyLanguage code was also provided and i was able to translate it into R.

## R spells for data wizards

July 18, 2013
By

Thomas Levine has put together a handy list of "data spells" for R: all of those cool, slightly hacky ways of getting things done with R that you might not come across in the standard documentation. Check out the link below for handy hints o...

## Connect R with Myrrix – Mahout & Cloudera’s real-time, scalable recommender system

(This article was first published on BNOSAC - Belgium Network of Open Source Analytical Consultants, and kindly contributed to R-bloggers) Myrrix is probably more known by java developers and users of Mahout than R users. This is because most of the times java and R developers live in a different community.  If you go to the website of Myrrix...

## Fair weather fans? (An R scatter plot matrix)

July 18, 2013
By

The Victoria HarbourCats are roughly half way through their inaugural season in the West Coast League, and currently lead the league in average attendance.  In a recent conversation with one of the team's staff, he mentioned that after the first game in early June, the fans started to come out when the sun...

## Advanced sab-R-metrics: Parallelization with the ‘mgcv’ Package

July 18, 2013
By

Carson Sievert (creator of the really neat pitchRx package) and Steamer Projections posed a question about reasonable run times of the mgcv package on large data in R yesterday, and promised my Pitch F/X friends I would post here with a quick tip on sp...

## Simple and Advanced Time series with Oracle R Enterprise

July 18, 2013
By

v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} This guest post from Marcos Arancibia describes how to use Oracle R Enterprise to analyze Time Series data. In this article, we give an overview of how to use Time Series Analysis against data stored in Oracle Database, using the Embedded R Execution capability to send time series computations to...

## Lilac Bloomsday Run in Graphs

July 17, 2013
By

The Lilac Bloomsday Run is a 7.46 miles (12 kms) race held annually in Spokane, Washington. The run was started in 1977 by Don Kardong and it saw 51,613 people register for it in 2013. In this post, I use R to collect data from their website, clean and...

## 9th IMACS seminar on Monte Carlo Methods, Annecy

July 17, 2013
By

As astute ‘Og’s readers may have gathered (!), I am now in Annecy, Savoie, for the 9th IMACS seminar on Monte Carlo Methods. Where I was kindly invited to give a talk on ABC. IMACS stands for “International Association for Mathematics and Computers in Simulation” and the conference gathers themes and sensibilities I am not

July 17, 2013
By

Even though the route might be circuitous, my seemingly random journeys all seem to lead back to finance.  My fun with rCharts sankey diagrams (Exploring Networks with Sankey) has led me into an exploration of the PIMCO network.  Although PIMCO is best known for its fixed income products, PIMCO has broadened its product offerings...

## Futile.logger 1.3.3 available on CRAN

July 17, 2013
By

The latest version of futile.logger, which I previewed back in May is now on CRAN. Thanks to Kirill Müller, the documentation …Continue reading »

## Figuring out which Simpsons character is speaking

July 17, 2013
By

You probably have a favorite Simpsons character. Maybe you hope to someday block out the sun, Mr. Burns style, maybe you enjoy Homer’s skill in averting meltdowns, or maybe you identify with Lisa’s struggles for acceptance. Through its ch...

## Plotting Likert-Scales (net stacked distributions) with ggplot #rstats

July 17, 2013
By

First of all, credits for this script must go to Ethan Brown, whose ideas for creating Likert scales like plots with ggplot built the core of my sjPlotLikert.R-script. All I did was some visual tweaking like having positive percentage values … Weiterlesen →

## UseR! Slides for “Classification Using C5.0”

July 17, 2013
By

I've had a lot of requests, so here they are.  Hopefully, all of the slides will be posted on the conference website.

## Subtle Variable Scoping in R

July 17, 2013
By

A languages manual usually defines how a language behaves, but does not warn you in cases where you assume a feature should be supported but isn’t. As an example, I will talk about the subtle variable scoping in R language. {} code blocks A lot of programmers coming from C/C++/Java will assume that code...

## Evaluating the Potential Incorporation of R into Research Methods Education in Psychology

July 17, 2013
By

I was recently completing some professional development activities that required me to write a report on a self-chosen topic related to diversity in student backgrounds. I chose to use the opportunity to reflect on the potential for using R to teach psychology students research methods. I thought I'd share the report in case it interests anyone. Abstract...

## Deepen your R experience with Rcpp

July 17, 2013
By

by Joseph Rickert It is very likely that even a very casual observer what is happening in the world of R these past few months would have come across some mention of Rcpp, the R package that greatly facilitates R and C++ integration. Rcpp is hot! Over 130 R packages now depend on Rcpp and it is likely to...

## Exploring Networks with Sankey

July 16, 2013
By

Motivated by a tweet from Tony Hirst (http://blog.ouseful.info/), I started experimenting with an rCharts implementation of the d3 sankey plugin.  While I was putting together examples, I found lots of gaps in my knowledge of sankeys and network a...

## What are my chances to talk to this girl? Fisher or Bayes

July 16, 2013
By

Robert Mathews said that : "Ronald Fisher gave scientists a mathematical machine for turning baloney into breakthroughs, and ukes into funding. It is time to pull the plug.". He's right. In one previous life, I wrote a thesis in Philosophy. But, a specific area, Epistemology also calledtheory of knowledge, because, It questions what knowledge is and how it can...

## How R will turn into SQL

July 16, 2013
By

Up until very recently the only way of running R code was through the standard R distribution. Of course you could use another IDE, but somewhere underneath it all you would be running the same, standard R engine from the R-core team. This is no longer your only option. A couple of weeks ago Radford

## Optimising a Noisy Objective Function

July 16, 2013
By
$Optimising a Noisy Objective Function$

I am busy with a project where I need to calibrate the Heston Model to some Asian options data. The model has been implemented as a function which executes a Monte Carlo (MC) simulation. As a result, the objective function is rather noisy. There are a number of algorithms for dealing with this sort of problem, and

## High Performing Predictive Analytics with R and Hadoop

July 16, 2013
By

I'm a bit late catching up on this, but Mario Inchosa (Revolution Analytics US Chief Scientist) gave a standing-room-only talk on high-performance predictive analytics in R and Hadoop at last month's Hadoop Summit. In the talk, he described some of the progress we've made integrating the ScaleR parallel external-memory algorithms into the Hadoop platform. He described some of the...

## Spatial Playtime

July 16, 2013
By

There have been repeated calls for “space” in many fields of social science (all links are behind paywalls, sorry): Demography: (Voss 2007) Sociology: (Gieryn 2000) Epidemiology: for an early critical review (Jacquez 2000) Geography: obviously geographers were into space before it was cool. A couple of pieces I like are Doreen Massey’s book, For Space… Continue reading →

## Priors

July 16, 2013
By

Nick Firoozye writes: While I am absolutely sympathetic to the Bayesian agenda I am often troubled by the requirement of having priors. We must have priors on the parameter of an infinite number of model we have never seen before and I find this troubling. There is a similarly troubling problem in economics of utility The post Priors...

## ngramr – an R package for Google Ngrams

July 16, 2013
By

The recent post How common are common words? made use of unusually explicit language for the Stubborn Mule. As expected, a number of email subscribers reported that the post fell foul of their email filters. Here I will return to the topic of n-grams, while keeping the language cleaner, and describe the R package I developed