Core minus one!

September 9, 2012
By
Core minus one!

Jean-Michel Marin visited me in Paris last week and, besides taking part in Pierre’s PhD defence, we made enough progress to close two more chapters of the new edition of Bayesian Core (soon to be Bayesian Essentials with R!) This follows the good work session we had in Carnon where we also completed two chapters

Read more »

How to embed a Gist in Tumblr

September 9, 2012
By

Here, both for the sake of posterity, as well as example, is an embedded Gist that describes how to embed a Gist in Tumblr: https://gist.github.com/1395926

Read more »

Football predictions display

September 9, 2012
By
Football predictions display

Having looked at the football data earlier, I wanted to look at predictions for new games. This consists of two parts, getting a predictive model, predicting and displaying the predictions. I decided to do this backwards, first to make the displays. Th...

Read more »

Implementing the CountSummary Procedure

Implementing the CountSummary Procedure

In my last post, I described and demonstrated the CountSummary procedure to be included in the ExploringData package that I am in the process of developing.  This procedure generates a collection of graphical data summaries for a count data sequence, based on the distplot, Ord_plot, and Ord_estimate functions from the vcd package.  The distplot function generates both the Poissonness...

Read more »

RInside 0.2.8

September 8, 2012
By

This morning version 0.2.8 of RInside arrived on the CRAN sites. RInside provides a set of convenience classes which facilitate embedding of R inside of C++ applications and programs, using the classes and functions provided by the Rcpp R and C++ in...

Read more »

Using R to connect to a SQL Server and MySQL Database using MS Windows

September 8, 2012
By

Connecting to MySQL and Microsoft SQL Server Connecting to a MySQL database or MS SQL Server from the R environment can be extremely useful.  It allows a researcher direct access to the data without have to first export it from a database and then import it from a csv file or entering it directly into

Read more »

Violence along Mexico’s Southern Border and Central America

September 7, 2012
By
Violence along Mexico’s Southern Border and Central America

Rates for Panama and Nicaragua are from 2009, all other countries 2010. Municipalities which are part of a metro area in Mexico are shown with the metro area homicide rate.Visit the interactive map of homicides Having just posted on violence along Mexico's northern border, I figured it's time to...

Read more »

Big Issue with System Backtests

September 7, 2012
By
Big Issue with System Backtests

Almost always, when I see a system backtested, the backtest assumes a static portfolio with no contributions or withdrawals.  This assumption only covers an extremely limited subset of my clients.  Cash flows in and out of a portfolio or syst...

Read more »

In praise of ProjectTemplate for reproducible research

September 7, 2012
By
In praise of ProjectTemplate for reproducible research

As you might know from some of my previous posts, I’m a big fan of making my scientific work reproducible. My main reasons for being so keen on this are: 1. Reproducibility is key to science – if it can’t be reproduced then it can not be verified (that is, the experiment can’t be tried again

Read more »

Simulation metamodeling with GNU R

September 7, 2012
By
Simulation metamodeling with GNU R

I am one of the organizers of ESSA2013 conference that will take place in September 2013 in Warsaw, Poland. The conference scope is social simulation and in particular methods of statistical analysis of simulation output (metamodeling). As we have just issued Call for Papers for the conference so I decided to post a simple example of a metamodel.Recently I had...

Read more »

More on fixed and random effects: Plotting and interpreting

September 7, 2012
By
More on fixed and random effects: Plotting and interpreting

In a recent post I showed how plotting model fits can help to interpret higher-order polynomial terms. The key comparison there was between a model that did and did not have the higher order fixed effect terms. If you're going to use this strategy, you need to...

Read more »

Video: R, RStudio, Rcmdr & rattle

September 7, 2012
By

I did a screencast for my co-workers to show how to get started with R, specifically what a base installation of R looks like, then showing how to improve your workflow using RStudio, Rcmdr or rattle.  The examples are somewhat … Continue reading →Video: R, RStudio, Rcmdr & rattle is an article from randyzwitch.com, a...

Read more »

Coming up: Two weeks of awesome guest bloggers

September 7, 2012
By

I'm heading out on vacation to my Australian homeland for the next two weeks, but fear not reader friends: we have an awesome lineup of guest bloggers to fill in while I'm away. I don't want to ruin the surprise, but you'll be seeing posts from: experts in data visualization about how they use R; members of the R...

Read more »

ggplot2 0.9.2 has been released!

September 7, 2012
By
ggplot2 0.9.2 has been released!

The main changes in this version are to the theming system. There are also a number of enhancements to the theming system that make it easier to modify themes and we’ve renamed a number of functions to have more informative names. Your existing code should continue to work, although you may receive warnings about functions

Read more »

Conversion of Meucci’s MatLab Code

September 7, 2012
By
Conversion of Meucci’s MatLab Code

You might remember a second proposal I put forward for this summer’s Google Summer of Code (GSoC). This project was ambitious, looking to convert a subset of Attillio Meucci’s MatLab code to R. Thankfully, Brian Peterson took the lead mentor position for this particular project. Coincidently, the day before GSoC started we received a very

Read more »

That damn R-squared !

September 7, 2012
By
That damn R-squared !

Another post about the R-squared coefficient, and about why, after some years teaching econometrics, I still hate when students ask questions about it. Usually, it starts with "I have a _____ R-squared... isn't it too low ?" Please, feel free to fi...

Read more »

Topic Modeling 1: Simulated LDA Corpus

September 6, 2012
By

Because I am self-taught in many of the areas of computer science and more advanced statistics and probability theory I am most interested in, and because I have a deep aversion both to looking foolish and being full of it...

Read more »

Add Text Annotations to ggplot2 Faceted Plot (an easier approach)

September 6, 2012
By
Add Text Annotations to ggplot2 Faceted Plot (an easier approach)

I recently posted a blog about adding text to a ggplot2 faceted plot (LINK). I was unhappy with the amount of time it takes to create the text data frame to then label the plot. And then yesterday when the … Continue reading →

Read more »

RcppArmadillo 0.3.4.0

September 6, 2012
By

A new major released of Armadillo came out earlier today. I prepared the corresponding RcppArmadillo package 0.3.4.0 which also arrived on CRAN earlier today. This released contains a few performance improvements, the beginnings of support of sparse matrices and more, see below. We also post the NEWS entry for the beta release which was prepared, but not uploaded to CRAN to...

Read more »

Kickstarter facilitates $50M in indie game funding

September 6, 2012
By
Kickstarter facilitates $50M in indie game funding

The social crowdfunding site Kickstarter announced today that it has enabled, via community contributions, $50M in funding in 2012 for new indie games. The second largest category of funding was for independent films. The blog post announcing the news includes analysis (using R, natch) of the breakdown in categories and funding sources. On a personal note, two Kickstarter projects...

Read more »

New Book on Probability with R

September 6, 2012
By

My first book on probability is completed. HTML version is freely available at theanalysisofdata.com and print version coming soon at a discounted price. Also, check out chapters 4 and 5 (available in pdf format) of the upcoming second volume on R programming and R graphics. http://theanalysisofdata.com/probability/viewer1.html (single page viewer) http://theanalysisofdata.com/probability/viewer2.html (two page viewer) http://theanalysisofdata.com/probability/0/0-2.pdf (table

Read more »

Get Long-Term Climate Data from KNMI Climate Explorer

September 6, 2012
By
Get Long-Term Climate Data from KNMI Climate Explorer

You can query global climate data from the KNMI Climate Explorer (the KNMI is the Royal Netherlands Metereological Institute) with R.Here's a little example how I retreived data for my hometown Innsbruck, Austria and plotted annual total precipitation....

Read more »

Kickfollower launches!

September 6, 2012
By

Hi internet, we’re happy to be here. We’ll be covering a range of topics in this blog — we’ll talk to inventors and creative people as they launch their projects from Kickstarter and IndieGoGo, we’ll talk about the crowd-funding industry and where it’s headed, and we’ll dig into the details of pricing and success rates … Continue reading...

Read more »

In case you missed it: August 2012 Roundup

September 6, 2012
By

In case you missed them, here are some articles from June of particular interest to R users. RStan is a new package for Bayesian modeling with R. It's faster and can fit more highly-correlated models than the MCMC sampler of BUGS and JAGS. Biostatistician Corey Chivers used R to animate the epidemic-like growth of retailer Walmart in the US....

Read more »

The future of Artificial Intelligence – as imagined in 1989

September 6, 2012
By
The future of Artificial Intelligence – as imagined in 1989

This image comes from the cover of Preliminary Papers of the Second International Workshop on Artificial Intelligence and Statistics (1989). Someone abandoned it in the lobby of my building at school. Whatever for, I’ll never know. I just love the idea of machine learning/AI/Statistics evoking a robot hand drawing a best fit line through some

Read more »

Export R plot to Illustrator or Inkscape

September 6, 2012
By
Export R plot to Illustrator or Inkscape

Have you ever exported an R plot as a PDF and tried to edit it further by importing the PDF into a vector graphics program like Adobe Illustrator or Inkscape? What typically happens is the points on the plot get...

Read more »

BaselR meetup

September 6, 2012
By

Mango Solutions host BaselR, a free, open and informal R user group for those using or interested in using R. There will be a few short R related presentations and an opportunity to meet and chat with other R users over a drink. Date: Thursday 13th September at 6.30pm. Venue : transBarent, Viaduktstrasse 3, Basel (by the train station)...

Read more »

Inference and autoregressive processes

September 6, 2012
By
Inference and autoregressive processes

Consider a (stationary) autoregressive process, say of order 2, for some white noise with variance . Here is a code to generate such a process, > phi1=.5 > phi2=-.4 > sigma=1.5 > set.seed(1) > n=240 > WN=rnorm(n,sd=sigma) > ...

Read more »

Visually weighted/ Watercolor Plots, new variants: Please vote!

September 6, 2012
By
Visually weighted/ Watercolor Plots, new variants: Please vote!

Update Oct-23: Added a new parameter add to the function. Now multiple groups can be plotted in a single plot (see example in my comment) As a follow-up on my R implementation of Solomon’s watercolor plots, I made some improvements to the function. I fine-tuned the graphical parameters (the median smoother line now diminishes faster

Read more »