A Graphical Approach to Showing the Result of Classification Models

June 4, 2013
By
A Graphical Approach to Showing the Result of Classification Models

This is one of my favorite charts, it easily allows one to see how many predictions are right, and it allows one to see where the wrong ones are as well. It is the equivalent of a confusion matrix, but sometimes a picture is worth a thousand words. Some sample code is included below.  

Read more »

Veterinary Epidemiologic Research: Modelling Survival Data – Semi-Parametric Analyses

June 4, 2013
By
Veterinary Epidemiologic Research: Modelling Survival Data – Semi-Parametric Analyses

Next on modelling survival data from Veterinary Epidemiologic Research: semi-parametric analyses. With non-parametric analyses, we could only evaluate the effect one or a small number of variables. To evaluate multiple explanatory variables, we analyze data with a proportional hazards model, the Cox regression. The functional form of the baseline hazard is not specified, which make

Read more »

How old is the oldest person you know?

June 4, 2013
By
How old is the oldest person you know?

Last week, we had a discussion with some colleagues about the fact that – in order to prepare for the SOA exams – we did not have time (so far) to mention results on extreme values in our actuarial program. I did gave an introduction in my nonlife actuarial models class, but it was only an introduction, in three...

Read more »

Webinar: Managing Data with R

June 4, 2013
By
Webinar: Managing Data with R

Before you can analyze data, it must be in the right form. Join Revolution Analytics and me this June 21st for a 4-hour webinar that shows how to perform the most commonly used data management tasks in R. We will work through … Continue reading →

Read more »

Collecting geocoded tweets with R and Java

June 4, 2013
By
Collecting geocoded tweets with R and Java

Number of tweets in different languages posted around GermanyThere are many thing one can do with tweets (sentiment analysis, maps, ...). This entry shows you how you can access the publicly available API using Java and how to analyse the data using R....

Read more »

IntR – Interactive GUI for performing geostatistical analysis in R

June 4, 2013
By
IntR – Interactive GUI for performing geostatistical analysis in R

In 2011 I presented at the UseR conference, held in Warwick (UK), a piece of software for easing the learning curve of R for geostatistical analysis. It is a very simple attempt to create an interactive interface in R, by using Python as GUI. F...

Read more »

PluginR v0.80 released & 2 new trainings in July 2013

June 4, 2013
By
PluginR v0.80 released & 2 new trainings in July 2013

I'm pleased to announce that the new version 0.80 of PluginR has just been released (as usual, available as a Tiki Mod) with a few minor bugs fixed, and a couple of new interesting features added: PluginR now uses a caching mechanism extending the ...

Read more »

PluginR v0.80 released & 2 new trainings in July 2013

June 4, 2013
By
PluginR v0.80 released & 2 new trainings in July 2013

I'm pleased to announce that the new version 0.80 of PluginR has just been released (as usual, available as a Tiki Mod) with a few minor bugs fixed, and a couple of new interesting features added: PluginR now uses a caching mechanism extending the ...

Read more »

UseR 2013

June 4, 2013
By
UseR 2013

Although the programme is quite interesting, I am not really involved in this conference (in fact I'm not even going $-$ even though, sometimes, I think it would be nice to spend the whole summer away at conferences!).But Vir has just put this pic...

Read more »

Value at Risk and Expected Shortfall, and other upcoming events

June 4, 2013
By
Value at Risk and Expected Shortfall, and other upcoming events

Highlighted Value at Risk and Expected Shortfall A two-day course exploring Value at Risk and Expected Shortfall, and their role in risk management. 2013 June 25 & 26, London. Lead by Patrick Burns. Details at the CFP Events site. New Events Thalesians — San Francisco 2013 June 5. Jesse Davis on “Risk Model Imposed Manager-to-Manager … Continue reading...

Read more »

Interactive slides with googleVis on shiny

June 4, 2013
By
Interactive slides with googleVis on shiny

Following on from last week's post, here are my slides on using googleVis on shiny from the Advanced R workshop at Lancaster University, 21 May 2013. googleVis on shiny Again, I wrote my slides in RMarkdown and I used slidify to create the HTML5 presentation. Unfortunately...

Read more »

A new Sudoku Solver in R. Part 1

June 3, 2013
By
A new Sudoku Solver in R. Part 1

Sudoku is nowadays probably the most widespread puzzle game in the world. As such, it has an interesting variety of solving techniques, not just with paper and pencil but also with computers. Of course, I am not the first one … Continue reading →

Read more »

Using the Ensembl Variant Effect Predictor with your 23andme data

June 3, 2013
By
Using the Ensembl Variant Effect Predictor with your 23andme data

I subscribe to the Ensembl blog and found, in my feed reader this morning, a post which linked to the Variant Effect Predictor (VEP). The original blog post, strangely, has disappeared. Not to worry: so, the VEP takes genotyping data in one of several formats, compares it with the Ensembl variation + core databases and

Read more »

Plotting Oracle RMAN backup durations with R

June 3, 2013
By
Plotting Oracle RMAN backup durations with R

  How long does your Oracle RMAN backup take to complete?  How does this vary over time?  Are there patterns by week, week of month, or day of week?   The gist below can help you evaluate questions like these.… Read more ›

Read more »

random sudokus

June 3, 2013
By
random sudokus

In a paper arXived on Friday, Roberto Fontana relates the generation of Sudoku grids to the one of Latin squares (which is unsurprising) and to maximum cliques of a graph (more surprising). The generation of a random Latin square proceeds in three steps: generate a random Latin square L with identity permutation matrix on symbol

Read more »

How to set up a reproducible R project

June 3, 2013
By

If you're thinking about starting a project (for example, a report or paper) using the R language for analysis, the Nice R code blog has some great advice. Following the principles of reproducible research, Macquarie University postdocs Rich FitzJohn and Daniel Falster suggest: Creating a directory structure to separate R code, data, reports, and output Treating data as read-only...

Read more »

Creating a zoomable map of tweets with R

June 3, 2013
By
Creating a zoomable map of tweets with R

Languages tweeted around Germany: red, blue, green, yellow, grey are for German, French, English, Dutch and other  respectively. See here for a zoomable version.Motivated by the project twitter languages of New York I wanted to...

Read more »

Understanding the value of Predictive Analytics on Web Data

June 3, 2013
By
Understanding the value of Predictive Analytics on Web Data

In this blogpost, I will be talking briefly about Predictive Analytics and why it holds value from a web analytics perspective. Broadly speaking, Predictive Analytics is a set of methodologies that assist us in anticipating customer behavior. The customer behavior of interest could be anything ranging from spend, buying habits, page views, response to a

Read more »

Creating Jekyll blog posts from R.

June 3, 2013
By
Creating Jekyll blog posts from R.

Adam Duncan Also avilable on R-bloggers.com Setting up a Jekyll/Jekyll Bootstrap blog site is a very worthwhile experience. Should you choose to use Jekyll as your blogging platform, you will find many resources out there describing the setup process. This post is not about getting set up using Jekyll or Jekyll Bootstrap. It’s about establishing a good workflow...

Read more »

A Few Tips for Writing an R Book

June 3, 2013
By
A Few Tips for Writing an R Book

I just finished fixing (hopefully all) the problems in the knitr book returned from the copy editor. David Smith has kindly announced this book before I do. I do not have much to say about this book: almost everything in the book can be found in the on...

Read more »

Chicken or the Egg? Granger-Causality for the masses

June 2, 2013
By
Chicken or the Egg?  Granger-Causality for the masses

When I first learned about Granger-causality this past February, I was bemused and quite skeptical of the whole procedure.  I felt it belonged on the scrapheap of impractical academic endeavors, preferring to possibly use an ARIMA transfer function model for the same task.  However, several contemporaries threw the red challenge flag and upon further review, my initial impressions have...

Read more »

Win Your Fantasy Football Auction Draft: Calculate the Optimal Players to Draft with this Shiny App in R

June 2, 2013
By

In this post, I use a Shiny app in R to determine the best possible players to pick in a fantasy football auction draft.  The app takes projections from FantasyPros, a site that averages across numerous sources of projections.  Based on your ...

Read more »

Cosmopolitan Public Spaces

June 2, 2013
By
Cosmopolitan Public Spaces

In my PhD and post-doc research projects at the university, I did a lot of research on the new cosmopolitanism together with Ulrich Beck. Our main goal was to test the hypothesis of an “empirical cosmopolitanization”. Maybe the term is confusing and too abstract, but what we were looking for were quite simple examples

Read more »

Facet wrapping multivariate data: reshape and ggplot

June 2, 2013
By
Facet wrapping multivariate data: reshape and ggplot

A common problem when trying to show data is that the attributes that you want to map for comparison are stored in multiple rather than single variables. For example, proportion of employment by type. This practical will achieve tis using … Continue reading →

Read more »

Using R: drawing several regression lines with ggplot2

June 2, 2013
By
Using R: drawing several regression lines with ggplot2

Occasionally I find myself wanting to draw several regression lines on the same plot, and of course ggplot2 has convenient facilities for this. As usual, don’t expect anything profound from this post, just a quick tip! There are several reasons we might end up with a table of  regression coefficients connecting two variables in different

Read more »

Cars in Netherlands

June 2, 2013
By
Cars in Netherlands

I am looking for a new car. So when I saw there was an update on vehicles in Statistics Netherlands I just had to go and look at the data. So, I learned the brown is getting more popular, often the number of cars from a certain construction year is lar...

Read more »

Grid Search for Free Parameters with Parallel Computing

June 1, 2013
By
Grid Search for Free Parameters with Parallel Computing

In my previous post (http://statcompute.wordpress.com/2013/05/25/test-drive-of-parallel-computing-with-r) on 05/25/2013, I’ve demonstrated the power of parallel computing with various R packages. However, in the real world, it is not straight-forward to utilize these powerful tools in our day-by-day computing tasks without carefully formulate the problem. In the example below, I am going to show how to use the

Read more »

Mapping a Revolution

June 1, 2013
By
Mapping a Revolution

Twitter has become an important communications tool for political protests. While mass media are often censored during large-scale political protests, Social Media channels remain relatively open and can be used to tell the world what is happening and to mobilize support all over the world. From an analytic perspective tweets with geo information are

Read more »

Loading Historical Stock Data

June 1, 2013
By
Loading Historical Stock Data

Historical Stock Data is critical for testing your investment strategies. I illustrated all my back-test examples with getSymbols function from quantmod package. For example, following is a back-test comparison for a few portfolio allocation methods: The getSymbols function, from quantmod package, downloads historical stock prices from Yahoo Fiance. I often get questions about alternative ways

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.