Steve Miller on R at Predictive Analytics World

February 26, 2010
By

At the Information Management blog, Steve Miller has provided two great reviews (here and here) of last week's Predictive Analytics World conference, including a recap of the Bay Area User's Group meeting featuring John Chambers. (My personal highlight from John's talk? A photograph of the very first sketch of what was to become the S system, which ultimately begat...

Read more »

Because it’s Friday: Visualizing an email chain

February 26, 2010
By
Because it’s Friday: Visualizing an email chain

We've all been there: someone sends an email to a mailing list with a Reply-To directing responses back to the mailing list. Before long, someone replies (unwittingly, to everyone) to ask to be taken of the list. And before long, the entire affair devolves into an endless cycle of requests to unsubscribe and pleas to stop mailing the entire...

Read more »

R tip: Finding the location of minimum and maximums

February 26, 2010
By

I can never remember this R command, so I am going to post it here which probably means I will always remember it and never have to look it up here again.I sometimes want to find the location of a minimum or maximum value in a vector, so I can look up the corresponding position in another vector, or...

Read more »

R tip: Finding the location of minimum and maximums

February 26, 2010
By

I can never remember this R command, so I am going to post it here which probably means I will always remember it and never have to look it up here again.I sometimes want to find the location of a minimum or maximum value in a vector, so I can look up the corresponding position in another vector, or...

Read more »

R and Sudoku solvers: Plus ca change…

February 25, 2010
By

Christian Robert blogged about a particularly heavy-handed solution to last Sunday's Sudoku puzzle in Le Monde. That had my symapthy as I like evolutionary computing methods, and his chart is rather pretty. From there, this spread on to the REvolutions blogs where David Smith riffed on it, and showed the acual puzzle. That didn't stop things as Christian blogged once more about...

Read more »

R and Sudoku solvers: Plus ca change…

February 25, 2010
By

Christian Robert blogged about a particularly heavy-handed solution to last Sunday's Sudoku puzzle in Le Monde. That had my symapthy as I like evolutionary computing methods, and his chart is rather pretty. From there, this spread on to the REvolutions...

Read more »

R and Sudoku solvers: Plus ca change…

February 25, 2010
By

Christian Robert blogged about a particularly heavy-handed solution to last Sunday's Sudoku puzzle in Le Monde. That had my symapthy as I like evolutionary computing methods, and his chart is rather pretty. From there, this spread on to the REvolutions blogs where David Smith riffed on it, and showed the acual puzzle. That didn't stop things as Christian blogged once more about...

Read more »

Welcome, Robin!

February 25, 2010
By
Welcome, Robin!

Robin Ryder started his new blog with his different solutions to Le Monde puzzle of last Saturday (about the algebraic sum of products…), solutions that are much more elegant than my pedestrian rendering. I particularly like the one based on the Jacobian of a matrix! (Robin is doing a postdoc in Dauphine and CREST—under my

Read more »

Responding to the Flowingdata GDP Graph Challenge

February 25, 2010
By
Responding to the Flowingdata GDP Graph Challenge

Nathan Yau of Flowingdata put up a challenge earlier today to improve upon a graph showing government spending as a percentage of GDP, published in the Economist. The underlying data wasn’t available. So I put on my graph-to-numbers glasses on and pulled out some data. Here it is in case you want to have a

Read more »

Nutritional supplements efficacy score – Graphing plots of current studies results (using R)

February 25, 2010
By
Nutritional supplements efficacy score – Graphing plots of current studies results (using R)

In this post I showcase a nice bar-plot and a balloon-plot listing recommended Nutritional supplements , according to how much evidence exists for thier benefits, scroll down to see it(and click here for the data behind it) * * * * The gorgeous blog “Information Is Beautiful” recently publish an eye candy post showing a “balloon race” image...

Read more »

Solving Sudoku with Simulated Annealing

February 25, 2010
By
Solving Sudoku with Simulated Annealing

How long would it take you to solve this devlishly hard Sudoku puzzle (from Le Monde)? You could do it the old-fashioned way -- with a pencil -- but Xi'an decided to solve it by programming a simulated annealing solver in R. The algorithm works by first guessing a solution at random -- filling in the empty cells above...

Read more »

inkblot: an alternative to stacked bar graphs

February 25, 2010
By
inkblot: an alternative to stacked bar graphs

Sometimes it is not easy to get useful information from a stacked bar chart, see for instance this blogpost at Support Analytics.So-called inkblot charts, as discussed at Kaiser Fung's Junk Charts, allow the reader to focus on the evolution of a time series.Now how to make this kind of charts with R? I asked on StackOverflow....

Read more »

Interaction plot from cell means

February 24, 2010
By
Interaction plot from cell means

I needed to produce a few a interaction plots for my book in R and, while the interaction.plot() function is useful it has a couple of drawbacks. First, the default output isn't very pretty. Second, it works from the raw data, whereas I often need plot...

Read more »

FFT (Fast Fourier Transform) of time series — promises and pitfalls towards trading

February 24, 2010
By
FFT (Fast Fourier Transform) of time series  — promises and pitfalls towards trading

Fig 1. FFT transformed time series (EBAY) reconstructed with first three and twenty harmonics, respectively.I see quite a few traders interested in advanced signal processing techniques. It is often instructive to see why they may or may not be useful....

Read more »

ggplot2: Plotting Dates, Hours and Minutes

February 24, 2010
By
ggplot2: Plotting Dates, Hours and Minutes

Plotting timeseries with dates on x-axis and times on y-axis can be a bit tricky in ggplot2. However, with a little trick this problem can be easily overcome. Let’s assume that I wanted to plot when the sun rises in London in 2010. sunriset function in maptools package calculates the sunrise times using algorithms provided

Read more »

PoRtable…

February 24, 2010
By
PoRtable…

Jobless as I might be, I do have some clients for data analysis. I try not to visit them in their office coz then things get really slow and time-consuming. When I can’t escape this, the worst thing is tuning data and software with client. So, I have a USB with portable versions of my

Read more »

Object types in R: The fundamentals

February 24, 2010
By

If you're a self-taught R programmer, you've probably grappled with the different kinds of objects you can use in the language. When should you use a list instead of a vector? What's the difference between a factor and character vector? These questions are easier to answer when you have some of the basics of R's object types down pat,...

Read more »

SoilWeb iPhone App: Beta-Testers?

February 23, 2010
By
SoilWeb iPhone App: Beta-Testers?

iPhone App Screenshot rev 0.2 - icon iphone App Screenshot rev 0.2 - in Fresno   More Updates: The application is now...

Read more »

Reminder: useR! 2010 abstracts due Monday

February 23, 2010
By

Don't forget, if you're planning to attend the R user conference useR! 2010 and are going to present a talk (and if not, why not?), abstracts are due for submission this coming Monday, March 1. That's also the deadline for early-bird registrations, so if you haven't registered yet, now is the time. useR! 2010: The R User Conference

Read more »

Numerical Integration/Differentiation in R: FTIR Spectra

February 23, 2010
By
Numerical Integration/Differentiation in R: FTIR Spectra

  Stumbled upon an excellent example of how to perform numerical integration in R. Below is an example of piece-wise linear and spline fits to FTIR data, and the resulting computed area under the curve. With a high density of points, it seems like the linear approximation is most efficient and sufficiently accurate. With very large...

Read more »

Slides from “R Productivity Environment” webinar

February 23, 2010
By

Thanks to everyone who attended for the great turnout at this morning's live webinar, 7 Ways to Increase your R Productivity. I really appreciate all the feedback and questions, seems like a lot of people are interested in a code editing and debugging environment for R. If you missed the webinar and want to learn about REvolution R Enterprise...

Read more »

Happy Birthday GGD! The 10 Most Popular Posts Since GGD’s Launch

February 23, 2010
By

The first post on Getting Genetics Done was one year ago today. To celebrate, here are the top 10 most viewed posts since GGD launched last year. Incidentally, nine of the ten are tutorials on how to do something in R. Thanks to all the readers and all...

Read more »

Getting Started with Sweave: R, LaTeX, Eclipse, StatET, & TeXlipse

February 23, 2010
By
Getting Started with Sweave: R, LaTeX, Eclipse, StatET, & TeXlipse

Being able to press a single button that runs all your statistical analyses and integrates the output into your final report is a beautiful thing. If you have not already heard, this is what Sweave can do for you. However, getting your computer to run ...

Read more »

Getting Started with Sweave: R, LaTeX, Eclipse, StatET, & TeXlipse

February 23, 2010
By
Getting Started with Sweave: R, LaTeX, Eclipse, StatET, & TeXlipse

Being able to press a single button that runs all your statistical analyses and integrates the output into your final report is a beautiful thing. If you have not already heard, this is what Sweave can do for you. However, getting your computer to run ...

Read more »

Mexico’s Economy

February 22, 2010
By
Mexico’s Economy

Yesterday the INEGI released the GDP figures for 2009, and since it was an annus horribilis for Mexico, I thought I'd put up a couple of charts. Looking through the Banco de Información Económica I found two series of historical seasonally adjusted GDP data available:GDP in 1993 pesos going from 1980 to 2007 GDP in 2003 pesos going...

Read more »

Mexico’s Economy

February 22, 2010
By
Mexico’s Economy

Yesterday the INEGI released the GDP figures for 2009, and since it was an annus horribilis for Mexico, I thought I'd put up a couple of charts. Looking through the Banco de Información Económica I found two series of historical seasonally adjusted GDP data available: GDP in 1993 pesos going from 1980 to 2007 GDP in 2003 pesos going...

Read more »

Time Series Calendar Heat Maps Using R

February 22, 2010
By
Time Series Calendar Heat Maps Using R

I came across an interesting blog that showcased Charting time series as calendar heat maps in R . It is based upon a great algorithm created by Paul Bleicher,CMO of Humedica. I'll let you link to the other blog to see more details on the background ...

Read more »

A quicky..

February 22, 2010
By

If you’re (and you should) interested in principal components then take a good look at this. The linked post will take you by hand to do everything from scratch. If you’re not in the mood then the dollowing R functions will help you. An example. # Generates sample matrix of five discrete clusters that have

Read more »

Sudoku via simulated annealing

February 22, 2010
By
Sudoku via simulated annealing

The Sudoku puzzle in this Sunday edition of Le Monde was horrendously difficult, so after spending one hour with only 4 entries filled, I decided to feed it to the simulated annealing R program I wrote while visiting SAMSI last year. The R program reached the exact (and only) solution in about 6000 iterations, as

Read more »