2009

India Census 2001 – Part 1

March 22, 2009 | anandram

I was trying – for the last few weeks – to get the 2001 Indian census data. Alas the census website is under construction. But fortunately the Internet rewind button works! Thankfully the literacy data was online there. The raw data is available here. I cleaned up the data so that it is ... [Read more...]

Dianne Reeves at Dominican

March 16, 2009 | Thinking inside the box

Yesterday afternoon, we had another chance to see Dianne Reeves (wikipedia). This time, it almost felt like she came to us as she was headlining at the annual trustee benefit concert at Dominican University, a small college about a mile from our place. And as in 2007 and 2003, she did not ... [Read more...]

Identify Data Points in Off-Screen R Graphics Devices

March 16, 2009 | Yihui Xie

Today Ruya Gokhan Kocer asked me how to use the R function identify() in off-screen graphics devices. Actually it’s pretty easy as long as we obtain the list returned by identify(pos = TRUE). For example, # open a windows device x11() x = rnorm(20) y = rnorm(20) plot(x, y) # identify 5 points ... [Read more...]

Color: The Cinderella of dataviz

March 13, 2009 | mike

“Avoiding catastrophe becomes the first principle in bringing color to information: Above all, do no harm.”  — Envisioning Information, Edward Tufte, Graphics Press, 1990    Color is one of the most abused and neglected tools in data visualization. It is abused when we make poor color choices; it is neglected when we rely ... [Read more...]

Visulization of correlation matrix

March 12, 2009 | Cloud Wei

Color Imagedata(mtcars)fit = lm(mpg ~ ., mtcars)cor = summary(fit, correlation = TRUE)$correlationcor2 = t(cor[11:1, ])colors = c("#A50F15", "#DE2D26", "#FB6A4A", "#FCAE91", "#FEE5D9","white", "#EFF3FF", "#BDD7E7", "#6BAED6", "#3182BD", "#08519C")image(1:11, 1:11, cor2, axes = FALSE, ann = F, col = colors)text(rep(1:11, 11), rep(1:11, each = 11), round(100 * cor2))Ellipseslibrary(... [Read more...]

no “Infinities”

March 12, 2009 | Paolo

Thanks to  Pierre-Yves for the below useful tip!if you have a dataset from which you want the max or min but they have to be real number and not "Inf" or "-Inf" there is a way to do it:data
[Read more...]

Andrews’ Curve And Parallel Coordinate Graph

March 11, 2009 | Cloud Wei

Unison graph and parallel coordinate graph share similar thought in visualising the difference of multidimensional data, thought the former is much more complicated. Based on iris data, we can see their performance.Parallel coordinate graphAndrews' Cur... [Read more...]

Scatterplots

March 11, 2009 | Cloud Wei

There are many types of scatterplots in R, here are some examples based on the famous Iris data.pairs() and coplot() in package graphics.gpairs() in package YaleToolkit.scatterplot.matrix() or spm() in package car.splom() in package lattice. [Read more...]

Choosing an SQL Engine for Analytics

March 9, 2009 | JD Long

I’ve been struggling for a while on which database to use for my working data. I used to use MS Access quite a lot. The problems with MS Access include but are not limited to: 2 GB file size limit, at least historically Versions change with each edition of MS ...
[Read more...]

Repeated Measures ANOVA using R

March 9, 2009 | Paul Gribble

While so-called “between-subjects” ANOVA is absolutely straightforward in R, performing repeated measures (within-subjects) ANOVA is not so obvious. I have come across at least three different ways of performing repeated measures ANOVA in R. Which method you use depends on … Continue reading → [Read more...]

NREGA and Indian maps in R

March 8, 2009 | anandram

A few days ago I was reading an article by Jean Drèze and his colleagues on how the first two years of National Rural Employment Guarantee Act (NREGA) has progressed (There was another article by Drèze on NREGA in 2007). The NREGA is empowering the rural people in a ... [Read more...]

Coimbatore Weather and Questioning Amma!

March 8, 2009 | anandram

A week ago, Amma was telling the weather was getting hot in Coimbatore. I was telling her it is going to get worse in the next two months. She shot back saying that March is the hottest month while April and May are less hotter in Coimbatore. Growing up in ... [Read more...]

Dealing with missing values

March 8, 2009 | Paolo

Two new quick tips from 'almost regular' contributor Jason: Handling missing values in R can be tricky. Let's say you have a table with missing values you'd like to read from disk. Reading in the table with, read.table( fileName ) might fail. If ...
[Read more...]

So here we have our 1st problem…

March 7, 2009 | the R user...

Hey all of you,I got such an interesting problem, a friend of mine is modelling something using bayesian statistics and she got an equation system to solve, but she's stucked and she showed to me such a long script to ty to solve it with numeric approx... [Read more...]
1 29 30 31 32 33 36

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)