### A Data Cleaning Example

September 27, 2015 |

For this particular example,the variables of interest are stored as key:value pairs anda single data cell could contain multiple (unknown) number of key:value pairs.The objective is to separate these key-value pairs and store the values in co... [Read more...]

### A Data Cleaning Example

September 27, 2015 |

For this particular example, the variables of interest are stored as key:value pairs and a single data cell could contain multiple (unknown) number of key:value pairs. Basically, we want to convert input dataset on LHS to the output dataset on the RHS as illustrated in the graphic below ...

### Survival Analysis – 2

August 13, 2015 |

In my previous post, I went over basics of survival analysis, that included estimating Kaplan-Meier estimate for a given time-to-event data. In this post, I'm exploring on Cox's proportional hazards model for survival data. KM estimator helps in figuri... [Read more...]

### Survival Analysis – 2

August 13, 2015 |

In my previous post, I went over basics of survival analysis, that included estimating Kaplan-Meier estimate for a given time-to-event data. In this post, I'm exploring on Cox's proportional hazards model for survival data. KM estimator helps in figuring out whether survival function estimates for different groups are same or ... [Read more...]

### Survival Analysis – 1

August 2, 2015 |

I recently was looking for methods to apply to time-to-event data and started exploring Survival Analysis Models. In this post, I'm exploring basic KM estimator which is a nonparametric estimator of the survival function using a real dataset (on time t... [Read more...]

### Survival Analysis – 1

August 2, 2015 |

I recently was looking for methods to apply to time-to-event data and started exploring Survival Analysis Models. In this post, I'm exploring basic KM estimator. It is a nonparametric estimator of the survival function. There are couple of instances when the KM estimator comes in handy - When the survival ... [Read more...]

### ogdindiar: R package to easily access Open Government Data from India Portal

June 8, 2015 |

Following up on my earlier posts on accessing Open Government Data from R, I've wrapped this code into an R package - ogdindiar. It's available on GitHub at https://github.com/steadyfish/ogdindiarIt provides one simple function - fetch_data() to download required data resource from the https://data.gov.... [Read more...]

### dplyr Use Cases: Non-Interactive Mode

March 16, 2015 |

The current release of dplyr (v 0.4.1) offers lot more flexibility regarding usage of important verbs in non-interactive mode. In this post, I'm exploring different possible use-cases.group_by_, select_, rename_:For group_by_, select_ and rename_,... [Read more...]

### Accessing Open Data Portal (India) using APIs

April 15, 2014 |

As I mentioned in my previous blog post, Government of India have started an Open Data Portal for making various data public. Most of the data-sets on the portal are available for manual download. Some of the data-sets though are also available to be a... [Read more...]

### Know India through Visualisations – 1

February 1, 2014 |

I'm going to produce just a couple of charts, a teaser of sorts in this post. In the forthcoming posts I'll dig deeper.I was amazed with the existing list of R packages to work with spatial data, without needing to get into much of the technical detail...