3972 search results for "git"

Clone all your gists locally with R

January 2, 2013
By

I really like gists as a quick way to include more lengthly code snippets into my blog posts. However, I am not a git user as such, and so I was quite concerned when I noticed that all my gists on this blog had vanished after Christmas. I suppose this was a result of Github's downtime...

Read more »

Polarisation and Mobilisation indicators

January 1, 2013
By
Polarisation and Mobilisation indicators

This blog post makes available a set of indicators discussed in a forthcoming edition of Digital Icons. In brief, the script takes a text input and calculates polarisation and mobilisation indexes based on the number of pronouns featured.The hypothesised relationship between pronouns and polarisation is one discussed extensively by critical discourse analysts, social...

Read more »

Tips for R Package Creation

December 30, 2012
By
Tips for R Package Creation

I’m being tortured by the mistakes of my past self. I think I’ve made most every mistake possible in creating a package and I want to go back in time and tell year ago me all I know now. But … Continue reading →

Read more »

Misusage of the new shiny package: A nerdy drink tracker for your next party

December 30, 2012
By
Misusage of the new shiny package: A nerdy drink tracker for your next party

Currently a lot of people are talking about the new shiny package. So I got curious and built an own, more or less useful app: A drink trackerThis app can be used to track how much someone drank and therefore it is very useful for every party, especial...

Read more »

National idenftification number: Finland

December 30, 2012
By

The Finnish Social Security number (FSSn) is a common variable in a Finnish population based study. Within FSSn are individuals birthday, and gender. We can also check if the FSSn correct because it has a check digit. If the data doesn't have birthday ...

Read more »

Update to Graphing Non-Proportional Hazards in R

December 30, 2012
By
Update to Graphing Non-Proportional Hazards in R

Update 1 February 2013: I've moved all of the functionality described in this post into an R package called simtvc. Have a look. It is much easier to use. This is a quick update for a previous post on Graphing Non-Proportional Hazards in R. In the previous post I showed how to simulate and graph 1,000...

Read more »

Integration of R, RStudio and Hadoop in a VirtualBox Cloudera Demo VM on Mac OS X

December 29, 2012
By
Integration of R, RStudio and Hadoop in a VirtualBox Cloudera Demo VM on Mac OS X

MotivationI was inspired by Revolution's blog and step-by-step tutorial from Jeffrey Breen on the set up of a local virtual instance of Hadoop with R. However, this tutorial describes the implementation using VMware's application. One downside to using VMware is that it's not free. I know most of the people including me like to hear the words open-source and free,...

Read more »

Row-wise summary curves in faceted ggplot2 figures

December 29, 2012
By
Row-wise summary curves in faceted ggplot2 figures

I really enjoy reading the Junk Charts blog.  A recent post made me wonder how easy it would be to add summary curves for small-multiple type plots, assuming the “small multiples” to summarize were the X component of a ggplot2::facet_grid(Y ~ X) … Continue reading →

Read more »

High-Dimensional Microarray Data Sets in R for Machine Learning

December 29, 2012
By

Much of my research in machine learning is aimed at small-sample, high-dimensional bioinformatics data sets. For instance, here is a paper of mine on the topic. A large number of papers proposing new machine-learning methods that target high-dimensional data use the same two data sets and consider few others. These data sets are the 1) Alon colon cancer...

Read more »

Men who stare at needles

December 29, 2012
By

Buffon's needle problem is a question first posed in the 18th century by Georges-Louis Leclerc, Comte de Buffon:What is the probability that a needle thrown at a lined sheet of paper will cross a line?This problem can be used to estimate π. If we set the nail size and the line distance = 1, the estimator can be calculated...

Read more »