Blog Archives

useR 2015: Computational

July 1, 2015
By
useR 2015: Computational

These are my initial notes from useR 2015. I will/may revise when I have time. Computational Performance; Chair: Dirk Eddelbuettel Running R+Hadoop using Docker Containers (E. James Harner) Introduction Big data architectures: HDFS/Hadoop: software framework for distributed storage and distributed processing Tachyon/Spark: uses in-memory Rc2 server (R cloud computing) Has an editor & output panel.

Read more »

useR 2015: Networks

July 1, 2015
By
useR 2015: Networks

These are my initial notes from useR 2015. Will revise when I have time. fbRads: Analyzing and managing Facebook ads from R (Gergely Daroczi) Modern advertising Google/Amazon/Facebook use our information Ad platforms: Google: RAdwords, facebook likes: fbRads. You can use the facebook API to get information from facebook. Get hashes of email address, not the

Read more »

useR 2015: Romain Francois: My R adventures

July 1, 2015
By
useR 2015: Romain Francois: My R adventures

Using R since 2002 and has been working on Rcpp, Rcpp11, Rcpp14 and dplyr internals. Worked on a number of big projects. 2005 he set up the R Graph Gallery 2009 worked on rJava 2010 Rcpp 2013 dplyr Key themes are Performance and usabililty rJava 0.7-* Creating objects was messy d <-jnew("java/lang/Double", 42 .jcal(d, "D",

Read more »

Standardising Function Names in R

March 31, 2015
By
Standardising Function Names in R

The renamer Package Tired of the disparate naming systems in R? Then this is the package for you. Installing the package The package is located in my drat. To install install.packages("renamer", repos="http://csgillespie.github.io/drat", type="source") or if you have drat installed drat::addRepo("csgillespie") install.packages("renamer", type="source") The source is available on my github page Example: The CamelCaseR If have an

Read more »

Analysing time course microarray data using Bioconductor: a case study using yeast2 Affymetrix arrays

July 13, 2012
By
Analysing time course microarray data using Bioconductor: a case study using yeast2 Affymetrix arrays

A few years ago I was involved in analysing some time-course microarray data. Our biological collaborators were interested in how we analysed their data, so this lead to a creation of tutorial, which in turn lead to a paper. When we submitted the paper, one the referees “suggested” that we write the paper using Sweave;

Read more »

UK R Courses – 2012

September 17, 2011
By
UK R Courses – 2012

The School of Mathematics & Statistics at Newcastle University (UK), are again running some R courses. In January, 2012, we will run: January 16th: Introduction to R; January 17th: Programming with R; January 18th & 19th: Advanced graphics with R. The courses aren’t aimed at teaching statistics, rather they aim to go through the fundemental

Read more »

Development of R (useR! 2011)

August 19, 2011
By
Development of R (useR! 2011)

Michael Rutter – R for Ubuntu Ubuntu 10.10 uses 2.10.1. Backports are newer versions of software for old releases. R backports are available CRAN (link). Lauchpad is a website for users to develop and maintain software (Canonical). One of Launchpad’s services is the personal package archive (PPA). This allows users to upload .deb source files, allowing

Read more »

Simon Urbanek – R Graphics: supercharged

August 18, 2011
By
Simon Urbanek – R Graphics: supercharged

New features: rasterImage() (R2.11) bitmap raster drawing; have maps as data backdrops. Polygons with holes: polypath() -(R2.12) At present there is no way to tell when to actually show the plot. For example: plot(x); lines(x). Should we display the plot after plot or after lines Solution dev.hold() and dev.flush() Better performance and useful for animations –

Read more »

Kaleidoscope IIIb (useR! 2011)

August 18, 2011
By
Kaleidoscope IIIb (useR! 2011)

O. Mersmann - The microbenchmark package Slides and code (link). SURGEON GENERAL’s WARNING: Microbenchmarks can lead to a distorted view of reality and massive loss of productivity For a higher-order benchmarking package check out the rbenchmark package on R (suggestion from the speaker). Why do we need micro-benchmarking? A simple example showed that it is currently very

Read more »

Big data (useR! 2011)

August 18, 2011
By
Big data (useR! 2011)

Unfortunatley, I missed the first and last talks. My notes from a session on Thursday morning J. Demmler – Challenges of working with a large database of routinely collected health data The SAIL data bank holds over 1.9 billion (anonymous) entries. To use the data for research, they need to ensure that proper data security is

Read more »