One of the neat tools available via a variety of packages in R is the creation of beautiful tables using data frames stored in R. In what follows, I’ll discuss these different options using data on departing flights from Seattle … Continue reading →

Introduction The objective of this blog post is to help you get started with Apache Zeppelin notebook for your R data science requirements. Zeppelin is a web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with Scala(with Apache Spark), Python(with Apache Spark), SparkSQL, Hive, Markdown, Shell and more.…

Previously in this series Understanding the beta distribution (using baseball statistics) Understanding empirical Bayes estimation (using baseball statistics) In my last post, I explained the method of empirical Bayes estimation, a way to calculate useful proportions out of many pairs of success/total counts (e.g. 0/1, 3/10, 235/1000). I used the example of estimating baseball batting averages based...

Extending genomation to work with paired-end BAM filesExtending genomation to work with paired-end BAM filesAccelerate functions responsible for reading genomic filesParallelizing data processing in ScoreMatrixListArithmetic, indicator and logic operations as well as subsetting work on ScoreMatrix objectsImprovements and new arguments in visualization functionsFaster...

Background I was honoured by a site visit by Drs. Yeo-Min Yun and Junghan Song of the Korean Society for Clinical Chemistry a few weeks ago. As both professors are on the organizing committee of the Cherry Blossom Symposium for Lab Automation in Seoul in Spring 2016, their primary motivation for visiting was to discuss … Continue reading...

In case you missed them, here are some articles from September of particular interest to R users. A tutorial on using R with Jupyter Notebooks and how to control the size of R graphics therein. A new version of Revolution R Open is available, featuring multi-threaded computing for R 3.2.2. One benefit of fitting statistical models to large data...

A lot time ago I wonder for some R package for chess. Python, JS, Java, C have chess libraries, why R not? But then the htmlwidgets and V8 packages were born and now possibilities are almost endless. Anyone can make a wrapper for a JS library :D. And this is how rchess came out. The rchess package is a...

The jobs Within the New Zealand Ministry of Business, Innovation and Employment, the Sector Trends team has recently secured resourcing for additional analysts on a range of statistical programmes. That’s the team that I usually manage, although for the next few months I’m doing a stint on a similar team, different topics. The formal details and position descriptions...

The other day, I picked up the latest copy of the CAS’ journal, Variance and skipped to the back where Leigh Halliwell had an article. I hope that I’m well on record as being one of his biggest fans, but if not, let me remedy that now. Leigh Halliwell has done really tremendous stuff. He’s mathematically sophisticated, but addresses...

e-mails with the latest R posts.

(You will not see this message again.)