20 (!) new R jobs (for 2015-10-21)
<img src=' [Read more...]
How well can you scale your strategy?
This post will deal with a quick, finger in the air way of seeing how well a strategy scales–namely, how … Continue reading →

Data Mining with R: Generate knowledge from data with the eoda R-Academy
The course “Data Mining with R”, which takes place from 9th to 10th November 2015 in Kassel, Germany, teaches the most important methods to expose correlations in data and inherent patterns. The wide set of methods can be used for almost every application area. From customer segmentation to timely recognition of ... [Read more...]
use simplify to remove redundancy of enriched GO terms
To simplify enriched GO result, we can use slim version of GO and use enricher function to analyze.
Another strategy is to use GOSemSim to calculate similarity of GO terms and remove those highly similar terms by keeping one representative term. To make this feature available to clusterProfiler users, I ...

Novelty: an update
A recent tweet: @neilfws I enjoyed this: https://t.co/ynyHRbgpLN Have you published (or are you thinking about publishing) this analysis anywhere? — Marcus Munafo (@MarcusMunafo) October 7, 2015 made me think (1) has it really been 5 years, (2) gee, my ggplot skills were dreadful back then and (3) did I really not know how ...

Warsaw R-Users Group Meeting #12
After summer holidays we are back with two talks: 6pm-6:30 – Adolfo Álvarez PhD ,,5 lessons I have learned at Analyx”. 7pm-7:30 – Piotr Migdał, PhD ,,Jupyter – the environment for learning and doing data analysis’’. See you tomorrow (22/10/2015) at 6 pm, Department of Mathematics, Warsaw University of Technology, Koszykowa 75 room 329. You … Czytaj dalej Warsaw ... [Read more...]
Smoothing Techniques using basis functions: Gaussian Basis
In my previous post, I introduced the concept of smoothing using Fourier basis functions and I applied them onto temperature data. It is important to note the that a similar kind of analysis can be replicated using B-splines (see this page). In this post, I extend the concept to an ... [Read more...]
Paging Widget for Shiny Apps
In my last post I described how I built a shiny application called “DFaceR” that used Chernoff Faces to plot multidimensional data. To improve application response time during plotting, I needed to split large datasets into more manageable “pag... [Read more...]
Paging Widget for Shiny Apps
In my last post I described how I built a shiny application called “DFaceR” that used Chernoff Faces to plot multidimensional data. To improve application response time during plotting, I needed to split large datasets into more manageable “pages” to be plotted. Rather than take the path of least resistance ... [Read more...]
Simple Python performance timing by checkpoints
SummaryThis is a simple python script that can measure python program running time in fine levels.
It’s simpler than a full profiler, easier to use than other currently available similar scripts.
[Read more...]
Argentina’s 2015 Presidential Election Forecasts
The model indicates the official candidate have chance of making it right way this Sunday, but the odds are short.
Predictions
The model I’m using indicates the official candidate have some chance of making it right way this Sunday, avoiding a runoff with Mauricio Macri in late November. The ...

Understanding credible intervals (using baseball statistics)
Previously in this series
Understanding the beta distribution (using baseball statistics)
Understanding empirical Bayes estimation (using baseball statistics)
In my last post, I explained the method of empirical Bayes estimation, a way to calculate useful proportions out of many pairs of success/total counts (e.g. 0/1, 3/10, 235/1000). I used the example ... [Read more...]
Argentine’s 2015 Presidential Election Forecasts
Predictions
The model I built to forecast the Argentine’s 2015 presidential election indicates the official candidate have a significant chance of making it right way this Sunday, avoiding a runoff with Mauricio Macri in November. The electoral preference distributions are quite apart from each other, with the distribution of Daniel ...

Installing R on OS X
I was in a conversation with an academic colleague (wicked smart dude) and the subject of installing R came up (NOTE: this will happen to you, too, if you ever have the misfortune to have a face-to-face convo with me ;-). They noted that getting up and running with R ...

Introducing the dplyrXdf package
The dplyr package is a popular toolkit for data transformation and manipulation. Over the last year and a half, dplyr has become a hot topic in the R community, for the way in which it streamlines and simplifies many common data manipulation tasks. Out of the box, dplyr supports data ... [Read more...]
Rblpapi 0.3.1
The first update to the Rblpapi package since the initial CRAN upload in August is now available.
Rblpapi connects R to the Bloomberg system, giving access to a truly vast array of time series data and custom calculations.
This release brings a new b... [Read more...]
Statistical Tests: Asymptotic, Exact, ou based on Simulations?
This morning, in our mathematical statistics course, we’ve been discussing the ‘proportion test‘, i.e. given a sample of Bernoulli trials, with , we want to test against A natural test (which can be related to the maximum likelihood ratio test) is based on the statistic The test function is ... [Read more...]
Happy World Statistics Day 2015!
By Paulin Shek Today is World Statistics Day . In celebration of this, here are my top 5 examples of fun applications of statistics. 1. The British Medical Journal publishes a Christmas edition every year. My favourite article dates back … Continue reading → [Read more...]
R and the Nobel Prize API
The Nobel Prizes. Love them? Hate them? Are they still relevant, meaningful? Go on admit it, you always imagined you would win one day. Whatever you think of them, the 2015 results are in. What’s more, the good people of the Nobel Foundation offer us free access to data via ...

forecast package v6.2
It is a while since I last updated the CRAN version of the forecast package, so I uploaded the latest version (6.2) today. The github version remains the most up-to-date version and is already two commits ahead of the CRAN version.
This update is mostly bug fixes and additional error traps. ... [Read more...]