Some Impressions from R Finance 2015

[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

by Joseph Rickert

The R/Finance 2015 Conference wrapped up last Saturday at UIC. It has been seven years already, but R/Finance still has the magic! – mostly very high quality presentations and the opportunity to interact and talk shop with some of the most accomplished R developers, financial modelers and even a few industry legends such as Emanuel Derman and Blair Hull.

Emanuel Derman led off with a provocative but extraordinary keynote talk. Derman began way out there, somewhere well beyond the left field wall recounting the struggle of Johannes Kepler to formulate his three laws of planetary motion and closed with some practical advice on how to go about the business of financial modeling. Along the way he shared some profound, original thinking in an attempt to provide a theoretical context for evaluating and understanding the limitations of financial models. His argument hinged on making and defending the distinction between theories and models. Theories such as physical theories of Kepler, Newton and Einstein are ontological: they attempt to say something about how the world is. A theory attempts to provide "absolute knowledge of the world". A model, on the other hand, "tells you about what some aspect of the world is like". Theories can be wrong, but they are not the kinds of things you can interrogate with "why" questions.

Models work through analogies and similarities. They compare something we understand to something we don't. Spinoza's Theory of emotions is a theory because it attempts to explain human emotions axiomatically from first principles.


The Black Scholes equation, by contrast, is a model that tries to provide insight through the analogy with Brownian motion. As I understood it, the practical advice from all of this is to avoid the twin traps of attempting to axiomatize financial models as if they directly captured reality, and of believing that analyzing data, no matter how many terabytes you plow through, is a substitute for an educated intuition about how the world is.

The following table lists the remaining talks in alphabetical order by speaker.

 Presentation PackagePackage Location
1Rohit Arora:   Inefficiency of Modified VaR and ES  
2Kyle Balkissoon: A Framework for Integrating Portfolio-level Backtesting with   Price and Quantity InformationPortFolioAnalytics 
3Mark Bennett:   Gaussian Mixture Models for Extreme Events  
4Oleg Bondarenko: High-Frequency Trading Invariants for Equity Index Futures  
5Matt Brigida:   Markov Regime-Switching (and some State Space) Models in Energy Marketscode for regime   switchingGitHub
6John Burkett:   Portfolio Optimization: Price Predictability, Utility Functions,   Computational Methods, and ApplicationsDEoptimCRAN
7Matthew Clegg:   The partialAR Package for Modeling Time Series with both Permanent and   Transient ComponentspartialARCRAN
8Yuanchu Dang:   Credit Default Swaps with R (with Zijie Zhu)CDSGitHub
9Gergely Daroczi: Network analysi​s of the Hungarian interbank lending market  
10Sanjiv Das:   Efficient Rebalancing of Taxable Portfolios  
11Sanjiv Das:   Matrix Metrics: Network-Based Systemic Risk Scoring  
12Emanuel Derman:   Understanding the World   
13Matthew Dixon:   Risk Decomposition for Fund Managers  
14Matt Dowle:   Fast automatic indexing with data.tabledata.tableCRAN
15Dirk Eddelbuettel: Rblpapi: Connecting R to the data service that shall not be   namedRblpapiGitHub
16Markus Gesmann:   Communicating risk – a perspective from an insurer  
17Vincenzo Giordano: Quantifying the Risk and Price Impact of Energy Policy Events   on Natural Gas Markets Using R (with Soumya Kalra)  
18Chris Green:   Detecting Multivariate Financial Data Outliers using Calibrated Robust   Mahalanobis DistancesCerioliOutlierDetectionCRAN
19Rohini Grover:   The informational role of algorithmic traders in the option market  
20Marius Hofert:   Parallel and other simulations in R made easy: An end-to-end studysimsalaparCRAN
21Nicholas James:   Efficient Multivariate Analysis of Change PointsecpCRAN
22Kresimir Kalafatic: Financial network analysis using SWIFT and R  
23Michael Kapler:   Follow the Leader – the application of time-lag series analysis to discover   leaders in S&P 500SITother
24Ilya Kipnis:   Flexible Asset Allocation With Stepwise Correlation Rank  
25Rob Krzyzanowski: Building Better Credit Models through Deployable Analytics in   R  
26Bryan Lewis:   More thoughts on the SVD and Finance  
27Yujia Liu and Guy Yollin: Fundamental Factor Model DataBrowser using Tableau and RfactorAnalyticsRFORGE
28Louis Marascio:   An Outsider's Education in Quantitative Trading   
29Doug Martin:   Nonparametric vs Parametric Shortfall: What are the Differences?  
30Alexander McNeil: R Tools for Understanding Credit Risk Modelling   
31William Nicholson: Structured Regularization for Large Vector AutoregressionBigVARGitHub
32Steven Pav:   Portfolio Cramer-Rao Bounds (why bad things happen to good quants)SharpeRCRAN
33Jerzy Pawlowksi: Are High Frequency Traders Prudent and Temperate?HighFreqGitHub
34Bernhard Pfaff:   The sequel of cccp: Solving cone constrained convex programscccpCRAN
35Stephen Rush:   Information Diffusion in Equity Markets  
36Mark Seligman:   The Arborist: a High-Performance Random Forest ImplementationRboristCRAN
37Majeed Simaan:   Global Minimum Variance Portfolio: a Horse Race of Volatilities  
38Anthoney Tsou:   Implementation of Quality Minus JunkqmjGitHub
39Marjan Wauters:   Characteristic-based equity portfolios: economic value and dynamic style   allocation  
40Hadley Wickham:   Data ingest in RreadrCRAN
41Eric Zivot:   Price Discovery Share-An Order Invariant Measure of Price Discovery with   Application to Exchange-Traded Funds  

I particularly enjoyed Sanjiv Das' talks on Efficient Rebalancing of Taxable Portfolios and Matrix Metrics: Network Based Systemic Risk Scoring, both of which are approachable by non-specialists. Sanjiv became the first person to present two talks at an R/Finance conference, and thus the first person to win one of the best presentation prizes with the judges unwilling to say which of his two presentations secured the award.

Bryan Lewis' talk: More thoughts on the SVD and Finance was also notable for its exposition. Listening to Bryan you can almost fool yourself into believing that you could develop a love for numerical analysis and willingly spend an inordinate amount of your time contemplating the stark elegance of matrix decompositions.

Alexander McNeil's talk: R Tools for Understanding Credit Risk Modeling was a concise and exceptionally coherent tutorial on the subject, an unusual format for a keynote talk, but something that I think will be valued by students when the slides for all of the presentations become available.

Going out on a limb a bit, I offer a few un-researched, but strong impressions of the conference. This year, to a greater extent than I remember in previous years, talks were built around particular packages; talks 5, 7 and 8 for example. Also, it seemed that authors were more comfortable hightlighting  and sharing packages that are work in progress; residing not on CRAN but on GitHub, R-Forge and other platforms. This may reflect a larger trend in R culture.

This is the year that cointegration replaced correlation as the operative concept in many models. The quants are way out ahead of the statisticians and data scientists on this one. Follow the money!

Speaking of data scientists: if you are a Random Forests fan do check out Mark Seligman's Rborist package, a high-performance and extensible implementation of the Random Forests algorithm.

Network analysis also seemed to be an essential element of many presentations. Gergely Daróczi's Shiny app for his analysis of the Hungarian interbank lending network is a spectacular example of how interactive graphics can enhance an analysis.

Finally, I'll finish up with some suggested reading in preparation for studying the slides of the presentations when they become available.

Sanjiv Das: Efficient Rebalancing of Taxable Portfolios
Sanjiv Das: Matrix Metrics: Network-based Systematic Risk Scoring
Emanuel Derman: Models.Behaving.Badly
Jurgen A. Doornik and R.J. O'Brien: Numerically Stable Cointegration Analysis (A recommendation from Bryan Lewis)
Arthur Koestler: The Sleepwalkers (I am certain this is the book whose title Derman forgot.)
Alexander J. McNeil and Rudiger Frey: Quantitative Risk Management Concepts, Techniques and Tools
Bernhard Pfaff: Analysis of Integrated and Cointegrated Time Series with R (Use R!)

To leave a comment for the author, please follow the link and comment on their blog: Revolutions. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)