(This article was first published on

**Commodity Stat Arb**, and kindly contributed to R-bloggers)

Much has been written and said about the NG / CL relationship. For most of 2009 pundits touted NG as historically underpriced relative to CL. As always the reality is more complex. Comparing a largely domestic USD commodity with an internationally traded commodity increasingly consumed by emerging markets has inherent risks.

More subtly, most commentators ignore the roll issues in both markets in making comparisons of relative value. For a long / short convergence trade to make sense over any time frame of more than one month, the reality of dealing with a periodically expiring contract has to be addressed.

Recently I was experimenting with applying cointegration metrics to the NG & CL markets. More can be found about cointegration elsewhere [http://en.wikipedia.org/wiki/Cointegration] but in a sentence cointegration measures the likelihood of convergence as opposed to correlation which measures the likelihood of comovement. Convergence provides a more useful measure of potential profitability of a long/short trade. In theory cointegration provides a useful filter against spurious correlations.

[Also for both a great exposition and elegant implementation in R, see: Pfaff, “Analysis of Integrated and Cointegrated Time Series in R”.]

So in order to test for cointegration you need a time series of the investable asset on both sides. In the case of CL & NG, with monthly expirations, you need to create the time series of what a long or short futures contract return would look like including the monthly roll.

For simplicity I assumed roll on the penultimate date at the closing price. This is easily achieved via the extremely liquid TAS market which is usually clears within a single tick of the closing price.

What is interesting is that over a reasonably long time frame, the two price series fail the test of cointegration. A plot of the spread relationship can be seen here:

The huge run up and subsequent collapse in CL is even more apparent when compared with NG on a roll adjusted basis.

Even on a shorter YTD time frame the results are still not passing the ADF test for cointegration:

Why does the test fail so conclusively? A closer examination of the return and price data yields the following interesting result. Starting with yesterday’s closing price if we back-cast prices based on returns, including the roll yield we get the following chart:

What this shows is that to end up at yesterday’s gas price if $4.059 for example, gas would have been over $30 in mid 2008 once the roll return is incorporated. Oil is a similar picture, to end up at yesterday’s settle of $89.92, oil would have to have been over $230 once the roll is accounted for. [Investors in oil and gas ETFs like USO & UNG will be familiar with this picture.]

From a cointegration standpoint what is apparent is that once the roll yield is incorporated the two time series have continued to diverge on a roll adjusted basis. This explains why no oil vs. gas long / short strategy has worked for some time.

This month looks like the first time that the winter prompt contract roll on NG is going to settle at less than a 10c discount… maybe this is the turning point for NG on a roll adjusted basis.

Good luck for 2011.

To

**leave a comment**for the author, please follow the link and comment on their blog:**Commodity Stat Arb**.R-bloggers.com offers

**daily e-mail updates**about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...