Monthly Archives: June 2014

Updates to R package raincpc: Global Daily Rainfall for over 35 years

June 26, 2014
By
Updates to R package raincpc: Global Daily Rainfall for over 35 years

The Climate Prediction Center's  (CPC) global rainfall data, 1979 - present, 50 km resolution, is one of the few high-quality, long-term, observation-based, daily rainfall products available for free. Although raw data is available at&nb...

Read more »

Review of Applied Predictive Modeling by Kuhn and Johnson

June 26, 2014
By
Review of Applied Predictive Modeling by Kuhn and Johnson

by Joseph Rickert Predictive Modeling or “Predictive Analytics”, the term that appears to be gaining traction in the business world, is driving the new “Big Data” information economy. Predictably, there is no shortage of material to be found on this subject. Some discussion of predictive modeling is sure to be found in any reasonably technical presentation of business decision...

Read more »

Maybe I Don’t Really Know R After All

June 26, 2014
By
Maybe I Don’t Really Know R After All

Lately, I’ve been feeling that I’m spreading myself too thin in terms of programming languages. At work, I spend most of my time in Hive/SQL, with the occasional Python for my smaller data. I really prefer Julia, but I’m alone at work on that one. And since I maintain a package on CRAN (RSiteCatalyst), I frequently spend Related posts:

Read more »

Jun 26-27, 2014 – Introduction to Data Science with R in NYC

June 26, 2014
By
Jun 26-27, 2014 – Introduction to Data Science with R in NYC

You can either register from eventbrite or our school site NYC Data Science Academy. Date: Thursday/Friday , June 26th and 27th, 2014 Time:  9:00am to 5:00pm Location: 500 7th Ave, 17th Floor, glass door classroom, New York, NY 10018 NYC Data Science Academy, training subbrand of SupStat (Official Training partner with RStudio Inc) is hosting our... Read more »

Tailoring univariate probability distributions

June 26, 2014
By
Tailoring univariate probability distributions

This post shows how to build a custom univariate distribution in R from scratch, so that you end up with the essential functions: a probability density function, cumulative distribution function, quantile function and random number generator. In the beginning all you need is an equation of the probability density function, … Continue reading →

Read more »

Be Careful with Using Model Design in R

June 25, 2014
By
Be Careful with Using Model Design in R

In R, useful functions for making design matrices are model.frame and model.matrix. I will to discuss some of the differences of behavior across and within the two functions. I also have an example where I have run into this problme and it caused me to lose time. Using model.frame for a design matrix Whenever I

Read more »

A Simple Shiny App for Monitoring Trading Strategies

June 25, 2014
By
A Simple Shiny App for Monitoring Trading Strategies

In a previous post I showed how to use  R, Knitr and LaTeX to build a template strategy report. This post goes a step further by making  the analysis  interactive. Besides the interactivity, the Shiny App also solves two problems : I can now access all my trading strategies from a single point regardless of the instrument traded.

Read more »

Boolean 3 (finally) on CRAN

June 25, 2014
By

I have finally managed to get boolean3 accepted to CRAN. You can find it here: boolean3 on CRAN. To summarize: boolean3 provides a means of estimating partial-observability binary response models following boolean logic. boolean3 was developed by Jason W. Morgan under the … Continue reading →

Read more »

What Would Cohen Have Titled “The Earth is Round (p < .05)” in 2014?

June 25, 2014
By
What Would Cohen Have Titled “The Earth is Round (p < .05)” in 2014?

The area of bibliometrics is not my area of expertise but is still of interest as a researcher. I sometimes think about how Google has impacted the way we title articles. Gone are the days of witty, snappy titles. Title … Continue reading →

Read more »

R Scrabble: Part 2

June 25, 2014
By
R Scrabble: Part 2

Ivan Nazarov and Bartek Chroł gave very interesting comments to my last post on counting number of subwords in NGSL words. In particular they proposed large speedups of my code. So I thought to try checking a larger data set. So today I will work with TWL2006 - the official word authority for tournament Scrabble...

Read more »