## Stata Fully Mapped into R

April 1, 2014
By

Hello all of you Stata loving statistical analysts out there!  I have great news.  I am finally nearly done with the package I have been working on which provides the mechanism for Stata users to seamlessly move from Stata to R though use of ...

Read more »

## Melbourne’s Weather and Cross Correlations

April 1, 2014
By

During a lunchtime discussion among recent GCaP class attendees, the topic of weather came up and I casually mentioned that the weather in Melbourne, Australia, can be very changeable because the continent is so old that there is very little geographical relief to moderate the prevailing winds coming from the west.In general, Melbourne...

Read more »

## Mapping the March 2014 California Earthquake with ggmap

April 1, 2014
By

I had no intention to blog this, but @jayjacobs convinced me otherwise. I was curious about the recent (end of March, 2014) California earthquake “storm” and did a quick plot for “fun” and personal use using ggmap/ggplot. I used data from the Southern California Earthquake Center (that I cleaned up a bit and that you

Read more »

## You don’t need to understand pointers to program using R

April 1, 2014
By

R is a statistical analysis package based on writing short scripts or programs (versus being based on GUIs like spreadsheets or directed workflow editors). I say “writing short scripts” because R’s programming language (itself called S) is a bit of an oddity that you really wouldn’t be using except it gives you access to superiorRelated posts:

Read more »

## A look at R vectorization through the Collatz Conjecture

April 1, 2014
By

by Seth Mottaghinejad, Analytic Consultant for Revolution Analytics You may have heard before that R is a vectorized language, but what do we mean by that? One way to read that is to say that many functions in R can operate efficiently on vectors (in addition to singletons). Here are some examples: > log(1) # input and output are...

Read more »

## Do Not Play With Mr. Penney

April 1, 2014
By

Facts do not speak (Henry Poincare) Mr. Penney is my best friend. He is maths teacher and loves playing. Yesterday we were in his office at the university when he suggested me a game: When you toss a coin three times, you can obtain eight different sequences of tails and heads: TTT, TTH, THT, HTT, THH,

Read more »

## IV Estimates via GMM with Clustering in R

April 1, 2014
By

In econometrics, generalized method of moments (GMM) is one estimation methodology that can be used to calculate instrumental variable (IV) estimates. Performing this calculation in R, for a linear IV model, is trivial. One simply uses the gmm() function in the excellent gmm package like an lm() or ivreg() function. The gmm() function will estimate

Read more »

## From Random Walks to Personalized PageRank

April 1, 2014
By

In graph theory (and its applications) it is often required to model how information spreads within a given graph. This is interesting for many applications, such as attack prediction, sybil detection, and recommender systems - just to name a...

Read more »

## Daylight Saving Effect on S&P500 and FTSE100

April 1, 2014
By

Does the transition to and from Daylight Saving Time (DST) have a (significant) effect on the stock market? In a recent blog post on The UK Stock Market Almanac, the author found that the average return of the FTSE100 index for the days following the start of British Summer Time (BST) was -0.07% during the

Read more »

## Include uncertainty in a financial model

April 1, 2014
By

Here’s a post that appears on my new website, ragscripts.com. On-line resources for analysts are often either too general to be of practical use or too specialised to be accessible. The aim of ragscripts.com is to remedy this by providing start to finish directions for complex analytical tasks. The site is under construction at the … Continue reading...

Read more »

## Calendar charts with googleVis

April 1, 2014
By

My little series of posts about the new googleVis charts continues with calendar charts. Google's calendar charts are still in beta, but they provide already a nice heat map visualisation of calendar year data. The current development version of google...

Read more »

## analyze the european social survey (ess) with r

March 31, 2014
By

with more than a decade of microdata aimed at gauging the political mood across european nations, the european social survey (ess) allows scientists like you to examine socio-demographic shifts among broad groups all the way down to pirate party (pirat...

Read more »

## Correlation with constraints on pairs

March 31, 2014
By
$\text{cov}(X,Y)$

An interesting question was posted on http://math.stackexchange.com/726205/…: if one knows the covariances  and , is it possible to infer ? I asked myself a question close to this one a few weeks ago (that I might also relate to a question I asked a long time ago, about possible correlations between three exchange rates, on financial markets). More precisely, if one knows the...

Read more »

## Capturing Intraday data, Backup plan

March 31, 2014
By

In the Capturing Intraday data post, I outlined steps to setup your own process to capture Intraday data. But what do you do if you missed some data points due for example internet being down or due to power outage your server was re-started. To fill up the gaps in the Intraday data, you could

Read more »

## Predictive analysis in ecommerce

March 31, 2014
By

Welcome to the blog post! We all know the predictive analysis is very hot topic now days. Everyone is looking for how the power of predictive analysis can be used in their business and get their business questions solved.  Recently, I was doing study on the predictive analysis in ecommerce. I found many interesting things The post Predictive...

Read more »

## April Fools’ Day: The 7 Funniest Data Cartoons

March 31, 2014
By

To give this years April Fools’ day a more analytical touch, we decided last week do a little poll on internet cartoons. We asked our friends and colleagues to select their favourite data related cartoon on the web, and organized a voting session to construct a top 5 list. (You can always share your own

Read more »

## Process and observation uncertainty explained with R

March 31, 2014
By
$Process and observation uncertainty explained with R$

Once up on a time I had grand ambitions of writing blog posts outlining all of the examples in the Ecological Detective.1 A few years ago I participated in a graduate seminar series where we went through many of the examples in this book. I am not a population biologist by trade but many of

Read more »

## Exploratory data analysis on P/E ratio of Indian Stocks

March 31, 2014
By

Price Earnings ratio (P/E) is one of the very popular ratios reported with all stocks.  Very simply this is thought as - Current Market Price / Earning per Share.   An operational definition of Earning per Share would be Total profit divided by # of Shares .  I will redirect interested readers for further reading towww.investopedia.com/terms/p/price-earningsratio.aspIn this post,...

Read more »

## Moustache target distribution and Wes Anderson

March 31, 2014
By
$Moustache target distribution and Wes Anderson$

Today I am going to introduce the moustache target distribution (moustarget distribution for brievety). Load some packages first. Let’s invoke the moustarget distribution. This defines a target distribution represented by a SVG file using RShapeTarget. The target probability density function is defined on and is proportional to on the segments described in the SVG files,

Read more »

## R: Free, Popular, Powerful, Flexible and Supported

March 31, 2014
By

Francis Smart offers five excellent reasons to use R, in a well-researched post ideal for sharing with anyone thinking about making the switch to R. (You might also share this YouTube video for a quick 90-second introduction to R.) The post also includes a novel analysis of interest in R, as tracked by Google Trends. Given its single-letter name,...

Read more »

## Probabilistic Momentum with Intraday data

March 30, 2014
By

I want to follow up the Intraday data post with testing the Probabilistic Momentum strategy on Intraday data. I will use Intraday data for SPY and GLD from the Bonnot Gang to test the strategy. Next, let’s examine the hourly perfromance of the strategy. There are lots of abnormal returns in the 9:30-10:00am box due

Read more »

## Bayesian Data Analysis [BDA3 - part #2]

March 30, 2014
By

Here is the second part of my review of Gelman et al.’ Bayesian Data Analysis (third edition): “When an iterative simulation algorithm is “tuned” (…) the iterations will not in general converge to the target distribution.” (p.297) Part III covers advanced computation, obviously including MCMC but also model approximations like variational Bayes and expectation propagation

Read more »

## The freqparcoord Package for Multivariate Visualization

March 30, 2014
By

Recently my student Yingkang Xie and I have developed freqparcoord, a novel approach to the parallel coordinates method for multivariate data visualization.  Our approach: Addresses the screen-clutter problem in parallel coordinates, by only plotting the “most typical” cases, meaning those with the highest estimated multivariate density values. This makes it easier to discern relations between variables.

Read more »

## New Blog on R, Statistics, Data Science and So On

March 30, 2014
By

Hi, Norm Matloff here. I’m a professor of computer science at UC Davis, and was a founding member of the UCD Dept. of Statistics. You may know my book, The Art of R Programming (NSP, 2011).  I have some strong views on statistics–which you are free to call analytics, data science, machine learning or whatever your favorite term is–so

Read more »

## Looking at Measles Data in Project Tycho

March 30, 2014
By

Project Tycho includes data from all weekly notifiable disease reports for the United States dating back to 1888. These data are freely available to anybody interested. I wanted to play around with the data a bit, so I registered.MeaslesMeasles a...

Read more »

## President Approval Ratings from Roosevelt to Obama

March 29, 2014
By

I have been watching the awesome Netflix show “House of Cards” and been fascinated by the devious schemes that Underwood is constantly plotting. The show often mentions approval ratings and it got me to wondering what Obama’s ratings currently were, and all other past US president  for that matter. However, I didn’t have much chance

Read more »

## R / Finance 2014 Open for Registration

March 29, 2014
By

The annoucement below just went to the R-SIG-Finance list. More information is as usual at the R / Finance page:Now open for registrations: R / Finance 2014: Applied Finance with R May 16 and 17, 2014 Chicago, IL, USA The reg...

Read more »

## Introduction to PortfolioAnalytics

March 29, 2014
By

PortfolioAnalytics Basics This is a guest post by Ross Bennett. Ross is currently enrolled in the University of Washington Master of Science in Computational Finance & Risk Management program with an expected graduation date of December 2014. He worked on the PortfolioAnalytics...

Read more »

## R/Finance 2014 Registration Open

March 29, 2014
By

As announced on the R-SIG-Finance mailing list, registration for R/Finance 2014 is now open! The conference will take place May 17 and 18 in Chicago.Building on the success of the previous conferences in 2009-2013, we expect more than 250 attendees fro...

Read more »