319 search results for "boxplot"

Dining in San Francisco – Let R Guide You

May 6, 2014
By
Dining in San Francisco – Let R Guide You

I’m frequently asked by newcomers to R to provide an easy to follow generic set of instructions on how to download data, transform it, aggregate it, make graphs, and write it all up for publication in a high impact journal – all by the end of the day ! While such a request is somewhat

Read more »

Dining in San Francisco – Let R Guide You

May 6, 2014
By
Dining in San Francisco – Let R Guide You

I’m frequently asked by newcomers to R to provide an easy to follow generic set of instructions on how to download data, transform it, aggregate it, make graphs, and write it all up for publication in a high impact journal – all by the end of the day ! While such a request is somewhat

Read more »

Comrades Marathon: Negative Splits and Cheating

May 6, 2014
By
Comrades Marathon: Negative Splits and Cheating

With this year’s Comrades Marathon just less than a month away, I was reminded of a story from earlier in the year. Mark Dowdeswell, a statistician at Wits University, found evidence of cheating by some middle and back of the pack Comrades runners. He identified a group of 20 athletes who had suspicious negative splits:

Read more »

There is no “Too Big” Data, is there?

April 23, 2014
By
There is no “Too Big” Data, is there?

A few years ago, a former classmate came back to me with a simple problem. He was working for some insurance company (and still is, don’t worry, chatting with me is not yet a reason for dismissal), and his problem was that their dataset was too large to run (standard) codes to get a regression, and some predictions. My...

Read more »

Overlaying species occurrence data with climate data

April 22, 2014
By
Overlaying species occurrence data with climate data

One of the goals of the rOpenSci is to facilitate interoperability between different data sources around web with our tools. We can achieve this by providing functionality within our packages that converts data coming down via web api's in one format (often a provider specific schema) into a standard format. The new version of rWBclimate that...

Read more »

Side-by-Side Box Plots with Patterns From Data Sets Stacked by reshape2 and melt() in R

Side-by-Side Box Plots with Patterns From Data Sets Stacked by reshape2 and melt() in R

Introduction A while ago, one of my co-workers asked me to group box plots by plotting them side-by-side within each group, and he wanted to use patterns rather than colours to distinguish between the box plots within a group; the publication that will display his plots prints in black-and-white only.  I gladly investigated how to

Read more »

Looking at Measles Data in Project Tycho, part II

April 6, 2014
By
Looking at Measles Data in Project Tycho, part II

Continuing from last week, I will now look at incidence rates of measles in the US. To recap, Project Tycho contains data from all weekly notifiable disease reports for the United States dating back to 1888. These data are freely available to any...

Read more »

Daylight Saving Effect on S&P500 and FTSE100

April 1, 2014
By
Daylight Saving Effect on S&P500 and FTSE100

Does the transition to and from Daylight Saving Time (DST) have a (significant) effect on the stock market? In a recent blog post on The UK Stock Market Almanac, the author found that the average return of the FTSE100 index for the days following the start of British Summer Time (BST) was -0.07% during the

Read more »

Exploratory data analysis on P/E ratio of Indian Stocks

March 31, 2014
By
Exploratory data analysis on P/E ratio of Indian Stocks

Price Earnings ratio (P/E) is one of the very popular ratios reported with all stocks.  Very simply this is thought as - Current Market Price / Earning per Share.   An operational definition of Earning per Share would be Total profit divided by # of Shares .  I will redirect interested readers for further reading to www.investopedia.com/terms/p/price-earningsratio.asp In this post,...

Read more »

Probabilistic Momentum with Intraday data

March 30, 2014
By
Probabilistic Momentum with Intraday data

I want to follow up the Intraday data post with testing the Probabilistic Momentum strategy on Intraday data. I will use Intraday data for SPY and GLD from the Bonnot Gang to test the strategy. Next, let’s examine the hourly perfromance of the strategy. There are lots of abnormal returns in the 9:30-10:00am box due

Read more »