±∞

February 21, 2013
By
±∞

The Cauchy distribution (?dcauchy in R) nails a flashlight over the number line and swings it at a constant speed from 9 o’clock down to 6 o’clock over to 3 o’clock. (Or the other direction, from 3→6→9.) Then counts Read more »

Removing white space around R figures

February 21, 2013
By

When I want to insert figures generated in R into a LaTeX document, it looks better if I first remove the white space around the figure. Unfortunately, R does not make this easy as the graphs are generated to look good on a screen, not in a document. There are two things that can be done to fix this...

Read more »

Le Monde puzzle [#809]

February 21, 2013
By
Le Monde puzzle [#809]

Another number theory puzzle, completed in the plane to Hamburg: Integers n are called noble if they can be decomposed as a sum n=a+b+… of distinct integers such that 1/a+1/b+…=1. They are called bourgeois if they are not noble but can be decomposed as a sum n=a+b+… of integers, some of them identical, such that

Read more »

Additional Plots on French Breakpoints as Valuation

February 21, 2013
By
Additional Plots on French Breakpoints as Valuation

I feel like there might be some merit in Slightly Different Measure of Valuation using Ken French’s Market(ME) to Book(BE) Breakpoints by percentile to offer an additional valuation metric for US stocks.  I thought some additional plots might he...

Read more »

Elevation Profiles in R

February 21, 2013
By
Elevation Profiles in R

First, let's load up our data. The data are available in a gist. You can convert your own GPS data to .csv by following the instructions here, using gpsbabel.gps <- read.csv("callan.csv",  header = TRUE)Next, we can use the function SMA fr...

Read more »

A slightly different introduction to R, part IV

February 21, 2013
By
A slightly different introduction to R, part IV

Now, after reading in data, making plots and organising commands with scripts and Sweave, we’re ready to do some numerical data analysis. If you’re following this introduction, you’ve probably been waiting for this moment, but I really think it’s a good idea to start with graphics and scripting before statistical calculations. We’ll use the silly

Read more »

Plot ranges of data in R

February 21, 2013
By
Plot ranges of data in R

How to control the limits of data values in R plots. R has multiple graphics engines.  Here we will talk about the base graphics and the ggplot2 package. We’ll create a bit of data to use in the examples: one2ten <- 1:10 ggplot2 demands that you have a data frame: ggdat <- data.frame(first=one2ten, second=one2ten) Seriously The post Plot...

Read more »

plot textual differences in Shiny

February 21, 2013
By
plot textual differences in Shiny

Wordclouds such as Wordle are pretty rubbish, so I thought I'd try to make a better one, one that actually produces (statistically) meaningful results. I was so happy with the outcome I decided to make it interactive, so go on, have a play!Compare any...

Read more »

Zurich, Feb 2013 – Spring Lecture

February 21, 2013
By

(This article was first published on Rmetrics blogs, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: Rmetrics blogs. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave,...

Read more »

Examining Overlapping Meetup Memberships with Venn Diagrams

February 21, 2013
By
Examining Overlapping Meetup Memberships with Venn Diagrams

As of the beginning of 2013, Data Community DC ran three Meetup groups: Data Science DC, Data Business DC, and R Users DC. We’ve often wondered how much these three groups overlapped. In this post, I’m going to show you … Continue reading → The post Examining Overlapping Meetup Memberships with Venn Diagrams appeared first on Data...

Read more »

Social Media Monitoring tools in R with just a few lines

February 21, 2013
By
Social Media Monitoring tools in R with just a few lines

Social Media Analysis has become some kind of new obsession in Marketing. Every company wants to engage existing customers or attract new ones through this communication channel. Therefore, they hire designers, editors, community managers, etc. However, when it comes to … Continue reading →

Read more »

Social Media Monitoring tools in R with just a few lines

February 21, 2013
By
Social Media Monitoring tools in R with just a few lines

Social Media Analysis has become some kind of new obsession in Marketing. Every company wants to engage existing customers or attract new ones through this communication channel. Therefore, they hire designers, editors, community managers, etc. However, when it comes to … Continue reading →

Read more »

Video: Survey Package in R

February 20, 2013
By
Video: Survey Package in R

Sebastián Duchêne presented a talk at Melbourne R Users on 20th February 2013 on the Survey Package in R. Talk Overview: Complex designs are common in survey data. In practice, collecting random samples from a populations is costly and impractical. … Continue reading →

Read more »

RcppArmadillo 0.3.6.3

February 20, 2013
By

A new Armadillo version 3.6.3 came out this morning, and the corresponding RcppArmadillo version is now on CRAN. Changes are incremental: Changes in RcppArmadillo version 0.3.6.3 (2013-02-20) Upgraded to Armadillo release Version 3.6.3 ...

Read more »

Model Selection and Multi-Model Inference

February 20, 2013
By
Model Selection and Multi-Model Inference

At D-RUG this week Rosemary Hartman presented a really useful case study in model selection, based on her work on frog habitat. Here is her code run through ‘knitr’. Original code and data are posted here. (yes, I am just doing this for the flying monkey) Editor’s note: we’re giving away flying monkey dolls from our...

Read more »

Quandl: A Wikipedia for Time Series Data

February 20, 2013
By

This guest post is by Tammer Kamel, Founder of Quandl Finding and formatting numerical data for analysis in R or Excel or indeed any application is a pain that all real world data analysts know all too well. In aggregate I have probably spent weeks of my life trying to find data on the web. And several more weeks...

Read more »

Analysis of Public .Rhistory Files

February 20, 2013
By
Analysis of Public .Rhistory Files

GitHub recently launched a more powerful search feature which has been used on more than one occasion to identify sensitive files that may be hosted in a public GitHub repository. When used innocently, there are all sorts of fun things you can find with this search feature. Inspired by Aldo Cortesi's post documenting his exploration

Read more »

Fixing My Internet With R and Python

February 20, 2013
By
Fixing My Internet With R and Python

Last summer, I had some internet connectivity problems. Specifically, I would have massive latency issues that affected my conversations on Skype and my relatively pathetic under the best of circumstances efforts at online gaming. It was driving me up a wall and I couldn't figure it out. It hadn't...

Read more »

Fixing My Internet With R and Python

February 20, 2013
By
Fixing My Internet With R and Python

Last summer, I had some internet connectivity problems. Specifically, I would have massive latency issues that affected my conversations on Skype and my relatively pathetic under the best of circumstances efforts at online gaming. It was driving me up a wall and I couldn't figure it out. It hadn't...

Read more »

To pre-pay or not to pre-pay for gas when renting a car?

February 20, 2013
By
To pre-pay or not to pre-pay for gas when renting a car?

One question we get asked a lot is whether it's worth it to pre-pay for the tank of gas when renting a car. At first, blush it seems like something you should never do. In the best case, you pay market rate for gas, and in the worst case, you pay for a tank of gas you never consume (what...

Read more »

Interactive two-host SIR model

February 20, 2013
By
Interactive two-host SIR model

This is an example of interfacing R, shiny, and deSolve to produce an interactive environment where users can explore model behavior by altering parameters in an easy to use GUI. The model tracks the number of susceptible, infectious, and recovered individuals in two co-occuring host species. The rates of change for each class are represented as a system of differential...

Read more »

Another Way to Look at Vanguard and Pimco

February 20, 2013
By
Another Way to Look at Vanguard and Pimco

I like the results of the analysis shown in my post Applying Tradeblotter’s Nice Work Across Manager Rather than Time, but I was not satisfied that the plot allowed a quick summary comparison of the two massive fund complexes.  I am much more pl...

Read more »

Progress bar in R

February 20, 2013
By

A decent percentage of working time in R, I spend looping over chromosomes, transcription factors or tissues, usually, using parallelization.To get the stuff to run simultaneously I use the foreach function from the doMC package, and for monitoring of ...

Read more »

Mapped: Twitter Languages in New York

February 20, 2013
By
Mapped: Twitter Languages in New York

Following the interest in our Twitter Tongues map for L

Read more »

Interactive two-host SIR model

February 20, 2013
By

This is an example of interfacing R, shiny, and deSolve to produce an interactive environment where users can explore model behavior by altering parameters in an easy to use GUI. The model tracks the number of susceptible, infectious, and recovered individuals in two co-occuring host species. The rates of change for each class are represented as a system of...

Read more »

Momentum in R: Part 4 with Quantstrat

February 19, 2013
By
Momentum in R: Part 4 with Quantstrat

The past few posts on momentum with R focused on a relatively simple way to backtest momentum strategies. In part 4, I use the quantstrat framework to backtest a momentum strategy. Using quantstrat opens the door to several features and options as well as an order book to check the trades at the completion of … Continue reading...

Read more »

Onepager Now with knitR

February 19, 2013
By

Since at some point I had trouble with a conflict between knitr and the latex package textpos, I used the lesser Sweave in Another Experiment with R and Sweave.  I ran the Sweave2knitr command and discovered that textpos and knitr play well togeth...

Read more »

Visualize major league pitching data with PitchRx

February 19, 2013
By

Anyone interested in playing around with the data generated by the PITCHf/x cameras at major league baseball games should definitely check out the pitchRx package from Carson Sievert. Major League Baseball Advanced Media makes the data available for download, and this package provides an interface from R to the speed, position and pitcher data for just about every MLB...

Read more »

Better modelling and visualisation of newspaper count data

February 19, 2013
By
Better modelling and visualisation of newspaper count data

<!-- Styles for R syntax highlighter In this post I outline how count data may be modelled using a negative binomial distribution in order to more accurately present trends in time series count data than using linear methods. I also show how to...

Read more »

Sponsors