Analyzing Twitter Data in R – Part 1

February 8, 2012
By

I recently began using the TwitteR package in R to examine my tweeting patterns. One of my first projects was to identify each of my Twitter followers, where they were located, how many tweets they had, and then plot their location on a map using a bubble which was related to their total number of

Read more »

Trust in the EU and National Parliaments

February 8, 2012
By
Trust in the EU and National Parliaments

I have been playing around with some data from Eurobarometer, to support some arguments for a small comment I am writing for the Maastricht Law Review. I got the data for the following two questions: I would like to ask you a question about how much ...

Read more »

What is the Potential Audience Size for a Hashtag Community?

February 8, 2012
By
What is the Potential Audience Size for a Hashtag Community?

What’s the potential audience size around a Twitter hashtag? Way back when, in the early days of webs stats, reported figures tended to centre around the notion of hits, the number of calls made to a server via website activity. I forget the details, but the metric was presumably generated from server logs. This measure

Read more »

Oracle’s strange understanding of R users

February 8, 2012
By

After reading David Smith’s tweet on the price of Oracle R Enterprise (actually free, but it requires Oracle Data Mining at $23K/core as pointed out by Joshua Ulrich.) I went to Oracle’s site to see what was all about. Oracle … Continue reading →

Read more »

discrimination between CpG islands and random sequences using Markov chains

February 8, 2012
By
discrimination between CpG islands and random sequences using Markov chains

Major part of modern research is trying to find patterns in the given dataset using learning methods. One of the methods that can use a priori information for such purpose is Markov chains, in which the probability of symbol occurrence … Continue reading →

Read more »

Revolution R update adds Red Hat 6 support

February 8, 2012
By

The Dev Team at Revolution Analytics recently released an update to the Revolution R 5 family. Version 5.0.1 adds compatibility with Red Hat Enterprise Linux 6 for all editions (Community, Academic and Enterprise). This expands the platform support to Red Hat 5, Red Hat 6 and Microsoft Windows. For Revolution R Enterprise customers and users of the free Academic...

Read more »

"R": PLS Regression (Gasoline) – 005

February 8, 2012
By
"R": PLS Regression (Gasoline) – 005

Let´s see know how to plot the scores for the 3 PLS Components:  We can see the explained variance from each component in the diagonal.We can get it from R with:> explvar(gas1)   Comp 1      Comp 2  &nbs...

Read more »

Zero rates with futile.paradigm

February 8, 2012
By
Zero rates with futile.paradigm

Here’s a short example of calculating zero rates and discount factors from cash rates using futile.paradigm. Of note is how …Continue reading »

Read more »

We Keep Our Vehicles Longer

February 8, 2012
By
We Keep Our Vehicles Longer

Description: Average age of passenger cars and light trucks in the United States since 1995. The gray area represents the possible variance of the trend line with 95% confidence. Data: https://www.polk.com/company/news/average_age_of_vehicles_...

Read more »

OpenCPU, R in the Cloud

February 8, 2012
By
OpenCPU, R in the Cloud

I ran across OpenCPU today. If you have any intest in R and reproducible research this is definitely worth checking out. Also, it looks like I might want to explore the potential of embedding functions in websites. Hm . . . .

Read more »

Hadley Wickham: ggplot2 Webinar (Today!)

February 8, 2012
By

Title: A Backstage Tour of ggplot2 with Hadley WickhamDate: Wednesday, February 8, 2012Time: 11:00AM - 12:00PM PacificPresenter: Hadley Wickham, Professor of Statistics, Rice UniversityRegister here.I used ggplot2 extensively a few years ago, but rever...

Read more »

recents advances in Monte Carlo Methods

February 8, 2012
By
recents advances in Monte Carlo Methods

Next Thursday (Jan. 16), at the RSS, there will be a special half-day meeting (afternoon, starting at 13:30) on Recent Advances in Monte Carlo Methods organised by the General Application Section. The speakers are Richard Everitt, University of Oxford, Missing data, and what to do about it Anthony Lee, Warwick University, Auxiliary variables and many-core

Read more »

RStudio Server part 3: using an ssh tunnel for high performance

February 8, 2012
By

In part 2 of this series of posts on RStudio Server, I commented that I suspected that RStudio Server would be fast. The first time I tried this from a remote connection, I was disappointed with the performance. Many companies… See more ›

Read more »

A spell-checker in R

February 7, 2012
By
A spell-checker in R

I came across Dr. Peter Norvig’s blog about writing a basic spell-checker (http://norvig.com/spell-correct.html), and just had to try to implement it in R. Please excuse the ugly-ish code (I have not optimized it or commented it adequately at this point, but you can get the idea of what it does by reading Dr. Norvig’s blog).

Read more »

Two incredibly useful functions to throw into your .rprofile

February 7, 2012
By

I’ve neglected this blog for quite some time but I’m getting around to finishing up a bunch of draft posts. But here is a quick one: Listing objects in your global environment A simple ls() doesn’t really tell you enough useful information at a glance. Most often I just want to know what I named

Read more »

What’s new in futile.matrix 1.1.2

February 7, 2012
By
What’s new in futile.matrix 1.1.2

This is an exciting release of futile.matrix, which in some ways the package grows up and finds its purpose. It …Continue reading »

Read more »

updated slides for ABC PhD course

February 7, 2012
By
updated slides for ABC PhD course

Over the weekend, I have added a few slides referring to recent papers mentioning the convergence of ABC algorithms, in particular the very relevant paper by Dean et al. I had already discussed in an earlier post. (This is taking a larger chunk of my time than expected! I am glad I will use the

Read more »

Example 9.20: visualizing Simpson’s paradox

February 7, 2012
By
Example 9.20: visualizing Simpson’s paradox

Simpson's paradox is always amazing to explain to students. What's bad for one group, and bad for another group is good for everyone, if you just collapse over the grouping variable. Unlike many mathematical paradoxes, this arises in a number of real...

Read more »

"R": PLS Regression (Gasoline) – 004

February 7, 2012
By
"R": PLS Regression (Gasoline) – 004

In the previous post we plot the Cross Validation predictions with:> plot(gas1, ncomp = 3, asp = 1, line = TRUE)We can plot the fitted values instead with:> plot(gas1, ncomp = 3, asp = 1, line = TRUE,which=train) Graphics are different:Of course, using "train" we get  overoptimisc statistics and we should look...

Read more »

Gauging Interest in a Montreal R User Group

February 7, 2012
By
Gauging Interest in a Montreal R User Group

Some of us over at McGill’s Biology Graduate Student Association have been developing and delivering R/Statistics workshops over the last few years. Through invited graduate students and faculty, we have tackled  everything from multi-part introductory workshops to get your feet wet, to special topics such as GLMs, GAMs, Multi-model inference, Phylogenetic analysis, Bayesian modeling, Meta-analysis,

Read more »

What are the most popular bike routes in London?

February 7, 2012
By
What are the most popular bike routes in London?

James Cheshire, R user and lecturer at the UCL Centre for Advanced Spatial Analysis, created this map of the most trafficked cycle routes in London: Click to enlarge, it's a gorgeous chart. The bicycle traffic data comes from London's public cycle-hire facility, which currently is only available in central London (which is why you don't see much activity in...

Read more »

MAT8886 a short word on profile likelihood

February 7, 2012
By
MAT8886 a short word on profile likelihood

Profile likelihood is an interesting theory to visualize and compute confidence interval for estimators (see e.g. Venzon & Moolgavkar (1988)). As we will use is, we will plot But more generally, it is possible to consider where . Then (...

Read more »

Early-February flotsam

February 7, 2012
By
Early-February flotsam

Mike Croucher at Walking Randomly points out an interesting difference in operator precedence for several mathematical packages to evaluate a simple operation 2^3^4. It is pretty much a divide between Matlab and Excel (does the later qualify as mathematical software?) … Continue reading →

Read more »

Update to Partisan Bias in Fed Inflation Forecasts

February 7, 2012
By
Update to Partisan Bias in Fed Inflation Forecasts

Since I'm in the depths of PhD thesis revisions I haven't had much time to do much other than update previous posts (see my Stata Country Standardizer Update).Here is an update of an earlier post about possible partisan biases in US Federal Reserve sta...

Read more »

Workshop on Mixed and Multilevel Modelling with R in Toronto

February 7, 2012
By

Summer Program In Data Analysis (SPIDA): May 24th – June 1st, 2012 In its thirteenth season this year, ISR’s Summer Program in Data Analysis focuses on linear models, beginning with “standard” regression through generalized linear models, and extending to mixed or multilevel models, linear and non-linear and generalized, which incorporate two or more hierarchical levels of data or longitudinal...

Read more »

What’s new in futile.paradigm 2.0.4

February 6, 2012
By
What’s new in futile.paradigm 2.0.4

Well this certainly took a while but the latest installment of my functional dispatching library for R is finally released …Continue reading »

Read more »

how to create a variable with r

February 6, 2012
By

(This article was first published on twotorials by anthony damico, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: twotorials by anthony damico. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

how to do simple arithmetic in r

February 6, 2012
By

(This article was first published on twotorials by anthony damico, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: twotorials by anthony damico. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

More Beautiful Growth of $1 Chart

February 6, 2012
By
More Beautiful Growth of $1 Chart

With all my recent focus on reporting and visualization, you might think that I have the investments all figured out.  Unfortunately, that is not the case, and I will resume more standard investment and systems posts soon.  I did want to shar...

Read more »