R-Studio

August 28, 2012
By
R-Studio

A post over on Dang, another error (show me yours and I’ll show you mine) has a method of working with R which uses an IDE called Eclipse in conjunction with a plugin called StatET. Eclipse is one of a number of IDEs that I’m aware of (Tinn-R being another, but this Sciviews pages has

Read more »

COMPSTAT2012

August 28, 2012
By

This week I’m in Cyprus attending the COMPSTAT2012 conference. There’s been the usual interesting collection of talks, and interactions with other researchers. But I was struck by two side comments in talks this morning that I’d like to mention. Stephen Pollock: Don’t imagine your model is the truth Actually, Stephen said something like “economists (or was it econometricians?) have...

Read more »

Are career motivations changing?

August 28, 2012
By
Are career motivations changing?

The German news magazine Der Spiegel published a series of articles around career developments. The stories suggest that career aspirations of young professionals today are somewhat different to those of previous generations in Germany. Apparen...

Read more »

Horizon on ggplot2

August 27, 2012
By
Horizon on ggplot2

SocialDataBlog’s kind reference in post Horizon plots with ggplot (not) motivated me to finish what the post started.  I knew that ggplot2 would be a little more difficult to use for the purpose of a horizon plot, but I felt compelled to provide...

Read more »

Addendum to yesterday’s post on custom CSS and R Markdown

August 27, 2012
By
Addendum to yesterday’s post on custom CSS and R Markdown

Updates from RStudio support: (1) “Thanks for reporting and I was able to reproduce this as well. I’ve filed a bug and we’ll take a look.” (2) Taking a further look, this is actually a bug in the Markdown package and we’ve asked the maintainer (Jeffrey Horner) to look into it. As juejung points out

Read more »

Two R community milestones

August 27, 2012
By

Two significant R community milestones were achieved over the weekend. Firstly, the number of community-contributed R packages on CRAN is now above 4000. (As of this writing, it's 4004.) Figure 10 of The Popularity of Data Analysis Software charts the exponential growth of R packages: at the end of last year the figure stood at 3500, and the number...

Read more »

Horizon plots with ggplot (not)

August 27, 2012
By
Horizon plots with ggplot (not)

The Timely Portfolio blog via R-bloggers has recently published some interesting entries about the value of horizon plots for visual comparison of a number of time series. Very nice it looks too. You can read more about them here. The trick to understanding them is to imagine that each row was orginally a line chart

Read more »

Heatmap tables with ggplot2, sort-of

August 27, 2012
By
Heatmap tables with ggplot2, sort-of

I wrote before about heatmap tables as a better way of producing frequency or other tables, with a solution which works nicely in latex. It is possible to do them much more easily in ggplot2, like this library(Hmisc) library(ggplot2) library(reshape) data(HairEyeColor) P=t(HairEyeColor) Pm=melt(P) ggfluctuation(Pm,type="heatmap")+geom_text(aes(label=Pm$value),colour="white")+ opts(axis.text.x=theme_text(size = 15),axis.text.y=theme_text(size = 15)) Note that ggfluctuation will also take

Read more »

The Art of R Programming – Matloff (2011)

August 27, 2012
By
The Art of R Programming – Matloff (2011)

It’s difficult to write a book on an entire programming language and keep it manageable and concise, but The Art of R Programming does it as well as any text I’ve seen. Matloff covers, in detail and among other things, R data structures, programming idioms, performance enhancements, interfaces with other languages, debugging and graphing. Title:The The post The...

Read more »

garch and long tails

August 27, 2012
By
garch and long tails

How much does garch shorten long tails? Previously Pertinent blog posts include: “A practical introduction to garch modeling” “The distribution of financial returns made simple” “Predictability of kurtosis and skewness in S&P constituents” Induced tails Part of the reason that the distributions of returns have long tails is because of volatility clustering.  It’s not really … Continue reading...

Read more »

Football (Eredivisie) goals

August 27, 2012
By
Football (Eredivisie) goals

The football season has started in Netherlands, so I went and had a look at last year's scores. I did not find downloadable data, at http://www.eredivisiestats.nl/wedstrijden.php I could copy last season's data and paste into a spreadsheet. T...

Read more »

Custom CSS for HTML generated using RStudio

August 26, 2012
By
Custom CSS for HTML generated using RStudio

People have been telling me for a while that the latest version of RStudio, the IDE for R, is a great way to generate reports. I finally got around to trying it out and for once, the hype is justified. Start with this excellent tutorial from Jeremy Anglim. Briefly: the process is not so different

Read more »

Live Autoregressive Process

August 26, 2012
By
Live Autoregressive Process

This post join two things that I like: Highcharts and the Time series. So I have a lot of fun making this minipost to show how works an autoregressive process. Well, let's remember the structure of an autoregresive process. $y_t = varphi,y_{t-1}+epsilon_t$, where $epsilon_t$ is a white noise, i.e, a $cov(epsilon_j, epsilon_i) = 0$ if $i neq...

Read more »

Structural Equation Modeling: Separating the General from the Specific (Part II)

August 26, 2012
By
Structural Equation Modeling: Separating the General from the Specific (Part II)

As promised in Halo Effects and Multicollinearity (my last post), I will show how to run a confirmatory factor analysis in R to test our bifactor model.  In addition, I will include a dependent variable and fit a structural equation mode...

Read more »

A Chlorpleth Map of Free and Reduced Price Lunch in R

August 26, 2012
By
A Chlorpleth Map of Free and Reduced Price Lunch in R

Charles Blow has an excellent op-ed in the New York Times about public education this week. The most important point he makes is that the defunding of public education is coming at precisely the time when American school children are most vulnerable:No...

Read more »

Walmart Invasion

August 26, 2012
By
Walmart Invasion

As an invasion biologist, the process of spatial spread is at the heart of what I do. When I came across this dataset of Walmart store openings since 1962 I couldn’t help but see it as an invasion front which looks a lot like a biological invasion or (albeit slow) epidemic. The video shows monthly

Read more »

Kaggle Prospect – Harvard Business Review

August 25, 2012
By

This post is meant for submitting visual analysis for the Harvard Business Review Contest on KaggleI used the subject lines for all the articles and all the years and mapped the articles into one of the following 18 categories  Business Ethics  Business Management  Crisis  Emerging Markets  Financial Performance  Health Care  Information Technology  Labor  Leadership  Management Systems  Marketing Strategy  Regulation  Social Media  Stock Market  Strategic Planning  Supply Chain  United States & World  Women & Management Changes in...

Read more »

Economic geography of the eastern USA circa 1999, median incomes…

August 25, 2012
By
Economic geography of the eastern USA
circa 1999, median incomes…

Economic geography of the eastern USA circa 1999, median incomes by zip code Code and data source to follow in a longer post.

Read more »

Why R for Mass Spectrometrist and Computational Proteomics

August 25, 2012
By
Why R for Mass Spectrometrist and Computational Proteomics

Why R:Actually, It is a common practice the integration of the statistical analysis of the resulted data and in silico predictions of the data generated in your manuscript and your daily research. Mass spectrometrist, biologist and bioinformaticians c...

Read more »

Love for ProjectTemplate

August 25, 2012
By
Love for ProjectTemplate

The advantage about writing a blog post about the tools you wish that you’d used throughout grad school is that, well, it makes you check them out. I went through the ProjectTemplate tutorial, and I’m hooked. Here’s the advantages as … Continue reading →

Read more »

London 2012 Olympics — Medals vs GDP and population

August 25, 2012
By
London 2012 Olympics — Medals vs GDP and population

It’s already midnight. I’m sitting near my bed. And before going to bed, I’ll type my last post on London 2012 Olympics. Olympic games are not only individual competitions, but also the reflections of countries’ strength. This is one reason why Olympics data … Continue reading →

Read more »

Exporting ctree object to Asymptote

August 25, 2012
By
Exporting ctree object to Asymptote

When producing regression or classification trees (standard rpart or ctree from party package) in GNU R I am often unsatisfied with the default plots they produce. One of many possible solutions is to export a tree plot to Asymptote.The ...

Read more »

Count data and GLMs: choosing among Poisson, negative binomial, and zero-inflated models

August 24, 2012
By

Ecologists commonly collect data representing counts of organisms. Generalized linear models (GLMs) provide a powerful tool for analyzing count data. The starting point for count data is a GLM with Poisson-distributed errors, but

Read more »

Commandeering a map from PDF or EPS, using Inkscape and R

August 24, 2012
By
Commandeering a map from PDF or EPS, using Inkscape and R

I love Nathan Yau’s tutorial on making choropleths from a SVG file. However, if you don’t have a SVG handy already and instead you want to repurpose a map from another vector format such as PDF or EPS, there are … Continue reading →

Read more »

Toy Example with GScholarScraper_3.1

August 24, 2012
By
Toy Example with GScholarScraper_3.1

A commentator on my blog brought up this nice idea of how to use the GScholarScraper function for bibliometrics..I altered the code a little bit which enables to set a year since when results should be returned and added a field to the output collectin...

Read more »

MPK Analytics – putting the R into analytics

August 24, 2012
By
MPK Analytics – putting the R into analytics

Welcome to the blog of MPK Analytics – the consulting and training company whose mission it is to help clients in academia, industry and government to transform their data into insight using

Read more »

Does playing baseball shorten your lifespan? (Answer: No.)

August 24, 2012
By
Does playing baseball shorten your lifespan? (Answer: No.)

A National Institute for Occupational Safety and Health study, published in March, found that professional American football (NFL) players lived longer, on average, than similar "mere mortals" in the general population. Football is a dangerous sport, so that might seem surprising at first, until you consider the fact that NFL players are elite sportsmen: only the strongest, fastest and...

Read more »

Finding the Best Subset of a GAM using Tabu Search and Visualizing It in R

August 24, 2012
By
Finding the Best Subset of a GAM using Tabu Search and Visualizing It in R

Finding the best subset of variables for a regression is a very common task in statistics and machine learning. There are statistical methods based on asymptotic normal theory that can help you decide whether to add or remove a variable at a time. The ...

Read more »

Visualizing the Arctic Sea Ice Extent Decline

August 24, 2012
By
Visualizing the Arctic Sea Ice Extent Decline

Understanding what is happening to Arctic sea ice is critical to recognizing the serious consequences of global warming. So I want to help people visualize the 30+ year trend in Arctic sea ice extent. The source data file is here: … Continue reading →

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.