Quick dprint Experiment

May 24, 2012
By
Quick dprint Experiment

As a quick dprint experiment, I thought I would try to do a quarterly return table that might potentially fit in knitR Performance Report 3 (really with knitr) and dprint.  Although I do not think I will use it in the final report, I do think it i...

Read more »

RStudio v0.96.225 Update

May 24, 2012
By
RStudio v0.96.225 Update

There’s an updated release of RStudio v0.96 available that includes some small enhancements and bugfixes, including: Comment/uncomment for Sweave and LaTeX Additional in-product documentation for R Markdown Offline support for MathJax previews More flexible handling of MathJax inline equations The release notes include a full list all of the changes. We’ve also published some additional documentation on

Read more »

Slides for R/Finance 2012

May 23, 2012
By
Slides for R/Finance 2012

Another succeessful* year of R/Finance is behind us. It was certainly more: a larger crowd, a longer session, more seminars, more presentations, more sponsors – perhaps even to the point where we’ve reaching a certain capacity. What began as an interesting idea among a few friends has more than credible momentum – it’s now more

Read more »

If You are a R Developer, Then You Must Try SAP HANA for Free.

May 23, 2012
By

This is a guest blog from Alvaro Tejada Galindo, my colleague and fellow R and SAP HANA enthusiast.  I am thankful to Alvaro for coming and posting on "AllThingsR". Are you an R developers? Have ever heard of SAP HANA? Would you like to test SAP HANA for free? SAP HANA is an In-Memory Database Technology allowing developers to analyze big data in real-time. Processes that...

Read more »

NYT charts the Facebook IPO with R

May 23, 2012
By
NYT charts the Facebook IPO with R

In conjunction with Facebook's record-setting IPO last Thursday, the New York Times created an infographic to put the size of the offer in context with other recent IPOs. A detail of the graphic as it appeared in the print edition appears below: ChartsNThings gives a fascinating peek into the weeklong process that went into creating this chart, where about...

Read more »

Global Fires, the Amazon and Humans

May 23, 2012
By
Global Fires, the Amazon and Humans

Fires are natural - most of the time (click on image for larger view). Natural Global Fires Many plants and animals have evolved to depend on fires periodically occurring in certain parts of the world. This phenomenon has been occurring for...

Read more »

knitR Performance Report 3 (really with knitr) and dprint

May 23, 2012
By
knitR Performance Report 3 (really with knitr) and dprint

please see knitr Performance Report–Attempt 3, knitr Performance Report-Attempt 2 and knitr Performance Report-Attempt 1 alstated’s asked a very good question in his comment on knitr Performance Report–Attempt 3, and I’m not sure I could have a...

Read more »

R-NOLD 2012-05-23 05:48:00

May 23, 2012
By
R-NOLD 2012-05-23 05:48:00

Mapping Global Earthquake using XML and MaptoolsEveryday the US Geological Survey (USGS) publish earthquake data (http://earthquake.usgs.gov/earthquakes/recenteqsww/Quakes/quakes_all.html) all over the globe. Using XML and maptool packages of R I d...

Read more »

Are scatterplots too complex for lay folks?

May 23, 2012
By
Are scatterplots too complex for lay folks?

Usually, I like to write about the solutions to problems I’ve had, but today I only have a problem to write about. This is the second research job I’ve had outside of academia, and in both cases I’ve met with … Continue reading →

Read more »

My On-Job Training Analytics

May 22, 2012
By
My On-Job Training Analytics

I have been working at Provincial Statistics Office of Tawi-Tawi (Philippines) which was part of the training on my OJT (On-Job Training). One of the requirements of the training is at least 80 hours of services, so I decided to work from April 19 to M...

Read more »

My new forecasting textbook

May 22, 2012
By

After years of saying that I was going to write a book to replace Makridakis, Wheelwright and Hyndman (1998), I’m finally ready to make an announcement! My new book is Forecasting: principles and practice, co-authored with George Athanasopoulos. It is available online and free-of-charge. We have written about 2/3 of the book so far (all of which is already...

Read more »

The grade level of Congress speeches, analyzed with R

May 22, 2012
By
The grade level of Congress speeches, analyzed with R

As widely reported by CNN, the Huffington Post, Talking Points Memo, the sophistication of speeches by US politicians has declined in recent years, dropping from an 11th-grade level in 2005 to a 10th-grade level today. The reports are based on an analysis by the Sunlight Foundation, based on textual analysis of congressional speeches given since 1996 provided by the...

Read more »

knitr Performance Report–Attempt 3

May 22, 2012
By
knitr Performance Report–Attempt 3

please see knitr Performance Report-Attempt 2 and knitr Performance Report-Attempt 1 Since the time of my last reporting post, RStudio, knitr, and Sweave have worked extremely hard to make document creation easier by becoming even more streamlined and ...

Read more »

read.odbc.ffdf & read.dbi.ffdf for fetching large corporate SQL data

If you are into large data but not enormeoulsy big data everyone is talking about and you are tired of finding a solution to get your data with several 10's of millions of records in R without having RAM issues, having a look at the packages ff, ffbase and ETLUtils might be the solution to your...

Read more »

Sunlight foundation analyses complexity of congressional speech

May 22, 2012
By

Sunlight foundation analyses complexity of congressional speech

Read more »

A complete Bayesian model for sensory profiling data

May 22, 2012
By
A complete Bayesian model for sensory profiling data

In this post I will try to add an important parts in the sensory profiling model I have been building. This concerns the question: 'Are all panelists equally reproducible?'. Obviously the answer is no, some are better than others. From this observation...

Read more »

Adding watermarks to plots

May 22, 2012
By
Adding watermarks to plots

A question was raised today on the mailing list: Is there an easy way to add a watermark to a ggplot? There are several options, depending on the type of watermark and the required level of control over the output, add a text label using annotate (th...

Read more »

Correlations and postive-definiteness

May 22, 2012
By
Correlations and postive-definiteness

On the way to another destination, I found some curious behavior with average correlations. The data Daily log returns from almost all of the constituents of the S&P 500 for years 2006 through 2011. The behavior Figure 1 shows the actual mean correlation among stocks for the set of years and the mean correlation with … Continue reading...

Read more »

LME summary data – results table

May 21, 2012
By
LME summary data – results table

UPDATE: Based on the comment from ‘linuxizer’, I’ve updated this to stay inline with the S3 classes, something I didn’t have my head around at the time, still don’t know it inside out. Brief post. One thing I do often … Continue reading →

Read more »

Classical Technical Patterns

May 21, 2012
By
Classical Technical Patterns

In my presentation about Seasonality Analysis and Pattern Matching at the R/Finance conference, I used examples that I have previously covered in my blog: Month of the Year Seasonality – I introduced the Seasonality charts in the Historical Seasonality Analysis: What company in DOW 30 is likely to do well in January? post. I also

Read more »

When SAP HANA met R – First kiss

When SAP HANA met R – First kiss

If you follow my blogs (I hope you do) then you know I really love the R programming language but I also love SAP HANA and in the past I have dealt with integration between those two:HANA meets RR meets HANASanitizing data in SAP HANA with RBut...those integrations were not done using the...

Read more »

A visual data summary for data frames

May 21, 2012
By
A visual data summary for data frames

If you want to get a quick numerical summary of a data set, the summary function gives a nice overview for data frames: > require(ggplot2) Loading required package: ggplot2 > data(diamonds) > summary(diamonds) carat cut color clarity depth table Min. :0.2000 Fair : 1610 D: 6775 SI1 :13065 Min. :43.00 Min. :43.00 1st Qu.:0.4000 Good : 4906 E: 9797...

Read more »

Inferring the community structure of networks

May 21, 2012
By
Inferring the community structure of networks

I continue my little excursion into network science. In the last post, I gave a little introduction to simulating and visualizing undirected networks with community structure in R. In this post I want to explore a method to infer the community structure of a network from its adjacency matrix. That is, given that I know

Read more »

R development master class: NYC June 21-22, Bay Area June 28-29

May 21, 2012
By

(A guest post by Hadley Wickham) Hi all, I’m going to be teaching an R development master classes in NYC June 21-12 and in the Bay Area June 28-29. The basic idea of the class is to help you write better code, focused on the mantra of “do not repeat yourself”. In day one you will learn powerful new...

Read more »

Permutation tests in R

May 21, 2012
By
Permutation tests in R

Permuation tests (also called randomization or re-randomization tests) have been around for a long time, but it took the advent of high-speed computers to make them practically available. They can be particularly useful when your data are sampled from unkown … Continue reading →

Read more »

RcppArmadillo 0.3.2.0

A new stable release 3.2.0 of Armadillo is now available. As usual, we have wrapped this into a new RcppArmadillo package, now at 0.3.0.2; and this version is now available via CRAN. The short NEWS entry follows below. For those interested in follo...

Read more »

Project Euler — problem 2

May 21, 2012
By

Almost my time for bed. Just write a quick solution on the second problem of Project Euler. Here it is. Each new term in the Fibonacci sequence is generated by adding the previous two terms. By starting with 1 and 2, … Continue reading →

Read more »

The Simple Gibbs example in Julia

May 21, 2012
By

The Gibbs sampler discussed on Darren Wilkinson's blog and also on Dirk Eddelbuettel's blog has been implemented in several languages, the first of which was R. In preparation for a session at useR!2012 on "What other languages should R user...

Read more »

Example 9.32: Multiple testing simulation

May 21, 2012
By
Example 9.32: Multiple testing simulation

In examples 9.30 and 9.31 we explored corrections for multiple testing and then extracting p-values adjusted by the Benjamini and Hochberg (or FDR) procedure. In this post we'll develop a simulation to explore the impact of "strong" and "weak" control of the family-wise error rate offered in multiple comparison corrections. Loosely put, weak control procedures...

Read more »