RTextTools v1.2 Available on CRAN + useR! 2011 Kaleidoscope Session

RTextTools v1.2 was released today and we're pleased to announce that the package is finally available on CRAN. Additionally, this update brings minor changes to the API, improvements to the GLMNET algorithm, and more comprehensive documentation. Get started by following our installatio

Read more »

The fun Package: Use R for Fun!

August 16, 2011
By
The fun Package: Use R for Fun!

A couple of days ago we released a package named fun to CRAN, but I did not dare to send an announcement to [email protected] as usual. This package is a collection of some classical computer games (e.g. the Mine sweeper and Five in a row) as well as other funny stuff. Some examples: ## install.packages('fun')

Read more »

Forecasting in R: Starting From Square One

August 16, 2011
By
Forecasting in R: Starting From Square One

Okay in the past few posts I jumped the gun a little bit.  Errors I made include rushing everything, not explaining anything and not giving my blog readers the love and respect they deserve.  What am I talking about? Well before we do anythin...

Read more »

ttrTests Experimentation

August 16, 2011
By
ttrTests Experimentation

I was intrigued by the CRAN update on a package ttrTests, especially since quantstrat is not built for backtesting system parameters and analyzing system performance as I mentioned in A Quantstrat to Build On Part 6.  ttrTests offers a nice start ...

Read more »

R Code Optimization

August 16, 2011
By
R Code Optimization

Handling Large Data with R The following experiments are inspired from this excellent presentation by Ryan Rosario: http://statistics.org.il/wp-content/uploads/2010/04/Big_Memory%20V0.pdf. R presents many I/O functions to the users for reading/writing data such as ‘read.table’ , ‘write.table’ -> http://cran.r-project.org/doc/manuals/R-intro.html#Reading-data-from-files. With data growing larger by the day many new methodologies are available in order to achieve faster I/O operations.

Read more »

Brian Ripley on The R Development Process

August 16, 2011
By

R Core member Professor Brian Ripley from Oxford University gave the first keynote presentation of useR! 2011 today, and gave some insights into what goes on behind the scenes to create two updates to R (plus several patches) every year. He began with some facts about the history of R (noting that if they'd known R would take off...

Read more »

The R Ecosystem

August 16, 2011
By

I gave my talk to the useR! 2011 conference this morning: The R Ecosystem. The goal of the talk was to show R in context: that the combination of the R project and its leadership, the R userbase, and the companies supporting and using R makes for a thriving ecosystem and is indicative of an extremely successful open source...

Read more »

ggplot2 Version of Figures in “25 Recipes for Getting Started with R”

August 16, 2011
By
ggplot2 Version of Figures in “25 Recipes for Getting Started with R”

In order to provide an option to compare graphs produced by basic internal plot function and ggplot2, I recreated the figures in the book, 25 Recipes for Getting Started with R, with ggplot2.The code used to create the images is in separate paragraphs, allowing easy comparison.Read...

Read more »

Jonathan Rougier – Nomograms for visualising relationships between three variables (useR! 2011)

August 16, 2011
By
Jonathan Rougier – Nomograms for visualising relationships between three variables (useR! 2011)

Background: Donkeys in Kenya. Tricky to find the weight of a donkey in the “field” – no pun intended! So using a few measurements,  estimate the weight. Other covariates include age. Standard practice is to fit: for adult donkeys, and other slightly different models for young/old and ill donkeys. What can a statistician add: Add

Read more »

Ulrike Gromping – Design of Experiments in R

August 16, 2011
By
Ulrike Gromping – Design of Experiments in R

Example: Car seat occupation: Algorithm must decide whether airbag opens: Must open for adult but not for small child or if the seat if empty a few others I missed. Key questions are: What type of design: 32 run regular fractional factorial Response measurement – depends on dummy position, so repeat for 3 different dummy

Read more »

High Performance Computing

August 16, 2011
By
High Performance Computing

Wilem Ligtenberg – GPU computing and R Why GPU computing – theoretical GFLOPs for a GPU is three times greater than a CPU. Use GPUs for same instruction multiple data problems (SIMD). Initially GPUs were developed for texture problems. For example, a wall smashed into lots of pieces. Each core handled a single piece. CUDA

Read more »

Using Deducer to work with R

August 16, 2011
By
Using Deducer to work with R

If one checks out the initial question that prompted this series, a common theme in the answers is that one should use the GUI as a tool to help one build code (and not just as a crutch to do the analysis). Being able to view the code produced by the GUI should help beginner R users

Read more »

R Code Examples on Graphics

August 16, 2011
By
R Code Examples on Graphics

Some useful R code examples on  graphics are: Learn R Toolkit: It contains PowerPoint slideshows, videos, R scripts and data files to help Excel users move up to R. R code examples are provided for panel charts, conditional format, dot … Continue reading →

Read more »

Kaleidoscope Ic (useR! 2011)

August 16, 2011
By
Kaleidoscope Ic (useR! 2011)

These are my rough notes on the Kaleidoscope Ic session. David Smith – The R Ecosystem (useR! 2011) David Smith works for Revolution Analytics. Quick overview of the R project – useR, r-journal, and r-forge. Social media starting to play a part in R – Google+, twitter, stackoverflow, and the traditional R mailing list. The

Read more »

Brian Ripley – The R Development Process (useR! 2011)

August 16, 2011
By
Brian Ripley – The R Development Process (useR! 2011)

There are my notes on the User2011 invited talk. Brian Ripley has been a member of R core since 1998 The R Development Process – A insideR’s view R Timeline: JCGS paper submitted in 1995. 1997: CRAN(Mar), Core team(Aug), CVS (Sept) R 1.0.0 Feb 2000 – 2.8MB. Many people don’t take 0.X.X seriously R 2.0.0 Oct

Read more »

Rchievement of the day

August 16, 2011
By
Rchievement of the day

Today ‘rjags’ was annoying me. The function ‘update.jags’ was (without asking me) disabling the progress bar when I was running R non-interactively. It was annoying, as I was sending R output to a log file in my dropbox, and having … Continue reading →

Read more »

The fun Package: Use R for Fun!

August 16, 2011
By
The fun Package: Use R for Fun!

A couple of days ago we released a package named fun to CRAN, but I did not dare to send an announcement to [email protected] as usual. This package is a collection of some classical computer games (e.g. the Mine sweeper and Five in a row) as we...

Read more »

Use geom_rect() to add recession bars to your time series plots #rstats #ggplot

August 15, 2011
By
Use geom_rect() to add recession bars to your time series plots #rstats #ggplot

ggplot2's geom_rect() layer makes it easy to highlight portions of your graph, such as recessions on an economic time series.

Read more »

lm System on Nikkei with New Chart

August 15, 2011
By
lm System on Nikkei with New Chart

I got a great idea from the zoo-overplot demo to make a very helpful visualization of system entry and exit.  Since the lm-based system presented in Unrequited lm Love is newest, I will use this system, but apply to the Nikkei 225 instead of the R...

Read more »

MySQL and R

August 15, 2011
By

Using MySQL with R is pretty easy, with RMySQL. Here are a few notes to keep me straight on a few things I always get snagged on.Typically, most folks are going to want to analyze data that's already in a MySQL database. Being a little bass-ackwards, I often want to go the other way. One reason to do...

Read more »

Paul Murrell – Introduction to Grid graphics (useR! 2011)

August 15, 2011
By
Paul Murrell – Introduction to Grid graphics (useR! 2011)

Typically, I’m very bad at taking notes in conference. This time around, I intend to make notes for each some of the talks I attend at this year’s useR! 2011 conference. Below are my notes that I made during this afternoon’s tutorial. Note: these are just notes I made and aren’t meant to be a

Read more »

How many NYC restaurants get As on their health inspections?

August 15, 2011
By
How many NYC restaurants get As on their health inspections?

Decision Science News is no stranger to misleading infographics in free New York newspapers. We could stop reading them entirely, but we find that playing "spot the infographic flaw" makes time fly on the subway.Recently we saw the above graphic in a paper called Metro. Can you spot the goof?

Read more »

Sync Your Rprofile Across Multiple R Installations

August 15, 2011
By
Sync Your Rprofile Across Multiple R Installations

Your Rprofile is a script that R executes every time you launch an R session. You can use it to automatically load packages, set your working directory, set options, define useful functions, and set up database connections, and run any other code you w...

Read more »

Using Emacs as a front-end for R

August 15, 2011
By

Back when I was a grad student, I was a devoted Emacs user. I basically used it like an operating system: it wasn't just my text editor, but also my mail reader, my Web browser, my news reader, and so much more. (I once even asked our sysadmin to change my default shell to /usr/bin/emacs. He refused.) So when...

Read more »

Live from useR!

August 15, 2011
By

The Revolution Analytics team is at the R user conference useR! at Warwick University this week. We'll bring you the updates from the conference with the latest from the R community as we go, but as we're on UK time blogging will be at unusual hours for the next few days. Things are off to a great start, with...

Read more »

Making Stuff is Scary

August 15, 2011
By
Making Stuff is Scary

My daughter's best friend lives just down the street. Her mother runs a cupcake shop that's just a little further down the street. Being eleven going on sixteen, my daughter fancies herself a "quote" -- worker at the shop. She's not paid in actual mone...

Read more »

Unrequited lm Love

August 14, 2011
By
Unrequited lm Love

In System Failure-Maybe it Will Help I presented the initial trials of a linear model system for stocks, and even though they were not a resounding success, I have been strangely determined to discover a working version of this framework.  Maybe t...

Read more »

Breaking it up into trend and seasonal and error components

August 14, 2011
By
Breaking it up into trend and seasonal and error components

 GDP=scan("/Users/stevensabol/Desktop/R/gdp.csv")Read 258 items> GDP=ts(GDP,start=1,frequency=4)> dlGDP=diff(log(GDP))> plot(stl(log(GDP),"per"))This allows us to do a structural decompositionlog(GDP) = trend + season + error Here is...

Read more »

Get those plots

August 14, 2011
By
Get those plots

Type in the following to get a Q-Q plot and a histogram on top of each otherpar(mfrow=c(2,1))> hist(dlGDP,prob=T,12)> lines(density(dlGDP))> qqnorm(dlGDP)> qqline(dlGDP)the top graph says that the errors are pretty nicely distributed around...

Read more »