1195 search results for "tutorial"

Reminder: One week til Rcpp class in San Francisco

October 1, 2011
By

Just a quick note to remind everyone that the Rcpp class in San Francisco, which I am holding together with Revolution Analytics, will take place a week from today. We are happy to report that the number of registrations has met our initial target...

Read more »

Simulation studies in R – Using all cores and other tips

September 16, 2011
By

After working more seriously with simulations I noticed some updates were necessary to my previous setup. Most notably are the following three: It is very handy to explicitly call the different scenarios instead of using nested loops Storing intermediate results in single files obliviates the need to rerun an almost finished but crashed analysis and

Read more »

How to program MapReduce jobs in Hadoop with R

September 13, 2011
By

MapReduce is a powerful programming framework for efficiently processing very large amounts of data stored in the Hadoop distributed filesystem. But while several programming frameworks for Hadoop exist, few are tuned to the needs of data analysts who typically work in the R environment as opposed to general-purpose languages like Java. That's why the dev team at Revolution Analytics...

Read more »

Alternately coloured line environment with fancyvrb

September 11, 2011
By

Recently, while typing up an R tutorial, I used the LaTeX fancyvrb package to create two environments—one coloured blue for R commands, and one coloured red to display R output. This worked well for large blocks of each type. Then I decided I wan...

Read more »

A simple example for writting parallel code

September 7, 2011
By
A simple example for writting parallel code

Today, programmers have to deal with multi-core and multi-computer technologies. Several people claim that software developers are far behind hardware technologies. My two favorite posts for this statement are Editor’s Desk: Software Lags Behind Hardware, But That’s a Good Thing A Hacker’s Craic -Why is software so far behind hardware? Parallel computing is not that

Read more »

Google Spreadsheets API: Listing Individual Spreadsheet Sheets in R

September 7, 2011
By
Google Spreadsheets API: Listing Individual Spreadsheet Sheets in R

In Using Google Spreadsheets as a Database Source for R, I described a simple Google function for pulling data into R from a Google Visualization/Chart tools API query language query applied to a Google spreadsheet, given the spreadsheet key and worksheet ID. But how do you get a list of sheets in spreadsheet, without opening

Read more »

Getting Started with Latent Dirichlet Allocation using RTextTools + topicmodels

RTextTools bundles a host of functions for performing supervised learning on your data, but what about other methods like latent Dirichlet allocation? With some help from the topicmodels package, we can get started with LDA in just five steps. Text in

Read more »

HPC for biological research

August 28, 2011
By

In early May I had the opportunity to attend a workshop on using high performance computing in R hosted at Nimbios. I’ve been meaning to write a summary of the meeting ever since but got sidetracked by various other projects. Since a collaborator recently asked for meeting notes I finally took the time to write

Read more »

FishBASE from R

August 26, 2011
By
FishBASE from R

In lab known for its quality data collection, high-speed video style, writing the weekly blog post can be a bit of a challenge for the local code monkey. That’s right, no videos today. But lucky for me, even this group … Continue reading →

Read more »

Maiden voyage

August 23, 2011
By
Maiden voyage

Who Me. I'm an associate professor of Statistics at Youngstown State University in Youngstown, Ohio, USA. I've been using R for about 7 years, Emacs about 3 years, git about 1 year, and Org-Mode for less than a year. What I want this blo...

Read more »