807 search results for "parallel"

Data Analysis Training

March 20, 2012
By
Data Analysis Training

I'm training some of my colleagues on Big'ish data analysis this week. Here's how I'm running the class. Would love your ideas to make it better. CLASS OBJECTIVES (LEARNING OUTCOMES)After completion of the course, you will be able to:Understand concept...

Read more »

find | xargs … Like a Boss

March 9, 2012
By

*Edit March 12* Be sure to look at the comments, especially the commentary on Hacker News - you can supercharge the find|xargs idea by using find|parallel instead.---Do you ever discover a trick to do something better, faster, or easier, and wish you c...

Read more »

Mining Twitter for consumer attitudes towards hotels

March 9, 2012
By
Mining Twitter for consumer attitudes towards hotels

Couple of months back I read Jeffrey Breen’s presentation on mining Twitter for consumer attitudes towards airlines, so I was just curious how it would look if I estimate the sentiment toward major hotels. So here it is: # load twitter library > library(twitteR) # search for all the hilton tweets > hilton.tweets=searchTwitter('@hilton',n=1500) > length(hilton.tweets)

Read more »

Big-data Naive Bayes and Classification Trees with R and Netezza

March 8, 2012
By

The IBM Netezza analytics appliances combine high-capacity storage for Big Data with a massively-parallel processing platform for high-performance computing. With the addition of Revolution R Enterprise for IBM Netezza, you can use the power of the R language to build predictive models on Big Data. In the demonstration below, Revolution Analytics' Derek Norton analyzes loan approval data stored on...

Read more »

Montreal R workshop: Plyr, reshape and other data manipulation goodies

March 8, 2012
By
Montreal R workshop: Plyr, reshape and other data manipulation goodies

March 12, 2012 14h-16h N4/17 Stewart Biology Building, McGill University Étienne Low-Decarie, McGill University This workshop is organized by the BGSA and is free of charge (!), but space is limited. Register early to ensure your spot! From Étienne: Ever want to split your data according to factors, apply a function on each part and

Read more »

lembarrasduchoix asked: thank you for the introduction to…

March 6, 2012
By
lembarrasduchoix asked:
thank you for the introduction to…

lembarrasduchoix asked: thank you for the introduction to Newcomb’s paradox! Could you do a post on your favorite paradoxes?    The decision theory paradoxes I’m familiar with are: Ellsberg Paradox— Theorists encode bothsituations with unknown...

Read more »

doSMP pulled

March 1, 2012
By
doSMP pulled

They have finally pulled that buggy unreliable piece of code that was doSMP from the CRAN mirrors while (I hear) Revolutions are re-writing it. To use all your cores for analysis on the Windows platform, you can try doSNOW instead; my code is something like the fragment...

Read more »

doSMP pulled

March 1, 2012
By
doSMP pulled

They have finally pulled that buggy unreliable piece of code that was doSMP from the CRAN mirrors while (I hear) Revolutions are re-writing it. To use all your cores for analysis on the Windows platform, you can try doSNOW instead; my code is something like the fragment below. Neither option is as attractive...

Read more »

Custom Amazon EC2 config for Rstudio

February 29, 2012
By

IntroductionThis post is a work in progress building on the previous post. It's my attempt to simultaneously learn Amazon's AWS tools and set up R and Rstudio Server on a customized "cloud" instance. I look forward to testing some R jobs that have la...

Read more »

Webinar tomorrow: Big-data statistics with Revolution R with IBM Netezza

February 28, 2012
By

As explained in detail by Michele Chambers at the IBM Netezza blog, there are two keys to getting fast performance with statistical analysis on massive data sets with R: Massive parallelization: break the problem down into small pieces, and run them in parallel Bring the R engine to the data (not the other way around), to avoid data transfer...

Read more »