418 search results for "hadoop"

partools: a Sensible R Package for Large Data Sets

August 5, 2015
By
partools: a Sensible R Package for Large Data Sets

As I mentioned recently, the new, greatly extended version of my partools package is now on CRAN. (The current version on CRAN is 1.1.3, whereas at the time of my previous announcement it was only 1.1.1. Note that Unix is NOT required.) It is my contention that for most R users who work with large … Continue reading...

Read more »

Getting Started: Adobe Analytics Clickstream Data Feed

August 4, 2015
By
Getting Started: Adobe Analytics Clickstream Data Feed

“Well, first you need a TMS and a three-tiered data layer, then some jQuery with a node backend to inject customer data into the page asynchronously if you want to avoid cookie-based limitations with cross-domain tracking and be Internet Explorer 4 compatible…” Blah Blah Blah. There’s a whole cottage industry around jargon-ing each other to

Read more »

Predict Social Network Influence with R and H2O Ensemble Learning

July 29, 2015
By
Predict Social Network Influence with R and H2O Ensemble Learning

What is H2O? H2O is an awesome machine learning framework. It is really great for data scientists and business analysts “who need scalable and fast machine learning”. H2O is completely open source and what makes it important is that works right of the box. There seems to be no easier way to start with scalable The post

Read more »

Installing and Starting SparkR Locally on Windows OS and RStudio

July 26, 2015
By
Installing and Starting SparkR Locally on Windows OS and RStudio

Introduction With the recent release of Apache Spark 1.4.1 on July 15th, 2015, I wanted to write a step-by-step guide to help new users get up and running with SparkR locally on a Windows machine using command shell and RStudio. SparkR provides an R frontend to Apache Spark and using Spark’s distributed computation engine allows

Read more »

CACM Highlights R

July 23, 2015
By
CACM Highlights R

The Association for Computing Machinery is the main professional organization for computer science, largely for academia but still with a broad membership. ACM publishes a number of journals, most of them for research but its flagship publication is a magazine, the Communications of the ACM. The current issue of the CACM includes an article, “Bringing … Continue reading...

Read more »

19 new R jobs! (from R-users.com ; 2015-07-21)

July 21, 2015
By
19 new R jobs! (from R-users.com ; 2015-07-21)

This is the bimonthly post (for 2015-07-21) for new R Jobs from R-users.com. Employers: visit this link to post a new R job to the R community (it’s free and quick). Job seekers: please follow the links below to learn more and apply for your job of interest (or visit previous R jobs posts). Full-Time FPS (game expert) Account Specialist (@Texas) PVP Live – Posted by Kierisi Frisco Texas, United States...

Read more »

Spark 1.4 for RStudio

July 14, 2015
By
Spark 1.4 for RStudio

Today’s guest post is written by Vincent Warmerdam of GoDataDriven and is reposted with Vincent’s permission from blog.godatadriven.com. You can learn more about how to use SparkR with RStudio at the 2015 EARL Conference in Boston November 2-4, where Vincent will be speaking live. This document contains a tutorial on how to provision a spark

Read more »

In case you missed it: June 2015 roundup

July 10, 2015
By

In case you missed them, here are some articles from June of particular interest to R users. The R Consortium, a trade group dedicated to the support and growth of the R Community, has launched with the R Foundation, Microsoft, RStudio and others as founding members. A detailed FAQ for fitting Generalized Linear Models in R. My presentation on...

Read more »

Review: Machine Learning with R Cookbook

July 3, 2015
By

"Machine Learning with R Cookbook" by Chiu Yu-Wei is nothing more or less than it purports to be: a collection of 110 recipes for applying Data Analysis and Machine Learning techniques in R. I was asked by the publishers to review this book and found it to be an interesting and informative read. It will The post

Read more »

useR 2015: it’s a wrap!

July 3, 2015
By

The latest worldwide R user conference has just wrapped up in Aalborg, Denmark and useR! 2015 was the best yet. A hearty round of applause to the organizers for a smoothly run, informative and fun event. To the organizers of next year's event in the Stanford, California: the bar has been raised. As I was chatting to various participants,...

Read more »