501 search results for "hadoop"

Benchmarking Random Forest Implementations

May 19, 2015
By
Benchmarking Random Forest Implementations

I currently have the need for machine learning tools that can deal with observations of...

Read more »

What’s new in Revolution R Enterprise 7.4

May 18, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics Without missing a beat, the engineers at Revolution Analytics have brought another strong release to users of Revolution R Enterprise (RRE). Just a few weeks after acquisition of Revolution Analytics by Microsoft, RRE 7.4 was released to customers on May 15 adding new capabilities, enhanced performance and security, ann faster...

Read more »

Oracle R Enterprise issues in Oracle Big Data Lite VM 4.1.0

Oracle R Enterprise issues in Oracle Big Data Lite VM 4.1.0

In the previous post, we examined some configuration issues with Cloudera Manager and Hadoop services in the latest release of Oracle Big Data Lite VM (4.1.0). In this post we report issues with Oracle R Enterprise, and the remedies we applied. It turns out that if we load the ORE package in R, we subsequently cannot use the help...

Read more »

What data science software tools do you use?

May 11, 2015
By

KDnuggets is once again running its annual poll of data science software tools, now in its 16th year. If you'd like to participate, visit the KDnuggets poll page and answer the question, "What Predictive Analytics, Data Mining, Data Science software/tools you used in the past 12 months?". The poll allows you to select up to 20 tools from the...

Read more »

In case you missed it: April 2015 roundup

May 8, 2015
By

In case you missed them, here are some articles from April of particular interest to R users. Joseph Rickert reviews the inaugural New York City R User Conference, featuring Andrew Gelman. Engineer Vineet Abraham compares performance benchmarks for R and Revolution R Open on OS X and Ubuntu. R was featured in the keynotes for the BUILD developer’s conference....

Read more »

useR! 2015 conference in Aalborg

May 6, 2015
By
useR! 2015 conference in Aalborg

The annual R conference bringing together users and developers from academia and industry is going to be held in Aalborg, Denmark, this summer, 1-3 July. The day prior to the conference 16 R tutorials are offered free of charge to the participants. The list of topics include dplyr, Tessera, Bioconductor, Grid graphics and RHadoop. In addition to our six...

Read more »

See R in action at the BUILD conference

April 29, 2015
By
See R in action at the BUILD conference

Build 2015, the Microsoft conference which brings around 5,000 developers to the Moscone Center in San Francisco, begins tomorrow. The conference is sold out, but you can livestream the keynote presentations from buildwindows.com to catch all the big announcements. You can also follow along on Twitter at the #Build2015 hashtag. There will be a major keynote presentation featuring CEO...

Read more »

The Intersection of “Data Capital” and Advanced Analytics

April 17, 2015
By

We’ve heard about the Three Laws of Data Capital from Paul Sonderegger at Oracle: data comes from activity, data tends to make more data, and platforms tend to win. Advanced analytics enables enterprises to take full advantage of the data their activity produces, ranging from IoT sensors and PoS transactions to social media and image/video. Traditional BI tools...

Read more »

Coarse Grain Parallelism with foreach and rxExec

April 2, 2015
By

by Joseph Rickert I have written a several posts about the Parallel External Memory Algorithms (PEMAs) in Revolution Analytics’ RevoScaleR package, most recently about rxBTrees(), but I haven’t said much about rxExec(). rxExec() is not itself a PEMA, but it can be used to write parallel algorithms. Pre-built PEMAs such as rxBTrees(), rxLinMod(), etc are inherently parallel algorithms designed...

Read more »

A first look at rxBTrees

March 19, 2015
By
A first look at rxBTrees

by Joseph Rickert The gradient boosting machine as developed by Friedman, Hastie, Tibshirani and others, has become an extremely successful algorithm for dealing with both classification and regression problems and is now an essential feature of any machine learning toolbox. R’s gbm() function (gbm package) is a particularly well crafted implementation of the gradient boosting machine that served as...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)