399 search results for "hadoop"

SparkR preview by Vincent Warmerdam

May 28, 2015
By
SparkR preview by Vincent Warmerdam

SparkR preview in Rstudio Apache Spark is the hip new technology on the block. It allows you to write scripts in a functional style and the technology behind it will allow you to run iterative tasks very quickly on a cluster of machines. It’s benchmarked to be quicker than hadoop for most machine learning use

Read more »

RevoScaleR’s Naive Bayes Classifier rxNaiveBayes()

May 28, 2015
By
RevoScaleR’s Naive Bayes Classifier rxNaiveBayes()

by Joseph Rickert, Because of its simplicity and good performance over a wide spectrum of classification problems the Naïve Bayes classifier ought to be on everyone's short list of machine learning algorithms. Now, with version 7.4 we have a high performance Naïve Bayes classifier in Revolution R Enterprise too. Like all Parallel External Memory Algorithms (PEMAs) in the RevoScaleR...

Read more »

R tops 2015 KDnuggets Software Poll

May 27, 2015
By
R tops 2015 KDnuggets Software Poll

R is the leading choice for Predictive Analytics / Data Mining / Data Science software according to the results of the 2015 KDnuggets Software Poll, now in its 16th year. Each of the 28,000 participants selected one or more tools they had used in the last year from a list of 93 options, and R was selected by 46.9%...

Read more »

R #1 by Wide Margin in Latest KDnuggets Poll

May 27, 2015
By
R #1 by Wide Margin in Latest KDnuggets Poll

The results of the latest KDnuggets Poll on software for Analytics, Big Data and Data Mining are out, and R has moved into the #1 position by a wide margin. I’ve updated the Surveys of Use section of The Popularity of Data … Continue reading →

Read more »

First Day Highlights from the Extremely Large Databases Conference

May 21, 2015
By
First Day Highlights from the Extremely Large Databases Conference

by Joseph Rickert The 8th XLDB (Extremely Large Databases) Conference open at Stanford on Tuesday with an outstanding program. This conference has been providing leadership in the "Big Data" world since its first workshop which was held in 2007. For example, the summary report for that year notes: "Both communities (industry and science) are moving towards parallel ... architectures...

Read more »

Course on using Oracle R Enterprise

Course on using Oracle R Enterprise

BNOSAC will be giving from June 08 up to June 12 a 5-day crash course on the use of R using Oracle R Enterprise. The course is given together with our Oracle Partner in Leuven, Belgium. If you are interested in attending, contact us for further details. For R users who aren't aware of this yet....

Read more »

Open soure software has changed the way we do business

May 20, 2015
By

Earlier this month TechCrunch published an article of mine, "The Business Economics And Opportunity Of Open-Source Data Science". With this article I wanted to share how open-source software has disrupted the economics of doing business, now that data is a fundamental component of every businesses' operations. Open source projects like Hadoop and R, coupled with commodity hardware, have fundamentally...

Read more »

Benchmarking Random Forest Implementations

May 19, 2015
By
Benchmarking Random Forest Implementations

I currently have the need for machine learning tools that can deal with observations of...

Read more »

What’s new in Revolution R Enterprise 7.4

May 18, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics Without missing a beat, the engineers at Revolution Analytics have brought another strong release to users of Revolution R Enterprise (RRE). Just a few weeks after acquisition of Revolution Analytics by Microsoft, RRE 7.4 was released to customers on May 15 adding new capabilities, enhanced performance and security, ann faster...

Read more »

What data science software tools do you use?

May 11, 2015
By

KDnuggets is once again running its annual poll of data science software tools, now in its 16th year. If you'd like to participate, visit the KDnuggets poll page and answer the question, "What Predictive Analytics, Data Mining, Data Science software/tools you used in the past 12 months?". The poll allows you to select up to 20 tools from the...

Read more »