467 search results for "hadoop"

Set up Sublime Text for light-weight all-in-one data science IDE

December 23, 2015
By
Set up Sublime Text for light-weight all-in-one data science IDE

tl;dr Sublime Text is a powerful text editor. Here I introduce how to add custom REPL config for remote/local R, Python, Scala, Spark, Hive, you name it (this is only tested for OS X). The example below interprets local Python (top), R (middle) and Hive (bottom) code on remote. IDE for everything Good IDEs are everywhere. RStudio for R, Pycharm for...

Read more »

All I want for Christmas is you big data analytics!

December 21, 2015
By
All I want for Christmas is you big data analytics!

By Hannah Evans Sound familiar? All businesses have data. But whether it is used to drive business value is another question entirely. Traditionally, technical analysts have made decisions about data technology, without truly understanding the business challenges beforehand, meaning that … Continue reading →

Read more »

The Data-Driven Weekly #1.6

December 16, 2015
By
The Data-Driven Weekly #1.6

Right on cue, this past week heralded in an announcement of OpenAI, a new non-profit started by a number of …Continue reading →

Read more »

Trade-offs to consider when reading a large dataset into R using the RevoScaleR package

December 15, 2015
By
Trade-offs to consider when reading a large dataset into R using the RevoScaleR package

by Seth Mottaghinejad, Data Scientist at Microsoft R and big data There are many R packages dedicated to letting users (or useRs if you prefer) deal with big data in R. (We will intentionally avoid using proper case for 'big data', because (1) the term has been somewhat hackneyed, and (2) for the sake of this article we can...

Read more »

Data Science Radar – Technologist Profile

December 14, 2015
By
Data Science Radar – Technologist Profile

    by Mark Sellors, Mango Solutions @sellorm  Mark Sellors from Mango took the Data Science Radar Challenge and his dominant skill was a Technologist, so we asked him a few questions. 1.  Tell us a bit about your background … Continue reading →

Read more »

Data Science Radar – Data Wrangler Profile

December 7, 2015
By
Data Science Radar – Data Wrangler Profile

by Steph Locke, Mango Solutions @SteffLocke Steph Locke Data Science Radar – Nov 2015 1. Tell us a bit about your background in Data Science I started off as a Product Analyst doing a bit of this, a bit of … Continue reading →

Read more »

The Case for a Data Science Lab

December 1, 2015
By

By Mark Sellors, Technical Architect – Mango Solutions As more and more Data Science moves from individuals working alone, with small data sets on their laptops, to more productionised, or analytically mature settings, an increasing number of restrictions are being … Continue reading →

Read more »

Experiments on english wikipedia. GloVe and word2vec.

November 30, 2015
By
Experiments on english wikipedia. GloVe and word2vec.

Today I will start to publish series of posts about experiments on english wikipedia. As I said before, text2vec is inspired by gensim - well designed and quite efficient python library for topic modeling and related NLP tasks. Also I found very useful Radim’s posts, where he tried to evaluate some algorithms on english wikipedia dump....

Read more »

R online classes with leading experts at statistics.com (33% discount)

November 24, 2015
By
2015-11-25 00_49_11-Clipboard

Statistics.com is an online learning website with 100+ courses in statistics, analytics, data mining, text mining, forecasting, social network analysis, spatial analysis, etc. They have kindly agreed to offer R-Bloggers readers a reduced rate of $399 for any of their 23 courses in R, Python, SQL or SAS.  These are high-impact courses, each 4-weeks long (normally costing up to $589).  They...

Read more »

The R-Podcast Episode 14: Tips and Tricks for using R-Markdown

November 18, 2015
By

The R-Podcast is back up and running! In this episode I discuss some useful resources and helpful tips/extensions that have greatly enhanced my work flow in creating reproducible analysis documents via R-Markdown. I also highlight some exciting new endeavors in the R community as well as provide my take on two key events that further

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)