515 search results for "Hadoop"

The Data-Driven Weekly #1.6

December 16, 2015
By
The Data-Driven Weekly #1.6

Right on cue, this past week heralded in an announcement of OpenAI, a new non-profit started by a number of …Continue reading →

Read more »

Trade-offs to consider when reading a large dataset into R using the RevoScaleR package

December 15, 2015
By
Trade-offs to consider when reading a large dataset into R using the RevoScaleR package

by Seth Mottaghinejad, Data Scientist at Microsoft R and big data There are many R packages dedicated to letting users (or useRs if you prefer) deal with big data in R. (We will intentionally avoid using proper case for 'big data', because (1) the term has been somewhat hackneyed, and (2) for the sake of this article we can...

Read more »

Data Science Radar – Technologist Profile

December 14, 2015
By
Data Science Radar – Technologist Profile

    by Mark Sellors, Mango Solutions @sellorm  Mark Sellors from Mango took the Data Science Radar Challenge and his dominant skill was a Technologist, so we asked him a few questions. 1.  Tell us a bit about your background … Continue reading →

Read more »

R tutorials

December 10, 2015
By
image02

There are tons of resources to help you learn the different aspects of R, and as a beginner this can be overwhelming. It’s also a dynamic language and rapidly changing, so it’s important to keep up with the latest tools and technologies. That’s why R-bloggers and DataCamp have worked together to bring you a learning path for R. Each section...

Read more »

Data Science Radar – Data Wrangler Profile

December 7, 2015
By
Data Science Radar – Data Wrangler Profile

by Steph Locke, Mango Solutions @SteffLocke Steph Locke Data Science Radar – Nov 2015 1. Tell us a bit about your background in Data Science I started off as a Product Analyst doing a bit of this, a bit of … Continue reading →

Read more »

The Case for a Data Science Lab

December 1, 2015
By

By Mark Sellors, Technical Architect – Mango Solutions As more and more Data Science moves from individuals working alone, with small data sets on their laptops, to more productionised, or analytically mature settings, an increasing number of restrictions are being … Continue reading →

Read more »

Experiments on english wikipedia. GloVe and word2vec.

November 30, 2015
By
Experiments on english wikipedia. GloVe and word2vec.

Today I will start to publish series of posts about experiments on english wikipedia. As I said before, text2vec is inspired by gensim - well designed and quite efficient python library for topic modeling and related NLP tasks. Also I found very useful Radim’s posts, where he tried to evaluate some algorithms on english wikipedia dump....

Read more »

GloVe vs word2vec revisited.

November 30, 2015
By
GloVe vs word2vec revisited.

Today I will start to publish series of posts about experiments on english wikipedia. As I said before, text2vec is inspired by gensim - well designed and quite efficient python library for topic modeling and related NLP tasks. Also I found very useful Radim’s posts, where he tried to evaluate some algorithms on english wikipedia dump....

Read more »

R online classes with leading experts at statistics.com (33% discount)

November 24, 2015
By
2015-11-25 00_49_11-Clipboard

Statistics.com is an online learning website with 100+ courses in statistics, analytics, data mining, text mining, forecasting, social network analysis, spatial analysis, etc. They have kindly agreed to offer R-Bloggers readers a reduced rate of $399 for any of their 23 courses in R, Python, SQL or SAS.  These are high-impact courses, each 4-weeks long (normally costing up to $589).  They...

Read more »

The R-Podcast Episode 14: Tips and Tricks for using R-Markdown

November 18, 2015
By

The R-Podcast is back up and running! In this episode I discuss some useful resources and helpful tips/extensions that have greatly enhanced my work flow in creating reproducible analysis documents via R-Markdown. I also highlight some exciting new endeavors in the R community as well as provide my take on two key events that further

Read more »

Recent popular posts

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)