510 search results for "hadoop"

GloVe vs word2vec revisited.

November 30, 2015
By
GloVe vs word2vec revisited.

Today I will start to publish series of posts about experiments on english wikipedia. As I said before, text2vec is inspired by gensim - well designed and quite efficient python library for topic modeling and related NLP tasks. Also I found very useful Radim’s posts, where he tried to evaluate some algorithms on english wikipedia dump....

Read more »

R online classes with leading experts at statistics.com (33% discount)

November 24, 2015
By
2015-11-25 00_49_11-Clipboard

Statistics.com is an online learning website with 100+ courses in statistics, analytics, data mining, text mining, forecasting, social network analysis, spatial analysis, etc. They have kindly agreed to offer R-Bloggers readers a reduced rate of $399 for any of their 23 courses in R, Python, SQL or SAS.  These are high-impact courses, each 4-weeks long (normally costing up to $589).  They...

Read more »

The R-Podcast Episode 14: Tips and Tricks for using R-Markdown

November 18, 2015
By

The R-Podcast is back up and running! In this episode I discuss some useful resources and helpful tips/extensions that have greatly enhanced my work flow in creating reproducible analysis documents via R-Markdown. I also highlight some exciting new endeavors in the R community as well as provide my take on two key events that further

Read more »

Big RAM is eating big data – Size of datasets used for analytics

November 18, 2015
By
Big RAM is eating big data – Size of datasets used for analytics

With so much hype about “big data” and the industry pushing for “big data” analytical...

Read more »

The Data-Driven Weekly #1.2

November 17, 2015
By
The Data-Driven Weekly #1.2

Last week witnessed a number of exciting announcements from the big data and machine learning space. What it shows is …Continue reading →

Read more »

R and Impala: it’s better to KISS than using Java

October 29, 2015
By
R and Impala: it’s better to KISS than using Java

One of the best things I like in working at CARD.com is that I am...

Read more »

Log files exploration with Oracle Big Data Discovery 1.1

Log files exploration with Oracle Big Data Discovery 1.1

In a previous post, we described how we performed exploratory data analysis (EDA) in real-world log files, as provided by Skroutz.gr, the leading online company in Greece for online price comparison, in the context of Athens Datathon 2015. In the present post we will have a look at the same job as performed with Oracle Big Data Discovery (v....

Read more »

SparkR quick start that works

October 4, 2015
By
SparkR quick start that works

If you’re following along the SparkR Quick Start, you’ll notice that the instructions are not consistent with a more recent …Continue reading →

Read more »

Are you headed to Strata? It’s next week!

September 23, 2015
By
Are you headed to Strata? It’s next week!

RStudio will again teach the new essentials for doing (big) data science in R at this year’s Strata NYC conference, September 29 2015 (http://strataconf.com/big-data-conference-ny-2015/public/schedule/detail/44154).  You will learn from Garrett Grolemund, Yihui Xie, and Nathan Stephens who are all working on fascinating new ways to keep the R ecosystem apace of the challenges facing those who work with data. Topics include: R Quickstart: Wrangle,

Read more »

Notes from the Kölner R meeting, 18 September 2015

September 22, 2015
By
Notes from the Kölner R meeting, 18 September 2015

Last Friday the Cologne R user group came together for the 15th time. Since its inception over three years ago the group evolved from a small gathering in a pub into an active data science community, covering wider topics than just R. Still, R is the link and clue between the different interests. Last Friday's agenda was a...

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)