568 search results for "register"

How to extract Google Analytics data in R using RGoogleAnalytics

November 5, 2014
By
How to extract Google Analytics data in R using RGoogleAnalytics

I am extremely thrilled to announce that RGoogleAnalytics was released recently by CRAN. R is already a swiss army knife for data analysis largely due its 6000 libraries. What this means is that digital analysts can now fully use the analytical capabilities of R to fully explore their Google Analytics Data. In this post, we will... Read More The post...

Read more »

A Walk-Forward Attempt on FAA

October 29, 2014
By
A Walk-Forward Attempt on FAA

So in the first post about FAA, I was requested to make a walk-forward test of FAA. While the results … Continue reading →

Read more »

Call for participation: AusDM 2014, Brisbane, 27-28 November

October 25, 2014
By
Call for participation: AusDM 2014, Brisbane, 27-28 November

********************************************************* 12th Australasian Data Mining Conference (AusDM 2014) Brisbane, Australia 27-28 November 2014 http://ausdm14.ausdm.org/ ********************************************************* The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. Since AusDM’02 the conference … Continue reading →

Read more »

Happening just now… 6th Conference of the R Spanish User Community

October 23, 2014
By
Happening just now… 6th Conference of the R Spanish User Community

The R-Spain Conferences have been taking place since 2009 as an expression of the growing interest that R elicits in many fileds. The organisers are the Comunidad R Hispano (R-es). The community supports many groups and initiatives aimed to develop … Sigue leyendo →

Read more »

A first look at Distributed R

October 23, 2014
By
A first look at Distributed R

by Joseph Rickert One of the most interesting R related presentations at last week’s Strata Hadoop World Conference in New York City was the session on Distributed R by Sunil Venkayala and Indrajit Roy, both of HP Labs. In short, Distributed R is an open source project with the end goal of running R code in parallel on data...

Read more »

Making an R Package to use the HERE geocode API

October 23, 2014
By

HERE is a product by Nokia, formerly called Nokia maps and before that, Ovi maps. It's the result of the acquisition of NAVTEQ in 2007 combined with Plazes and Metacarta, among others. It has a geocoding API, mapping tiles, routing services, and other things. I'm focused on the geocoding service. Under the “Base” license,...

Read more »

Analyze Instagram with R

October 13, 2014
By
Analyze Instagram with R

This tutorial will show you how you create an Instagram app, create an authentication process with R and get data via the Instagram API. There is no R package for this yet so we... The post Analyze Instagram with R appeared first on ThinkToStart.

Read more »

Building a DGA Classifier: Part 3, Model Selection

October 6, 2014
By
Building a DGA Classifier: Part 3, Model Selection

This is part two of a three-part blog series on building a DGA classifier and it is split into the three phases of building a classifier: 1) Data preparation 2) Feature engineering and 3) Model selection (this post) Back in part 1, we prepared the data and we are starting with a nice clean list of domains labeled as either legitimate (“legit”) or generated by an algorithm (“dga”)....

Read more »

By-Group Aggregation in Parallel

October 4, 2014
By
By-Group Aggregation in Parallel

Similar to the row search, by-group aggregation is another perfect use case to demonstrate the power of split-and-conquer with parallelism. In the example below, it is shown that the homebrew by-group aggregation with foreach pakage, albeit inefficiently coded, is still a lot faster than the summarize() function in Hmisc package.

Read more »

Building a DGA Classifier: Part 1, Data Preparation

September 30, 2014
By

This will be a three-part blog series on building a DGA classifier and will be split into three logical phases of building a classifier: 1) Data preparation (this) 2) Feature engineering and 3) Model selection. And before I get too far into this, I want to give a huge thank you to Click Security for releasing a DGA classifier in python as part of...

Read more »