September 2017

News Roundup from Microsoft Ignite

September 25, 2017 | David Smith

It's been a big day for the team here at Microsoft, with a flurry of announcements from the Ignite conference in Orlando. We'll provide more in-depth details in the coming days and weeks, but for now here's a brief roundup of the news related to data science: Microsoft ML Server 9.2 ... [Read more...]

Custom Level Coding in vtreat

September 25, 2017 | Nina Zumel

One of the services that the R package vtreat provides is level coding (what we sometimes call impact coding): converting the levels of a categorical variable to a meaningful and concise single numeric variable, rather than coding them as indicator variables (AKA "one-hot encoding"). Level coding can be computationally and ... [Read more...]

The Battle of Bayesian Home Run Models

September 25, 2017 | jameshunterbr

The regular Major League Baseball season is coming to an end. Next week, we move into the playoffs and eventually the World Series. However, we have a nice statistical modeling question playing out in this last week. Giancarlo Stanton of … Continue reading → [Read more...]

Speeding Up Digital Arachnids

September 25, 2017 | hrbrmstr

spiderbar, spiderbar Reads robots rules from afar. Crawls the web, any size; Fetches with respect, never lies. Look Out! Here comes the spiderbar. Is it fast? Listen bud, It's got C++ under the hood. Can you scrape, from a site? Test with can_fetch(), TRUE == alright Hey, there There goes ...
[Read more...]

Survival Analysis with R

September 24, 2017 | R Views

With roots dating back to at least 1662 when John Graunt, a London merchant, published an extensive set of inferences based on mortality records, survival analysis is one of the oldest subfields of Statistics [1]. Basic life-table methods, including techniques for dealing with censored data, were discovered before 1700 [2], and in the early ...
[Read more...]

Super excited for R promises

September 24, 2017 | Appsilon Data Science Blog

We at Appsilon are excited about RStudio introducing promises in R quite soon which is going to be a huge step forward in programming in R (we have already used futures and similar libraries to run code asynchronously, however this is going to be a sta... [Read more...]

eXtremely Boost your machine learning Exercises (Part-1)

September 24, 2017 | Jakub Kwiecien

eXtreme Gradient Boosting is a machine learning model which became really popular few years ago after winning several Kaggle competitions. It is very powerful algorithm that use an ensemble of weak learners to obtain a strong learner. Its R implementation is available in xgboost package and it is really worth ... [Read more...]

RcppGSL 0.3.3

September 24, 2017 | Thinking inside the box

A maintenance update RcppGSL 0.3.3 is now on CRAN. It switched the vignette to the our new pinp package and its two-column pdf default. The RcppGSL package provides an interface from R to the GNU GSL using the Rcpp package. No user-facing new code or... [Read more...]

Automatic Crack Detection – with Deep Learning

September 23, 2017 | recurrentnull

On Friday at DOAG Big Data Days, I presented one possible application of deep learning: using deep learning for automatic crack detection – with some background theory, a Keras model trained from scratch, and the use of VGG16 pretrained on Imagenet. The amount of input data really was minimal, and the ...
[Read more...]

Building Your Own Brownlow Model

September 23, 2017 | Analysis of AFL

As some of you may be aware, this is the best time of year. Not only is it finals time but its also Brownlow week. During my honours year my thesis was on trying to predict who would win that years Brownlow medal. I have been running the model ever ...
[Read more...]

Postgresql + R Sandbox

September 23, 2017 | --Jean Arreola--

ElephantSQL ElephantSQL offers a free instance of Postgresql, with a limit of 20 MB and 5 concurrent connections. For example, you can upload a shiny application that depends on data from ElephantSQL. You only need to register to the site and automat... [Read more...]

RcppCNPy 0.2.7

September 23, 2017 | Thinking inside the box

A new version of the RcppCNPy package arrived on CRAN yesterday. RcppCNPy provides R with read and write access to NumPy files thanks to the cnpy library by Carl Rogers. This version updates internals for function registration, but otherwise mostly s... [Read more...]

RcppClassic 0.9.8

September 23, 2017 | Thinking inside the box

A bug-fix release RcppClassic 0.9.8 for the very recent 0.9.7 release which fixes a build issue on macOS introduced in 0.9.7. No other changes. Courtesy of CRANberries, there are changes relative to the previous release. Questions, comments etc shoul... [Read more...]

Upcoming data preparation and modeling article series

September 23, 2017 | John Mount

I am pleased to announce that vtreat version 0.6.0 is now available to R users on CRAN. vtreat is an excellent way to prepare data for machine learning, statistical inference, and predictive analytic projects. If you are an R user we strongly suggest you incorporate vtreat into your projects. vtreat handles, ...
[Read more...]

Thoughts on nest()

September 23, 2017 | Jocelyn Ireson-Paine

I’ve been experimenting with the Tidyverse’s nest function, because it may be useful when, for example, using households together with benefit units. Below are some thoughts that I first posted as a comment to Hadley Wickham’s blog entry “tidyr 0.4.0”. More on this in future posts. First, this ... [Read more...]
1 2 3 4 5 6 15

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)