One thing critical for success in the age of AI (and who has it)

February 14, 2017
By
One thing critical for success in the age of AI (and who has it)

Data science, artificial intelligence, automation, and other advanced technologies are reshaping the world. As the world changes who's likely to succeed? The post One thing critical for success in the age of AI (and who has it) appeared first on SHARP SIGHT LABS.

Read more »

30 issues of Demographic Digest – the most frequent journals

February 13, 2017
By
30 issues of Demographic Digest – the most frequent journals

Today, the 30-th issue of my Demographic Digest was published.

Read more »

To Eat, or Not to Eat…WHERE is the question

February 13, 2017
By
To Eat, or Not to Eat…WHERE is the question

According to NYC Health, “Each year, thousands of New York City residents become sick from consuming foods or drinks that are contaminated with harmful bacteria, viruses The post To Eat, or Not to Eat...WHERE is the question appeared first on NYC Data Science Academy Blog.

Read more »

Labor Market Analysis with R: Will Obama Ever be Beat?

February 13, 2017
By
Labor Market Analysis with R: Will Obama Ever be Beat?

No matter how many jobs are promised, Barak Obama’s administration will be nearly impossible to beat when it comes to employment growth. The following code uses the blcrapeR package, which is available on CRAN. Politicians talk a lot about jobs and unemployment, even though the actual power they have over the labor market is up Read More

Read more »

See RStudio + sparklyr for big data at Strata + Hadoop World

February 13, 2017
By
See RStudio + sparklyr for big data at Strata + Hadoop World

If big data is your thing, you use R, and you’re headed to Strata + Hadoop World in San Jose March 13 & 14th, you can experience in person how easy and practical it is to analyze big data with R and Spark. In a beginner level talk by RStudio’s Edgar Ruiz and an intermediate

Read more »

MODIStsp v.1.3.1 released !

February 13, 2017
By
MODIStsp v.1.3.1 released !

 "MODIStsp" is a R package allowing automatic download and preprocessing of MODIS Land Products time series, available at this https://github.com/lbusett/MODIStsp github page (See also here for additional information) v1.3.1 adds functionality for processing of MODIS snow cover products, accelerated download, processing of specified portions of years, plus various bug fixing and improvements.

Read more »

A comparison of deep learning packages for R

February 13, 2017
By

Oksana Kutina and Stefan Feuerriegel fom University of Freiburg recently published an in-depth comparison of four R packages for deep learning. The packages reviewed were: MXNet: The R interface to the MXNet deep learning library. (The blog post refers to an older name for the package, MXNetR.) darch: An R package for deep architectures and restricted Boltzmann machines. deepnet:...

Read more »

Data Science for Doctors – Part 3 : Distributions

February 13, 2017
By
Data Science for Doctors – Part 3 : Distributions

Data science enhances people’s decision making. Doctors and researchers are making critical decisions every day. Therefore, it is absolutely necessary for those people to have some basic knowledge of data science. This series aims to help people that are around medical field to enhance their data science skills. This is the third part of the Related exercise sets:

Read more »

A Graphic Look at Bay Area Bike Share

February 13, 2017
By
A Graphic Look at Bay Area Bike Share

Introduction This post is about the first of the four projects we are supposed to deliver at the NYC Data Science Academy Data Science Bootcamp program. The The post A Graphic Look at Bay Area Bike Share appeared first on NYC Data Science Academy Blog.

Read more »

Production R at ONS

February 13, 2017
By
Production R at ONS

Mark Sellors Head of Data Engineering I’ve recently been working at the Office for National Statistics, under the very broad umbrella of a SAS to R transition project. As you might imagine, ONS, as the UK’s largest producer of official … Continue reading →

Read more »

Visualizing Streaming Data And Alert Notification with Shiny

February 13, 2017
By
Visualizing Streaming Data And Alert Notification with Shiny

Streaming Data is data that is generated continuously and it includes various sources such as sensors, log files, geospatial services, etc. The data may come at regular intervals and we may want to have a dashboard which updates by itself and incorporates the newly added data so that we can use it for deta driven Email and Text...

Read more »

RcppTOML 0.1.1

February 13, 2017
By

Following up on the somewhat important RcppTOML 0.1.0 releaseas which brought RcppTOML to Windows, we have a first minor update 0.1.1. Two things changed: once again updated upstream code from Chase Geigle's cpptoml which now supports Date types too, and we added the ability to parse TOML from strings as opposed to only from files.

Read more »

Using R to study tennis players

February 13, 2017
By
Using R to study tennis players

Looking at winning records of ATP top players and Brazilian tennists - In the previous post about tennis, we studied how changes in ball’s composition in hard and grass courts affected the game back in 2000. In this post,...

Read more »

Venture Capital Deals in 2016 – An Overview (2/2)

February 13, 2017
By
Venture Capital Deals in 2016 – An Overview (2/2)

  In the previous post (mostly based on EDA principles) I highlighted the main features of 1,720 Venture Capital deals that took place in 2016 in 50 different countries. It is of a central importance to once again underline that the dataset that I used is not a representative sample Read More ...

Read more »

How to Find Equidistant Coordinates Between Two Locations on Earth

February 13, 2017
By
How to Find Equidistant Coordinates Between Two Locations on Earth

Here’s to the ones who dream foolish, as they may seem (The Fools Who Dream, ‘La La Land’ OST) One of the key points of The Meeting Point Locator is to obtain an orthogonal great circle to the bearing defined by any two given locations on Earth. A great circle is the intersection of the … Continue...

Read more »

mlr Google Summer of Code 2017

February 12, 2017
By

We are happy to announce that we applied for a another Google Summer of Code project in 2017.

Read more »

mlr 2.10

February 12, 2017
By

mlr 2.10 is now on CRAN. Please update your package if you haven’t done so in a while. Here is an overview of the changes:

Read more »

a knapsack riddle?

February 12, 2017
By
a knapsack riddle?

The riddle of the week is a sort of multiarmed bandits optimisation. Of sorts. Or rather a generalised knapsack problem. The question is about optimising the allocation of 100 undistinguishable units to 10 distinct boxes against a similarly endowed adversary, when the loss function is and the distribution q of the

Read more »

Dataframes and the tidyverse

February 12, 2017
By
Dataframes and the tidyverse

The data frame is the primary structure for working with data in R. Whenever you have data that is arranged in a spreadsheet-like fashion, the default receptacle for that data in R is the data frame. In a data frame,

Read more »

Video Introduction to Bayesian Data Analysis, Part 1: What is Bayes?

February 12, 2017
By
Video Introduction to Bayesian Data Analysis, Part 1: What is Bayes?

This is video one of a three part introduction to Bayesian data analysis aimed at you who isn’t necessarily that well-versed in probability theory but that do know a little bit of programming. I gave a version of this tutorial at the UseR 2015 conf...

Read more »

Happy pbirthday class of 2016

February 12, 2017
By
Happy pbirthday class of 2016

Abstract Continuing the analysis of first names given to newborns in Berlin 2016, we solve the following problem: what is the probability, that in a school class of size \(n\) of these kids there will be at least two kids having the same first name? The answer to the problem for classes of size 26 is 34% and...

Read more »

Implementing the Gradient Descent Algorithm in R

February 12, 2017
By
Implementing the Gradient Descent Algorithm in R

A Brief Introduction Linear regression is a classic supervised statistical technique for predictive modelling which is based on the linear hypothesis: y = mx + c where y is the response or outcome variable, m is the gradient of the linear trend-line, x is the predictor variable and c is the intercept. The intercept is… Continue reading...

Read more »

Using R: tibbles and the t.test function

February 12, 2017
By
Using R: tibbles and the t.test function

A participant in the R course I’m teaching showed me a case where a tbl_df (the new flavour of data frame provided by the tibble package; standard in new RStudio versions) interacts badly with the t.test function. I had not seen this happen before. The reason is this: Interacting with legacy code A handful of

Read more »

ROPE and Equivalence Testing: Practically Equivalent?

February 12, 2017
By
ROPE and Equivalence Testing: Practically Equivalent?

In a previous post, I compared equivalence tests to Bayes factors, and pointed out several benefits of equivalence tests. But a much more logical comparison, and one I did not give enough attention to so far, is the ROPE procedure using Bayesian estimation. I’d like to thank John Kruschke for feedback on a draft of this blog post....

Read more »

Letting Travis keep a secret

February 12, 2017
By

More and more packages, be it for R or another language, are now interfacing different application programming interfaces (API) which are exposed to the web. And many of these may require an API key, or token, or account and password. Which traditionally poses a problem in automated tests such as those running on the popular Travis CI service...

Read more »

R in Open Data: Complaints in The Field of Freedom of Information data set from data.gov.rs

February 12, 2017
By
R in Open Data: Complaints in The Field of Freedom of Information data set from data.gov.rs

The notebooks (R, Rmd, and HTML files are provided in my GitHub repository) focus on an exploratory analysis of the open data set on the complaints in the field of freedom of information, provided at the Open Data Portal of the Republic of Serbia that is currently under development. The data set was kindly provided to the...

Read more »

Text mining and word cloud fundamentals in R : 5 simple steps you should know

February 11, 2017
By
Text mining and word cloud fundamentals in R : 5 simple steps you should know

Text mining methods allow us to highlight the most frequently used keywords in a paragraph of texts. One can create a word cloud, also referred as text cloud or tag cloud, which is a visual representation of text data. The procedure of creating...

Read more »

Who were the notable dead of Wikipedia?

February 11, 2017
By
Who were the notable dead of Wikipedia?

As described in my last post, I extracted all notable deaths from Wikipedia over the 2004-2016 period. In this post I want to explore this study population. Who were the notable dead? How old were notable dead? Let me assume here most entries of th...

Read more »

Extracting notable deaths from Wikipedia

February 11, 2017
By

I like Wikipedia. My husband likes it even more, he included it in his PhD thesis acknowledgements! I appreciate the efforts done for sharing knowledge, and also the apparently random stuff you can find on the website. In particular, I’ve been intrig...

Read more »

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.