An Iterative Approach to Data Science

August 3, 2017
By
An Iterative Approach to Data Science

It is the nature of boot camp.  We drink from the firehose because we only have 12 weeks to learn what university programs would spread out The post An Iterative Approach to Data Science appeared first on NYC Data Science Academy Blog.

Read more »

How we voted in South Carolina

August 3, 2017
By
How we voted in South Carolina

Purpose This post seeks to explore how Greenville, SC and surrounding areas voted in the 2016 election. It also demonstrates how to retrieve data from the Data.World site. To retrieve data from this site using the tools in this post, you have to create an account (easy to do if you have a Facebook, Twitter, or Github account). You...

Read more »

Initiating development of a chatbot with plumber and ngrok

August 3, 2017
By
Initiating development of a chatbot with plumber and ngrok

Chatbots have become a rage since some time now, with firms from various sectors investing in such bots that would reduce or remove the need of employing a call centre whilst maintaining similar levels of efficiency, if not greater. A notable example i...

Read more »

Passing user-supplied C++ functions with RcppXPtrUtils

August 3, 2017
By
Passing user-supplied C++ functions with RcppXPtrUtils

Sitting on top of R’s external pointers, the RcppXPtr class provides a powerful and generic framework for Passing user-supplied C++ functions to a C++ backend. This technique is exploited in the RcppDE package, an efficient C++ based implementation of the DEoptim package that accepts optimisation objectives as both R and compiled functions (see demo("compiled", "RcppDE") for further details). This solution has a couple of issues though: Some repetitive scaffolding...

Read more »

Impact of the conservation optimism hashtag

August 3, 2017
By
Impact of the conservation optimism hashtag

Impact of the conservation optimism hashtag The hashtag #conservationoptimism became popular during the recent International Congress for Conservation Biology symposium. Michael Burgass asked me what its twitter impact was, so here is a quick analysi...

Read more »

Text categorization with deep learning, in R

August 3, 2017
By

Given a short review of a product, like "I couldn't put it down!", can you predict what the product is? In that case it's pretty easy — it's for a book — but this general problem of text categorization comes up in a lot of natural language analysis problems. In his talk at useR!2017 (shown below), Microsoft data scientist...

Read more »

Numerical Differentiation with Finite Differences in R

August 3, 2017
By
Numerical Differentiation with Finite Differences in R

Part 1 of 7 in the series Numerical AnalysisNumerical differentiation is a method of approximating the derivative of a function at particular value . Often, particularly in physics and engineering, a function may be too complicated to merit the work necessary to find the exact derivative, or the function itself... The post Numerical Differentiation with Finite Differences in R appeared...

Read more »

Parallel Computing Exercises: Snow and Rmpi (Part-3)

August 3, 2017
By
Parallel Computing Exercises: Snow and Rmpi (Part-3)

The foreach statement, which was introduced in the previous set of exercises of this series, can work with various parallel backends. This set allows to train in working with backends provided by the snow and Rmpi packages (on a single machine with multiple CPUs). The name of the former package stands for “Simple Network of Related exercise sets: Parallel Computing...

Read more »

Rborist version 0-1.8 available from CRAN

August 3, 2017
By

Version 0-1.8 of the Rborist implementation of the Random Forest (TM) algorithm is now available from CRAN. Although most changes involve refactoring to accommodate future updates, there are several bug fixes and enhancements worth mentioning. New option maxLeaf allows a limit to be set on the number of terminal nodes (i.e., leaves) in each trained tree. In order to not to introduce behavior dependent upon...

Read more »

Generating Quadratic Primes: Euler Problem 27

August 2, 2017
By

Solution to Euler Problem 27 using the R language. Find the product of the coefficients for the quadratic expression that produces the most primes. Continue reading → The post Generating Quadratic Primes: Euler Problem 27 appeared first on The Devil is in the Data.

Read more »

Fun data: open data that is fun to analyse

August 2, 2017
By
Fun data: open data that is fun to analyse

Joe Russell, Adnan Fiaz Jeremy Singer-Vine sends out a newsletter every week where he highlights a number of interesting open datasets (you can explore all the datasets here). At Mango we are all for open data so we thought we would also share some of the open datasets we think are fun to explore. Open Food Facts Food prices North Korea Missile Tests Flight...

Read more »

RStudio Connect v1.5.4 – Now Supporting Plumber!

August 2, 2017
By
RStudio Connect v1.5.4 – Now Supporting Plumber!

We’re thrilled to announce support for hosting Plumber APIs in RStudio Connect: version 1.5.4. Plumber is an R package that allows you to define web APIs by adding special annotations to your existing R code – allowing you to make your R functions accessible to other systems. Below you can see the auto-generated “swagger” interface for a web API written...

Read more »

Applications in energy, retail and shipping

August 2, 2017
By
Applications in energy, retail and shipping

The Solutions section of the Cortana Intelligence Gallery provides more than two dozen working examples of applying machine learning, data science and artificial intelligence to real-world problems. Each solution provides sample data, scripts for model training and evaluation, and reporting of predictions. You can deploy a complete stack in Azure to implement the solution with the click of a...

Read more »

What makes an R talk popular? Scraping useR2017 attendance information to find out!

August 2, 2017
By
What makes an R talk popular? Scraping useR2017 attendance information to find out!

Click here to explore the data for yourself First off — I’ll admit that was my poor attempt at a click-bait title. But if you’re still reading the next paragraph, that means it was successful! Table of contents Background Am I...

Read more »

Data wrangling : Transforming (3/3)

August 2, 2017
By
Data wrangling : Transforming (3/3)

Data wrangling is a task of great importance in data analysis. Data wrangling, is the process of importing, cleaning and transforming raw data into actionable information for analysis. It is a time-consuming process which is estimated to take about 60-80% of analyst’s time. In this series we will go through this process. It will be Related exercise sets: Data table...

Read more »

[R] Kenntnis-Tage 2017: Register now and benefit from the Summer Special

August 2, 2017
By
[R] Kenntnis-Tage 2017: Register now and benefit from the Summer Special

On November 8 and 9, Kassel will once more become the meeting point for the German-speaking R community. From the usage of R in the automotive industry to risk analysis, from data mining with caret to R Markdown: The Kenntnis-Tage 2017 are again standing for an exciting program – always with a focus on … „ Kenntnis-Tage 2017:...

Read more »

Twitter Coverage of the ISMB/ECCB Conference 2017

August 2, 2017
By
Twitter Coverage of the ISMB/ECCB Conference 2017

ISMB (Intelligent Systems for Molecular Biology – which sounds rather old-fashioned now, doesn’t it?) is the largest conference for bioinformatics and computational biology. It is held annually and, when in Europe, jointly with the European Conference on Computational Biology (ECCB). I’ve had the good fortune to attend twice: in Brisbane 2003 (very enjoyable early in … Continue reading Twitter...

Read more »

elastic – Elasticsearch for R

August 2, 2017
By

elastic is an R client for Elasticsearch elastic has been around since 2013, with the first commit in November, 2013. sidebar - 'elastic' was picked as a package named before the company now known as Elastic changed their name to Elastic. What is Elasticsearch? If you aren't familiar with Elasticsearch, it is a distributed, RESTful search and analytics engine. It's similar to Solr. It falls...

Read more »

F-Test: Compare Two Variances in R

August 2, 2017
By
F-Test: Compare Two Variances in R

F-test is used to assess whether the variances of two populations (A and B) are equal. Contents When to you use F-test? Research questions and statistical hypotheses Formula of F-test Compute F-test in R R function Import and check your data into R Preleminary test to check F-test assumptions Compute F-test Interpretation of the result Access to the values returned by var.test() function Infos When to...

Read more »

Singular Value Decomposition (SVD): Tutorial Using Examples in R

August 1, 2017
By
Singular Value Decomposition (SVD): Tutorial Using Examples in R

If you have ever looked with any depth at statistical computing for multivariate analysis, there is a good chance you have come across the singular value decomposition (SVD). It is a workhorse for techniques that decompose data, such as correspondence analysis and principal...

Read more »

Five kinds of weather you’ll meet in America

August 1, 2017
By
Five kinds of weather you’ll meet in America

K-MEANS CLUSTERING, A WORKHORSE OF DATA SCIENCE AND MACHINE LEARNING CLICK TO ENLARGE The USA is a large country. How different are people’s experiences of the weather depending on where they live? To look into this question, we downloaded high temperature data for over 1,300 airport weather stations in the contiguous USA for every day The post Five kinds...

Read more »

A Postcard from JSM

August 1, 2017
By
A Postcard from JSM

Baltimore has the reputation of being a tough town: hot in the summer and gritty, but the convention center hosting the Joint Statistical Meetings is a pretty cool place to be. There are thousands of people here and so many sessions (over 600) that it’s just impossible to get an overview of all that’s going on. So, here are...

Read more »

Transfer Learning with Keras in R

August 1, 2017
By
Transfer Learning with Keras in R

In my last posts ((http://flovv.github.io/Logo_detection_deep_learning/ and here, I described how one can detect logos in images with R. The first results were promising and achieved a classification accuracy of ~50%. In this post i will detail h...

Read more »

A modern database interface for R

August 1, 2017
By

At the useR! conference last month, Jim Hester gave a talk about two packages that provide a modern database interface for R. Those packages are the odbc package (developed by Jim and other members of the RStudio team), and the DBI package (developed by Kirill Müller with support from the R Consortium). To communicate with databases, a common protocol...

Read more »

HebRew (using Hebrew in R)

August 1, 2017
By

Adi Sarid (Tel Aviv university and Sarid Research Institute LTD.) July-2017 Background A while back I participated in an R workshop, in the annual convention of the Israeli Association for Statistics. I had the pleasure of talking with Tal Galili and Jonathan Rosenblatt which indicated that a lot of Israeli R users run into difficulties … Continue reading HebRew...

Read more »

R⁶ — Reticulating Parquet Files

August 1, 2017
By

The reticulate package provides a very clean & concise interface bridge between R and Python which makes it handy to work with modules that have yet to be ported to R (going native is always better when you can do it). This post shows how to use reticulate to create parquet files directly from R... Continue reading →

Read more »

Let’s Talk Drawdowns (And Affiliates)

August 1, 2017
By
Let’s Talk Drawdowns (And Affiliates)

This post will be directed towards those newer in investing, with an explanation of drawdowns–in my opinion, a simple and … Continue reading →

Read more »

Showing Some Respect for Data Munging

August 1, 2017
By
Showing Some Respect for Data Munging

In this post, I'd like to focus on data munging, e.g. the process of acquiring and arranging data (typically in a tidy manner) prior to data analysis. It's common knowledge that data scientists spend an enormous amount of time munging data, but data analysis, modeling, and visualization get most of the attention at presentations, on blogs and in the...

Read more »

Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-5)

August 1, 2017
By
Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-5)

Statistics are often taught in school by and for people who like Mathematics. As a consequence, in those class emphasis is put on leaning equations, solving calculus problems and creating mathematics models instead of building an intuition for probabilistic problems. But, if you read this, you know a bit of R programming and have access Related exercise sets: Nonparametric Tests...

Read more »

Search R-bloggers

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC2

ODSC1

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training



statcon.de

mljar.com

Contact us if you wish to help support R-bloggers, and place your banner here.