April R Course Finder update: Logistics Regression, New platforms and Complete Machine Learning

April 9, 2017
By
April R Course Finder update: Logistics Regression, New platforms and Complete Machine Learning

Last year we launched R Course Finder, an online directory that helps you to find the right R course quickly. With so many R courses available online, we thought it was a good idea to offer a tool that helps people to compare these courses, before they decide where to spend their valuable time and Related exercise sets:

Read more »

Shiny server series part 1: setting up

April 9, 2017
By
Shiny server series part 1: setting up

This guide is part of a series on setting up your own private server running shiny apps. There are many guides out there with great advice on how to set up an R shiny server and related software. I try to make a comprehensive guide based in part on the...

Read more »

Web Scraping and Applied Clustering Global Happiness and Social Progress Index

April 9, 2017
By
Web Scraping and Applied Clustering Global Happiness and Social Progress Index

Increasing amount of data is available on the web. Web scraping is a technique developed to extract data from web pages automatically and transforming it into a data format for further data analysis and insights. Applied clustering is an unsupervised learning technique that refers to a family of pattern discovery and data mining tools with Related Post

Read more »

Data on tour: Plotting 3D maps and location tracks

April 8, 2017
By
Data on tour: Plotting 3D maps and location tracks

Recently, I was on Gran Canaria for a vacation. So, what better way to keep up the holiday spirit a while longer than to visualize all the places we went in R!? I am combining location data collected by our car GPS, Google location data from my ...

Read more »

A Python-Like walk() Function for R

April 8, 2017
By
A Python-Like walk() Function for R

A really nice function available in Python is walk(), which recursively descends a directory tree, calling a user-supplied function in each directory within the tree. It might be used, say, to count the number of files, or maybe to remove all small files and so on. I had students in my undergraduate class write such … Continue...

Read more »

#4: Simpler shoulders()

April 8, 2017
By
#4: Simpler shoulders()

Welcome to the fourth post in the repulsively random R ramblings series, or R4 for short. My twitter feed was buzzing about a nice (and as yet unpublished, ie not-on-CRAN) package https://github.com/dirkschumacher/thankr by Dirk Schumacher which compiles a a list of packages (ordered by maintainer count) for your current session (or installation or ...) with a...

Read more »

Correlation and Correlogram Exercises

April 8, 2017
By
Correlation and Correlogram Exercises

Correlation analysis is one of the most popular techniques for data exploration. This set of exercises is intended to help you to extend, speed up, and validate your correlation analysis. It allows to practice in: – calculating linear and nonlinear correlation coefficients, – testing those coefficients for statistical significance, – creating correlation matrices to study Related exercise sets:

Read more »

Exploring propensity score matching and weighting

April 8, 2017
By
Exploring propensity score matching and weighting

This post jots down some playing around with the pros, cons and limits of propensity score matching or weighting for causal social science research. Intro to propensity score matching One is often faced with an analytical question about causality and effect sizes when the only data around is from a quasi-experiment, not the random controlled trial one would hope for....

Read more »

R code to accompany Real-World Machine Learning (Chapter 5): Event Modeling

April 8, 2017
By
R code to accompany Real-World Machine Learning (Chapter 5): Event Modeling

Abstract The rwml-R Github repo is updated with R code for the event modeling examples from Chapter 5 of the book “Real-World Machine Learning” by Henrik Brink, Joseph W. Richards, and Mark Fetherolf. Examples given include reading large data files with the fread function from data.table, optimization of model parameters with caret, computing and plotting ROC curves...

Read more »

ReinforcementLearning: A package for replicating human behavior in R

April 8, 2017
By
ReinforcementLearning: A package for replicating human behavior in R

Nicolas Proellochs and Stefan Feuerriegel 2017-04-06 Introduction Reinforcement learning has recently gained a great deal of traction in studies that call for human-like learning. In settings where an explicit teacher is not available, this method teaches an agent via interaction with its environment without any supervision other than its own decision-making policy. In many cases, … Continue...

Read more »

Mapping waxwings annual migration without Twitter

April 7, 2017
By
Mapping waxwings annual migration without Twitter

Recently a reader left a comment on this blog mentioning his cool blog post in which he mapped the spread of a migratory bird using Twitter. His data source was the Waxwings UK account which reports sightings of Bohemian waxwings in the UK. I decided t...

Read more »

Data Visualization – Part 3

April 7, 2017
By
Data Visualization – Part 3

What Type of Data Visualization Do You Choose (if any)? Determining whether or not you need a visualization is step one. While it seems silly, this is probably something everyone (including myself) should be doing more often. A lot of times, it seems like a great way to showcase the amount of work you have

Read more »

slickR – the last carousel you’ll ever need

April 7, 2017
By
slickR – the last carousel you’ll ever need

We are happy to bring the slick JavaScript library to R. It is self-described as “the last carousel you’ll ever need”. This carousel is based on putting the elements of the carousel in a div HTML tag. This makes the carousel very versatile in what can be placed inside. Regular objects that are placed in … Continue...

Read more »

R Weekly Bulletin Vol – III

April 7, 2017
By
R Weekly Bulletin Vol – III

This week’s R bulletin will cover topics like how to read select columns, the difference between boolean operators, converting a factor to a numeric and changing memory available to R. We will also cover functions like data, format, tolower, toupper, and strsplit function. Hope you like this R weekly bulletin. Enjoy reading! Shortcut Keys 1. To change the working directory... The post R...

Read more »

The faces of R, analyzed with R

April 7, 2017
By
The faces of R, analyzed with R

Maëlle Salmon recently created a collage of profile pictures of people who use the #rstats hashtag in their Twitter bio to indicate their use of R. (I've included a detail below; click to see the complete version at Maëlle's blog.) Naturally, Maëlle created the collage using R itself. Matching Twitter bios were found using the search_users function in the...

Read more »

Bio7 2.5 for Windows and Linux Released

April 7, 2017
By
Bio7 2.5 for Windows and Linux Released

07.04.2017 A new release of Bio7 is available for Windows (64-bit) and Linux (64-bit). The MacOSX version will be released soon, too. This release comes with a plethora of new functions for R. General Bio7 is now based on Eclipse 4.6.3 Redesigned all Bio7 icons and created new icons, too Nearly all icons are now

Read more »

R⁶ — RStudio Server Client? Make An App For That!

April 7, 2017
By

RStudio is a great way to work through analyses tasks, and I suspect most folks use the “desktop” version of the product on their local workstations. The fine folks at RStudio also make a server version (the codebase for RStudio is able to generate server or desktop and they are generally in 100% feature parity... Continue reading...

Read more »

Data Structures Exercises (Part-1)

April 7, 2017
By
Data Structures Exercises (Part-1)

R Programming has various Data Structures for efficient manipulation of Data. Following are the list of data structures supported by R. 1. Vectors 2. Lists 3. Matrix 4. Data frame This exercise helps through various operations of R Data structures. Answers to the exercises are available here. Exercise 1 Create an atomic vector of Character Related exercise sets:

Read more »

R benchmark for High-Performance Analytics and Computing (II): GPU Packages

April 7, 2017
By
R benchmark for High-Performance Analytics and Computing (II): GPU Packages

Share This: 1. Overview In the previous post (here), we have analyzed the performance gain of R in the heterogeneous system by accelerators, including NVIDIA GPU and Intel Xeon Phi. Furthermore, GPU accelerated packages can greatly improve the performance of R. Figure 1 shows the download statistics of CRAN over the years. Obviously, GPU is

Read more »

packcircles version 0.2.0 released

April 7, 2017
By
packcircles version 0.2.0 released

Version 0.2.0 of the packcircles package has just been published on CRAN. This package provides functions to find non-overlapping arrangements of circles in bounded and unbounded areas. The package how has a new circleProgressiveLayout function. It uses an efficient deterministic algorithm to arrange circles by consecutively placing each one externally tangent to two previously placed circles while avoiding overlaps....

Read more »

#3: Follow R-devel

April 6, 2017
By
#3: Follow R-devel

Welcome to the third post in the rarely relevant R recommendation series, or R4 for short. Today will be brief, but of some importance. In order to know where R is going next, few places provide a better vantage point than the actual ongoing developm...

Read more »

Solving Data Science’s First Mile Problem

April 6, 2017
By
Solving Data Science’s First Mile Problem

At data.world, we are out to solve the “first mile problem of data science”: helping people obtain and understand the data sets they need. Everybody starts here whether they are analyzing their fantasy football league or working on the Zika pa...

Read more »

Solving Data Science’s First Mile Problem

April 6, 2017
By
Solving Data Science’s First Mile Problem

At data.world, we are out to solve the “first mile problem of data science”: helping people obtain and understand the data sets they need. Everybody starts here whether they are analyzing their fantasy football league or working on the Zika pa...

Read more »

Microsoft R Open 3.3.3 now available

April 6, 2017
By

Microsoft R Open (MRO), Microsoft's enhanced distribution of open source R, has been upgraded to version 3.3.3, and is now available for download for Windows, Mac, and Linux. This update upgrades the R language engine to R 3.3.3, upgrades the installer, and updates the bundled packages. R 3.3.3 makes just a few minor fixes compared to R 3.3.2 (see...

Read more »

Weighted Linear Support Vector Machine

April 6, 2017
By
Weighted Linear Support Vector Machine

Consider the spam vs ham data from this site. Let us do some basic analysis on the data with R version 3.3.3, 64 bits on qn windows machine setwd("your directory") sms_data<-read.csv("sms_spam.csv",stringsAsFactors = FALSE) Next, check the proportion of spam and ham in your data set prop.table(table(sms_data$type)) ham spam 0.8659849 0.1340151 As you can see, the Related Post

Read more »

Data Science for Operational Excellence (Part-1)

April 6, 2017
By
Data Science for Operational Excellence (Part-1)

 R has many powerful libraries to handle operations research. This exercise tries to demonstrate a few basic functionality of R while dealing with linear programming. Linear programming is a technique for the optimization of a linear objective function, subject to linear equality and linear inequality constraints. The lpsolve package in R provides a set Related exercise sets:

Read more »

Fitting a rational function in R using ordinary least-squares regression

April 6, 2017
By
Fitting a rational function in R using ordinary least-squares regression

by Srini Kumar, VP of Product Management and Data Science, LevaData; and Bob Horton, Senior Data Scientist, Microsoft A rational function is defined as the ratio of two functions. The (https://en.wikipedia.org/wiki/Pad%C3%A9_approximant) uses a ratio of polynomials to approximate functions: $$ R(x)= \frac{\sum_{j=0}^m a_j x^j}{1+\sum_{k=1}^n b_k x^k}=\frac{a_0+a_1x+a_2x^2+\cdots+a_mx^m}{1+b_1 x+b_2x^2+\cdots+b_nx^n} $$ Here we show a way to fit this type of...

Read more »

Statlearn17, Lyon

April 6, 2017
By
Statlearn17, Lyon

Today and tomorrow, I am attending the Statlearn17 conference in Lyon, France. Which is a workshop with one-hour talks on statistics and machine learning. And which makes for the second workshop on machine learning in two weeks! Yesterday there were two tutorials in R, but I only took the train to Lyon this morning: it

Read more »

Plotly charts in nteract notebooks using R

April 5, 2017
By
Plotly charts in nteract notebooks using R

The Plotly R package can now be used within nteract notebooks. Below are some examples. Visit below links for installation related information Visit Carson’s Plotly for R book for more details on plotly and its capabilities Visit nteract releases to download nteract Visit IRkernel to see details on how to install an R kernel for

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.