July 2017

Text Mining of Stack Overflow Questions

July 5, 2017 | Rstats on Julia Silge

Note: Cross-posted with the Stack Overflow blog. This week, my fellow Stack Overflow data scientist David Robinson and I are happy to announce the publication of our book Text Mining with R with O’Reilly. We are so excited to see this project out in the world, and so relieved ...
[Read more...]

Control Systems Toolbox in R – a GSoC 2017 Project

July 5, 2017 | R Views

Introduction Control theory is an interdisciplinary branch of mathematics and engineering that has the objective of controlling physical systems. A control system is a device or a collection of devices that manage, command, direct or regulate the behavior of other devices or systems. Control systems engineering is a major application ...
[Read more...]

Announcing the rOpenSci Fellowships Program

July 5, 2017 | Karthik Ram

rOpenSci's mission is to promote a culture of open, transparent, and reproducible research across various research domains. Everything we do, from developing high-quality open-source software for data science and, software review, to building community through events like our community calls and annual unconference are all geared toward lowering barriers to ... [Read more...]

The Role of Small Data and Vacation Recap Example

July 5, 2017 | Gregory Kanevsky

Wikipedia defines small data 'small' enough for human comprehension but then it goes further by qualifying data in a volume and format that makes it accessible, informative and actionable. I am not certain the latter is always true: smaller footprint doesn't automatically qualify data as informative and actionable without more ...
[Read more...]

Data wrangling : Transforming (1/3)

July 5, 2017 | Vasileios Tsakalos

Data wrangling is a task of great importance in data analysis. Data wrangling, is the process of importing, cleaning and transforming raw data into actionable information for analysis. It is a time-consuming process which is estimated to take about 60-80% of analyst’s time. In this series we will go ... [Read more...]

Finding my Dropbox in R

July 5, 2017 | Abhijit

I’ll often keep non-sensitive data on Dropbox so that I can access it on all my machines without gumming up git. I just wrote a small script to find the Dropbox location on each of my computers automatically. The crucial information is available here, from Dropbox. My small snippet ... [Read more...]

Teach the tidyverse to beginners

July 5, 2017 | David Robinson

A few years ago, I wrote a post Don’t teach built-in plotting to beginners (teach ggplot2). I argued that ggplot2 was not an advanced approach meant for experts, but rather a suitable introduction to data visualization. Many teachers suggest I’m overestimating their students: “No, see, my students are ... [Read more...]

How perceptions of R have changed

July 5, 2017 | David Smith

In the sponsor presentation for Microsoft at the useR!2017 conference in Brussels this morning, I thought I'd share how perceptions of R have changed over the years. Today, R known as is popular, comprehensive, accepted, scalable, production-ready and supported software environment for data analysis, but that wasn't always the case. ... [Read more...]

Finding my Dropbox in R

July 4, 2017 | R on Abhijit Dasgupta

I’ll often keep non-sensitive data on Dropbox so that I can access it on all my machines without gumming up git. I just wrote a small script to find the Dropbox location on each of my computers automatically. The crucial information is available here, from Dropbox. My small snippet ... [Read more...]

Launching webrockets at runconf17

July 4, 2017 | Alicia Schep, Miles McBain

We, Alicia Schep and Miles McBain, drove the webrockets project at #runconf17. To make progress we solicited code, advice, and entertaining anecdotes from a host of other attendees, whom we humbly thank for helping to make our project possible. This post is divided into two sections: First up we'll relate ...
[Read more...]

Machine Learning Explained: Regularization

July 4, 2017 | EnhanceDataScience

Welcome to this new post of Machine Learning Explained.After dealing with overfitting, today we will study a way to correct overfitting with regularization. Regularization adds a penalty on the different parameters of the model to reduce the freedom of the model. Hence, the model will be less likely to ...
[Read more...]

Ordinary Least Squares (OLS) Linear Regression in R

July 4, 2017 | S. Richter-Walsh

Ordinary Least Squares (OLS) linear regression is a statistical technique used for the analysis and modelling of linear relationships between a response variable and one or more predictor variables. If the relationship between two variables appears to be linear, then a straight line can be fit to the data in ...
[Read more...]

iPlots exercises

July 4, 2017 | Euthymios Kasvikis

INTRODUCTION iPlots is a package which provides interactive statistical graphics, written in Java. You can find many interesting plots such as histograms, barcharts, scatterplots, boxplots, fluctuation diagrams, parallel coordinates plots and spineplots. The amazing part is that all of these plots support querying, linked highlighting, color brushing, and interactive changing ... [Read more...]

More Companies using R

July 4, 2017 | David Smith

Here's a quick roundup of some case studies published recently on the Microsoft Customer Stories portal, with examples of companies running R in production environments using the Microsoft stack. eToro, the social trading network, uses Microsoft R Server and Azure Machine Learning so that investors can automatically duplicate market strategies ... [Read more...]
1 11 12 13 14

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)