Beyond R: Integration of R into the analysis software KNIME and in Docker environments

August 26, 2016
By
Beyond R: Integration of R into the analysis software KNIME and in Docker environments

Speaker of the Kenntnis-Tage 2016: Eugene Harschack | Decadis AG Eugene Harschack works in the Business Intelligence Solution department at Decadis, where he implements BI solutions for customers. Decadis is an independent IT consulting firm which supports customers in planning, implementing and operating IT infrastructures. The department Business Intelligence Solutions deals with classic BI …

Read more »

RcppArmadillo 0.7.400.2.0

August 25, 2016
By
RcppArmadillo 0.7.400.2.0

Another Armadillo 7.* release -- now at 7.400. We skipped the 7.300.* serie release as it came too soon after our most recent CRAN release. Releasing RcppArmadillo 0.7.400.2.0 now keeps us at the (roughly monthly) cadence which works as a good compr...

Read more »

Simple Guide to S3 Methods

August 25, 2016
By
Simple Guide to S3 Methods

So I’ve just submitted my first article to the R Journal! The paper, “A Simple Guide to S3 Methods”, is my attempt at a quick and simple guide to using S3 methods in R. Although not my first publication, it is my first solo publication, whi...

Read more »

Presidential Election Predictions 2016 (an ASA competition)

August 25, 2016
By

Guest post by Jo Hardin ASA’s Prediction Competition In this election year, the American Statistical Association (ASA) has put together a competition for students to predict the exact percentages for the winner of the 2016 presidential election. They are offering cash prizes for the entry that gets closest to the national vote percentage and that best … Continue...

Read more »

Tickets now available for EARL2016 Boston

August 25, 2016
By
Tickets now available for EARL2016 Boston

We are pleased to announce that registration has now opened for this year’s Boston EARL Conference, which will be held at the Boston Science Museum on the 7-9th November. The call for abstracts is still open  so it’s not to late … Continue reading →

Read more »

R with Power BI: Import, Transform, Visualize and Share

August 25, 2016
By
R with Power BI: Import, Transform, Visualize and Share

Power BI, Microsoft's data visualization and reporting platform, has made great strides in the past year integrating the R language. This Computerworld article describes the recent advances with Power BI and R. In short, you can: import data into Power BI by using an R script cleanse and transform other data sources coming into Power BI using R functions...

Read more »

Multidimensional clustering with web analytics data

August 25, 2016
By
Multidimensional clustering with web analytics data

Speaker of the Kenntnis-Tage 2016: Alexander Kruse | etracker GmbH Alexander Kruse works as a data analyst at etracker, a leading provider of products and services for optimizing websites and online marketing activities in Europe. By now, more than 110.000 customers are using etracker solutions, among them companies such as Jochen Schweizer, Vorwerk, the …

Read more »

Working efficiently with R – faster to the data product

August 25, 2016
By
Working efficiently with R – faster to the data product

Speaker of the Kenntnis-Tage 2016: Julian Gimbel | Lufthansa Industry Solutions Julian Gimbel works as a data scientist at Lufthansa Industry Solutions. The company supports customers in digitalizing and automating their processes in order to increase their turnover and efficiency while simultaneously reducing their costs. In his guest lecture working efficiently with R – …

Read more »

R Markdown: How to format tables and figures in .docx files

August 25, 2016
By
R Markdown: How to format tables and figures in .docx files

In research, we usually publish the most important findings in tables and figures. When writing research papers using Rmarkdown (*.Rmd), we have several options to format the output of the final MS Word document (.docx). Tables can be formated using either the knitr package’s kable() function or several functions of the pander package. Figure sizes Related Post

Read more »

Extending sparklyr to Compute Cost for K-means on YARN Cluster with Spark ML Library

August 24, 2016
By
Extending sparklyr to Compute Cost for K-means on YARN Cluster with Spark ML Library

Machine and statistical learning wizards are becoming more eager to perform analysis with Spark ML library if this is only possible. It’s trendy, posh, spicy and gives the feeling of doing state of the art machine learning and being up to date with ...

Read more »

Python style logging in R

August 24, 2016
By
Python style logging in R

This entry is part 20 of 20 in the series Using RWe are increasingly using R in “operational” settings that require robust error handling and logging. In this post we describe a quick-and-dirty way to get python style multi-level log files …   read more ...

Read more »

Using MANOVA to Analyse a Banking Crisis Exercises

August 24, 2016
By
Using MANOVA to Analyse a Banking Crisis Exercises

In this set of exercises we will practice multivariate analysis of variance – MANOVA. We shall try to find if there is a difference in the combination of export and bank reserves, depending on the status of banking sector (is there a crisis or not). The data set is fictitious and servers for education purposes

Read more »

stringr 1.1.0

August 24, 2016
By
stringr 1.1.0

I’m pleased to announce version 1.1.0 of stringr. stringr makes string manipulation easier by using consistent function and argument names, and eliminating options that you don’t need 95% of the time. To get started with stringr, check out the strings chapter in R for data science. Install it with: install.packages("stringr") This release is mostly bug fixes, but there

Read more »

How to evaluate R models in Azure Machine Learning Studio

August 24, 2016
By
How to evaluate R models in Azure Machine Learning Studio

Azure Machine Learning Studio is a GUI-based integrated development environment for constructing and operationalizing machine learning workflows. The basic computational unit of an Azure ML Studio workflow (or Experiment) is a module which implements machine learning algorithms, data conversion and transformation functions etc. Modules can be connected by data flows, thus implementing a machine learning pipeline. A typical pipeline...

Read more »

Edward Tufte Keynote Presenter at Data Science Summit, Sep 26-27

August 23, 2016
By
Edward Tufte Keynote Presenter at Data Science Summit, Sep 26-27

I'm excited to share that one of my data science heroes will be a presenter at the Microsoft Data Science Summit in Atlanta, September 26-27. Edward Tufte, the data visualization pioneer, will deliver a keynote address on the future of data analysis and the how to make more credible conclusions based on data. If you're not familiar with Tufte,...

Read more »

useR and JSM 2016 conferences: a story in tweets

August 23, 2016
By
useR and JSM 2016 conferences: a story in tweets

I was amused by a Guardian article last month that declared “I’m a serious academic, not a professional Instagrammer,” arguing that social media is a distraction for scientific research. This attitude was, to say the least, not popular on academic Twitter, which responded with the #seriousacademic hashtag. When someone tries to claim that a...

Read more »

Rborist version 0.1-3 is now available from CRAN

August 23, 2016
By

Version 0.1-3 of the Rborist Random Forest package can now be downloaded from CRAN. This version follows closely on the short-lived 0.1-2, which failed to install on Solaris. The new version features incremental performance improvements, as well as...

Read more »

10 Reasons why you can’t miss this years EARL Conference in London

August 23, 2016
By
10 Reasons why you can’t miss this years EARL Conference in London

    On September 13th-15th Mango Solutions are running the EARL ( Effective Applications of the R Language) Conference for all users, enthusiasts and beginners of the R programming language. It is an event not to be missed and here … Continue reading →

Read more »

GSoC 2016 Report – Rperform

August 23, 2016
By
GSoC 2016 Report – Rperform

Developer: Akash Tandon Mentors: Joshua Ulrich, Toby Dylan Hocking Official Project Link: Rperform: performance analysis of R package code This project meant to deal primarily with development of Rperform’s functionalities to allow developers to obtain potential performance impacts of a pull request (PR) without having to merge, extension of the package’s existing performance metric measurement and visualization functions, and...

Read more »

Visualising Residuals

August 23, 2016
By
Visualising Residuals

Residuals. Now there’s something to get you out of bed in the morning! OK, maybe residuals aren’t the sexiest topic in the world. Still, they’re an essential element and means for identifying potential problems of any statistical model. For example, the residuals from a linear regression model should be homoscedastic. If not, this indicates an issue with the model...

Read more »

New package tokenizers joins rOpenSci

August 23, 2016
By

The R package ecosystem for natural language processing has been flourishing in recent days. R packages for text analysis have usually been based on the classes provided by the NLP or tm packages. Many of them depend on Java. But recently there have been a number of new packages for text analysis in R, most notably

Read more »

Trends in daily R package downloads

August 23, 2016
By
Trends in daily R package downloads

This post was prompted by this blog about using the cranlogs package by Gabor Csardi. But my own interest as long time package developer dates back to this post by Ben Bolker. I like to see that my packages are being used. So I thought why stop at counting downloads and plotting the past. Why not predict into the future?

Read more »

How to win a drone in 20 lines of R code

August 22, 2016
By
How to win a drone in 20 lines of R code

Or a less clickbaity title: Model based optimization of machine learning models with mlr and mlrMBO. I recently participated in the #TEFDataChallenge a datathon organized by Wayra. The first price was a drone for every team member, which is a pretty ...

Read more »

Integrate Free R Tutorials With Blackboard, Moodle, Canvas & More!

August 22, 2016
By

Are you teaching a course this semester that makes use of R? Now you can integrate DataCamp’s free interactive R courses and tutorials with all major learning management systems at no cost. Learn More. Why is this exciting? Well, DataCamp’s autogr...

Read more »

Matrix Operations Exercises

August 22, 2016
By
Matrix Operations Exercises

This set of exercises will help you to learn and test your skill in matrix operations, starting with basic ones like scalar multiplication all the way through eigenvalue and eigenvectors. Before proceeding, it might be helpful to look over the help pages for the diag, t, eigen, and crossprod functions. If you want further documentation

Read more »

Five great charts in 5 lines of R code each

August 22, 2016
By
Five great charts in 5 lines of R code each

Sharon Machlis is a journalist with Computerworld, and to show other journalists how great R is for data visualization she shows them these five data visualizations, each of which can be created in 5 lines of R code or less. I've reproduced Sharon's code and charts below. I did make a couple of tweaks to the code, though. I...

Read more »

A Shiny App for estimating Multiplier Exponent Curves from Manning’s Equation Inputs

August 22, 2016
By
A Shiny App for estimating Multiplier Exponent Curves from Manning’s Equation Inputs

WASP and other flow and water quality models ask users to input multiplier and exponent values relating velocity and depth to discharge. These relationships take the form of V=aQ^b and depth=cQ^d, where the values a, b, c, and d describe the curve that approximates paired points from other sources. When paired field measurements are lacking, Manning's equation provides an...

Read more »

Building a Data Science Platform for R&D, Part 3 – R, R Studio Server, SparkR & Sparklyr

August 22, 2016
By
Building a Data Science Platform for R&D, Part 3 – R, R Studio Server, SparkR & Sparklyr

Part 1 and Part 2 of this series dealt with setting up AWS, loading data into S3, deploying a Spark cluster and using it to access our data. In this part we will deploy R and R Studio Server to … Continue reading →

Read more »

Using cranlogs in R with Plotly

August 22, 2016
By

In this post we’ll use the cranlogs package to visualize the number of downloads for Plotly’s R API

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.