Sentiment Analysis in R

February 21, 2017
By

Current research in finance and the social sciences utilizes sentiment analysis to understand human decisions in response to textual materials. While sentiment analysis has received great traction lately, the available tools are not yet living up to the needs of researchers. Especially R has not yet capabilities that most research desires. Our package “SentimentAnalysis” performs … Continue...

Read more »

coauthorship and citation networks

February 20, 2017
By
coauthorship and citation networks

As I discovered (!) the Annals of Applied Statistics in my mailbox just prior to taking the local train to Dauphine for the first time in 2017 (!), I...

Read more »

R Weekly

February 20, 2017
By
R Weekly

During my Monday morning ritual of avoiding work,  I found this publication that is written in R, for people who use R – R Weekly.  The authors do a pretty awesome...

Read more »

rxNeuralNet vs. xgBoost vs. H2O

February 20, 2017
By
rxNeuralNet vs. xgBoost vs. H2O

Recently, I did a session at local user group in Ljubljana, Slovenija, where I introduced the new algorithms that are available with MicrosoftML package for Microsoft R Server 9.0.3....

Read more »

strcode – structure your code better

strcode – structure your code better

I am pleased to announce my package strcode, a package that should make structuring code easier. You can install it from GitHub, a CRAN submission is planned at a...

Read more »

Data Science for Doctors – Part 4 : Inferential Statistics (1/5)

February 20, 2017
By
Data Science for Doctors – Part 4 : Inferential Statistics (1/5)

Data science enhances people’s decision making. Doctors and researchers are making critical decisions every day. Therefore, it is absolutely necessary for those people to have some basic knowledge of...

Read more »

Yes, you can run R in the cloud securely

February 20, 2017
By
Yes, you can run R in the cloud securely

Once thought of as the ‘little programming language that could’, R has fundamentally transformed the way data scientists and organisations use their data. It gives businesses the power to...

Read more »

PCA – hierarchical tree – partition: Why do we need to choose for visualizing data?

February 20, 2017
By
PCA – hierarchical tree – partition: Why do we need to choose for visualizing data?

Principal component methods such as PCA (principal component analysis) or MCA (multiple correspondence analysis) can be used as a pre-processing step before clustering. But principal component methods give also a framework...

Read more »

More tidyverse: using dplyr functions

February 20, 2017
By

This week, we return to our “Getting Started With R” series. Today we are going to look at some tools from the “dplyr” package. Hadley Wickham, the creator of dplyr,...

Read more »

Is my time series additive or multiplicative?

February 20, 2017
By
Is my time series additive or multiplicative?

Time series data is an important area of analysis, especially if you do a lot of web analytics. To be able to analyse time series effectively, it helps to...

Read more »

future: Reproducible RNGs, future_lapply() and more

February 19, 2017
By
future: Reproducible RNGs, future_lapply() and more

future 1.3.0 is available on CRAN. With futures, it is easy to write R code once, which the user can choose to evaluate in parallel using whatever resources...

Read more »

RcppMLPACK2 and the MLPACK Machine Learning Library

February 19, 2017
By
RcppMLPACK2 and the MLPACK Machine Learning Library

mlpack mlpack is, to quote, a scalable machine learning library, written in C++, that aims to provide fast, extensible implementations of cutting-edge machine learning algorithms. It has been written by Ryan Curtin...

Read more »

Animated Spirals

February 19, 2017
By
Animated Spirals

Ed Hawkins’ Global Temperature Spiral is my new favourite visualization. It’s powerful, compelling, and super tangible. I wanted to apply the spiral to my own data, so I got janky with...

Read more »

SatRday and visual inference of vine copulas

February 19, 2017
By
SatRday and visual inference of vine copulas

SatRday From the 16th to the 18th of February, satRday was held in the City of Cape Town in South Africa. The programme kicked off with two days of workshops...

Read more »

Building Shiny App exercises part 7

February 19, 2017
By
Building Shiny App exercises part 7

Connect widgets & plots In the seventh part of our journey we are ready to connect more of the widgets we created before with our k-means plot in order...

Read more »

Factoextra R Package: Easy Multivariate Data Analyses and Elegant Visualization

February 19, 2017
By
Factoextra R Package: Easy Multivariate Data Analyses and Elegant Visualization

factoextra is an R package making easy to extract and visualize the output...

Read more »

padr::pad does now do group padding

February 18, 2017
By

A few weeks ago padr was introduced on CRAN, allowing you to quickly get datetime data ready for analysis. If you have missed this, see the introduction blog or...

Read more »

Predicting food preferences with sparklyr (machine learning)

February 18, 2017
By
Predicting food preferences with sparklyr (machine learning)

This week I want to show how to run machine learning applications on a Spark cluster. I am using the sparklyr package, which provides a handy interface to access...

Read more »

Bar bar plots but not Babar plots

February 18, 2017
By
Bar bar plots but not Babar plots

You might have heard of the “bar bar plots” movement whose goal is to prevent using (let’s use ggplot2 language shall we) geom_bar when you could have used e.g....

Read more »

Using Rcpp with C++11, C++14 and C++17

February 18, 2017
By
Using Rcpp with C++11, C++14 and C++17

Background When we started the Rcpp Gallery in late 2012, a few of us spent the next four weeks diligently writing articles ensuring that at least one new article would be...

Read more »

Analytical and Numerical Solutions to Linear Regression Problems

February 18, 2017
By
Analytical and Numerical Solutions to Linear Regression Problems

This exercise focuses on linear regression with both analytical (normal equation) and numerical (gradient descent) methods. We will start with linear regression with one variable. From this part of...

Read more »

Putting It All Together

February 18, 2017
By
Putting It All Together

The kind folks over at @RStudio gave a nod to my recently CRAN-released epidata package in their January data package roundup and I thought it might be useful to...

Read more »

cricketr and yorkr books – Paperback now in Amazon

February 18, 2017
By
cricketr and yorkr books – Paperback now in Amazon

My books – Cricket Analytics with cricketr – Beaten by sheer pace!: Cricket analytics with yorkr are now available on Amazon in both Paperback and Kindle versions The cricketr...

Read more »

Accessing MSSQL Server with R (RSQLServer with dplyr)

February 18, 2017
By
Accessing MSSQL Server with R (RSQLServer with dplyr)

Recently I have been starting to use dplyr for handling my data in R. It makes everything a lot smoother! My previous workflow – running an SQL query, storing...

Read more »

RPushbullet 0.3.1

February 17, 2017
By
RPushbullet 0.3.1

A new release 0.3.1 of the RPushbullet package, following the recent 0.3.0 release is now on CRAN. RPushbullet is interfacing the neat Pushbullet service for inter-device messaging, communication,...

Read more »

Using Armadillo with SuperLU

February 17, 2017
By
Using Armadillo with SuperLU

Armadillo is very versatile C++ library for linear algebra, brough to R via the RcppArmadillo package. It has proven to be very useful and popular, and is (as of February...

Read more »

A plot against the CatterPlots complot

February 17, 2017
By
A plot against the CatterPlots complot

In these terrible times, we R people have more important subjects to debate/care about than ggplot2 vs. base R graphics (isn’t even worth discussing anyway, ggplot2 is clearly the...

Read more »

Optimization and Operations Research in R

February 17, 2017
By

Authors: Stefan Feuerriegel and Joscha Märkle-Huß R is widely taught in business courses and, hence, known by most data scientists with business background. However, when it comes to optimization...

Read more »

Catterplots: Plots with cats

February 17, 2017
By
Catterplots: Plots with cats

As a devotee of Tufte, I'm generally against chartjunk. Graphical elements that obscure interpretation of the data occasionally have a useful role to play, but more often than not...

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.