Playing with R, Shiny Dashboard and Google Analytics Data

August 17, 2015
By
Playing with R, Shiny Dashboard and Google Analytics Data

In this post, I want to share some examples of data visualization I was playing with recently. Like in many other occasions, my field of application is digital analytics data. Precisely, data from Google Analytics.You might remember a previous post whe...

Read more »

What’s new in Revolution R Enterprise 7.4.1

August 17, 2015
By

by Richard Kittler, Revolution R Enterprise PM, Microsoft Advanced Analytics In its latest release Revolution has added to the platform support of Revolution R Enterprise (RRE) version 7.4. Released August 14, version 7.4.1 extends RRE 7.4 capabilities to the Teradata database, HPC Server cluster, and Windows 10 platforms. With RRE for Teradata customers enjoy the advantage of bringing the...

Read more »

RSiteCatalyst Version 1.4.5 Release Notes

August 17, 2015
By

It’s only been a month since the last RSiteCatalyst update, and this update is also a pretty minor update in terms of functionality. Set Your Own Endpoint For the overseas users (or companies with weird setups), you can now use the endpoint argument in the SCAuth() function to specify your API endpoint. For the most part, this

Read more »

Secure HTTPS Connections for R

August 17, 2015
By
Secure HTTPS Connections for R

Traditionally, the mechanisms for obtaining R and related software have used standard HTTP connections. This isn’t ideal though, as without a secure (HTTPS) connection there is less assurance that you are downloading code from a legitimate source rather than from another server posing as one. Recently there have been a number of changes that make it easier to use HTTPS

Read more »

3 new R jobs (from R-users.com ; 2015-08-17)

August 17, 2015
By
3 new R jobs (from R-users.com ; 2015-08-17)

This is the bimonthly post (for 2015-08-17) for new R Jobs from R-users.com. Employers: you may visit this link to post a new R job to the R community (it’s free and quick). Job seekers: please follow the links below to learn more and apply for your job of interest (or go to R-users.com to see all the R jobs that are currently available) Full-Time Director, Political and Engagement Analytics (@...

Read more »

Slides of 10+ excellent tutorials at KDD 2015: Spark, graph mining and many more

August 17, 2015
By
Slides of 10+ excellent tutorials at KDD 2015: Spark, graph mining and many more

by Yanchang Zhao RDataMining.com I attended the KDD 2015 conference in Sydney last week. At the conference, there were more than 10 tutorials and I went to two of them, which are 1) Graph-Based User Behavior Modeling: From Prediction to … Continue reading →

Read more »

Constructing a network of politicians from newspaper data

Constructing a network of politicians from newspaper data

The following is a guest post by Jana Blahak and Jan Dix (University of Konstanz), with support from Simon Munzert. In the last post, we introduced the rzeit package, an R binding to the Content API at ZEIT Online. This time, we give a little demonstration of what can be done with these media data. The question we...

Read more »

Some reflections on teaching frequentist statistics at ESSLLI 2015

August 17, 2015
By
Some reflections on teaching frequentist statistics at ESSLLI 2015

I spent the last two weeks teaching frequentist and Bayesian statistics at the European Summer School in Logic, Language, and Information (ESSLLI) in Barcelona, at the beautiful and centrally located Pompeu Fabra University. The course web page for the first week is here, and the web page for the second course is here. (NOTE: Uni Potsdam...

Read more »

R, Python, and SAS: Getting Started with Linear Regression

August 16, 2015
By
R, Python, and SAS: Getting Started with Linear Regression

Consider the linear regression model, $$ y_i=f_i(boldsymbol{x}|boldsymbol{beta})+varepsilon_i, $$ where $y_i$ is the response or the dependent variable at the $i$th case, $i=1,cdots, N$ and the predictor or the independent variable is the $boldsymbol{x}$ term defined in the mean function $f_i(boldsymbol{x}|boldsymbol{beta})$. For simplicity, consider the following simple linear regression (SLR) model, $$ y_i=beta_0+beta_1x_i+varepsilon_i. $$ To obtain the (best) estimate...

Read more »

R 3.2.2 is released

August 16, 2015
By
R 3.2.2 is released

R 3.2.2 (codename “Fire Safety”) was released last weekend. You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of new features and bug fixes is provided below. SOME OF THE CHANGES I personally found two things particularly interesting in this release: setInternet2(TRUE) is now the default for windows (which will … Continue reading...

Read more »

Some Considerations of Modeling Severity in Operational Losses

August 16, 2015
By
Some Considerations of Modeling Severity in Operational Losses

In the Loss Distributional Approach (LDA) for Operational Risk models, multiple distributions, including Log Normal, Gamma, Burr, Pareto, and so on, can be considered candidates for the distribution of severity measures. However, the challenge remains in the stress testing exercise, e.g. CCAR, to relate operational losses to macro-economic scenarios denoted by a set of macro-economic

Read more »

Yet another post on google scholar data analysis

August 16, 2015
By
Yet another post on google scholar data analysis

Inspired by this post, I wanted to use Google Scholar data to put nice images on my professional website (girly habit). This post explains how I combined the functions available in the R package scholar with additional analyses (partially inspired from the script available at this link, which in my case results in a cannot

Read more »

Managing longitudinal data: Conversion between the wide and the long

August 16, 2015
By
Managing longitudinal data: Conversion between the wide and the long

If you measure the same person twice, you have longitudinal data. We all love longitudinal data because we can understand how their health outcomes change with time and this helps answering many interesting research questions. However, newer R users often face a problem in managing longitudinal data because it often comes in two ‘shapes': the

Read more »

Seattle histogram

August 16, 2015
By
Seattle histogram

Filed under: pictures, R, Statistics, Travel Tagged: histogram, sculpture, Seattle, Washington Convention Center

Read more »

RForcecom Demo VVideo

August 16, 2015
By
RForcecom Demo VVideo

Recently, I have created a demo video of an R package named RForcecom which connect to the Salesforce.com and Force.com from R. The video consists of 4 parts. Install and load RForcecom Sign into the Salesforce.com Get opportunity list from…Read more ›

Read more »

Time Series Analysis: Building a model on non-stationary time series

August 15, 2015
By
Time Series Analysis: Building a model on non-stationary time series

In this post I will give a brief introduction to time series analysis and its applications. We will be using the R package astsa which was developed by professor David Stoffer at the University of Pittsburgh. The textbook it accompanies, which is a good read for anyone interested in the topic, can be found in

Read more »

The Rise of the Robots (Advisors…)

August 15, 2015
By
The Rise of the Robots (Advisors…)

The Asset Management industry is on the verge of a major change. Over the last couple of years Robots Advisors (RA) have emerged as new players. The term itself is hard to define as it encompasses a large variety of services. Some are designed to help traditional advisers to better allocate their clients money and some are

Read more »

Use box plots to assess the distribution and to identify the outliers in your dataset

August 14, 2015
By
Use box plots to assess the distribution and to identify the outliers in your dataset

After you check the distribution of the data by ploting the histogram, the second thing to do is to look for outliers. Identifying the outliers is important becuase it might happen that an association you find in your analysis can be explained by the presence of outliers. The best tool to identify the outliers is

Read more »

Rblpapi: Connecting R to Bloomberg

August 14, 2015
By
Rblpapi: Connecting R to Bloomberg

Whit, John and I are thrilled to announce Rblapi, a new CRAN package which connects R to the Bloomberg backends. Rebuilt from scratch using only the Bloomberg C++ API and the Rcpp and BH packages, it offers efficient and direct access from R to a truly vast...

Read more »

Mango Solutions co-Founder announced as inaugural President of newly launched R Consortium

August 14, 2015
By
Mango Solutions co-Founder announced as inaugural President of newly launched R Consortium

In July 2015, Mango Solutions, the leading data science software and services company, announced their membership of the recently launched R Consortium.  Mango are delighted to announce that their co-Founder and Chief Data Scientist, Richard Pugh, has been elected as … Continue reading →

Read more »

R 3.2.2 released

August 14, 2015
By

This just in from the R Core Team: R 3.2.2 has been released. With this update, data accessed over the Web -- including files downloaded using download.file via URLS, and packages installed install.packages -- will be transmitted using the secure HTTPS protocol. (This has always been an option with prior versions of R, but now it is the default...

Read more »

2015 Fantasy Football Auction Draft Optimizer using OpenCPU

August 14, 2015
By
2015 Fantasy Football Auction Draft Optimizer using OpenCPU

We are releasing our 2015 update to the highly requested Auction Draft Optimizer in an OpenCPU app.  The app allows you to calculate custom projections for your league based on The post 2015 Fantasy Football Auction Draft Optimizer using OpenCPU appeared first on Fantasy Football Analytics.

Read more »

STAN trailer [PG+53]

August 13, 2015
By
STAN trailer [PG+53]

Filed under: Kids, R, Statistics, University life Tagged: Andrew Gelman, Hami...

Read more »

Bivariate Linear Regression

August 13, 2015
By
Bivariate Linear Regression

Regression is one of the – maybe even the single most important fundamental tool for statistical analysis in quite a large number of research areas. It forms the basis of many of the fancy statistical methods currently en vogue in the social sciences. Multilevel analysis and structural equation modeling are perhaps the most widespread and

Read more »

Survival Analysis – 2

August 13, 2015
By

In my previous post, I went over basics of survival analysis, that included estimating Kaplan-Meier estimate for a given time-to-event data. In this post, I'm exploring on Cox's proportional hazards model for survival data. KM estimator helps in figuri...

Read more »

Using Google Analytics with R

August 13, 2015
By

For the most part, SMB’s tend to utilize free analytics solutions like Google Analytics for their web and digital strategy. A powerful platform in its own right, it can be combined with the R to create custom visualizations, deep dives into data, and statistical inferences. This article will focus on the usage of R and The post

Read more »

R News From JSM 2015

August 13, 2015
By
R News From JSM 2015

by Joseph Rickert We can declare 2015 the year that R went mainstream at the JSM. There is no doubt about it, the calculations, visualizations and deep thinking of a great many of the world's statisticians are rendered or expressed in R and the JSM is with the program. In 2013 I was happy to have stumbled into a...

Read more »

Mapping Historic US Presidential Election Results

August 13, 2015
By
Mapping Historic US Presidential Election Results

Today I will demonstrate how to map historic US Presidential Election results in R. If you want to skip ahead and play with a shiny app that lets you visualize this dataset, then click here. The dataset we will use comes from wikipedia’s List of United States presidential election results by state and is packaged in the choroplethr The post

Read more »

Showing a distribution over time: how many summary stats?

August 13, 2015
By
Showing a distribution over time: how many summary stats?

I saw this nice graph today on Twitter, by Thomas Forth: but the more I looked at it, the more I felt it was hard to understand the changes over time across the income distribution from the Gini coefficient and … Continue reading →

Read more »