Update: Predictive Bookmaker Consensus Model for the UEFA Euro 2016

June 25, 2016
By
Update: Predictive Bookmaker Consensus Model for the UEFA Euro 2016

(By Achim Zeileis) Three weeks we had published a Predictive Bookmaker Consensus Model for the UEFA Euro 2016 that provided probabilistic forecasts for all teams in the UEFA European Championship 2016. By combining bookmaker odds with simulation techniques and paired comparison models we obtained not only a winning probability for each team but also estimates of the abilities (or...

Read more »

Rcpp::algorithm

June 24, 2016
By
Rcpp::algorithm

Introduction A while back I saw a post on StackOverflow where the user was trying to use Rcpp::sugar::sum() on an RcppParallel::RVector. Obviously this doesn’t work and it raised the question “Why doesn’t something more generic exist to provide functions with R semantics that can be used on arbitrary data structures?” As a result, I set out to create a set of...

Read more »

Custom Templated as and wrap Functions within Rcpp.

June 24, 2016
By
Custom Templated as and wrap Functions within Rcpp.

Introduction Consider a need to be able to interface with a data type that is not presently supported by Rcpp. The data type might come from a new library or from within ones own program. In such cases, Rcpp is faced with an issue of consciousness as the new data type is not similar to known types so the autocoversion...

Read more »

New Open Course: Exploratory Data Analysis with swirl

June 24, 2016
By

New Free swirl Course on DataCamp The DataCamp team is excited to announce another course using the swirl package. This free course adapts the Exploratory Data Analysis curriculum from swirl to the interactive DataCamp in-browser interface. No set-up needed!  Start Course! This course offers an introduction to exploratory data analysis which includes exercises in basic...

Read more »

validate version 1.5 is out

June 24, 2016
By

A new version of the validate package for data validation was just accepted on CRAN and will be available on all mirrors in a few days. The most important addition is that you can now reference the data set as … Continue reading →

Read more »

Amazon X-ray data provides insight into movie characters

June 24, 2016
By
Amazon X-ray data provides insight into movie characters

I'm a regular user of Amazon Video: as someone who spends a fair bit of time on planes, it's great to be able to download some of my favourite shows (hello, Orphan Black and Vikings) and catch up on episodes during the trip. Amazon Video has a useful feature, too: if you forget the name of a character, or...

Read more »

Text Mining with R: Top Keywords of the useR! 2016

June 24, 2016
By
Text Mining with R: Top Keywords of the useR! 2016

From June 27-30 the international R user and developer community will meet in Stanford, California for the useR! 2016 Conference. Right in the heart of Silicon Valley, gripping presentations and talks will cover a broad range of topics from R-related computing issues to general statistical topics. In case you are wondering what the most popular … „Text Mining...

Read more »

Microsoft Analytics in 2016

June 23, 2016
By
Microsoft Analytics in 2016

If you had asked me two years ago if Microsoft was a serious vendor for data science and analytics infrastructure and tools, I would have laughed. At the time their offering seemed to me to consist of Excel against SQL Server. There is nothing really wrong (or exciting) about SQL Server, but friends don’t let friends use Excel for...

Read more »

Introducing QCSimulator: A 5-qubit quantum computing simulator in R

June 23, 2016
By
Introducing QCSimulator: A 5-qubit quantum computing simulator in R

Introduction My 5-qubit Quantum Computing Simulator,QCSimulator, is finally ready, and here it is! I have been able to successfully complete this simulator by working through a fair amount of material. To a large extent, the simulator is easy, if one understands how to solve the quantum circuit. However the theory behind quantum computing itself, is

Read more »

R, Stan and Bayesian Statistics

June 23, 2016
By
R, Stan and Bayesian Statistics

by Joseph Rickert Just about two and a half years ago I wrote about some resources for doing Bayesian statistics in R. Motivated by the tutorial Modern Bayesian Tools for Time Series Analysis by Harte and Weylandt that I attended at R/Finance last month, and the upcoming tutorial An Introduction to Bayesian Inference using R Interfaces to Stan that...

Read more »

Use your .Rprofile to give you important notifications

June 23, 2016
By
Use your .Rprofile to give you important notifications

In R, we can use a file called .Rprofile to do things in R based on a number of triggers. One thing I’ve done is give myself a DIY notification of how many data breaches I’ve been involved in! First of all, you need a file called .Rprofile that’s stored in your working directory. Some The post

Read more »

satRday conference in September 2016 at Budapest

June 23, 2016
By
satRday conference in September 2016 at Budapest

As you probably already know, satRdays are go -- and the location of the first three events in the series is now also decided:Budapest, HungarySan Juan, Puerto RicoCape Town, South AfricaTo help to organize such community-driven, regional R conferences...

Read more »

Working with Rcpp::StringVector

June 22, 2016
By
Working with Rcpp::StringVector

Vectors are fundamental containers in R. This makes them equally important in Rcpp. Vectors can be useful for storing multiple elements of a common class (e.g., integer, numeric, character). In Rcpp, vectors come in the form of NumericVector, CharacterVector, LogicalVector, StringVector and more. Look in the header file Rcpp/include/Rcpp/vector/instantiation.h for more types. Here we explore how to work with Rcpp::StringVector as a way to...

Read more »

Case Study: Animation and Others Vizs

June 22, 2016
By
Case Study: Animation and Others Vizs

This post will be about if we can show some data in other ways to try to tell more clearly the Oh! Foo! is this rly happening? story. Time time ago an gif appears showing the change of the global temperatures over time. Well, some sites like http://gizmodo.com/ made a reference to this animation as one-of-the-most-convincing-climate-change-visualization. Mmmm… ok! A kind of...

Read more »

In search of an incredible posterior

June 22, 2016
By
In search of an incredible posterior

What is credibility? For over one hundred years 1 actuaries have been wresting with the idea of “credibility”. This is the process whereby one may make a quantitative assessment of the predictive power of sample data. Where necessary, the researcher augments the sample with some exogeneous information - usually more data - to arrive at a final conclusion. In...

Read more »

What is reproducible research?

June 22, 2016
By
What is reproducible research?

I was asked about my understanding of reproducible research, and how that applies to social research. Here is how I see: Reproducible research is key to any scientific method, including applied social sciences. My minimalist understanding of reproduci...

Read more »

R 3.3.1 is released

June 22, 2016
By
R 3.3.1 is released

R 3.3.1 (codename “Bug in Your Hair”) was released yesterday You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of bug fixes is provided below new features and (this release does not introduce new features). Upgrading to R 3.3.1 on Windows If you are using Windows you can easily upgrade to the latest … Continue...

Read more »

R 3.3.1 is released

June 22, 2016
By
R 3.3.1 is released

R 3.3.1 (codename “Bug in Your Hair”) was released yesterday You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of bug fixes is provided below new features and (this release does not introduce new features). Upgrading to R 3.3.1 on Windows If you are using Windows you can easily upgrade to the latest … Continue reading...

Read more »

R 3.3.1 now available

June 22, 2016
By

Peter Dalgaard announced yesterday on behalf of the R core team that R 3.3.1, the latest update to the R language, is now available for download from your local CRAN mirror. As of this writing, binaries of R 3.3.1 are available for Windows and Linux; the Mac version should appear very soon. This minor update, codenamed "Bug in Your...

Read more »

y-aware scaling in context

June 22, 2016
By

Nina Zumel introduced y-aware scaling in her recent article Principal Components Regression, Pt. 2: Y-Aware Methods. I really encourage you to read the article and add the technique to your repertoire. The method combines well with other methods and can drive better predictive modeling results. From feedback I am not sure everybody noticed that in … Continue reading...

Read more »

MonetDBLite because fast

June 21, 2016
By
MonetDBLite because fast

MonetDBLite is a SQL database that runs inside the R environment for statistical computing and does not require the installation of any external software. MonetDBLite is based on free and open-source MonetDB, a product of the Centrum Wiskunde & Informatica.MonetDBLite is similar in functionality to RSQLite, but typically completes queries blazingly fast...

Read more »

Clustering Mixed Data Types in R

June 21, 2016
By
Clustering Mixed Data Types in R

Clustering allows us to better understand how a sample might be comprised of distinct subgroups given a set of variables. While many introductions to cluster analysis typically review a simple application using continuous variables, clustering data of mixed types (e.g., continuous, ordinal, and nominal) is often of interest. The following is an overview of one approach to clustering data...

Read more »

Clustering Mixed Data Types in R

June 21, 2016
By
Clustering Mixed Data Types in R

Clustering allows us to better understand how a sample might be comprised of distinct subgroups given a set of variables. While many introductions to cluster analysis typically review a simple application using continuous variables, clustering data of mixed types (e.g., continuous, ordinal, and nominal) is often of interest. The following is an overview of one approach to clustering data...

Read more »

Clustering Mixed Data Types in R

June 21, 2016
By
Clustering Mixed Data Types in R

Clustering allows us to better understand how a sample might be comprised of distinct subgroups given a set of variables. While many introductions to cluster analysis typically review a simple application using continuous variables, clustering data of mixed types (e.g., continuous, ordinal, and nominal) is often of interest. The following is an overview of one approach to clustering data...

Read more »

Euro 2016 Squads Part Deux

June 21, 2016
By
Euro 2016 Squads Part Deux

Between some of the games I had a go at replicating a plot from liberation.fr on the connections between Euro 2016 players and the country of birth using the circlize package in R. As with the previous post, the colours are based on the home shirt of each team and data scraped from Wikipedia. The … Continue reading...

Read more »

Working with Venn Diagrams

June 21, 2016
By
Working with Venn Diagrams

In this post, we will learn how to create venn diagrams for gene lists and how to retrieve the genes present in each venn compartment with R.

Read more »

LDAvis Show Case on R-Bloggers

June 21, 2016
By
LDAvis Show Case on R-Bloggers

Text mining is a new challenge for machine wandering practitioners. The increased interest in the text mining is caused by an augmentation of internet users and by rapid growth of the internet data which is said that in 80% is a text data. Extracting information from articles, news, posts and comments have became a desirable skill but what...

Read more »

EARL London Agenda Announced

June 21, 2016
By
EARL London Agenda Announced

We are delighted to announce the full agenda and impressive line up of speakers for September’s EARL Conference. EARL 2016 will feature the largest number of talks yet with 3 streams and 48 presentations. We have a fantastic range of cross-industry representation … Continue reading →

Read more »

Exploring Global Internet Performance Data Using R

June 21, 2016
By
Exploring Global Internet Performance Data Using R

by Lourdes O. Montenegro Lourdes O. Montenegro is a PhD candidate at the Lee Kuan Yew School of Public Policy, National University of Singapore. Her research interests cover the intersection of applied data science, technology, economics and public policy. Many of us now find it hard to live without a good quality internet connection. As a result, there is...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.