Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

January 26, 2017
By
Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

As I wrote in the previous post, I will continue in describing regression methods, which are suitable for double seasonal (or multi-seasonal) time series. In the previous post about Multiple Linear Regression, I showed how to use “simple” OLS regression method to model double seasonal time series of electricity consumption and use it for accurate forecasting. Interactions...

Read more »

Cards on the table

January 26, 2017
By
Cards on the table

After the last post building on feedback from readers, the blog is back to the regular program of recycling old Github repos. Today’s project was waiting for its turn here and will involve a Catan card game. Nearly a year ago, I played Catan with my ...

Read more »

BiclustGUI 1.1.0

January 26, 2017
By
BiclustGUI 1.1.0

BiBitR enters the fray! Ewoud De Troyer, University of Hasselt (CenStat) Introduction The latest patch of the BiclustGUI includes the implementation of BiBitR, a simple R wrapper which directly calls the original Java code for applying the BiBit algo...

Read more »

Kung Fu R

January 26, 2017
By
Kung Fu R

A great way to hone your skills as a data scientist is to pick a topic you're passionate about, find some data related to it, and analyze the heck out of it. Jim Vallandingham is clearly passionate about old Kung Fu movies — particularly those from the Shaw Brothers Studio — and has used R to analyze data the...

Read more »

Image Compression with Principal Component Analysis

January 26, 2017
By
Image Compression with Principal Component Analysis

Image compression with principal component analysis is a frequently occurring application of the dimension reduction technique. Recall from a previous post that employed singular value decomposition to compress an image, that an image is a matrix of pixels represented by RGB color values. Thus, principal component analysis can be used... The post Image Compression with Principal Component Analysis...

Read more »

New Zealand bank replaces SAS server with R Server

January 26, 2017
By

Heartland Bank, a rapidly growing bank in New Zealand, has adopted a data-driven approach to analyzing risk, evaluating credit lines, and understanding cash flows. But they found their legacy SAS system to be labor-intensive and time consuming when it came to updating financial models, and it was expensive to boot. (Being licensed on a per-user basis, it was available...

Read more »

Multiple Regression (Part 2) – Diagnostics

January 26, 2017
By
Multiple Regression (Part 2) – Diagnostics

Multiple Regression is one of the most widely used methods in statistical modelling. However, despite its many benefits, it is oftentimes used without checking the underlying assumptions. This can lead to results which can be misleading or even completely wrong. Therefore, applying diagnostics to detect any strong violations of the assumptions is important. In the

Read more »

Using CPLEX in R: Installing cplexAPI in Windows 10

January 26, 2017
By
Using CPLEX in R: Installing cplexAPI in Windows 10

I have a very large mixed integer problem to solve.   COIN‘s solver accessed through Rsymphony did a decent job, but had trouble finding feasible solutions.  I had been told that the IBM cplex  solver was the best on the market for mixed integer programming. Installing cplex is a relatively easy task.  The difficulties began when I … Continue...

Read more »

Upcoming Win-Vector LLC public speaking engagements

January 26, 2017
By

I am happy to announce a couple of exciting upcoming Win-Vector LLC public speaking engagements. BARUG Meetup Tuesday, Tuesday February 7, 2017 ~7:50pm, Intuit, Building 20, 2600 Marine Way, Mountain View, CA. Win-Vector LLC’s John Mount will be giving a “lightning talk” (15 minutes) on R calling conventions (standard versus non-standard) and showing how to … Continue...

Read more »

Upgrading to macOS Sierra (nee OSX) for R users

January 26, 2017
By
Upgrading to macOS Sierra (nee OSX) for R users

A good fraction of R users use Apple computers. Apple machines historically have sat at a sweet spot of convenience, power, and utility: Convenience: Apple machines are available at retail stores, come with purchasable support, and can run a lot of common commercial software. Power: R packages such as parallel and Rcpp work better on … Continue...

Read more »

Predicting the Willingen 2017 men’s ski jumping competition

January 26, 2017
By
Predicting the Willingen 2017 men’s ski jumping competition

In an earlier post of mine, I carried out an analysis on ski jumping data for Zakopane, Poland and attempted to predict which athletes would end up on the podium. I also created a classification tree and tested it on the 2017 competition data with good results. For this side project of mine, I hope… Continue reading...

Read more »

One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)

January 26, 2017
By
One View of the Impact of the New Immigration Ban (+ freeing PDF data with tabulizer)

Dear Leader has made good on his campaign promise to “crack down” on immigration from “dangerous” countries. I wanted to both see one side of the impact of that decree — how many potential immigrants per year might this be impacting — and show toss up some code that shows how to free data from... Continue reading...

Read more »

Metro Systems Over Time: Part 2

January 26, 2017
By
Metro Systems Over Time: Part 2

Note, at the time of this writing using the packages ggplot2 and ggmap from CRAN will result in an error. To avoid the error be sure to install both packages from GitHub with the package devtools and restart R if the problem persists. devtools::install_github("dkahle/ggmap") devtools::install_github("hadley/ggplot2") Introduction In Part 1 of this series we collected geodata Related Post

Read more »

digest 0.6.12

January 26, 2017
By

A new release, now at version 0.6.12, of the digest package is now on CRAN and in Debian. The digest creates hash digests of arbitrary R objects (using the 'md5', 'sha-1', 'sha-256', 'crc32', 'xxhash' and 'murmurhash' algorithms) permitting easy comp...

Read more »

What is new in the intrval R package?

January 26, 2017
By

An update (v 0.1-1) of the intrval package was recently published on CRAN. The package simplifies interval related logical operations (read more about the motivation in this post). So what is new in this version? Some of the inconsistencies in the 1st CRAN release have been cleaned up, and I have been pushed hard (see GitHub issue to implement all...

Read more »

More water, a bit more about saints

January 25, 2017
By
More water, a bit more about saints

I was lucky enough to get some nice and interesting feedback on my last post. One comment was really useful and pretty embarrassing: I had written “see” instead of “sea” in the whole post… Thanks Steve Dempsey for the correction! I also got s...

Read more »

R Markdown for the Enterprise

January 25, 2017
By
R Markdown for the Enterprise

by Edgar Ruiz In the corporate world, spreadsheets and PowerPoint presentations still dominate as the tools used for analyzing and sharing information. So, it is not at all surprising that even when business analysts use R for the analytical heavy lifting, they frequently revert to using spreadsheets and slide decks to share their results. This

Read more »

Data Driven Cheatsheets

January 25, 2017
By
Data Driven Cheatsheets

Guest post by Jonathan Sidi Cheatsheets are currently built and used exclusivley as a teaching tool. We want to try and change this and produce a cheat sheet that gives a roadmap to build a known product, but also is built as a function so users can input data into it to make the cheatsheet … Continue...

Read more »

DataChats: An interview with Mine Çetinkaya-Rundel

January 25, 2017
By
DataChats: An interview with Mine Çetinkaya-Rundel

Hey R fans! We've just released a new episode of our DataChats video series. In this episode, we interview Mine Çetinkaya-Rundel, Ph.D.. Mine is Director of Undergraduate Studies and an Associate Professor of the Practice in the Department of St...

Read more »

Live Training: Using Email for your Data Science Portfolio

January 25, 2017
By
Live Training: Using Email for your Data Science Portfolio

This Friday at 9am PT I will be running a live training titled Using Email for your Data Science Portfolio. Within the world of online communication, email is the... The post Live Training: Using Email for your Data Science Portfolio appeared first on AriLamstein.com.

Read more »

Training courses: R, Stan and Scala

January 25, 2017
By
Training courses: R, Stan and Scala

Over the next few months we’re running a number of R, Stan and Scala courses around the UK. Feburary Mon 13 – Introduction to R (London) Tue 14 – Programming with R (London) Wed 15 – Advanced Graphics with R (London) Thur 16 (2-day course) – Predictive Analysis (London) March Tue Mar 21 – R for

Read more »

New Applied Finance Course: Bond Valuation and Analysis in R

January 25, 2017
By
New Applied Finance Course:  Bond Valuation and Analysis in R

This week we are launching Bond Valuation and Analysis in R taught by Clifford S. Ang, CFA, Vice President at Compass Lexecon. Bonds are securities issued by governments or corporations that pay interest over a fixed schedule and are the most well-kno...

Read more »

Extracting and Enriching Ocean Biogeographic Information System (OBIS) Data with R

January 25, 2017
By
Extracting and Enriching Ocean Biogeographic Information System (OBIS) Data with R

Programmatic access to biodiversity data is revolutionising large-scale, reproducible biodiversity research. In the marine realm, the largest global database of species occurrence records is the Ocean Biogeographic Information System, OBIS. As of January 2017, OBIS contains 47.78 million occurrences of 117,345 species, all openly available and accessible via the OBIS API. The number of questions to address...

Read more »

Modelling extremes using generalized additive models

January 25, 2017
By
Modelling extremes using generalized additive models

Quite some years ago, whilst working on the EU Sixth Framework project Euro-limpacs, I organized a workshop on statistical methods for analyzing time series data. One of the sessions was on the analysis of extremes, ably given by Paul Northrop (UCL Department of Statistical Science). That intro certainly whet my appetite but I never quite found the time to...

Read more »

A Glimpse into The Daily Life of a Data Scientist

January 24, 2017
By
A Glimpse into The Daily Life of a Data Scientist

A couple of weeks ago, I had a discussion with a co-worker regarding a project I was involved in, I felt that there was no clear understanding of the daily challenges data scientists face. Few days later, I was at Rstudio::Conf 2017 where I met lots of data scientists from academia and industry. Later on, I described one of...

Read more »

a typo that went under the radar

January 24, 2017
By
a typo that went under the radar

A chance occurrence on X validated: a question on an incomprehensible formula for Bayesian model choice: which, most unfortunately!, appeared in Bayesian Essentials with R! Eeech! It looks like one line in our LATEX file got erased and the likelihood part in the denominator altogether vanished. Apologies to all readers confused by this nonsensical formula!Filed

Read more »

Building a machine learning model with the MicrosoftML package

January 24, 2017
By
Building a machine learning model with the MicrosoftML package

Microsoft R Server 9 includes a new R package for machine learning: MicrosoftML. (So do the Data Science Virtual Machine and the free Microsoft R Client edition, incidentally.) This package includes a suite of fast predictive modeling functions implemented by Microsoft Research, including: Linear (rxFastLinear) and logistic (rxLogisticRegression) model functions based on the Stochastic Dual Coordinate Ascent method; Classification/regression...

Read more »

“smooth” package for R. es() function. Part IV. Model selection and combination of forecasts

January 24, 2017
By
“smooth” package for R. es() function. Part IV. Model selection and combination of forecasts

Mixed models In the previous posts we have discussed pure additive and pure multiplicative exponential smoothing models. The next logical step would be to discuss mixed models, where some components have additive and the others have multiplicative nature. But we won’t spend much time on them because I personally think that they do not make

Read more »

Descriptive Analysis of MLST Data for MRSA

January 24, 2017
By
Descriptive Analysis of MLST Data for MRSA

During one of my summers, I had the opportunity to conduct some research on the prevalence of methicillin-resistant Staphylococcus aureus (MRSA) in vulnerable populations and examining US emergency department data and I thought this would be a pretty interesting topic to expand on for my thesis in lieu of the increasing concerns of antimicrobial resistance, … Continue...

Read more »

Sponsors

Mango solutions











Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.