TTTAR2: My First Shiny App with Bootstrap – #RUGSMAPS

August 26, 2014
By
TTTAR2: My First Shiny App with Bootstrap – #RUGSMAPS

Thing To Try After useR! part 2 (TTTAR2) Originally, this post was supposed to be a sequel to TTTAR1 about h2o machine learning. Since TTTAR1 I have been carrying out more h2o tests both locally and on the cloud with the very kind support of Nick Elprin from Domino. The more...

Read more »

Statistics: Losing Ground to CS, Losing Image Among Students

August 26, 2014
By

by Norman Matloff The American Statistical Association (ASA) leadership, and many in Statistics academia. have been undergoing a period of angst the last few years, They worry that the field of Statistics is headed for a future of reduced national influence and importance, with the feeling that: The field is to a large extent being usurped by other disciplines,...

Read more »

Coursera course on computational finance with R

August 26, 2014
By
Coursera course on computational finance with R

As of today (Tuesday 26th of August), a new session of Professor Eric Zivot’s course on computational finance and financial econometrics starts on Coursera. Just like the previous run of the course, most R labs and R assignments will take place in DataCamp’s interactive learning environment. Designed by Professor Eric Zivot (University of Washington), Introduction to computational

Read more »

Making Your Code Citable

August 26, 2014
By
Making Your Code Citable

Original post from GitHub Guides:Digital Object Identifiers (DOI) are the backbone of the academic reference and metrics system. If you’re a researcher writing software, this guide will show you how to make the work you share on GitHub citable by archiving one of your GitHub repositories and assigning a DOI with the data...

Read more »

Statistics: Losing Ground to CS, Losing Image Among Students

August 26, 2014
By
Statistics:  Losing Ground to CS, Losing Image Among Students

The American Statistical Association (ASA)  leadership, and many in Statistics academia. have been undergoing a period of angst the last few years,  They worry that the field of Statistics is headed for a future of reduced national influence and importance, with the feeling that: The field is to a large extent being usurped by other

Read more »

ChainLadder 0.1.8 released

August 26, 2014
By
ChainLadder 0.1.8 released

Over the weekend we released version 0.1.8 of the ChainLadder package for claims reserving on CRAN. What is claim reserving?The insurance industry, unlike other industries, does not sell products as such but promises. An insurance policy is a promise by the insurer to the policyholder to pay for future claims for an upfront received premium. As a result...

Read more »

The Chi-Squared Test of Independence – An Example in Both R and SAS

The Chi-Squared Test of Independence – An Example in Both R and SAS

Introduction The chi-squared test of independence is one of the most basic and common hypothesis tests in the statistical analysis of categorical data.  Given 2 categorical random variables, and , the chi-squared test of independence determines whether or not there exists a statistical dependence between them.  Formally, it is a hypothesis test with the following null and

Read more »

Do Massive Workloads Equal Big Declines in Running Back Productivity?

August 25, 2014
By
Do Massive Workloads Equal Big Declines in Running Back Productivity?

The popular wisdom is that “massive workloads” result in a decline in running back productivity in the following season. For example, anecdotal evidence using Corey Dillon (2004) and Jamaal Anderson The post Do Massive Workloads Equal Big Declines in Running Back Productivity? appeared first on Fantasy Football Analytics.

Read more »

Continuous or Discrete Latent Structure? Correspondence Analysis vs. Nonnegative Matrix Factorization

August 25, 2014
By
Continuous or Discrete Latent Structure? Correspondence Analysis vs. Nonnegative Matrix Factorization

A map gives us the big picture, which is why mapping has become so important in marketing research. What is the perceptual structure underlying the European automotive market? All we need is a contingency table with cars as the rows, attributes as the ...

Read more »

Spline interpolation of temporal resolution for satellite images.

August 25, 2014
By
Spline interpolation of temporal resolution for satellite images.

This week, I had a discussion with a few of my colleagues on the possibility of utilizing remote sensing data or satellite images to improve our statistical estimation such as imputation. One source of interest is the

Read more »

David Smith, the useR! 2014 Interview

August 25, 2014
By
David Smith, the useR! 2014 Interview

“At one point, the only source for daily R news was David Smith’s blog.” —...

Read more »

Of Needles and Haystacks: Building an Accurate Statewide Dropout Early Warning System in Wisconsin

August 25, 2014
By
Of Needles and Haystacks: Building an Accurate Statewide Dropout Early Warning System in Wisconsin

For the past two years I have been working on the Wisconsin Dropout Early Warning System, a predictive model of on time high school graduation for students in grades 6-9 in Wisconsin. The goal of this project is to help schools and educators have an ea...

Read more »

8 new R jobs (for Augest 25th 2014)

August 25, 2014
By
r_jobs

This is the bimonthly R Jobs post (for 2014-08-25), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

Some love for ggplot2

August 25, 2014
By
Some love for ggplot2

With all the recent buzz about ggvis (this, this, and this) it’s often easy to forget all that ggplot2 offers as a graphics package. True, ggplot is a static approach to graphing unlike ggvis but it has fundamentally changed the way we think about plots in R. I recently spent some time thinking about some

Read more »

New in OpenCPU 1.4.4: session namespaces

August 24, 2014
By
New in OpenCPU 1.4.4: session namespaces

The OpenCPU system exposes an HTTP API for embedded scientific computing with R. This provides reliable and scalable foundations for integrating R based analysis and visualization modules into pipelines, web applications or big data infrastruct...

Read more »

Forecasting with R in WA

August 24, 2014
By

On 23–25 September, I will be running a 3-day workshop in Perth on “Forecasting: principles and practice” mostly based on my book of the same name. Workshop participants will be assumed to be familiar with basic statistical tools such as multiple regression, but no knowledge of time series or forecasting will be assumed. Some prior experience in R is...

Read more »

R: Stem (Pre-Processed) Text Blocks

August 24, 2014
By
R: Stem (Pre-Processed) Text Blocks

Objective I recently needed to stem every word in a block of text i.e. reduce each word to a root form. Problem The stemmer I was using would only stem the last word in each block of text e.g. Solution I wrote a function which splits a block of text into individual words, stems each

Read more »

JAGS and Stan

August 24, 2014
By
JAGS and Stan

During the last year I have been running some estimations in both JAGS and Stan. In that period I have seen one example where JAGS could not get me decent samples (in the sense of low Rhat and high number of effective samples) but that was data which I...

Read more »

Computational Actuarial Science, with R

August 24, 2014
By

The book Computational Actuarial Science, with R is officially out. In the introduction of the book, and on the website of CRC, it is mentioned that the datasets can be found “in an R package on CRAN“, which is unfortunately incorrect. Some datasets are too large, so the package can not be uploaded on CRAN. Hopefully, Christophe host the package...

Read more »

VCI — The Value Charts Indicator

August 23, 2014
By
VCI — The Value Charts Indicator

So recently, I was made known of the Value Charts Indicator , which was supposed to be some form of … Continue reading →

Read more »

R OOP – a little privacy please?

August 23, 2014
By

As of late, I’ve been making heavy use of Reference Classes in R. They are easier for me to wrap my mind around since they adopt a usage style more like “traditional” OOP languages like Java. Primarily, object methods are part of the class definition and accessed via the instantiated object. For instance: With S3/S4 classes, you...

Read more »

An adverse consequence of fitting "maximal" linear mixed models

August 23, 2014
By
An adverse consequence of fitting "maximal" linear mixed models

Distribution of intercept-slope correlation estimates with 37 subjects, 15 itemsDistribution of intercept-slope correlation estimates with 50 subjects, 30 itemsShould one always fit a full variance covariance matrix (a "maximal" model) when one analyze...

Read more »

More ESA 2014 Program Text-Mining: Topics as Communities

August 22, 2014
By
More ESA 2014 Program Text-Mining: Topics as Communities

In my first pass at text analysis of the ESA program, I looked at how the frequency of words used in the ESA program differed from last year to this year. There are much more sophisticated ways at looking at word use in text, though, and I began to dive into the text-mining literature to find...

Read more »

Because it’s Friday: A 3-minute movie in 4095 bytes

August 22, 2014
By

This entire movie — images, music, everything — is generated from a Windows PC executable of just 4,095 bytes. That's not a typo: we're not talking bytes not megabytes or gigabytes here. Less than 4kb total creates this entire scene. For comparison, a medium-quality video file of this exact same scene in AVI format comes in at over 64Mb:...

Read more »

Entering the field as a data scientist with certification

August 22, 2014
By

By Neera Talbert, VP Services and Ben Wiley, R Programmer at Revolution Analytics By now, everyone should be familiar with the data scientist boom. Simply logging onto LinkedIn reveals a seemingly infinite number of people with words and phrases like “Data Scientist”, “Big Data Specialist”, and “Analytics” in their title. A few weeks ago, an article floated around the...

Read more »

Pro Grammar and Devel Hoper

August 22, 2014
By
Pro Grammar and Devel Hoper

I've been teasing about this post for some time now. My next blog post is "Pro Grammar and Devel Hoper". And this not just an empty pun. Stay tuned.— Romain François (@romain_francois) August 3, 2014 @stefanbache another teaser. https://t.co/i2ubfOyjIO iris >> filter( Sepal.Length > 7 ) iris |> filter( Sepal.Length > 7 )— Romain François (@romain_francois)

Read more »

Running “Native” Data Wrangling Applications in the Browser – IPython Notebooks (and R?) in Chrome

August 22, 2014
By
Running “Native” Data Wrangling Applications in the Browser – IPython Notebooks (and R?) in Chrome

Using browser based data analysis toolkits such as pandas in IPython notebooks, or R in RStudio, means you need to have access to python or R and the corresponding application server either on your own computer, or running on a remote server that you have access to. When running occasional training sessions or workshops, this

Read more »

stringdist 0.8: now with soundex

August 22, 2014
By

An update to the stringdist package was released earlier this month. Thanks to a contribution of Jan van der Laan the package now includes a method to compute soundex codes as defined here. Briefly, soundex encoding aims to translate words … Continue reading →

Read more »

RcppEigen 0.3.2.2.0

August 21, 2014
By

A new upstream release of the Eigen C++ template library for linear algebra was released a few days ago. And Yixuan Qiu did some really nice work rolling this into a new RcppEigen released and then sent me a nice pull requent. The new version is now...

Read more »