My talks @ Universitat de Girona

April 18, 2014
By
My talks @ Universitat de Girona

Just after Easter, I'll go for a very quick trip to lovely Girona, where Marc Saez has invited me to give two talks.The first one will be a re-run of the short course on INLA that I did at Bayes Pharma last year. It's scheduled (and prepared) as a 3-ho...

Read more »

Date formating in R

April 18, 2014
By

As I often manipulate time series from different sources, I rarely come across the same date format twice. Having to reformat the dates every time is a real waste of time because I never remember the syntax of the as.Date function. I put below a few examples that turn strings into standard R date format.

Read more »

Reot: Empirical Orthogonal Teleconnections in R

April 18, 2014
By
Reot: Empirical Orthogonal Teleconnections in R

We are happy to introduce Reot, an R package designed for empirical orthogonal teleconnection (EOT) analysis of gridded geo-scientific space-time data based on the method by van den Dool et al. (2000). EOT denotes a regression-based approach to decompose spatio-temporal … Continue reading →

Read more »

Using Gitbook with R Markdown

April 18, 2014
By
Using Gitbook with R Markdown

Gitbook has been getting some (deserved) attention. For those who haven’t seen it, Gitbook is a system to create really beautiful interactive web (or PDF and ebook) books. For me, the timing of discovering this framework could not be better as I am preparing documentation for propensity score analysis for an upcoming workshop I am giving. Of...

Read more »

Publishing in GitHub

April 17, 2014
By

Publishing in GitHub Publishing in GitHubI struggled to make my first repository in GitHub. I finally found out the steps to do so.Make your folder in local host and add the required files in the folder (SRCFOLDER). One of the required files is README.md, which will contain overview of the project.Add...

Read more »

Database for chronic malignancies

April 17, 2014
By

Dear all, recently I have designed a database for Multiple Myeloma, which can easily be extended to include other malignancies and chronic diseases.The database and its guide can be downloaded from GitHub Repository.Bye.

Read more »

Physignal Input is 3D Array Only

April 17, 2014
By

Geomorph users,The function physignal() in version 2.0 has a mistake in the manual regarding the input matrix: only 3D array can be used at this time. The next version will have this generalized to 2D array and 3D array. In the meantime,...

Read more »

Diving into H2O

April 17, 2014
By
H2O

by Joseph Rickert One of the remarkable features of the R language is its adaptability. Motivated by R’s popularity and helped by R’s expressive power and transparency developers working on other platforms display what looks like inexhaustible creativity in providing seamless interfaces to software that complements R’s strengths. The H2O R package that connects to 0xdata’s H2O software (Apache...

Read more »

dplyr Example #1

April 17, 2014
By
dplyr Example #1

Hadley Wickam released the dplyr package in January 2014. Since then I have been itching to give it a try as it has been suggested to speed up some data manipulations and, more important to me, provide a singular framework … Continue reading →

Read more »

R Interface to Myfxbook

April 17, 2014
By
R Interface to Myfxbook

Myfxbook provides an interface to your FOREX trading accounts as well as an active trading community. It has a broad range of functionality including a responsive interface to the FOREX market; tools for performing statistical analyses on your trades; the facility to mirror trades from other traders or systems; and provides a platform for publicising

Read more »

Make your ggplots shareable, collaborative, and with D3

April 17, 2014
By
Make your ggplots shareable, collaborative, and with D3

Editor's note: This is a guest post from Matt Sundquist form the Plot.ly team.You can access the source code for this post at https://gist.github.com/sckott/10991885Ggplotly and Plotly's R API let you make ggplot2 plots, add py$ggplotly(), and make your plots interactive, online, and drawn with D3. Let's make some.1. Getting Started and ExamplesHere is Fisher's iris...

Read more »

How Fast the Fastest Human Would Run 100m?

April 16, 2014
By
How Fast the Fastest Human Would Run 100m?

Ethan Siegel wrote a post entitled The Math of the Fastest Human Alive five years ago, using regressions. An alternative is too use extreme value models (I wrote a post a long time ago on the maximum length of a tennis match using extreme value theory a few years ago). In 2009, John Einmahl and Sander Smeets wrote a great article...

Read more »

Bitsanity

April 16, 2014
By
Bitsanity

BitsanityThe awesome folks at Quandl (an amazing data collection and distribution service) have been so kind as to allow me to write for their blog.In my first post for them I demonstrate (with detailed R code) how a user of their free data services co...

Read more »

Why writing vectorized code in R is a good idea

April 16, 2014
By

As a language for statistical computing, R has always had a bias towards linear algebra, and is optimized for operations dealing in complete vectors and matrixes. This can be surprising to programmers coming to R from lower-level languages, where iterative programming (looping over the elements of a vector or matrix) is more natural and often more efficient. That's not...

Read more »

Using R — Working with Geospatial Data (and ggplot2)

April 16, 2014
By
Using R — Working with Geospatial Data (and ggplot2)

This is a follow-up blog-post to an earlier introductory post by Steven Brey: Using R: Working with Geospatial Data. In this post, we’ll learn how to plot geospatial data in ggplot2. Why might we want to do this? Well, it’s really …   read more ...

Read more »

Modeling Data With Functional Programming In R – Chapter 1

April 16, 2014
By
Modeling Data With Functional Programming In R – Chapter 1

I’m pleased to announce that a rewrite of my first chapter of “Modeling Data With Functional Programming In R” is …Continue reading →

Read more »

Vectorization in R: Why?

April 16, 2014
By

Here are my notes from a recent talk I gave on vectorization at a Davis R Users’ Group meeting. Thanks to Vince Buffalo, John Myles White, and Hadley Wickham for their input as I was preparing this. Feedback welcome! Beginning R users are often told to “vectorize” their code. Here, I try to explain...

Read more »

Geomorph 3D Visualization

April 16, 2014
By
Geomorph 3D Visualization

Dear geomorph users,version 2.0 of geomorph brings new developments in how shape deformations from 3D coordinate shape data can be viewed. We have implemented warping of 3D surface files (e.g., .ply files), which allows the user to visualize the shape deformations along Principal Component axes, Multivariate Regression slopes, Partial Least Squares axes and group differences, to name a few.The new function warpRefMesh() reads in a .ply...

Read more »

Geomorph Version 2.0 Now Available!

April 16, 2014
By

Geomorph users,We have uploaded version 2.0 to CRAN. The windows and mac binaries have been compiled and the tarball is available.Version 2.0 comes with substantial changes and new features: New function phylo.pls()for assessing the multivari...

Read more »

R Courses in Milan (Italy): May 2014

April 16, 2014
By

Dear R users, the May 2014 public training course schedule for Milano (Italy) based courses is as follows: Web Applications with R and Shiny May 15, 2014 Reports in R with RStudio May 16, 2014 Basic R Programming May 22, … Continue reading →

Read more »

Errors on percentage errors

April 16, 2014
By
Errors on percentage errors

The MAPE (mean absolute percentage error) is a popular measure for forecast accuracy and is defined as     where denotes an observation and denotes its forecast, and the mean is taken over . Armstrong (1985, p.348) was the first (to my knowledge) to point out the asymmetry of the MAPE saying that “it has a bias favoring estimates...

Read more »

Checking (G)LM model assumptions in R

April 16, 2014
By
Checking (G)LM model assumptions in R

(Generalized) Linear models make some strong assumptions concerning the data structure: Independance of each data points Correct distribution of the residuals Correct specification of the variance structure Linear relationship between the response and the linear predictor For simple lm 2-4) means that the residuals should be normally distributed, the variance should be homogenous across the

Read more »

Impact of Dimensionality on Data in Pictures

April 16, 2014
By
Impact of Dimensionality on Data in Pictures

I am excited to announce that this is supposed to be my first article published also on r-bloggers.com :) The processing of data needs to take dimensionality into account as usual metrics change their behaviour in subtle ways, which impacts the … Continue reading → The post Impact of Dimensionality on Data in Pictures appeared first on

Read more »

Topic Modeling In R

April 16, 2014
By

Editor's note: This is the first in a series of posts from rOpenSci's recent hackathon.I recently had the pleasure of participating in rOpenSci's hackathon. To be honest, I was quite nervous to work among such notables, but I immediately felt welcome thanks to a warm and personable group. Alyssa Frazee has a great post summarizing...

Read more »

Accessing Open Data Portal (India) using APIs

April 15, 2014
By

As I mentioned in my previous blog post, Government of India have started an Open Data Portal for making various data public. Most of the data-sets on the portal are available for manual download. Some of the data-sets though are also available to be a...

Read more »

Visualizing Age-Length Keys — More

April 15, 2014
By
Visualizing Age-Length Keys — More

I recently posted on the addition of ageKeyPlot() to the FSA package for visualizing age-length keys. Since that post, I have added one more type of plot as illustrated below. This post should be considered as an extension of the … Continue reading →

Read more »

Oracle’s Strategy for Advanced Analytics

April 15, 2014
By

At Oracle our goal is to enable you to get timely insight from all of your data. We continuously enhance Oracle Database to allow workloads that have traditionally required extracting data from the database to run in-place. We do this to narrow the ga...

Read more »

Investigating the relationship between gold and bitcoin prices with R.

April 15, 2014
By
Investigating the relationship between gold and bitcoin prices with R.

Image by Ennio Pozzetti   In this post I will explore some of the movements in markets in recent years, these movements have caught many by surprise resulting in some people unexpectedly striking it rich while others have lost a great deal. I am no financial advisor, nor do I have a background in financial

Read more »

Wealth and Skill | A Talk to Students

April 15, 2014
By

I enjoyed talking to University of Alabama students this morning about wealth, skill, and luck.  I tried to synthesize a whole lot of research into something meaningful.  Of course, it would not have been possible without rCharts + Slidify.  Thanks ...

Read more »