Upcoming Rcpp talk in Sydney

June 20, 2013
By

The Sydney Users of R Forum (SURF) will be hosting me for a talk on July 10. The focus will be Rcpp for R and C++ integration, and the intent is to have this be really applied with lots of motivating examples. Organizers Louise and Eugene were able...

Read more »

Quickly read Excel (xlsx) worksheets into R on any platform

June 20, 2013
By

I wrote a couple days about about importing Excel files into R. There are lots of ways to do this, but all the ways that use only R have drawbacks (as I outlined in my last post), and all the other ways require installation of programs other than R. I’m not opposed to using programs

Read more »

Huge interest in next LondonR user group meeting

June 20, 2013
By

The next LondonR meeting takes place on the 16 July and registrations have already exceeded 200. Presentations at the meeting will be made by Rich Pugh of Mango Solutions, Andrie de Vries of Revolution Analytics and Hadley Wickham of RStudio. All places for a  pre-meeting workshop with Hadley Wickham were snapped up within 2 days of announcing the details. More information...

Read more »

How American Century revolutionized their investment platform with R

June 20, 2013
By
How American Century revolutionized their investment platform with R

American Century Investments is a top-20 mutual fund company with more than 125 billion dollars of assets under management. The quantitative investment group manages 22 funds, and takes an objective, systematic and disciplined approach to determine which stocks to buy and sell. Real-time data and carefully calibrated statistical models are the foundation of this quantitative approach. This group formerly...

Read more »

Datagrabbing Commonly Formatted Sheets from a Google Spreadsheet – Guardian 2014 University Guide Data

June 20, 2013
By
Datagrabbing Commonly Formatted Sheets from a Google Spreadsheet – Guardian 2014 University Guide Data

So it seems like it’s that time of year when the Guardian publish their university rankings data (Datablog: University guide 2014), which means another opportunity to have a tinker and see what I’ve learned since last year… (Last year’s hack was a Filtering Guardian University Data Every Which Way You Can…, where I had a

Read more »

Bayesian Modeling of Anscombe’s Quartet

June 20, 2013
By
Bayesian Modeling of Anscombe’s Quartet

Anscombe’s quartet is a collection of four datasets that look radically different yet result in the same regression line when using ordinary least square regression. The graph below shows Anscombe’s quartet with imposed regression lines (taken from the Wikipedia article). While least square regression is a good choice for dataset 1 (upper left plot) it...

Read more »

Data Science Labs: Predictive Models to Improve Vaccine Quality and Production

June 20, 2013
By
Data Science Labs: Predictive Models to Improve Vaccine Quality and Production

The age of "blockbuster drugs" is coming to an end, as personalized medicine becomes a reality. Data science will be a major driver of innovation in these and other areas of the pharmaceutical industry. This was demonstrated during a project the Data Science Labs team executed on with a major pharmaceuticals company.

Read more »

Installing the RGoogleAnalytics package

June 20, 2013
By
Installing the RGoogleAnalytics package

In this blog post, I would walk you through the steps from downloading to installing the RGoogleAnalytics package on your machine. The RGoogleAnalytics package currently resides at https://code.google.com/p/r-google-analytics/ and this page lists the latest developments around the package. The zip and tarball archives for the package can be obtained from the Downloads Section. Once you download the

Read more »

Update to curves2d()

June 20, 2013
By

(This article was first published on geomorph, and kindly contributed to R-bloggers) Dear morphometricians, Below you will find an update to our function for digitizing curves in 2d: curves2d(). This solves a problem with the function plotting landmarks and semilandmarks out of sequence. To use it, you can "source()" the code from a directory, or copy and paste it...

Read more »

Using the Windows Clipboard, or Passing Data Quickly From Excel to R and Back Again

June 19, 2013
By
Using the Windows Clipboard, or Passing Data Quickly From Excel to R and Back Again

Two of my favorite functions are copy.table() and paste.table(). I’m going to turn this story on its head and give you the ending first. The first allows you to copy a data frame to the clipboard in a format that … Continue reading →

Read more »

literacy rates using semantics and R

June 19, 2013
By
literacy rates using semantics and R

(This article was first published on - R, and kindly contributed to R-bloggers) Somehow I stumbled into the world of linked open data trying to pull information easily off of a wikipedia page without having to write a customer scrapper. Enter in dbpedia, semantic technologies and some wonderful R packages take care of the back-end coding. The Research Group...

Read more »

A Toy Instrumental Variable Application

June 19, 2013
By
A Toy Instrumental Variable Application

Draw nicer Classification and Regression Trees with the rpart.plot package

June 19, 2013
By
Draw nicer Classification and Regression Trees with the rpart.plot package

by Joseph Rickert The basic way to plot a classification or regression tree built with R’s rpart() function is just to call plot. However, in general, the results just aren’t pretty. As it turns out, for some time now there has been a better way to plot rpart() trees: the prp() function in Stephen Milborrow’s rpart.plot package. This function...

Read more »

Spatial Overlays with R – Retrieving Polygon Attributes for a Set of Points

June 19, 2013
By

A short tutorial for spatial overlays using R-GIS..library(sp)library(dismo)# spatial dataalt gadm # viewplot(alt)plot(gadm, add=T)# some addressespts # make it spatialcoords spdf_pts # assign CRS/projectionproj4string(spdf_pts) # check datastr(spdf_pts)# plot it on topplot(spdf_pts, cex = 2, col = 2, add = T)# do an intersection (points in polygon)# yielding the polygon's attribute dataover(spdf_pts, gadm)

Read more »

Compiling R 3.0.1 with MKL support

June 19, 2013
By
Compiling R 3.0.1 with MKL support

Before you begin, be aware that there is others excellent posts about the issue, as: 1. Compiling 64-bit R 2.10.1 with MKL in Linux 2. Speeding up R with Intel’s Math Kernel Library (MKL) 3. Performance benefits of linking R to multithreaded math libraries 4. Using MKL-Linked R in Eclipse But, as recently i faced some problems to The post Compiling...

Read more »

Visualizing transitions with the transitionPlot function

June 19, 2013
By
Visualizing transitions with the transitionPlot function

A transition between states - the above is a simulation of before and after surgery where I've highlighted the large proportion that...

Read more »

An R package for Smith-Wilson yield curves

June 19, 2013
By
An R package for Smith-Wilson yield curves

Yield Curve fitting - the Smith-Wilson method Yield Curve fitting - the Smith-Wilson method This article illustrates the R package SmithWilsonYieldCurve, and provides some additional background on yield curve fitting. The method implemented in the package fits a curve to interest rate market...

Read more »

R Plotting Financial Time Series

June 19, 2013
By

In my little world of finance, data almost always is a time series.  Through both quiet iteration and significant revolutions, the volunteers of R have made analyzing and charting time series pleasant.  As a mini-tribute to all those who have...

Read more »

Generating Tables Using Pander, knitr, and Rmarkdown

June 19, 2013
By

Generating Tables Using Pander, knitr, and Rmarkdown I use a pretty common workflow (I think) for producing reports on a day to day basis. I write them in rmarkdown using RStudio, knit them into .html and .md documents using knitr, then convert the resulting .md file to a .docx file using pander, which is really just a way of communicating with Pandoc via my R terminal. This workflow...

Read more »

highlight 0.4.2

June 19, 2013
By

highlight 0.4.2 is on CRAN. This fixes a few bugs reported by users. The main improvement is that we can now use highlight as a vignette engine, using the functionality introduced in R 3.0.0. In the Rcpp universe, we use … Continue reading →

Read more »

Dallas R Users: Creating R Packages this Saturday, 6/29

June 18, 2013
By

I’ll be presenting at the Dallas R Users Group next Saturday at 10:00AM at the University of Dallas on how to reproduce your R code. We’ll review how to use R scripts, how to embed R code in reproducible documents, and then introduce how to create your own R packages based on your R code.

Read more »

My R package’s worldmap of downloads!

June 18, 2013
By
My R package’s worldmap of downloads!

Last week, a colleague draw my attention on this new log files from the Rstudio cloud CRAN mirror, through a post from Tal Galili. This CRAN mirror is a little different, as it uses Amazon CloudFront to deliver the downloads … Continue reading →

Read more »

The Fallacy of 1/N and Static Weight Allocation

June 18, 2013
By
The Fallacy of 1/N and Static Weight Allocation

In the last few years there has been a increasing tendency to ignore the value of a disciplined quantitative approach to the portfolio allocation process in favor of simple and static weighting schemes such as equal weighting or some type of adjusted volatility weighting. The former simply ignores the underlying security dynamics, assuming equal risk-return,

Read more »

Create SAS Code from R ‘tree’ Objects

June 18, 2013
By
Create SAS Code from R ‘tree’ Objects

I recently was faced with the desire to port some tree models developed in R to SAS so I could score a large database. To me this makes sense as SAS is better with large files (or at least so as not to offend anyone, I am better with large files in SAS). I started

Read more »

Printing R help files in the console or in knitr documents

June 18, 2013
By

Yesterday, I was creating a knitr document based on a script, and was looking for a way to include content from an R help file. The script, which was a teaching document, had a help() command for when the author wanted to refer readers to R documentation. I wanted that text in my final document, though. There’s no...

Read more »

Resources for getting started with R

June 18, 2013
By
Resources for getting started with R

As we believe you may know, we are having a webinar tomorrow (June 19th, 2013) on Predictive Analytics. During this webinar, you are going to be introduced to R, learn how to build a predictive model and also how to carry insightful analysis through visualization. As learning a new language can be a really difficult

Read more »

BCEA 1.3.0

June 18, 2013
By
BCEA 1.3.0

After months of work (although to be fair, we haven't worked 100% full time on this), Andrea and I are nearly ready to publish the next release of BCEA. Andrea has done a brilliant job and is responsible for most of the good new features (NB: see ...

Read more »

PivotalR Improves the Scalability and Performance of In-Database Analytics

June 18, 2013
By
PivotalR Improves the Scalability and Performance of In-Database Analytics

One of the greatest challenges while working with big datasets concerns the need to move information out of storage for analysis. To this end, the recent announcement of PivotalR 0.1 extends Pivotal HD's capabilities, allowing users of the statistical programming language R to perform in-database analytics without leaving the command line.

Read more »

R GIS: Terrain Analysis for Polygons as Simple as it Gets!

June 18, 2013
By
R GIS: Terrain Analysis for Polygons as Simple as it Gets!

library(rgdal)library(raster)alt gadm gadm_sub plot(alt)plot(gadm_sub, add=T)asp slo > extract(slo, gadm_sub, fun = mean, na.rm = T, small = T, df = T) ID slope1 1 9.9590532 2 1.0474433 3 7.4561654 4 1.6737865 5 11.946553> extract(asp, gadm_sub, fun = mean, na.rm = T, small...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.