Graphic Parameters (symbols, line types, and colors) for ggplot2

April 27, 2012
By
Graphic Parameters (symbols, line types, and colors) for ggplot2

Following up on John Mount’s post on remembering symbol parameters in ggplot2, I decided to give it a try and included symbols, line types, and colors (based upon Earl Glynn’s wonderful color chart).  Code follows below. require(ggplot2) ...

Read more »

Graphic Parameters (symbols, line types, and colors) for ggplot2

April 27, 2012
By
Graphic Parameters (symbols, line types, and colors) for ggplot2

Following up on John Mount’s post on remembering symbol parameters in ggplot2, I decided to give it a try and included symbols, line types, and colors (based upon Earl Glynn’s wonderful color chart).  Code follows below. require(ggplot2) ...

Read more »

Randomization thoughts

April 27, 2012
By
Randomization thoughts

Le Grand Casino of Monte CarloOn Monday I’m going to be leading a little stats workshop on randomization tests and null models. In preparation for this I wrote up code for null model examples I wanted to write a post that introduced the basics of these models (Null models, bootstrapping,...

Read more »

soilDB Demo: Processing SSURGO Attribute Data with SDA_query()

April 26, 2012
By
soilDB Demo: Processing SSURGO Attribute Data with SDA_query()

Mapping near Paloma, CA This image has nothing to do with the following content. A quick example of how to use the USDA-NRCS soil data access query facility (SDA), via the soilDB package for R. The following code describes how to get component-level so...

Read more »

phyloseq: Reproducible interactive analysis of microbiome census data using R

April 26, 2012
By
phyloseq: Reproducible interactive analysis of microbiome census data using R

Collaborative development of phyloseq on GitHub. Official stable release of phyloseq on Bioconductor. Advances in DNA sequencing technology have dramatically improved the scope and scale of culture-independent investigations into microbial communities. There are effective software tools available to process raw DNA … Continue reading →

Read more »

AdfTest Function Enhanced With Rcpp Armadillo

April 26, 2012
By

In my previous post about rewriting my code to run in parallel part one I mentioned that we will make a small change to adfTest() function as well. In this post we will perform this small but performance-dramatic change. When you take a closer look at the source code of this particular function from fUnitRoots package

Read more »

Structural Breaks (Bull or Bear?)

April 26, 2012
By
Structural Breaks (Bull or Bear?)

When I spotted the bfast R package, I could not resist attempting to apply it to identify bull and bear markets.  For all the details that I do not understand, please see the references: Jan Verbesselt, Rob Hyndman, Glenn Newnham, Darius Culvenor...

Read more »

Graphic Parameters (symbols, line types, and colors) for ggplot2

April 26, 2012
By
Graphic Parameters (symbols, line types, and colors) for ggplot2

Following up on John Mount’s post on remembering symbol parameters in ggplot2, I decided to give it a try and included symbols, line types, and colors (based upon Earl Glynn’s wonderful color chart).  Code follows below.    

Read more »

Big Data statistics in the search for a cure for MS

April 26, 2012
By

Multiple Sclerosis (MS) is a debilitating and complex disease with an unknown cause — and for which there is currently no cure. The SUNY Buffalo is home to one of the leading multiple sclerosis (MS) research centers in the world, and as reported in Healthcare IT News, the research team is using IBM Netezza and Revolution R Enterprise to...

Read more »

spam evolution

April 26, 2012
By
spam evolution

Despite some rather modest protection (like a simple captcha), I still receive spammy comments on this blog every now and again. They’re easily spotted and actually never appear on the website. There’s obviously an incentive for the spammer to post … Continue reading →

Read more »

R Tips: lots of tips for R programming

April 26, 2012
By
R Tips: lots of tips for R programming

by Yanchang Zhao, RDataMining.com There are more than 100 R tips at http://pj.freefaculty.org/R/Rtips.html, which provide quick examples to small challenges in everyday R programming, especially for users switching from other languages to R. There is also a .PDF version for … Continue reading →

Read more »

More PubMed data mining: looking at top 20 CBT journals

April 26, 2012
By
More PubMed data mining: looking at top 20 CBT journals

In this short article I present some data of the top 20 Cognitive Behavior Therapy (CBT) journals with the most PubMed publications, and compare that to data from 2010 and 2011.

Read more »

Installing R packages without admin rights on MS Windows

April 26, 2012
By
Installing R packages without admin rights on MS Windows

Is there a life outside the office?Photo: Markus GesmannIt is not unusual that you will not have admin rights in an IT controlled office environment. But then again the limitations set by the IT department can spark of some creativity. And I have to ad...

Read more »

Graphing Predicted Legislative Violence with Zelig & ggplot2

April 25, 2012
By
Graphing Predicted Legislative Violence with Zelig & ggplot2

In my previous post I briefly mentioned an early draft of a working paper (HERE) I've written that looks into the possible causes of violence between legislators (like the violence shown in this picture from the Turkish Parliament).  From The GuardianIn this...

Read more »

Late-April flotsam

April 25, 2012
By
Late-April flotsam

It has been month and a half since I compiled a list of statistical/programming internet flotsam and jetsam. Via Lambda The Ultimate: Evaluating the Design of the R Language: Objects and Functions For Data Analysis (PDF). A very detailed evaluation … Continue reading →

Read more »

LeaRning R

April 25, 2012
By

If, like me, you're still an R novice, you'll no doubt find this post on Pairach Piboonrungroj's blog extremely helpful. Among other things, Pairach provides links to 20 40 "R tutorials". It's a really nice resource!H.T. to David Smith for posting about this on the Revolutions blog.© 2012, David E. Giles

Read more »

Big Data, R and HANA: Analyze 200 Million Data Points and Later Visualize in HTML5 Using D3 – Part II

April 25, 2012
By
Big Data, R and HANA: Analyze 200 Million Data Points and Later Visualize in HTML5 Using D3 – Part II

In my last blog, Big Data, R and SAP HANA: Analyze 200 Million Data Points and Later Visualize Using Google Maps, I analyzed historical airlines performance data set using R and SAP HANA and put the aggregated analysis on Google Maps.  Undoub...

Read more »

Live Longer – Choose Your Country Wisely (if you can)

April 25, 2012
By
Live Longer – Choose Your Country Wisely (if you can)

Full democracy countries are the ones in which to live. This week's story could start and end with the above graph with almost no further explanation. But that wouldn't do it justice. So, like so many of the past articles on "Graph o...

Read more »

useR! 2012 Conference

April 25, 2012
By

“Spatial data is, quite literally, everwhere” (Barry Rowlingson) this is so true! And because of that you guys will have the chance to take part in a great tutroial on using R for managing geospatial data, transforming, making maps and working with OGC standards. So visit this years useR! conference at Vanderbilt University; Nashville, Tennessee,

Read more »

20 free R tutorials (and one reference card)

April 25, 2012
By

If you're just getting started with the R language, R user Pairach Piboonrungroj has published a handly list of 20 free R tutorials published by university departments. Included on the list: Getting Started with the R Data Analysis Package (by Norm Matloff), Getting Started with R (from York University), and An Introduction to R (by Phil Spector). Paraich also...

Read more »

Reproducible Research: Running odfWeave with 7-zip

April 25, 2012
By

odfWeave is an R-package that is used for making dynamic reports by Sweave processing of Open Document Format (ODF) files. For anyone new to report generation and lacking knowledge of markup languages this might be a good starting point or even a true ...

Read more »

Short versus long papers, in academic journals

April 24, 2012
By
Short versus long papers, in academic journals

This Monday, during my talk on quantile regressions (at the Montreal R-meeting), we've seen how those nice graphs could be interpreted, with the evolution of the slope of the linear regression, as a function of the probability level. One illustrati...

Read more »

Projects in RStudio

April 24, 2012
By
Projects in RStudio

Now that I have one enormous project on the go and one smaller one, I find it’s helping me considerably to have each project stored in separate RStudio projects.  So, each project has its own scripting that I’ve been working … Continue reading →

Read more »

R, Julia and genome wide selection

April 24, 2012
By
R, Julia and genome wide selection

— “You are a pussy” emailed my friend. — “Sensu cat?” I replied. — “No. Sensu chicken” blurbed my now ex-friend. What was this about? He read my post on R, Julia and the shiny new thing, which prompted him … Continue reading →

Read more »

Insights into Quantile Regression from Arthur Charpentier

April 24, 2012
By
Insights into Quantile Regression from Arthur Charpentier

At this Monday’s Montreal R User Group meeting, Arthur Charpentier gave an interesting talk on the subject of quantile regression. One of the main messages I took away from the workshop was that quantile regression can be used to determine if extreme events are becoming more extreme. The example given was hurricane intensity since 1978.

Read more »

Varying Window Length for Linear Models on Stocks

April 24, 2012
By
Varying Window Length for Linear Models on Stocks

In a previous post, we discussed ideas generated by a Timely Portfolio post about Linear Models on Stock. I wanted to see if there was a relationship between the window length of the running mean of the linear regression slope estimate and the running mean of the correlation between fitted and observed values. The parameters

Read more »

How to remember point shape codes in R

April 24, 2012
By
How to remember point shape codes in R

I suspect I am not unique in not being able to remember how to control the point shapes in R. Part of this is a documentation problem: no package ever seems to write the shapes down. All packages just use the “usual set” that derives from S-Plus and was carried through base-graphics, to grid, lattice Related posts:

Read more »

Heat map visualization of sick day trends in Finland with R, ggplot2 and Google Correlate

April 24, 2012
By
Heat map visualization of sick day trends in Finland with R, ggplot2 and Google Correlate

Inspired by Margintale’s post “ggplot2 Time Series Heatmaps” and Google Flu Trends I decided to use a heat map to visualize sick days logged by HeiaHeia.com Finnish users. I got the data from our database, filtering results by country (Finnish users only) in a tab separated form with the first line as the header. Three columns

Read more »

Rmetrics financial engineering workshop

April 24, 2012
By

For those looking for an in-depth workshop on financial engineering with R, look no further than the R/Rmetrics Workshop and Summer School held annually in beautiful Meielisalp, Switzerland. This is an intimate workshop limited to around 50 participants, and features tutorials from leading practitioners in finance with R. This year's workshop takes plase June 24-28. You can find the...

Read more »