Making Color Ramps in Matlab

October 9, 2012
By

When visualizing an array of data in a heatmap, a good color map makes a world of difference.Thanks to my work in 'omics (i.e. transcriptomics - microarrays and RNASeq) I've looked at a lot of heatmaps over the past couple of years, and generated quite...

Read more »

A brief script on Geographical data analysis in R

A brief script on Geographical data analysis in R

I saw this post and I decided to replicated that good example but with data closer to me, particulary data of my country. So, I've got the shape data of the capital of my country (You can download the data from here). The data comes from the 2002 CENSO...

Read more »

Permanent Portfolio – Transaction Cost and better Risk Parity

October 9, 2012
By
Permanent Portfolio – Transaction Cost and better Risk Parity

I want to address comments that were asked in my last post, Permanent Portfolio – Simple Tools, about Permanent Portfolio strategy. Specifically: The impact of transaction costs on the perfromance and Create a modified version of risk allocation portfolio that distributes weights across 3 asset classes: stocks(SPY), gold(GLD), and treasuries(TLT), and only invests into cash(SHY)

Read more »

Revised graph for BAC 1% cash rewards

October 9, 2012
By
Revised graph for BAC 1% cash rewards

Thanks to Anthony Damico who alerted me to an error in the Bank of America's 1% cash rewards figure in my last post. The plot shows the true reward percentage as a function of purchase amount. A key "feature" of the BoA reward program is that the full 1% is only awarded for purchases in

Read more »

Age-Period-Cohort models and the decline of violence

October 9, 2012
By
Age-Period-Cohort models and the decline of violence

Ever since the end of the Mexican Revolution and the Cristero War violence in Mexico inched down in fits and starts from a high of about 60 homicides per 100,000 people to its lowest level sometime during the middle of the last decade (there's some uncertainty about the number of homicides in...

Read more »

R User Group Sponsorship: Applications open for 2013

October 9, 2012
By

Here at Revolution Analytics, we're proud to have been a part of fostering local user groups for R users around the world. Last year we sponsored more than 40 groups, with representation from every continent around the world. And we're thrilled to see the continued growth in user groups: this time last year there about 50 R user groups;...

Read more »

Git Error when pushing with a large file

October 9, 2012
By

Quick Note: I had an error recently where RStudio nor EGit nor the command line would push my repo to github.  I can't remember the exact error, although after some googling I found this SO answer that solved it git config http.postBuffer 52428800...

Read more »

Error Handling in R

October 9, 2012
By
Error Handling in R

It’s often the case that I want to write an R script that loops over multiple datasets, or different subsets of a large dataset, running the same procedure over them: generating plots, or fitting a model, perhaps. I set the script running and turn to another task, only to come back later and find the Related posts:

Read more »

Debugging parameter mismatch across RAC database instances with R, dba_hist, and gv$parameter

October 9, 2012
By
Debugging parameter mismatch across RAC database instances with R, dba_hist, and gv$parameter

Did you find this post useful?  Does your organization need Oracle services?  We can help.   Much of this morning went into investigating strange ADDM reports on a two-node Oracle RAC database.  For some reason, there were statistically improbable differences…Read more ›

Read more »

Conditional Colors and Shapes in plot() with ifelse()

October 9, 2012
By
Conditional Colors and Shapes in plot() with ifelse()

Here’s an example of how to color your plot shapes and pch using an ifelse() statement. Ifelse() is handy as it creates an easy way to branch a function. The syntax is: ifelse(logical.condition, option1, option2). Meaning, if the logical condi...

Read more »

What’s in My Pocket? Read it now! (or Read It Later)

October 9, 2012
By
What’s in My Pocket? Read it now! (or Read It Later)

IntroductionYou know what's awesome? Pocket.I mean, sure, it's not the first. I think Instapaper existed a little before (perhaps). And there are alternatives, like Google Reader. But Pocket is still my favorite. It's pretty awesome at what it does.Pocket (or Read It Later, as it used to be known) has fundamentally changed the...

Read more »

New season of Grey’s Anatomy with Exponential Random Graph Models

October 8, 2012
By

In a previous post we used the web of sexual contacts among characters on the Grey’s Anatomy television show to look at some social network analysis using R. To celebrate the beginning of the new season, Ben Lind has put … Continue reading →

Read more »

Laplace’s liberation army

October 8, 2012
By
Laplace’s liberation army

Great as it is, Google does not always give the "best", or "right" (ie "most appropriate") results on a given search. For example, if you google "jags" (using google.co.uk), the first results are a girls' independent school and a sports club. The real JAGS (OK: real in a geeky, nerdy, statistical sense) only comes...

Read more »

lubridate 1.2.0 now on CRAN

October 8, 2012
By
lubridate 1.2.0 now on CRAN

The latest version of lubridate offers some powerful new features and huge speed improvements. Some areas, such as date parsing are more than 50 times faster. lubridate 1.2.0 also fixes those pesky NA bugs in 1.1.0. Here’s some of what you’ll find: Parsers can now handle a wider variety date formats, even within the same

Read more »

In case you missed it: September 2012 Roundup

October 8, 2012
By

In case you missed them, here are some articles from September of particular interest to R users. You can now browse the R-devel sources and changelogs at GitHub. R is used to create a 3-D animation of the Antarctic ice cap. At the DataWeek SF conference, R users from eBay, Intuit, Minted and other companies describe how R is...

Read more »

Another R + iButton script

October 8, 2012
By

Previously I’ve detailed R scripts that automate the launching and downloading Maxim iButton thermochron data loggers. I’m typically doing the launching and downloading at separate times in my workflow, since I have duplicate iButtons to swap out, so separate scripts work for me. Ryan Knowles recently contributed a combined version of these scripts that downloads

Read more »

Product revenue prediction with R – part 2

October 8, 2012
By
Product revenue prediction with R – part 2

After development of predictive model for transactional product revenue -(Product revenue prediction with R – part 1), we can further improvise the model prediction by modifications in the model. In this post, we will see what are the steps required for model improvement. With the help of a set of model summary parameters, the data

Read more »

Product revenue prediction with R – part 3

October 8, 2012
By
Product revenue prediction with R – part 3

After development and improvement  of predictive model with R (as in the previous blog), I have focused here about making a prediction with the R model ( linear regression model ) and comparison with the Google prediction API model. In statistical modeling, R will calculate intercept and variable coefficients to describe the relationship between a

Read more »

Product revenue prediction with R – part 1

October 8, 2012
By
Product revenue prediction with R – part 1

In my upcoming three blogs, I am going to discuss about how Product managers, Data analyst and Data scientists can develop model for the prediction of the transactional product revenue on the basis of user actions like total numbers of time product added to the cart, total numbers of time product added to the cart,

Read more »

Two ways that correlation and stepwise regression can give different results

October 8, 2012
By

In general, a correlation test is used to test the association between two variables (y and z). However, if there is a third variable (x) that might be related to z or y, it makes...

Read more »

Summarizing Data

October 8, 2012
By
Summarizing Data

In this post, I'll go over four functions that you can use to nicely summarize your data.  Before any regression analysis, a descriptive analysis is key to understanding your variables and the relationships between them.  Next week, I'll have...

Read more »

Example 10.5: Convert a character-valued categorical variable to numeric

October 8, 2012
By
Example 10.5: Convert a character-valued categorical variable to numeric

In some settings it may be necessary to recode a categorical variable with character values into a variable with numeric values. For example, the matching macro we discussed in example 7.35 will only match on numeric variables. One way to conve...

Read more »

DIY ZeroAccess GeoIP Analysis : So What?

October 8, 2012
By
DIY ZeroAccess GeoIP Analysis : So What?

NOTE: A great deal of this post comes from @jayjacobs as he took a conversation we were having about thoughts on ways to look at the data and just ran like the Flash with it. Did you know that – if you’re a US citizen – you have approximately a 1 in 5 chance of getting the

Read more »

CrowdANALYTIX – Ideation Contest – Warranty Pricing

October 8, 2012
By

I recently completed an ideation contest on CrowdANALYTIX where the participants had to build an approach towards warranty pricing and fraud detection.Ideation contests are quite different from the usual data mining contests where the objective is...

Read more »

Functions for plotting and getting Greek in labels

October 8, 2012
By
Functions for plotting and getting Greek in labels

The problem: We often want to plot data and assign plot attributes based on characteristics of the data. For example, if we have a group of students with the following IQs, we might want to indicate who is an outlier in the statistical sense. I like...

Read more »

S&P 500 correlations up to date

October 8, 2012
By
S&P 500 correlations up to date

I haven’t heard much about correlation lately.  I was curious about what it’s been doing. Data The dataset is daily log returns on 464 large cap US stocks from the start of 2006 to 2012 October 5. The sector data were taken from Wikipedia. The correlation calculated here is the mean correlation of stocks among … Continue reading...

Read more »

GBIF biodiversity data from R – more functions

October 8, 2012
By
GBIF biodiversity data from R – more functions

We have been working on an R package to get GBIF data from R, with the stable version available through CRAN here, and the development version available on GitHub here. We had a Google Summer of code stuent work on the package this summer - you can se...

Read more »

Presidential Candidate Sentiment Analysis

October 7, 2012
By
Presidential Candidate Sentiment Analysis

After watching the Presidential debates and hearing all the opinions on how the candidates performed, I got the hair brained idea of creating a simple function that would do automate the pulling down of tweets for each candidate, analyze the positivity or negativity of tweets, and then graph them out. This project turned out to

Read more »

SPIDER makes the top 10 barcoding publications of 2012

October 7, 2012
By
SPIDER makes the top 10 barcoding publications of 2012

In the recent Barcode Bulletin published by iBoL, our humble paper announcing the R package spider: Species identity and evolution made second on their list of the top 10 publications of 2012. Not bad for a side project! Spider is available for downl...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.