R is Hot: Part 5

November 4, 2010
By

This the final installment of a five-part article series. You can download the complete article from the Revolution Analytics website. Building a Business The value of R to business is borne out by the experiences of John Lucker and his team of advanced analytics professionals at Deloitte Consulting LLP. John is a Deloitte Consulting Principal and leads the firm’s...

Read more »

The dead of Juarez

November 4, 2010
By
The dead of Juarez

Ciudad Juárez became the most violent city in Mexico as a consequence of the vicious war between the Juárez Cartel and the Sinaloa Cartel. So far more than 6,500 people have been murdered in this city since the war started. With the Juárez Cartel significantly weakened, 2010 is set to be the most violent year yet... Here are some statistics...

Read more »

The dead of Juarez

November 4, 2010
By
The dead of Juarez

Ciudad Juárez became the most violent city in Mexico as a consequence of the vicious war between the Juárez Cartel and the Sinaloa Cartel. So far more than 6,500 people have been murdered in this city since the war started. With the Juárez Cartel significantly weakened, 2010 is set to be the most violent year yet... Here are some statistics...

Read more »

The Answer Depends on the Question

November 3, 2010
By

To quote from the preface to the first edition in Jeffreys (1961): ‘It is sometimes considered a paradox that the answer depends not only on the observations but on the question; it should be a platitude.’1 Generalized Linear Models : P. ...

Read more »

iPhone App Store Acceptance Time / Download Results

November 3, 2010
By
iPhone App Store Acceptance Time / Download Results

   Complaints about the iPhone App Store are not uncommon among developers.  The submission process is frustrating at best, you can expect arbitrary rejections, and Apple's policies have not always been particularly open or welcoming. &...

Read more »

iPhone App Store Acceptance Time / Download Results

November 3, 2010
By
iPhone App Store Acceptance Time / Download Results

   Complaints about the iPhone App Store are not uncommon among developers.  The submission process is frustrating at best, you can expect arbitrary rejections, and Apple's policies have not always been particularly open or welcoming. &...

Read more »

RMongo: Accessing MongoDB in R

November 3, 2010
By

I recently created RMongo, a database access layer to MongoDB in R as an R package. To install RMongo, download it from https://github.com/quid/RMongo/downloads Run: R CMD install RMongo_0.0.17.tar.gz I tried to mimic the RMySQL commands in RMongo. Bel...

Read more »

Keeping up with election results, with R

November 3, 2010
By
Keeping up with election results, with R

Yesterday's US election is pretty much over now: most of the results are in, the pundits have offered their political analysis, and there's even been a bit of mathematical analysis of the results, too. But last night as the results were flowing in, R user Brock Tibert just wanted to track the results of the Massachusetts governor's race. The...

Read more »

My residuals look weird… aren’t they ?

November 3, 2010
By
My residuals look weird… aren’t they ?

Since I got the same question twice, let us look at it quickly....  Some students show me a graph (from a Poisson regression) which looks like that, and they asked "isn't it weird ?", i.e."residuals are null or positive... this is not what we...

Read more »

Looping through a set of graphics in odfWeave

November 3, 2010
By

At proMENTE social research we often use the odfWeave and Sweave packages for the amazing statistics program R for automating the production of graphics and reports. odfWeave and Sweave are for the OpenOffice and lyx (www.lyx.org) word processors respectively. One problem with this approach arises when you have to produce and insert into your document

Read more »

Looping through a set of graphics in odfWeave

November 3, 2010
By

At proMENTE social research we often use the odfWeave and Sweave packages for the amazing statistics program R for automating the production of graphics and reports. odfWeave and Sweave are for the OpenOffice and lyx (www.lyx.org) word proce...

Read more »

inline 0.3.7

November 3, 2010
By

A bug-fix release 0.3.7 of inline is now on CRAN and at Debian. It fixes a minor bug: when package.skeleton() was called to convert one or more functions created with this package into a package, the corner case of just a single submitted function ...

Read more »

Rcpp 0.8.8

November 2, 2010
By

A bug-fix release 0.8.8 of Rcpp is now available. It is awaiting processing at CRAN, and will be uploaded to Debian once processed at CRAN. In the meantime, sources are available from my local directory here. This release follows on the heels o...

Read more »

Another lottery coincidence

November 2, 2010
By

Last week, the Freakonomics blog in the NYT reported that the Israeli lottery had drawn the same six numbers twice in a month. The seventh "bonus ball" was different, but still: quite a coincidence, right? Cue the quote from an expert to explain just how remarkable this is: Yitzhak Melechson, a statistics professor at the University of Tel Aviv,...

Read more »

SAS vs Open Source, ctd

November 2, 2010
By

Following up on the story from last week, where SAS CEO Jim Goodnight said he "hadn't noticed" competition from open-source alternatives, open-source BI vendor Pentaho's "Chief Geek" James Dixon responds: What this means is that SAS has moved from the Igorance phase to the Ridicule phase of battling open source, they only have Fighting and Losing to go. There...

Read more »

Comments on probabilities

November 2, 2010
By
Comments on probabilities

The only thing I remember from courses I had in probability a few years ago is that we also have to clearly defined the event we want to calculate the probability. On the Freakonomics blog, last week, the Israeli lottery was mentioned (here, see a...

Read more »

Installing rgdal on Mac OS X

November 2, 2010
By
Installing rgdal on Mac OS X

After running a spatial data analysis with R session today, it became apparent that there are one or two teething problems installing the important rgdal package on Mac OS X operating systems. The usual install.packages(“rgdal”) won’t work. My colleague Jon Reades did some digging around to find this solution. I have tested it and it

Read more »

Names of villages, in France

November 2, 2010
By
Names of villages, in France

Keith Briggs published a post here on names of English place name element distribution, which contains almost twenty maps like the one where names ends by -bourn,bourne,burn (here) or -head (there). Actually, it is possible (Robin mentioned that a...

Read more »

intergraph+network: no hacking necessary

November 2, 2010
By

A short update on network+intergraph R packages story: Couple of days ago Carter Butts released a new version of the ‘network’ package (ver. 1.5-1). It has a namespace now. Consequently, the ‘intergraph’ package should work out-of-the-box. There is no need to install my hacked version of the ‘network’ package anymore.

Read more »

How to simulate wind speed time series with R

November 2, 2010
By
How to simulate wind speed time series with R

If you need to generate synthetic wind speed time series, you may find useful the procedure described in  “A Markov method for simulating non-gaussian wind speed time series” by G.M. McNerney and P.S. Veers (Sandia Laboratories, 1985), and “Estimation of extreme wind speeds with very long return periods” by M.D.G Dukes and J.P. Palutikof (Journal

Read more »

ROC – plot

November 1, 2010
By
ROC – plot

There are many implantation in R already of ROC plots (e.g. in the packages PresenceAbsence, ROCR). I just wrote my own very simple script just to get a better understanding of it.

Read more »

Programming with R – Returning Information as a List

November 1, 2010
By

In previous posts (here and here) we created a simple function that returns a single numeric value. In some situations it may be more useful to return a more flexible data type, such as a list object, to provide more information about the calculations that have been performed. We can extend our previous function by changing

Read more »

Google TechTalk on integrating R

November 1, 2010
By

As noted on the Google Open Source Blog last week, R package authors Dirk Eddelbuettel and Romain Francois recently gave a presentation on R at the Googleplex, on various topic related to "bridging" R into other systems. Their 90-minute talk is available for replay on YouTube (as part of the Google TechTalks series), and you can download the slides...

Read more »

An analysis of the Stackoverflow Beta sites

November 1, 2010
By
An analysis of the Stackoverflow Beta sites

In the last six months or so, the behemoth of Q & A sites stackoverflow, decided to change tack and launch a number of other non-computing-language sites. To launch a site in the stackoverflow family, sites have to spend time gathering followers in Area51. Once a site has gained a critical mass, a new StackExchange

Read more »

Interesting packages of new BioC2.7

November 1, 2010
By

NuPoP Nucleosome positioning predictionMulcom Differential expression and false discovery rate calculation through multiple comparisonontoCAT Ontology parsingBHC Bayesian Hierarchical ClusteringiSeq Bayesian Hierarchical Modeling of ChIP-seq...

Read more »

Example 8.12: Bike ride plot, part 1

November 1, 2010
By
Example 8.12: Bike ride plot, part 1

The iPhone app Cyclemeter uses the phone's GPS capability to record location and other data, and infer speed, while you ride. I took a ride near my house recently, and downloaded the data. I'd like to examine my route and my speed. A simple plot of ...

Read more »

ggplot2 change axis labels

November 1, 2010
By
ggplot2 change axis labels

Today, I will try to change the labels of the x and y axis.

Read more »

Modeling sound pressure level of a rifle shot

November 1, 2010
By
Modeling sound pressure level of a rifle shot

Noise can be classified as pollution and lawmakers often (always?) treat it as such. Noise can have different origin points, point source being among the simplest to model. Because noise has broader health implications, being able to understand its propagation, a simple model can further our understanding in toning down or preventing excessive noise burden

Read more »

Choropleth Maps of Presidential Voting

November 1, 2010
By
Choropleth Maps of Presidential Voting

Having always appreciated the red and blue cartograms and cartographs of geographic electoral preferences, such as those made available by Mark Newman, I sought to produce similar maps, but include information about support for non-“state-sponsored” parties, and to extend the coverage back in time. I was able to find county-level presidential election returns going as … Continue reading →

Read more »