Monthly Archives: September 2011

KDNuggest: R most commonly used software for data mining & analytics

September 5, 2011
By
KDNuggest: R most commonly used software for data mining & analytics

In a poll with 570 respondents conducted last month at KDNuggets, the R software was the most frequent response to the question, "What programming languages you used for data mining / data analysis in the past 12 months?". The results are tabled below (respondents could select more than one response): In another poll conducted earlier this year, KDNuggets also...

Read more »

Review of “Risk and Meaning” by Nicolas Bouleau

September 5, 2011
By
Review of “Risk and Meaning” by Nicolas Bouleau

The subtitle is: Adversaries in Art, Science and Philosophy. Executive Summary Genius or madness? I haven’t decided. Irreversibility of interpretation The book drives home that once we decide how something is we can’t go back to our state of innocence. Figures 1 through 3 exhibit this idea via a randomly generated polygon.  Look at Figure … Continue reading...

Read more »

A misleading title…

September 4, 2011
By
A misleading title…

When I received this book, Handbook of fitting statistical distributions with R, by Z. Karian and E.J. Dudewicz,  from/for the Short Book Reviews section of the International Statistical Review, I was obviously impressed by its size (around 1700 pages and 3 kilos…). From briefly glancing at the table of contents, and the list of standard

Read more »

googleVis 0.2.9

September 4, 2011
By
googleVis 0.2.9

We have published googleVis 0.2.9 on CRAN. The new version updates the package for the new features of the Google Visualisation API and brings an new in-page editor option. Here is a simple example, displaying the participants of the R user Conference...

Read more »

Ladies and Gents: GDP has finally gotten its long awaited forecast

September 4, 2011
By
Ladies and Gents: GDP has finally gotten its long awaited forecast

Today we will be finally creating our long awaited GDP forecast.  In order to create this forecast we have to combine both the forecast from our deterministic trend model and the forecast from our de-trended GDP model. Our model for the trend is:t...

Read more »

Scatter plots with images

September 4, 2011
By

Edward Tufte has written extensively on the presentation of data covering good and bad practice. He has made a number of suggestions for adaptations of regularly used graph types to assist with the interpretation and understanding of data. One idea for enhancing scatter plots covered in Tufte’s book Beautiful Evidence is the use of images

Read more »

Microfinance in India: Getting a sense of the geographic distribution

September 3, 2011
By
Microfinance in India: Getting a sense of the geographic distribution

I am working on a review paper on microfinance in India and use data from the MIX market. Today, I was amazed by how quick I conjured a map of India with the headquarters of the microfinance institutions that report data to the MIX market depicted on that map. Ideally, I would have more geolocation

Read more »

The Problems with Pairing R + Java

A core focus of the RTextTools project has been to make the package as accessible and user-friendly as possible. In its early iterations, the package contained dependencies such as RWeka, openNLP, and

Read more »

An example of ROC curves plotting with ROCR

September 3, 2011
By
An example of ROC curves plotting with ROCR

Decided to start githib with ROC curve plotting example. There is not a one ROC curve but several - according to the number of comparisons (classifications), also legend with maximal and minimal ROC AUC are added to the plot. ROC curves and ROC AU...

Read more »

rmongodb – R Driver for MongoDB

September 3, 2011
By

The source code to rmongodb (home page at http://cnub.org/rmongodb.ashx), a driver to MongoDB for the R language, has been released as open source at GitHub: https://github.com/gerald-lindsly/rmongodb.  This portable full-featured package was developed on top of the mongodb.org supported C driver. It runs almost entirely in native code so you can expect high performance.  Plans are to submit rmongodb to CRAN soon for pre-built binary distribution, but first I would...

Read more »