hacking .gov shortened links

July 30, 2011
By
hacking .gov shortened links

This past Friday, the web portal to the US Federal government, USA.gov, organized hackathons across the US for programmers and data scientists to work with and analyze the data from their link-shortening service. It turns out that if you shorten a web link with bit.ly, the shortened link looks like 1.usa.gov/V6NpL (that one goes to

Read more »

RStudio 0.94.92 visited

July 30, 2011
By
RStudio 0.94.92 visited

I just updated my RStudio version to the latest, v.0.94.92 (will this asymptotically approach 1, or actually get to 1?). It was nice to see the number of improvements the development team has implemented, based I’m sure on community feedback. The team has, in my experience, been extraordinarily responsive to user feedback, and I’m sure

Read more »

Forking Myself

July 29, 2011
By
Forking Myself

I’ve spent some time forking myself. Over the past few days when I could steal away an hour here  or there I decided to make a big change to the package. But it’s a good change. First some book-keeping. The Romantest.R file has a minor bug in it. Not really a bug, I just pulled

Read more »

Splitting Vectors of Uneven Strings

July 29, 2011
By
Splitting Vectors of Uneven Strings

Suppose you have a vector of names such that the first three words in the vector contain relevant information, but there is a bunch of extraneous stuff. For example,Our goal is to collapse the first three words into one contiguous string (without the ...

Read more »

Text Editors in The Lord of the Rings

July 29, 2011
By
Text Editors in The Lord of the Rings

Prompted by a passing thought about TextMate, I thought I'd make a comprehensive, accurate, unbiased, and irrefutable survey of text editors by way of comparison to locations in The Lord of the Rings. TextMate: Minas Tirith A once-great but now decaying city. Only the King has the power to renew it, but he is a long absent, indeed...

Read more »

Text Editors in The Lord of the Rings

July 29, 2011
By
Text Editors in The Lord of the Rings

Prompted by a passing thought about TextMate, I thought I’d make a comprehensive, accurate, unbiased, and irrefutable survey of text editors by way of comparison to locations in The Lord of the Rings. TextMate: Minas Tirith A quiet, long-overlooked land populated by simple folk who keep mostly to themselves. They are somewhat set in their

Read more »

NppToR 2.6.0 beta 2

July 29, 2011
By

http://sourceforge.net/projects/npptor/files/npptor%20installer/NppToR-2.6.0.beta2.exe/download I’ve released beta 2 of NppToR 2.6.0.  Please take a look and report any problems.  This improves the installer and the uninstaller as well as a few bugs that popped up from the transition to UNICODE.

Read more »

The Road to Default: Whaa???

July 29, 2011
By
The Road to Default: Whaa???

Okay so here is what has been happening:The yield curve has been going through a mad flattening- indicating that investors are "flying to safety" and that a recession may be looming around the corner. Why has it been flattening? Well, a string of bad n...

Read more »

multi-platform real-time ‘intro’ in R using rdyncall

July 29, 2011
By
multi-platform real-time ‘intro’ in R using rdyncall

Guest post by Daniel Adler. Below is a real-time audio-visual multimedia demonstration – or in short ‘an intro’ – written in 100% pure R. It requires no compilation and runs across major platforms via the package rdyncall and preinstalled precompiled standard libraries such as OpenGL and SDL libraries. This ‘happy-birthday’ production runs about 3 minutes

Read more »

Financial Engineering with R

July 29, 2011
By

At the InformationManagement blog, Steve Miller talks about the applications of R to financial engineering, and reviews David Ruppert's book Statistics and Data Analysis for Financial Engineering. InformationManagement: Statistics and Financial Engineering

Read more »

Infovis vs. statgraphics: A clear example of their different goals

July 29, 2011
By
Infovis vs. statgraphics:  A clear example of their different goals

I recently came across a data visualization that perfectly demonstrates the difference between the “infovis” and “statgraphics” perspectives. Here’s the image (link from Tyler Cowen): That’s the infovis. The statgraphic version would simply be a dotplot, something like this: (I purposely used the default settings in R with only minor modifications here to demonstrate what The post Infovis...

Read more »

[R][ggplot2][R-bloggers]RcmdrPlugin.KMggplot2_0.0-3 is on CRAN now

July 28, 2011
By
[R][ggplot2][R-bloggers]RcmdrPlugin.KMggplot2_0.0-3 is on CRAN now

RcmdrPlugin.KMggplot2 (CRAN) I posted an Rcmdr plug-in for a ”ggplot2” GUI front-end on CRAN. This version supports Kaplan-Meier plot and other plots as follow: Kaplan-Meier plot Show no. at risk on inside Show no. at risk table on outside Histogram Colo

Read more »

Text Editors in The Lord of the Rings

July 28, 2011
By
Text Editors in The Lord of the Rings

Prompted by a passing thought about TextMate, I thought I’d make a comprehensive, accurate, unbiased, and irrefutable survey of text editors by way of comparison to locations in The Lord of the Rings. TextMate: Minas Tirith A once-great but now decaying city. Only the King has the power to renew it, but he is a long absent, indeed...

Read more »

Text Editors in The Lord of the Rings

July 28, 2011
By
Text Editors in The Lord of the Rings

Prompted by a passing thought about TextMate, I thought I’d make a comprehensive, accurate, unbiased, and irrefutable survey of text editors by way of comparison to locations in The Lord of the Rings. TextMate: Minas Tirith A once-great but now decaying city. Only the King has the power to renew it, but he is a long absent, indeed...

Read more »

Challenge alert — material identification

July 28, 2011
By
Challenge alert — material identification

We start yet another series of post — challenge alerts. This series is intended to share news about machine learning or data mining challenges which may be interesting to the members of our community, possibly with some brief introduction to the problem. So if you hear about some contest, notify us on Skewed distribution. Today

Read more »

Le Monde puzzle [#29]

July 28, 2011
By
Le Monde puzzle [#29]

This week, the puzzle from the weekend edition of Le Monde was easy to state: in the sequence (8+17n), is there a 6th power? a 7th? an 8th? If so, give the first occurrence. So I first wrote an R code for a function testing whether an integer is any power: (The function returns the

Read more »

Pattern Recognition: forward Boxplot Trajectories using R

July 28, 2011
By
Pattern  Recognition: forward Boxplot Trajectories using R

Although the following discussion can apply to the Quantitative Candlestick Pattern Recognition series, it is addressing the same issue as any basic conditional type system -- how and when to exit.  The following is one way to visualize and think ...

Read more »

Program for useR! 2011 available

July 28, 2011
By

The final program for the worldwide user conference, useR! 2011, is now available as a downloadable booklet (PDF, 7Mb). Revolution Analytics is very proud to sponsor this annual gathering of R users from around the world, and the program includes an outstanding lineup of speakers from the R Core Group, package developers, users in industry and academia, and the...

Read more »

Tweets vs. Likes: What gets shared on Twitter vs. Facebook?

July 28, 2011
By
Tweets vs. Likes: What gets shared on Twitter vs. Facebook?

It always strikes me as curious that some posts get a lot of love on Twitter, while others get many more shares on Facebook: What accounts for this difference? Some of it is surely site-dependent: maybe one blogger has a Facebook page but not a Twitter account, while another has these roles reversed. But even

Read more »

Getting rid of white space at the beginning and end of a string

July 28, 2011
By

There are situations where we are working with character strings extracted from various sources and it can be annoying when there is white space at the beginning and/or end of the strings. This whitespace can cause problems when attemping to sort, subset or various other common operations. The stringr package has a handy function str_trim

Read more »

I can’t resist a word cloud: now using R!

July 28, 2011
By
I can’t resist a word cloud: now using R!

The wordcloud package is word clouds for R with a difference: they look great. Of course, having just analysed online coverage of the ISMB conference, I had to run all 6 906 comments from the 2008-2011 meetings through some code. If you followed along via the Sweave code, I went as far as generating the

Read more »

More S&P 500 correlation

July 28, 2011
By
More S&P 500 correlation

Here are some additions to the previous post on S&P 500 correlation. Correlation distribution Before we only looked at mean correlations.  However, it is possible to see more of the distribution than just the mean.  Figures 1 and 2 show several quantiles: 10%, 25%, 50%, 75%, 90%. Figure 1: Quantiles of 50-day rolling correlation of … Continue reading...

Read more »

Displaying Missouri sex offender/child day care facility proximity map using batchgeo.com

July 28, 2011
By

Computer Assisted Reporting This is the last of four articles about analyzing distances between sex offenders and child daycare centers in Missouri as part of a joint project with KSHB NBC Action News in Kansas City. The previous two articles gave deta...

Read more »

Marketing optimization using the nonlinear minimization function nlm

July 28, 2011
By

Guest post by Bob Agnew ([email protected]) —————- Introduction Marketing optimization consists of assigning offers to prospects in order to maximize total expected profit subject to a few general linear constraints and the requirement that a prospect receives at most one offer.  What distinguishes these problems is their sheer size.  With millions of prospects, brute force linear solvers are unsuitable. ...

Read more »

Computing distance matrix between Missouri sex offenders and child daycare facilities

July 28, 2011
By

Computer Assisted Reporting This is the third of four articles about analyzing distances between sex offenders and child daycare centers in Missouri as part of a joint project with KSHB NBC Action News in Kansas City. The previous article explained how...

Read more »

Core not in CiRM

July 27, 2011
By
Core not in CiRM

Despite not enjoying this year the optimal environment of CiRM, we are still making good progress on the revision (or the R vision) of Bayesian Core. In the past two days, we went over Chapters 1 (Introduction), 2 (Normal Models), 5 (Capture-Recapture Experiments), and 6 (Mixture Models), with Chapters 3 (Regression), 4 (Generalised Linear Models)

Read more »

Creating Financial Instrument metadata in R

July 27, 2011
By

(This is a guest post by Ilya Kipnis)When trading stocks in a single currency, instrument metadata can be safely ignored because the multiplier is 1 and the currencies are all the same.  When doing analysis on fixed income products, options, futures, or other complex derivative instruments, the data defining the properties of these instruments becomes critical to tasks...

Read more »

Join the Reserves

July 27, 2011
By
Join the Reserves

Most forget that the tremendous macro imbalances caused by the 10 Trillion in foreign reserves are just 14 years old phenomenon but the results have been and will be profound.  The buying started after the Asia Pacific collapse of 1997, and the As...

Read more »

Analysis of ISMB coverage at FriendFeed: 2008 – 2011

July 27, 2011
By
Analysis of ISMB coverage at FriendFeed: 2008 – 2011

ISMB/ECCB 2011 was held between July 15-19 this year and as in previous years, FriendFeed was used to cover the meeting. Last year, I wrote a post about how to use R to analyse the coverage. I was planning something similar for 2011 when I thought: we have 4 years of ISMB at FriendFeed now

Read more »