Two browsers for R help documentation

June 29, 2011
By
Two browsers for R help documentation

The same excellent documentation for R commands is available through two different help browsers: text and HTML, and let’s see how how each looks, works, and how to switch the default. Look and feel Here is how both look for … Continue reading →

Read more »

roll calls, ideal points, 112th Congress

June 29, 2011
By
roll calls, ideal points, 112th Congress

Now that classes are over, I took a little time to update my scripts that update the analysis of Congressional roll calls in close to real time.   Links appear at the top of the blog.   As of about 15 minutes ago, we’re up to 77 non-unanimous roll calls in the 112th Senate.  

Read more »

A simple ggplot2 scatterplot

June 29, 2011
By

Here’s a bit of code used to produce one of the figures in my recent paper dealing with modeling rocky intertidal snail body temperatures. This was my first foray into ggplot2, and it only involved a few hours of head-scratching. The plot is a co...

Read more »

Putting together multinomial discrete regressions by combining simple logits

June 29, 2011
By

When predicting 0/1 data we can use logit (or probit or robit or some other robust model such as invlogit (0.01 + 0.98*X*beta)). Logit is simple enough and we can use bayesglm to regularize and avoid the problem of separation. What if there are more than 2 categories? If they’re ordered (1, 2, 3, etc),

Read more »

Stata 12 embraces structural equation models

June 28, 2011
By
Stata 12 embraces structural equation models

Stata 12 has just been announced. The software will start shipping by the end of July.  A key new feature introduced in the new version is the module for structural equation models (SEM), a staple tool in marketing, psychology, and several other research disciplines.LISREL and AMOS have...

Read more »

Saving Chunks of SSURGO Data in SoilWeb for Google Earth

June 28, 2011
By
Saving Chunks of SSURGO Data in SoilWeb for Google Earth

SoilWeb is an interactive, multifaceted interface to USDA-NCSS soil survey information. Our SoilWeb application for Google Earth streams soil map units and point data as you navigate across the lower '48 states. Currently, our system imposes a 30,000 ...

Read more »

Synctex with Sweave/pgfSweave in TeXShop/TeXWorks

June 28, 2011
By

Ever been editing an .Rnw (Sweave) file and tried to sync a pdf with the source in TeXShop (or TeXWorks) and had it open the .tex file? This is because the synctex information (in the .synctex.gz file) is messed up. Both TeXShop and TeXWorks support synctex, that means that if everything is groovy, we should

Read more »

Benchmarking Revolution R for data mining

June 28, 2011
By
Benchmarking Revolution R for data mining

The blog Heuristically Andrew puts Revolution R through its paces by running some benchmarks versus open-source R for data mining applications. The benchmarks set out to answer the following question: I recently upgraded my notebook (where I often use R for data mining) and was faced with two questions: for the fastest speed for building models, do I use...

Read more »

p-Values for Cointegration Tests With Breaks in the Data

June 28, 2011
By
p-Values for Cointegration Tests With Breaks in the Data

In an earlier post I went through some econometrics that involved the problem of testing for multivariate cointegration in the case where there are one or more trend-breaks or level-breaks in the time-series data.  Specifically, I talked about the modified Trace tests introduced by Johansen et al. (2000), and I mentioned the really nice discussion of the application of these tests...

Read more »

Visualizing Periodic Data

June 28, 2011
By

Yesterday the Princeton machine learning reading group went through a paper by Tukey on “Some graphic and semigraphic displays”. One issue we talked about at length was Tukey’s idiosyncratic approach to visualizing periodic data in a circular format to emphasize the connections between the “start” and the “end” of the data set. Allison Chaney pointed

Read more »

Slideshow of Graphs since TimelyPortfolio’s November Inception

June 28, 2011
By
Slideshow of Graphs since TimelyPortfolio’s November Inception

I have had a lot of fun blogging at Timely Portfolio over the last 7 months.  Here are all the graphs that I have shown.  Thanks especially to R.

Read more »

Monitoring Sources of Bond Returns with ML/BAC Corporate OAS and CPI

June 28, 2011
By
Monitoring Sources of Bond Returns with ML/BAC Corporate OAS and CPI

In response to the nice comment requesting an update to Monitoring Sources of Bond Return and also longer history, I thought I would update the original and then rerun with CPI to give a longer time series.  For even longer history back to 1919, s...

Read more »

rbold: An R Interface for Bold Systems barcode repository

June 28, 2011
By
rbold: An R Interface for Bold Systems barcode repository

Have you ever wanted to search and fetch barcode data from Bold Systems?I am developing functions to interface with Bold from R. I just started, but hopefully folks will find it useful.The code is at Github here. The two functions are still very buggy,...

Read more »

Topic Modeling the Sarah Palin Emails

June 27, 2011
By
Topic Modeling the Sarah Palin Emails

tl;dr Browse through Sarah Palin’s emails, automagically organized by topic, here. LDA-based Email Browser Earlier this month, several thousand emails from Sarah Palin’s time as governor of Alaska were released. The emails weren’t organized in any fashion, though, so to make them easier to browse, I did some topic modeling (in particular, using latent Dirichlet

Read more »

density()

June 27, 2011
By
density()

Following my earlier posts on the revision of Lack of confidence, here is an interesting outcome from the derivation of the exact marginal likelihood in the Laplace case. Computing the posterior probability of a normal model versus a Laplace model in the normal (gold) and the Laplace (chocolate) settings leads to the above histogram(s), which

Read more »

Benchmarking R, Revolution R, and HyperThreading for data mining

June 27, 2011
By
Benchmarking R, Revolution R, and HyperThreading for data mining

Usually data mining benchmarks measure lift, precision, etc., but wasting analyst time hurts the ROI of any project. I recently upgraded my notebook (where I often use R for data mining) and was faced with two questions: for the fastest … Continue reading →

Read more »

RghcnV3 version 1.1

June 27, 2011
By
RghcnV3 version 1.1

I’ve just uploaded version 1.1 of  the package RghcnV3 to Cran. I’ve made a few changes that should make it easier for some folks to use. First I removed the requirement for rgdal. At the present time “rgdal” is not required. On the MAC installing it can be a little trouble, but if you RTFM

Read more »

Bonds Risk and Return by Rating

June 27, 2011
By
Bonds Risk and Return by Rating

As an extension to the Bond Market as a Casino Game series and Historical Sources of Bond Returns-Comparison of Daily to Monthly, I thought a ggplot of risk and return by decade and Moody’s Rating might be helpful.  Anyone who has read those oth...

Read more »

New cloudnumbers.com release

June 27, 2011
By

We are very proud to announce our cloudnumbers.com release number 5! In the last days we rolled out several releases and bug fixes. Cloudnumbers.com now supports many more features and has an optimized startup process. This is a list of our main and very important new features: Bioconductor packages for the R application can be

Read more »

The R Programming Wikibook

June 27, 2011
By

Visit “The R Programming wikibook” to extend your knowledge about R and to get a lot of introductions how to use it. If you are an R expert and wish to contribute your knowledge and editing skills to the project, then you can learn how to write in wiki-markup and how to edit a wikibook.

Read more »

Example 8.42: skewness and kurtosis and more moments (oh my!)

June 27, 2011
By
Example 8.42: skewness and kurtosis and more moments (oh my!)

While skewness and kurtosis are not as often calculated and reported as mean and standard deviation, they can be useful at times. Skewness is the 3rd moment around the mean, and characterizes whether the distribution is symmetric (skewness=0). Kurtos...

Read more »

A Winking Pink Elephant

June 27, 2011
By
A Winking Pink Elephant

The title of chapter 5 in my Guerrilla Capacity Planning book is, "Evaluating Scalability Parameters," and underneath it you'll see this quote:"With four parameters I can fit an elephant. With five I can make his trunk wiggle." —John von NeumannIn that vein, Guerrilla alumnus Stephen O'C. pointed me at a recent blog post and paper (PDF)...

Read more »

googleVis library on use

June 26, 2011
By
googleVis library on use

Data on the map While surfing around the Internet I accidentally found the googleVis library for R and especially the gvisGeoMap-function which creates a map based on country data.  In a table hockey scene we have a great World Ranking … Continu...

Read more »

NHL Statistics – Goals scored by age

June 26, 2011
By
NHL Statistics – Goals scored by age

NHL Statistics, part 1 Goals scored by age Data Twirling blog gave instructions to how to get NHL statistics data from the website and I saw an opportunity to learn R and statistics with help of data I know and … Continue reading →

Read more »

RObjectTables are AWESOME

June 26, 2011
By

Why isn't everyone using the RObjectTables package? This is the best thing ever! Here's the basic idea of RObjectTables: An environment is an object where you can lookup names and associate them with values. And in particular its where you look up variables used in an expression. But there's no reason you can't take any other object that associates names...

Read more »

Bayesian Fall school in La Rochelle

June 26, 2011
By
Bayesian Fall school in La Rochelle

The French agronomy research institute INRA is organising a Fall school in La Rochelle, Nov. 28 – Dec. 02, on Bayesian methods, oriented towards the applications in food sciences, environmental sciences, and biology. The provisional program (in French) is ■ Initiation aux outils informatiques R et WinBUGS (TP et réalisation de projets sur ordinateur) ■

Read more »

Next Steps: Drafting the R Help Files

With RTextTools now released and the feedback rolling in, the development team is getting the ball rolling on the help documentation for the library. Currently, you cannot access help files about the library or its functions from within R. However, we do offer a draft of a quick start guide in PDF format under the Documentation section of the...

Read more »

NHL Statistics – Goals scored by age

June 26, 2011
By
NHL Statistics – Goals scored by age

NHL Statistics, part 1 Goals scored by age Data Twirling blog gave instructions to how to get NHL statistics data from the website and I saw an opportunity to learn R and statistics with help of data I know and … Continue reading →

Read more »

googleVis library on use

June 26, 2011
By
googleVis library on use

Data on the map While surfing around the Internet I accidentally found the googleVis library for R and especially the gvisGeoMap-function which creates a map based on country data.  In a table hockey scene we have a great World Ranking … Continue reading →

Read more »