2729 search results for "GIS"

Importing google news data to R

July 6, 2011
By
Importing google news data to R

I've been playing around lately with the stock market data available from google finance, through quantmod in R. Here's a function I've written (which depends on the R Data Science Toolkit), to pull news stories related to a stock from google, parse t...

Read more »

Early stopping and penalized likelihood

July 6, 2011
By
Early stopping and penalized likelihood

Maximum likelihood gives the beat fit to the training data but in general overfits, yielding overly-noisy parameter estimates that don't perform so well when predicting new data. A popular solution to this overfitting problem takes advantage of the iterative nature of most maximum likelihood algorithms by stopping early. In general, an iterative optimization algorithm goes from a...

Read more »

In 4 Steps your Application (including R) is running on a Cloud Computing Cluster

In 4 Steps your Application (including R) is running on a Cloud Computing Cluster

Today, cloud computing is used in many application areas from academic research to industry. Commercial cloud providers as Amazon Web Services (AWS) advertise the simple and fast access to cloud computing resources. Posts in different blogs proof that you can get your application running in the cloud, but it will cost you more than 15

Read more »

R performance optimization

July 3, 2011
By

The blog The Average Investors Blog R posted a nice report about accelerating a default Debian R installation and added some details about his benchmarks in the comment section

Read more »

Wikipedia for Kaggle Participants

July 1, 2011
By

Kaggle has released a new data-mining challenge: use data from 10 years of Wikipedia edits in order to predict future edit rates. The dataset has been anonymized in order to obscure editor identity and article identity, simultaneously adding focus to the challenge and robbing the dataset of considerable richness. I have some experience with wikipedia…

Read more »

Weighting and prediction in sample surveys

July 1, 2011
By

A couple years ago Rod Little was invited to write an article for the diamond jubilee of the Calcutta Statistical Association Bulletin. His article was published with discussions from Danny Pfefferman, J. N. K. Rao, Don Rubin, and myself. Here it all is.I'll paste my discussion below, but it's worth reading the others' perspectives too. Especially...

Read more »

roll calls, ideal points, 112th Congress

June 29, 2011
By
roll calls, ideal points, 112th Congress

Now that classes are over, I took a little time to update my scripts that update the analysis of Congressional roll calls in close to real time.   Links appear at the top of the blog.   As of about 15 minutes ago, we’re up to 77 non-unanimous roll calls in the 112th Senate.  

Read more »

Monitoring Sources of Bond Returns with ML/BAC Corporate OAS and CPI

June 28, 2011
By
Monitoring Sources of Bond Returns with ML/BAC Corporate OAS and CPI

In response to the nice comment requesting an update to Monitoring Sources of Bond Return and also longer history, I thought I would update the original and then rerun with CPI to give a longer time series.  For even longer history back to 1919, s...

Read more »

RghcnV3 version 1.1

June 27, 2011
By
RghcnV3 version 1.1

I’ve just uploaded version 1.1 of  the package RghcnV3 to Cran. I’ve made a few changes that should make it easier for some folks to use. First I removed the requirement for rgdal. At the present time “rgdal” is not required. On the MAC installing it can be a little trouble, but if you RTFM

Read more »

New cloudnumbers.com release

June 27, 2011
By

We are very proud to announce our cloudnumbers.com release number 5! In the last days we rolled out several releases and bug fixes. Cloudnumbers.com now supports many more features and has an optimized startup process. This is a list of our main and very important new features: Bioconductor packages for the R application can be

Read more »