A word cloud (or tag cloud) can be an handy tool when you need to highlight the most commonly cited words in a text using a quick visualization. Of course, you can use one of the several on-line services, such as wordle or tagxedo ,...
Sooner on later, every useR will manage to exhaust R’s built-in capabilities and land on CRAN looking for his dreamed needle in a haystack of 3k+ contributed packages. Probably most of you already know stuff like Task Views or rseek which make finding something relevant a bit easier than digging the full list or googling, however all methods
As a matter of convention, we usually run 3 or 4 chains in JAGS. By default, this gives rise to chains that draw samples from 3 or 4 distinct pseudorandom number generators. I didn’t go and check whether it does things 111,222,333 or 123,123,123, but in any event the “parallel chains” in JAGS are samples
Friday July 22 is the last day on which you can register for UseR! 2011 at the University of Warwick. The conference will be 2011 August 16-18. You can peruse the book of abstracts and view the draft schedule. I am scheduled to give a talk on “Random input testing with R”. The abstract is: … Continue reading...
The most recent edition of the Revolution Newsletter is out. The news section is below, and you read the full July edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Beta Test Revolution R Enterprise 5.0. Are you running R in a Microsoft environment? Revolution...
Today, the AG Statistical Computing released the “Call for a Special Topic on Grid and Cloud Computing” in the Journal “Methods of Information in Medicine”. We are inviting submissions for a special topic of Methods of Information in Medicine on “Grid and Cloud Computing Methods in Biomedical Research“. This special topic call originates from a
Maximum likelihood gives the beat fit to the training data but in general overfits, yielding overly-noisy parameter estimates that don't perform so well when predicting new data. A popular solution to this overfitting problem takes advantage of the iterative nature of most maximum likelihood algorithms by stopping early. In general, an iterative optimization algorithm goes from a...
Today, cloud computing is used in many application areas from academic research to industry. Commercial cloud providers as Amazon Web Services (AWS) advertise the simple and fast access to cloud computing resources. Posts in different blogs proof that you can get your application running in the cloud, but it will cost you more than 15
The blog The Average Investors Blog R posted a nice report about accelerating a default Debian R installation and added some details about his benchmarks in the comment section
Kaggle has released a new data-mining challenge: use data from 10 years of Wikipedia edits in order to predict future edit rates. The dataset has been anonymized in order to obscure editor identity and article identity, simultaneously adding focus to the challenge and robbing the dataset of considerable richness. I have some experience with wikipedia…