
This looks interesting: http://infochimps.org/search?query=soil
A few months back I gave a presentation to the NYC R Meetup. (R is a statistical programming language. If this means nothing to you, feel free to stop reading now.) The presentation was on ggplot2, a popular package for generating graphs of data and statistics. In the talk (which you can see here, including
Patrick Burns just posted to the mailing list the following massage: There is now a document called “Some hints for the R beginner” whose purpose is to get people up and running with R as quickly as possible. Direct access to it is: http://www.burns-stat.com/pages/Tutor/hints_R_begin.html JRR Tolkien wrote a story (sans hobbits) called ‘Leaf by Niggle’ that has always resonated with me. I...
Here i present some Books and Articles about Ecological Modelling and “R”. Since “R” is integrated in Bio7 all the presented methods in the Books and Articles can also be useful together with Bio7. Books: Ellner, Stephen P. & Guckenheimer, John (2006). Dynamic Models in Biology. Princeton University Press Bolker B (2008) Ecological Models and
In connection with the Le Monde puzzle of last week, I was looking for an R function that would give me the prime factor decomposition of any integer. Such a function exists within the package schoolmath, developped by Joerg Schlarmann and Josef Wienand. It is called prime.factor and it returns the prime factors of any 
The impacts of drought depend on time-scale. On short time-scales, drought means dry soil. On long time-scales, it means dry rivers and empty reservoirs. A region may simultaneously experience dry conditions on one time-scale and wet conditions on another e.g. wet soil but low streamflow or visa versa. Standardized Precipitation Index (SPI) is a widely
A contingency table is used in statistics to provide a tabular summary of categorical data and the cells in the table are the number of occassions that a particular combination of variables occur together in a set of data. The relationship between variables in a contingency table are often investigated using Chi-squared tests. The simplest contingency
At the Edinburgh mixture estimation workshop, Murray Aitkin presented his proposal to compare models via the posterior distribution of the likelihood ratio. As already commented in a post last July, the positive aspect of looking at this quantity rather than at the Bayes factor is that the priors are then allowed to be improper if 
The R package oro.nifti has been released. Medical imaging data, in NIfTI or Analyze formats, may be input, created from scratch, converted from DICOM (using oro.dicom) and output to a file.
The R package oro.nifti has been released. Medical imaging data, in NIfTI or Analyze formats, may be input, created from scratch, converted from DICOM (using oro.dicom) and output to a file.
InformationWeek published today a profile of Zack Urlocker, the former MySQL executive who recently joined REvolution's board: Former MySQL staffer Zack Urlocker is going to try to do for predictive analytics what he once did for relational database systems: bring open source code to a user population that hasn't necessarily had access to the technology before. REvolution Computing of...
In the US, at least. Via The Consumerist: Incidentally, the US FDA doesn't publish pyramids like this any more: it's now a garish personalized 2-d triangle with stripes. But at least it doesn't make the error of dimension committed by the left-hand pyramid: that orange section is a hell of a lot larger than 74% of the volume. The...
On a lark, and to kill a bit of time, I was running the R fortune command looking for references to SAS. Here’s what two successive random fortunes turned up. Can there be two more antipodal opinions about the same product? I laughed out loud. > fortune(‘SAS’) There are companies whose yearly license fees to 
Hi there! This blog will be dedicated to data visualization in R. Why? Two reasons. First, when it comes to statistics, I am always starting by some exploratory analyses, mostly with plots. And when I handle large quantities of data, it’s nice to make some graphs to get a grasp about what is going on. 
GCS Model Grids
Recently finished some collaborative work with Vishal, related to visualizing climate change data for the SEI. This project was funded in part by the California Energy Commission, with additional technical support from the Google Earth Team. One of the final products was an...
As a coincidence, or not, I received the following email just before starting our mixture estimation workshop (the above is Ben Nevis on Monday, whose skyline really looks like a three component mixture!) and giving a discussion on label switching: I am implementing a Markov-Chain Monte Carlo method for Gibbs sampling from a simple mixture 
another plyr example quantiles (0.05, 0.25, 0.5, 0.75, 0.95) of DSC by temperature bin
There are plenty of good examples on how to use functions from the plyr package. Here is one more, demonstrating how to use ddply with a custom function. Note that there...
I totally missed this the other day, but there's much more to that special report on the data deluge in The Economist. (Thanks to readers SB and DN for pointing this out.) There's an total of nine articles in the report (you can find them all in the Related Items box on this page), including a section on business...
I wrote an R function to do soft-trimming, right clipping FastQ reads based on quality.
This function has the option of leaving out sequences trimmed to extinction and will do left-side fixed trimming as well.
#softTrim
#trim first position lower than minQuality and all subsequent positions
#omit sequences that after trimming are shorter than minLength
#left trim to firstBase, (1 implies no left trim)
#input:...
I wrote an R function to do soft-trimming, right clipping FastQ reads based on quality.
This function has the option of leaving out sequences trimmed to extinction and will do left-side fixed trimming as well.
#softTrim
#trim first position lower than minQuality and all subsequent positions
#omit sequences that after trimming are shorter than minLength
#left trim to firstBase, (1 implies no left trim)
#input:...
Analyst David Stodder at Intelligent Enterprise also noted the activity around R at the recent Predictive Analytics World conference in San Francisco, and he reviews his impressions in a column today. In fact, he attributes the increasing prominence of predictive analytics to R: Possibly the most important factor influencing the spread of predictive analytics is the growing popularity of...
In a previous tutorial I showed you how to create plots faceted by the level of a third variable using ggplot2. A commenter asked about using faceted plots and viewports and reminded me of this function I found in the ggplot2 Google group. The arrange function below is similar to using par(mfrow=c(r,c)) in base graphics to put more than...