212 search results for "evaluation"

CFP: the 11th Australasian Data Mining Conference (AusDM 2013), submission due 15 July

April 3, 2013
By
CFP: the 11th Australasian Data Mining Conference (AusDM 2013), submission due 15 July

********************************************************************* The 11th Australasian Data Mining Conference (AusDM 2013) Canberra, Australia, 13-15 November 2013, http://ausdm13.togaware.com Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891 ********************************************************************* Data mining, the art and science of intelligent analysis of (usually large) data sets for meaningful (and previously unknown) … Continue reading

Read more »

Lots of data != "Big Data"

March 28, 2013
By
Lots of data != "Big Data"

by Joseph Rickert When talking with data scientists and analysts — who are working with large scale data analytics platforms such as Hadoop — about the best way to do some sophisticated modeling task it is not uncommon for someone to say, "We have all of the data. Why not just use it all?" This sort of comment often...

Read more »

Writing a MS-Word document using R (with as little overhead as possible)

March 24, 2013
By
Writing a MS-Word document using R (with as little overhead as possible)

The problem: producing a Word (.docx) file of a statistical report created in R, with as little overhead as possible. The solution: combining R+knitr+rmarkdown+pander+pandoc (it is easier than it is spelled). If you get what this post is about, just …

Read more »

Read more »

Review of Mathematica 9 and R-link

March 18, 2013
By

VIDEO TRANSCRIPT: Hello, this is Matt Asher from StatisticsBlog.com. I’m going to be reviewing Mathematica 9, from Wolfram Research. In particular, I’ll be focusing on using it with R and to do Monte Carlo simulations and other statistical work. You can find a full transcript of this video at my blog, including the source code

Read more »

Evaluation of Orthogonal Signal Correction for PLS modeling (OSC-PLS and OPLS)

March 15, 2013
By
Evaluation of Orthogonal Signal Correction for PLS modeling (OSC-PLS and OPLS)

Partial least squares projection to latent structures or PLS is one of my favorite modeling algorithms. PLS is an optimal algorithm for predictive modeling using wide data or data with  rows << variables. While there is s a wealth of literature regarding the application of PLS to various tasks, I find it especially useful for biological

Read more »

Job advert

March 12, 2013
By
Job advert

We finally got around to prepare everything we needed to advertise the position which will be available in the MRC grant we've been awarded last year.The project will run for 30 months and we're looking for a post-doctoral candidate to work on the Rese...

Read more »

Call for participation: DMApps 2013 – an International Workshop on Data Mining Applications in Industry and Government

March 10, 2013
By
Call for participation: DMApps 2013 – an International Workshop on Data Mining Applications in Industry and Government

Call for participation: DMApps 2013 – an International Workshop on Data Mining Applications in Industry and Government in conjunction with PAKDD 2013, Gold Coast, Australia, April 14, 2013 http://dmapps2013.rdatamining.com To attend the workshop, you need to register for PAKDD 2013 … Continue reading

Read more »

Veterinary Epidemiologic Research: Linear Regression Part 2 – Checking assumptions

March 6, 2013
By
Veterinary Epidemiologic Research: Linear Regression Part 2 – Checking assumptions

We continue on the linear regression chapter the book Veterinary Epidemiologic Research. Using same data as last post and running example 14.12: Now we can create some plots to assess the major assumptions of linear regression. First, let’s have a look at homoscedasticity, or constant variance of residuals. You can run a statistical test, the

Read more »

Times per second benchmark

March 5, 2013
By

In GNU R the simplest way to measure execution time of a piece code is to use system.time. However, sometimes I want to find out how many times some function can be executed in one second. This is especially useful when we want to compare function...

Read more »

PCA to PLS modeling analysis strategy for WIDE DATA

March 2, 2013
By
PCA to PLS modeling analysis strategy for WIDE DATA

Working with wide data is already hard enough, add to this row outliers and things can get murky fast. Here is an example of an anlysis of a wide data set, 24 rows  x 84 columns. Using imDEV, written in R, to calculate and visualize a principal components analysis (PCA) on this data set. We find that

Read more »