March 2012

Levenshtein distance in C++ and code profiling in R

March 25, 2012 | Dzidorius Martinaitis

At work, the client requested, if existing search engine could accept singular and plural forms equally, e. g. “partner” and “partners” would lead to the same result. The first option – stemming. In that case, search engine would use root of a word, e. g. “partn”. However, stemming has many weaknesses: ... [Read more...]

Disproportionality Data

March 25, 2012 | Christopher Gandrud

So I was hunting around for some data on disproportional electoral outcomes (when the proportion of voters cast for political parties is not close to the proportion of legislative seats that they win).Michael Gallagher keeps an updated version of his L...
[Read more...]

Citations in markdown using knitr

March 24, 2012 | Carl

I am finding myself more and more drawn to markdown rather then tex/Rnw as my standard format (not least of which is the ease of displaying the files on github, particularly now that we have automatic image uploading). One thing I miss from latex is the citation commands. (I ... [Read more...]

Linking apple liking to sensory

March 24, 2012 | Wingfeet

Previously it was seen that apple liking was related to consumers scores for juiciness and sweetness. It would be most nice if these scores can be linked to sensory scores. Thus a three block model would result:A block with sensory data describing how ...
[Read more...]

Video: R at Work and at Home

March 24, 2012 | jeromyanglim

The following video was filmed at Melbourne R Users. The description of the talk from the meetup site: Eu Jin is a Senior Analyst with Deloitte Analytics in Melbourne. He has over four years experience in data mining and statistical … Continue reading →
[Read more...]

Gini Efficient Frontier

March 23, 2012 | systematicinvestor

David Varadi have recently wrote two posts about Gini Coefficient: I Dream of Gini, and Mean-Gini Optimization. I want to show how to use Gini risk measure to construct efficient frontier and compare it with alternative risk measures I discussed previously. I will use Gini mean difference risk measure – the ... [Read more...]

Serious stats – free statistics resources

March 23, 2012 | Thom Baguley

The companion web site for Serious Stats is now live:http://www.palgrave.com/psychology/baguley/The web site includes:- a free sample chapter (Chapter 15: Contrasts)- data sets- R scripts- 5 online supplements (for meta-analysis, multiple imputation, r... [Read more...]

Dissimilarity Between Soil Profiles: A Closer Look

March 23, 2012 | dylan

Continuing the previous discussion of pair-wise dissimilarity between soil profiles, the following demonstration (code, comments, and figures) further elaborates on the method. A more in-depth discussion of this example will be included as a vignette w... [Read more...]

R in Google Summer of Code 2012

March 23, 2012 | Joshua Ulrich

This post is a slightly revised (and "blogified") version of the message Brian Peterson has sent to various R mailing lists.Once again, R has been accepted as a mentoring organization for the Google Summer of Code (2012).  We invite students interested in this program to learn more about it.  A ... [Read more...]

RStudio Development Environment

March 23, 2012 | bryan

Compared to many other languages of equal popularity, there are realtively few development environments for R. In fact, the total number of production ready R IDEs could probably be counted on one hand. That deficiency is a small price to pay to use R ... [Read more...]

R, Twitter and McDonald’s

March 23, 2012 | David Smith

Ed Chen is a data scientist at Twitter, so he's accustomed to working with big data and complex models. In an interview with MIT Technology Review, he describes his data science toolbox: A common pattern for me is that I'll code a MapReduce job in Scala, do some simple command-line ... [Read more...]
1 2 3 4 5 6 16

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)