My mi presentation at the useR! 2009

August 4, 2009
By

I have been to many conferences. Most of them are of Political Science. The common feature of these conferences is that the audience size is always small. If you get 10 audiences in your panel, your must be a big name.My presentation at the useR! 20...

Read more »

My mi presentation at the useR! 2009

August 4, 2009
By

I have been to many conferences. Most of them are of Political Science. The common feature of these conferences is that the audience size is always small. If you get 10 audiences in your panel, your must be a big name.My presentation at the useR! 20...

Read more »

Non-parametric methods for the study of the correlation: Spearman’s rank correlation coefficient and Kendall tau rank correlation coefficient

August 3, 2009
By

We saw in the previous post, how to study the correlation between variables that follow a Gaussian distribution with the Pearson product-moment correlation coefficient. If it is not possible to assume that the values follow gaussian distributions, we have two non-parametric methods: the Spearman's rho test and Kendall's tau test.For example, you want to study the productivity...

Read more »

Non-parametric methods for the study of the correlation: Spearman’s rank correlation coefficient and Kendall tau rank correlation coefficient

August 3, 2009
By

We saw in the previous post, how to study the correlation between variables that follow a Gaussian distribution with the Pearson product-moment correlation coefficient. If it is not possible to assume that the values follow gaussian distributions, we have two non-parametric methods: the Spearman's rho test and Kendall's tau test.For example, you want to study the productivity...

Read more »

Rcpp 0.6.6

August 3, 2009
By

Version 0.6.6 of Rcpp went off to CRAN and Debian earlier as another small maintenance release. While the diffstat output that will appear in cranberries may look impressive, it ste...

Read more »

Rcpp 0.6.6

August 3, 2009
By

Version 0.6.6 of Rcpp went off to CRAN and Debian earlier as another small maintenance release. While the diffstat output that will appear in cranberries may look impressive, it stems c...

Read more »

Rcpp 0.6.6

August 3, 2009
By

Version 0.6.6 of Rcpp went off to CRAN and Debian earlier as another small maintenance release. While the diffstat output that will appear in cranberries may look impressive, it ste...

Read more »

R GUI page on the R wiki

August 3, 2009
By

I've started the process of moving the content of this page to the R wiki. The motivation is that the content will become dynamic and updated much more often, people can add their own project, we can have use cases of each gui, tutorials, feature c...

Read more »

Parametric method for the study of the correlation: the Pearson r-test

August 3, 2009
By

Suppose you want to study whether there is a correlation between 2 sets of data. To do this we compute the Pearson product-moment correlation coefficient, which is a measure of the correlation (linear dependence) between two variables X and Y; then we ...

Read more »

Parametric method for the study of the correlation: the Pearson r-test

August 3, 2009
By

Suppose you want to study whether there is a correlation between 2 sets of data. To do this we compute the Pearson product-moment correlation coefficient, which is a measure of the correlation (linear dependence) between two variables X and Y; then we ...

Read more »

useR! 2009 conference, Rennes, France

August 2, 2009
By

Since my last attendance of the useR conference in Vienna, Austria, I have always wanted to go back. However, the high registration fee was always the factor that chills me off.This year, I am taking advantage of my last year as the student (the regis...

Read more »

useR! 2009 conference, Rennes, France

August 2, 2009
By

Since my last attendance of the useR conference in Vienna, Austria, I have always wanted to go back. However, the high registration fee was always the factor that chills me off.This year, I am taking advantage of my last year as the student (the regis...

Read more »

Example 7.8: Plot two empirical cumulative density functions using available tools

August 1, 2009
By
Example 7.8: Plot two empirical cumulative density functions using available tools

The empirical cumulative density function (CDF) (section 5.1.16) is a useful way to compare distributions between populations. The Kolmogorov-Smirnov (section 2.4.2) statistic D is the value of x with the maximum distance between the two curves. As an...

Read more »

Simple visualization of a 11X5 table (for WordPress 2.9 Features Vote Results)

July 31, 2009
By
Simple visualization of a 11X5 table (for WordPress 2.9 Features Vote Results)

I guess this is not the number one post I would like to start with on this blog, but I feel the time is right for it (community-wise). I’ll move on to the subject matter in a moment, but first a short intro: This blog is written by Tal Galili. I am an aspiring statistician who

Read more »

Simple visualization of a 11X5 table (for WordPress 2.9 Features Vote Results)

July 31, 2009
By
Simple visualization of a 11X5 table (for WordPress 2.9 Features Vote Results)

I guess this is not the number one post I would like to start with on this blog, but I feel the time is right for it (community-wise). I’ll move on to the subject matter in a moment, but first a short intro: This blog is written by Tal Galili. I am ...

Read more »

Kruskal-Wallis one-way analysis of variance

July 31, 2009
By

If you have to perform the comparison between multiple groups, but you can not run a ANOVA for multiple comparisons because the groups do not follow a normal distribution, you can use the Kruskal-Wallis test, which can be applied when you can not make ...

Read more »

Kruskal-Wallis one-way analysis of variance

July 31, 2009
By

If you have to perform the comparison between multiple groups, but you can not run a ANOVA for multiple comparisons because the groups do not follow a normal distribution, you can use the Kruskal-Wallis test, which can be applied when you can not make ...

Read more »

Analysis of variance: ANOVA, for multiple comparisons

July 30, 2009
By

Analysis of variance: ANOVA, for multiple comparisonsThe ANOVA model can be used to compare the mean of several groups with each other, using a parametric method (assuming that the groups follow a Gaussian distribution).Proceed with the following example:The manager of a supermarket chain wants to see if the consumption in kilowatts of 4 stores between them are equal. He...

Read more »

Analysis of variance: ANOVA, for multiple comparisons

July 30, 2009
By

Analysis of variance: ANOVA, for multiple comparisonsThe ANOVA model can be used to compare the mean of several groups with each other, using a parametric method (assuming that the groups follow a Gaussian distribution).Proceed with the following example:The manager of a supermarket chain wants to see if the consumption in kilowatts of 4 stores between them are equal. He...

Read more »

Comparison of two proportions: parametric (Z-test) and non-parametric (chi-squared) methods

July 29, 2009
By

Consider for example the following problem.The owner of a betting company wants to verify whether a customer is cheating or not. To do this want to compare the number of successes of one player with the number of successes of one of his employees, of w...

Read more »

Comparison of two proportions: parametric (Z-test) and non-parametric (chi-squared) methods

July 29, 2009
By

Consider for example the following problem.The owner of a betting company wants to verify whether a customer is cheating or not. To do this want to compare the number of successes of one player with the number of successes of one of his employees, of w...

Read more »

Wilcoxon signed rank test

July 29, 2009
By

Non-parametric statistical hypothesis test, for the comparison of the means between 2 paired samplesThe mayor of a city wants to see if pollution levels are reduced by closing the streets to the car traffic. This is measured by the rate of pollution every 60 minutes (8am 22pm: total of 15 measurements) in a day when traffic is open, and...

Read more »

Wilcoxon signed rank test

July 29, 2009
By

Non-parametric statistical hypothesis test, for the comparison of the means between 2 paired samplesThe mayor of a city wants to see if pollution levels are reduced by closing the streets to the car traffic. This is measured by the rate of pollution every 60 minutes (8am 22pm: total of 15 measurements) in a day when traffic is open, and...

Read more »

Beta Verson of tikzDevice Released!

July 28, 2009
By

The tikzDevice package provides a new graphics device for R which enables direct output of graphics in a LaTeX-friendly way. The device output consists of files containing instructions for the TikZ graphics language and may be imported directly into LaTeX documents using the \input{} command. The beta version of tikzDevice is now available here. An

Read more »

I know it’s been so long…

July 28, 2009
By

Hey,I know it's been so long since last time I posted something in here, but I was really busy with my thesis and some other stuff, but now that I have more time I promise I'll post some interesting stuff in here, by the way, I found such an Interestin...

Read more »

I know it’s been so long…

July 28, 2009
By

Hey,I know it's been so long since last time I posted something in here, but I was really busy with my thesis and some other stuff, but now that I have more time I promise I'll post some interesting stuff in here, by the way, I found such an Interestin...

Read more »

Corpus Linguistics with R, Day 2

July 28, 2009
By

R Lesson 2 text gsub ("second", "third", text) SEARCH-REPLACE-SUBJECT "This is a first example sentence." "And this is a third example sentence." > gsub ("n", "X", text) "This is a first example seXteXce." "AXd this is a secoXd example seXteXce." > gsub ("is", "was", text) "Thwas was a first example

Read more »

Corpus Linguistics with R, Day 1

July 28, 2009
By

(This post documents the first day of a class on R that I took at ESU C&T. I is posted here purely for my own use.) R Lesson 1 > 2+3; 2/3; 2^3 5 0.6666667 8 --- Fundamentals - Functions > log(x=1000, base=10) 3 --- (Formals describes the syntax of other

Read more »

Wilcoxon-Mann-Whitney rank sum test (or test U)

July 27, 2009
By

Comparison of the averages of two independent groups of samples, of which we can not assume a distribution of Gaussian type; is also known as Mann-Whitney U-test.You want to see if the mean of goals suffered by two football teams over the years is the same. Are below the number of goals suffered by each team in 6 games...

Read more »