I would argue that your Wilcoxon test is meaningless. For all four datasets, the first data column has no overlap whatsoever with the second data column. All Wilcoxon Ws are 0. The "BIZARRE" behavior may be that the test tries to interpolate what the p value for W of 0 would be given your sample size. With your sample size, the Wilcoxon test may use a normal approximation, which might explain this behavior (http://en.wikipedia.org/wiki/Mann–Whitney_U#Normal_approximation) you would have to check it yourself.
If you create overlap by, say, adding the mean of the first data column to all observations in the second data column, you have overlap between the data columns. Only then it makes sense to run a Wilcoxon test and you get what you would expect. In your instance, it is nonsensical to perform a Wilcoxon test. That is the likely reason for the p-value issue you observe. w <- wilcox.test(c(1:40),(c(1:40)+20.5)) w$p.value w <- wilcox.test(c(1:50),(c(1:50)+25.5)) w$p.value w <- wilcox.test(c(1:60),(c(1:60)+30.5)) w$p.value w <- wilcox.test(c(1:70),(c(1:70)+35.5)) w$p.value -- View this message in context: http://r.789695.n4.nabble.com/BIZARRE-results-from-wilcox-test-tp3597818p3597936.html Sent from the R help mailing list archive at Nabble.com. ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.