Frequent 'nonparametric' Questions

140 votes

8 answers

121k views

How to choose between t-test or non-parametric test e.g. Wilcoxon in small samples

Certain hypotheses can be tested using Student's t-test (maybe using Welch's correction for unequal variances in the two-sample case), or by a non-parametric test like the Wilcoxon paired signed rank ...

Silverfish

23.8k

asked Oct 29, 2014 at 3:02

103 votes

6 answers

99k views

Kendall Tau or Spearman's rho?

In which cases should one prefer the one over the other? I found someone who claims an advantage for Kendall, for pedagogical reasons, are there other reasons?

Tal Galili

21.8k

asked Oct 24, 2010 at 13:15

24 votes

5 answers

13k views

What exactly does a non-parametric test accomplish & What do you do with the results?

I have a feeling this may have been asked elsewhere, but not really with the type of basic description I need. I know non-parametric relies on the median instead of the mean to compare... something. ...

Taal

315

asked Aug 12, 2013 at 21:46

77 votes

15 answers

12k views

Why would parametric statistics ever be preferred over nonparametric?

Can someone explain to me why would anyone choose a parametric over a nonparametric statistical method for hypothesis testing or regression analysis? In my mind, it's like going for rafting and ...

en1

947

asked Jul 30, 2015 at 11:48

29 votes

1 answer

57k views

What is the non-parametric equivalent of a two-way ANOVA that can include interactions?

Hi I am trying to find the non-parametric equivalent of a two-way ANOVA (3x4 design) which is capable of including interactions. From my reading in Zar 1984 "Biostatistical analysis" this is possible ...

user35595

291

asked Dec 2, 2013 at 23:46

63 votes

7 answers

54k views

Which permutation test implementation in R to use instead of t-tests (paired and non-paired)?

I have data from an experiment that I analyzed using t-tests. The dependent variable is interval scaled and the data are either unpaired (i.e., 2 groups) or paired (i.e., within-subjects). E.g. (...

Henrik

14.3k

asked Jan 10, 2011 at 12:10

28 votes

4 answers

21k views

Is there an equivalent to Kruskal Wallis one-way test for a two-way model?

If the model does not satisfy ANOVA assumptions (normality in particular), if one-way, Kruskal-Wallis non-parametric test is recommended. But, what if you have multiple factors?

user4267

351

asked Jun 21, 2011 at 2:46

12 votes

3 answers

923 views

Determine if a heavy tailed distributed process has improved significantly

I observe processing times of a process before and after a change in order to find out, if the process has improved by the change. The process has improved, if the processing time is reduced. The ...

Christian

233

asked Jun 9, 2012 at 14:50

52 votes

3 answers

51k views

Bootstrap vs. permutation hypothesis testing

There are several popular resampling techniques, which are often used in practice, such as bootstrapping, permutation test, jackknife, etc. There are numerous articles & books discuss these ...

Tu.2

2,957

asked Dec 25, 2011 at 1:03

7 votes

4 answers

9k views

Should I use t-test on highly skewed and discrete data?

I have samples from a highly skewed dataset about users' participation (e.g.: number of posts), that have different sizes (but not less than 200) and I want to compare their mean. For that, I'm using ...

Milena Araujo

551

asked Aug 10, 2014 at 0:15

42 votes

2 answers

6k views

Is there a reliable nonparametric confidence interval for the mean of a skewed distribution?

Very skewed distributions such as the log-normal do not result in accurate bootstrap confidence intervals. Here is an example showing that the left and right tail areas are far from the ideal 0.025 ...

Frank Harrell

95.8k

asked Dec 15, 2015 at 23:56

16 votes

1 answer

4k views

Why is the Mann–Whitney U test significant when the medians are equal?

I've received a results from a Mann-Whitney rank test that I don't understand. The median of the 2 populations is identical (6.9). The uppper and lower quantiles of each population are: 6.64 & 7....

Mog

1,241

asked May 21, 2011 at 16:36

15 votes

3 answers

5k views

Why is the asymptotic relative efficiency of the Wilcoxon test $3/\pi$ compared to Student's t-test for normally distributed data?

It is well-known that the asymptotic relative efficiency (ARE) of the Wilcoxon signed rank test is $\frac{3}{\pi} \approx 0.955$ compared to Student's t-test, if the data are drawn from a normally ...

Silverfish

23.8k

asked Dec 28, 2014 at 23:39

14 votes

3 answers

6k views

Non-parametric measure of strength of association between an ordinal and a continuous random variable

I'm throwing here the problem as I received it. I have two random variables. One of which is continuous (Y) and the other one which is discrete and will be approached as ordinal (X). I put below the ...

user603

22.9k

asked Jun 13, 2014 at 11:55

9 votes

1 answer

5k views

Relative efficiency of Wilcoxon signed rank in small samples

I have seen in published literature (and posted on here) that the asymptotic relative efficiency of the Wilcoxon signed rank test is at least 0.864 when compared to the t test. I have also heard that ...

Jimj

1,183

asked Oct 5, 2013 at 0:25

44 votes

4 answers

69k views

What exactly is the difference between a parametric and non-parametric model?

I am confused with the definition of non-parametric model after reading this link Parametric vs Nonparametric Models and Answer comments of my another question. Originally I thought "parametric vs ...

Haitao Du

37.2k

asked Mar 20, 2017 at 13:54

20 votes

3 answers

3k views

When to check model assumptions

Statistical methods are based on model assumptions. For example, an independent one-way ANOVA makes the following assumptions: Normally distributed residuals Homogeneity of variance Independence of ...

Michael McCarthy

301

asked Nov 7, 2021 at 3:49

17 votes

2 answers

10k views

Applicability of chi-square test if many cells have frequencies less than 5

To find association between peer's support (independent variable) and work satisfaction (dependent variable) I wish to apply chi-square test. Peer's support is categories in four groups according to ...

Braj-Stat

621

asked Sep 4, 2012 at 8:49

2 votes

1 answer

2k views

Which statistical analysis should I perform if the data sets are not normally distributed?

I am doing an experiment where there are two independent groups; one is the group of "infected" patients another is the group of "sepsis" patients. I am comparing "platelet monocyte aggregates(PMA)" ...

Saurabh Goswami

23

asked Jun 5, 2020 at 7:43

19 votes

3 answers

18k views

A non-parametric repeated-measures multi-way Anova in R?

The following question is one of those holy grails for me for some time now, I hope someone might be able to offer a good advice. I wish to perform a non-parametric repeated measures multiway anova ...

Tal Galili

21.8k

asked Aug 4, 2010 at 20:01

40 votes

3 answers

16k views

Why are Gaussian process models called non-parametric?

I am a bit confused. Why are Gaussian processes called non parametric models? They do assume that the functional values, or a subset of them, have a Gaussian prior with mean 0 and covariance function ...

user34790

6,837

asked Dec 27, 2012 at 5:00

22 votes

1 answer

43k views

Should I use t-test on highly skewed data ? Scientific proof, please?

I have samples from a highly skewed (looking like an exponential distribution) dataset about users' participation (e.g.: number of posts), that have different sizes (but not less than 200) and I want ...

Milena Araujo

551

asked Aug 5, 2014 at 22:56

18 votes

5 answers

5k views

Checking ANOVA assumptions

A few months ago I posted a question about homoscedasticity tests in R on SO, and Ian Fellows answered that (I'll paraphrase his answer very loosely): Homoscedasticity tests are not a good tool ...

aL3xa

2,211

asked Sep 18, 2010 at 17:42

10 votes

1 answer

616 views

Why are all the permutations of i.i.d. samples from a continuous distribution equally likely?

Suppose $X$ is i.i.d from a continuous distribution Why is$$P(X_{i_1}<X_{i_2}<\cdots<X_{i_3})=P(X_{j_1}<X_{j_2}<\cdots<X_{j_3})=\frac{1}{n!}$$for all $i,j$? I think we can reason ...

ZHU

565

asked Jan 16, 2017 at 4:25

9 votes

5 answers

8k views

Wilcoxon Signed Rank Symmetry Assumption

The assumption of symmetricity for signed rank test (and its relevance) is becoming extremely confusing for me. I am hypothesizing that sub-population A (before treatment) and sub-population B (after ...

Ash

93

asked May 24, 2018 at 18:09

6 votes

1 answer

2k views

Plotting non-parametric (E)CDF confidence envelopes for comparison

I have previously asked about a way to test whether two samples are drawn from the same distribution (Non-parametric test if two samples are drawn from the same distribution). I was very glad to learn ...

Luke Gorrie

467

asked Aug 16, 2017 at 18:46

23 votes

2 answers

22k views

Power analysis for Kruskal-Wallis or Mann-Whitney U test using R?

Is it possible to perform a power analysis for the Kruskal-Wallis and Mann-Whitney U test? If yes, are there any R packages/functions that perform it?

Giorgio Spedicato

3,692

asked Sep 21, 2013 at 7:34

20 votes

4 answers

9k views

Is there any statistical test that is parametric and non-parametric?

Is there any statistical test that is parametric and non-parametric? This question was asked by an interview panel. Is it valid question?

Biostat

1,989

asked Nov 16, 2011 at 0:00

20 votes

3 answers

24k views

Is there a multiple-sample version or alternative to the Kolmogorov-Smirnov Test?

I am comparing the size distribution of trees in six pairs of plots where one plot received a treatment and the other a control. Using a Kolmogorov-Smirnov test on each pair of plots I find that $p$ ...

N Brouwer

2,173

asked Aug 31, 2012 at 18:52

19 votes

1 answer

5k views

What is "Targeted Maximum Likelihood Expectation"?

I'm trying to understand some papers by Mark van der Laan. He's a theoretical statistician at Berkeley working on problems overlap significantly with machine learning. One problem for me (besides ...

Nathan Kurz

311

asked Jan 22, 2015 at 21:37

15 votes

1 answer

10k views

Is there an alternative to the Kolmogorov-Smirnov test for tied data with correction?

I've got a bunch of data from two samples (control and treated), each containing several thousand values which are to undergo significance testing in R. Theoretically, the values should be continuous, ...

AnjaM

275

asked Sep 3, 2012 at 14:01

13 votes

1 answer

12k views

Friedman test vs Wilcoxon test

I'm trying to assess performance of a supervised machine learning classification algorithm. The observations fall into nominal classes (2 for the time being, however I'd like to generalize this to ...

AdrianoKF

233

asked Jan 30, 2014 at 19:06

9 votes

2 answers

7k views

Permutation test in R

I have the following data for 10 subjects based on before and after measurements: ...

user1453477

93

asked Nov 19, 2012 at 18:56

7 votes

2 answers

28k views

Nonparametric equivalent of ANCOVA for continuous dependent variables

I have an independent categorical variable ($X$ with two categories, $x_{1}$ and $x_{2}$) and two continuous dependent variables ($y$ and $z$). Using a Mann Whitney test, I know that $y$ is ...

jetistat001

417

asked Oct 26, 2012 at 19:00

6 votes

2 answers

2k views

Estimate population quantiles from subpopulations' quantiles

Suppose there is a population partitioned arbitrarily into a set of subpopulations that completely cover the original population. Assume that for some variable, we know each subpopulation's quintiles ...

J. Miller

205

asked Feb 7, 2014 at 20:39

6 votes

4 answers

3k views

Is there a non-parametric form of a 3-way ANOVA?

I am currently in the process of writing a publication about the home range of cat shark species in South Africa. However, I am currently struggling with how to create an interaction model of shark ...

Tom Johnson

61

asked Jun 6, 2022 at 16:12

0 votes

2 answers

256 views

Is there an assumption-free ANOVA?

ANOVA presupposes a normal distribution and equal variance. Kruskal–Wallis (non-parametric ANOVA) assumes that all population distributions are the same (except their parameters). I'd like to know if ...

Davi Américo

1,220

asked Dec 1, 2021 at 4:07

37 votes

4 answers

49k views

What is the weak side of decision trees?

Decision trees seems to be a very understandable machine learning method. Once created it can be easily inspected by a human which is a great advantage in some applications. What are the practical ...

Łukasz Lew

1,412

asked Aug 5, 2010 at 10:42

28 votes

2 answers

19k views

Non-parametric test if two samples are drawn from the same distribution

I would like to test the hypothesis that two samples are drawn from the same population, without making any assumptions about the distributions of the samples or the population. How should I do this? ...

Luke Gorrie

467

asked Jul 2, 2017 at 11:14

19 votes

2 answers

40k views

How to run two-way ANOVA on data with neither normality nor equality of variance in R?

I am working on my master thesis at the moment and planned on running the statistics with SigmaPlot. However, after spending some time with my data I came to the conclusion that SigmaPlot might not be ...

Sabine

191

asked May 16, 2012 at 13:21

19 votes

3 answers

2k views

Statistical test for two distributions where only 5-number summary is known?

I have two distributions where only the 5-number summary (minimum, 1st quartile, median, 3rd quartile, maximum) and sample size are known. Contrary to the question here, not all data points are ...

bonifaz

1,095

asked Feb 17, 2014 at 21:59

11 votes

2 answers

10k views

Is ordinal or interval data required for the Wilcoxon signed rank test?

Having looked at multiple online sources, I can't seem to get a straight answer. Could someone please clarify for me if ordinal data is sufficient to use for the WSRT and if not, is the sign test an ...

Ay-Jay

111

asked Jan 7, 2013 at 17:34

11 votes

2 answers

760 views

Propensity score matching vs non-parametric regression

I am trying to understand the benefit of propensity matching over non-parametric regression for causal inference from non-experimental data. As background: the way I understand it, parametric ...

Shade

113

asked Oct 29, 2020 at 23:38

8 votes

2 answers

2k views

What inferential method produces the empirical CDF?

The empirical cdf is an estimate of the cdf. What kind of estimation method (such as method of moments, MLE, ...) constructs the empirical cdf? Is the empirical cdf a nonparametric estimate? Do ...

Tim

19.6k

asked Mar 31, 2014 at 2:31

7 votes

2 answers

1k views

Is the inference from a parametric test valid when the population distribution is not normal?

This question arose from reading this post: T-test for non normal when N>50? In the response to this post, the author outlines really well that the assumption of normality with regard to the t-...

jimbo

71

asked Dec 24, 2016 at 18:43

7 votes

1 answer

3k views

bootstrapping a linear mixed model with R's lmeresampler or lme4 or a robust regression?

considering that I have a very small sample and that my residuals are non-normally distributed, I've decided to perform a lmer() with bootstrapping. This is my very ...

Larissa Cury

781

asked Oct 2, 2022 at 18:13

7 votes

2 answers

5k views

Jonckheere-Terpstra interpretation

I am running the Jonckheere-Terpstra in place of Kruskal-Wallis test, as my factor is in ordinal scale (i.e. groups can be ordered). The Asymptotic significance (2-tailed) is 0.000, so it seems there ...

MMaster

193

asked Sep 12, 2012 at 19:28

6 votes

2 answers

3k views

How to compare two non-normally distributed samples with very different sizes? (Mann-Whitney vs Randomization/Bootstrap)

Perhaps this is a very basic question, but I didn't find yet a simple solution for this simple problem: I want to compare two samples (say X and Y) for a continuous variable which is non-normally ...

gufranca

75

asked Mar 19, 2014 at 23:24

6 votes

3 answers

4k views

Mann Whitney Test: Clearing Up Confusion

I have been reading many statistical websites stating that the Mann Whitney test is a test of medians. However, I believe that this is not really true? It is a test of the difference in the ranks. The ...

Neal

162

asked Jun 2, 2020 at 22:55

6 votes

1 answer

3k views

Difference of 'centers' of 2 non-normal samples with Mann-Whitney test

I have 2 non-normally distributed samples of different sizes (N1~=N2). To evaluate whether there is a significant difference between these samples, I used the Mann Whitney U test (...

DankMasterDan

1,486

asked Nov 19, 2013 at 20:10

Questions tagged [nonparametric]

Related Tags