Skip to main content

Questions tagged [small-sample]

Refers to statistical complications or problems due to having few data. If your question is about a small sample relative to the number of variables, please use the [underdetermined] tag instead.

0 votes
1 answer
46 views

getting insignificant p-values but large effect size what should I do?

I used Qualtrics to conduct a survey and received 700+ responses. I also collected demographic data to disaggregate the data and observe any differences between groups. A couple questions used display ...
neybeh's user avatar
  • 1
0 votes
0 answers
5 views

Pooling Participant Data to Resolve Limited Trials per Condition Issues

Problem: In an experimental task, there are 4 conditions with only 2 trials each, resulting in 8 observations per participant. Typically, we use Signal Detection Theory (SDT) to calculate descriptive ...
StupefiedByYou's user avatar
1 vote
1 answer
31 views

Considering 96 observation for estimating the intercept (rule of thumb)

I remember Prof. Frank Harrell stated that in order to calculate the sample size using the rule of thumb, we must include 96 observations for just computing the intercept, hence the estimated sample ...
elisa's user avatar
  • 55
8 votes
5 answers
774 views

Correlation for Small Dataset?

I have an $x$ and a $y$ that I would like to find the correlation of to learn more about their relationship. Unfortunately, I only have $10$ points. Can I in good faith use the Pearson correlation ...
Camellia99's user avatar
0 votes
0 answers
15 views

Investigating seasonality and trend in short time series

I am interested in investigating the presence, or lack of, seasonality or trend in very short time series (typically max 12 observations). Case in point: suppose one is looking at average number of ...
Astral's user avatar
  • 133
15 votes
4 answers
1k views

Train-validation-test split for small and unbalanced dataset?

I have a dataset of around 100 rows, each with around 400 features. 93 of them are class 0, and 7 are class 1. I want to be able to split my 100 examples into a train set, a validation set, and a test ...
Thao Nguyen's user avatar
1 vote
1 answer
82 views

Distribution of a spread of observations in triplicate sample taken from Gaussian distribution

Suppose random triplicate samples are taken from a Gaussian distribution with known mean and SD. What should be the distribution of the maximum absolute difference between 3 possible pairs of ...
Maciej Tomczak's user avatar
3 votes
1 answer
45 views

How to deal with extremely small training dataset in machine learning? [duplicate]

I've around 100 rows of data with labels ...
zZzZ's user avatar
  • 79
1 vote
1 answer
142 views

How to deal with extremely small training data? [closed]

I've around 100 rows of data with labels ...
zZzZ's user avatar
  • 79
3 votes
2 answers
66 views

Mixed-effect logistic regression with small sample size: is it possible or do you have alternative solutions?

I want to run a mixed-effect regression model on a few data points. I have 24 participants and 4 trials per participant. I want to include two fixed effects and their interaction in the model, as well ...
chiaras15's user avatar
2 votes
0 answers
33 views

Regression with small sample size - LASSO or remove variables?

I'm trying to run a regression, but I only have 14 observations, each being a different city in the US. My dependent variable is the total number of trips per capita, and my explanatory variables are ...
BeyondConfused's user avatar
5 votes
2 answers
413 views

ANOVA vs Kruskal Wallis - Small sample size

I have data from an experiment comparing plant weights for 4 independent treatment groups. The data seem to be normally distributed (I have been warned about using statistical tests for normality). ...
user411569's user avatar
4 votes
1 answer
270 views

Small Sample size

I have collected primary data from 100 tech startups. However, there are 23k active tech startups. Given that, how will I justify the small sample size. I am collecting the data from an emerging ...
Anu's user avatar
  • 65
1 vote
0 answers
90 views

Hyperparameter tuning for small datasets

I have about 10 small imbalanced datasets (some of them only have about 150 samples). I want to try a bunch of balancing techniques on some models. For that, I'm using the repeated stratified cross-...
beautifularmy's user avatar
0 votes
0 answers
12 views

Engle-Granger cointegration test critical values

I am conducting the Engle-Granger cointegration test on a system of three time series: logged spot exchange rates, logged domestic price index, and logged foreign price index. I would like to use ...
Pavel Filip's user avatar

15 30 50 per page
1
2 3 4 5
48