Questions tagged [categorical-data]
Categorical (also called nominal) data can take on a limited number of possible values called categories. Categorical values "label", they do not "measure". Please use [ordinal-data] tag for discrete but ordered data types.
3,590
questions
-1
votes
0
answers
7
views
categorical independent variables [closed]
I am conducting a retrospective study and have some categorical independent variables, and my outcome variable is also categorical. I am wondering if the chi-square test with multiple categorical ...
0
votes
0
answers
7
views
How to find the marginal distribution of $p_k$ in multinominal regression
Given a multinominal regression the probability of a certain class $k$ is a function of predictors. How would we find analytically the marginal probability of $p_k$ given we know the distribution of ${...
1
vote
0
answers
16
views
What type of test should I use for paired categorical data?
Somewhat similar questions have been asked here and here, but none of the other answers are really applicable in my case.
I have a group of people, where in Year 1 everybody is sick. Here I have only ...
1
vote
1
answer
21
views
Why the contribution of a categorical value in SHAP trained on Catboost differs from observation to observation
Context
Let's imagine I am interested in predicting sepal length in the iris dataset using catboost.
Objective
My main objective is understanding the effect of each categorical value for ...
-2
votes
0
answers
40
views
Is factor with two levels set as random effect okay? [closed]
a. Can covariate that are factors/categories that have 2 levels, such as yes/no or presence/absence, that also change spatial location from year to year have a smooth basis of “re” (bs=re)? Should it ...
1
vote
0
answers
22
views
Modeling for a data set that has different number of factors for each row (not binomial) [closed]
The modeling issue I'm having is that the categorical variable for each row has different number of factors. If I can reshape the data by products (a,b,c,.....~cost, hoursum, numPod, numDate), so that ...
3
votes
1
answer
65
views
Factor-smooth interactions in generalized additive models
I have a very simple question regarding the continous factor-smooth interactions implement in the mgcv package in R: https://stat.ethz.ch/R-manual/R-patched/library/mgcv/html/factor.smooth.html
...
0
votes
0
answers
31
views
Correct test for regression with more than two factor levels?
Is it correct to use a "post hoc test" (e.g. a Tukey test) to analyse differences among factor levels in a linear regression or is there another method that is preferred?
In my case, I have ...
3
votes
1
answer
58
views
Testing dependence of two categorical variables with data separated by test subject
This relates to a real-world experiment I'm working on. Let's say in my experiment, test subjects look at pictures of sunglasses that are either red or blue, and then label them "cool" or &...
1
vote
1
answer
14
views
Handling Composite Variables in Latent Class Analysis
I am working with secondary survey data that includes 13 dichotomous variables. According to my theoretical model, I need to group these variables into composite variables. However, I understand that ...
2
votes
1
answer
28
views
Exponential Regression dependent variable with dummy variables or numerical average of each category?
My dataset includes toxin concentrations (continuous, dependent variable) for different size classes (5mm increments) of juvenile fish (categorical, independent variable). The smallest size class is ...
1
vote
2
answers
59
views
What is the difference between a) multilevel modelling and b) adding a categorical IV to a multiple regression?
The examples of multilevel modelling I have seen are equivalent to treating the group as an extra categorical IV in a multiple regression. For example, if children are grouped into N classes, you ...
3
votes
1
answer
144
views
Are dichotomous categorical variables technically interval/continuous measures?
In my university statistics book it says "Dichotomous categorical variables are easily handled in MRA. This is because they are by definition, an interval (continuous) measure.". However I ...
0
votes
0
answers
24
views
Statistical test to show differences between two categorical datasets
I have two categorical datasets that I would like to compare against each other, but I'm not exactly sure which test to choose and how to go about it. Both datasets show the number of cases for 8 ...
2
votes
1
answer
40
views
How to model the interaction between two continuous variables and two factors using gams with mgcv? [closed]
I was wondering how I could model the effect of two continuous variables X1 and X2 and two factors F1 and F2 and their interaction on a outcome Y using gams. What I primarily want to know is (in ...