Questions tagged [binary-data]
A binary variable takes one of two values, typically coded as "0" and "1".
1,459
questions
0
votes
0
answers
11
views
Coefficient of determination with binary prediction
The coefficient of determination $R^2$ is a popular measure of regression performance that compares the mean squared-deviation of predictions to the variance of the actual data. If instead of ...
0
votes
0
answers
9
views
Extremely high logloss in binary classification problem [duplicate]
I have a binary classification problem that I am currently trying to tackle with xgboost. This is a low signal-to-noise ratio situation dealing with time series. Per this answer "Dumb" log-...
1
vote
1
answer
73
views
Logloss worse than random guessing with xgboost
I have a binary classification problem that I am currently trying to tackle with xgboost. This is a low signal-to-noise ratio situation dealing with time series. My out of sample AUC is 0.65, which is ...
0
votes
0
answers
10
views
How does conditional logit deal with binary predictors in choice sets of three or more alternatives?
I am interested in clogit, not multinomial logit, and the overall effect of the predictor on the choice of n alternatives that can only be identified by their attributes (i.e., the choice sets are ...
1
vote
1
answer
27
views
Conducting a network meta-analysis in R - inputting dichotomous data
I am looking to conduct a meta-analysis in R using {netmeta}. However, my outcome of interest is a little bit complex and would appreciate advice in how best to enter it into the dataset.
I am looking ...
2
votes
1
answer
71
views
How do I measure the regularity of the distribution in a list of binary data?
Suppose I have a list list = [0, 1, 0, 1, 0, 1, 1, 0, 0, 0, 1, 1, 1, 0, 1], which gives information about whether a person was sick on a day (1) or not (0), since ...
0
votes
0
answers
18
views
What is the appropriate statistical test in this scenario?
I work with a particular genus of plants where there have been many instances where what was once considered a species is now considered to be a synonym of another species. I am interested in ...
0
votes
0
answers
8
views
Similarity measure for sparse, ordered, binary vectors, with more weighting to True values
I have two sparse, ordered, binary vectors. The size of the vectors is around ~100.
I am under the impression that cosine similarity is useful for sparse, ordered, binary vectors. For my purposes, it ...
0
votes
0
answers
11
views
Can you use a binary outcome variable in a longitudinal latent-growth mediation model
I am wanting to create a LGCM in which I am assessing whether two latent factors (the intercept and slope) mediate the relationship between a continuous predictor and binary outcome variable but am ...
3
votes
1
answer
49
views
Testing for difference in trends of binary-parameter proportion/fraction versus continuous parameter between two (or more) different populations
I have three samples of objects: sample X (total = 138), sample Y (same basic class as X, but located in a different environment; total = 186), and sample Z (total = 996). For each object, I have a ...
1
vote
1
answer
14
views
Handling Composite Variables in Latent Class Analysis
I am working with secondary survey data that includes 13 dichotomous variables. According to my theoretical model, I need to group these variables into composite variables. However, I understand that ...
2
votes
0
answers
23
views
Quantifying a Sequence of Binary Outcomes
I'm compiling a dataset on animal color patterns. Blotches flank the body of the animal (Agkistrodon contortrix) and meet along its midline (i.e., spine). At midline, the blotches can join ...
0
votes
0
answers
20
views
Correlation Coefficent is higher when likelihood of an event is lower, how does this occur?
I have different variables that I am interested in if they influence pass/fail rates.
To see what variables I might use as a leading indicator, I've pulled different variables such as "tutoring&...
0
votes
0
answers
7
views
How to draw a ROC curve given estimated probability that a unit is positive and actual observed class? [duplicate]
Assume that a classification model fitted to data available to you has provided for each statistical unit a probability $P(+|x)$ that the unit is positive. The following table shows all available ...
0
votes
0
answers
33
views
Panel Data logistic regression
I am working with a panel dataset that describes company code as an id variable and fiscal year as a time variable (1500 firms from 2013-2022). The dependent variable is binary. First I checked for ...