Skip to main content

Questions tagged [binary-data]

A binary variable takes one of two values, typically coded as "0" and "1".

0 votes
0 answers
11 views

Coefficient of determination with binary prediction

The coefficient of determination $R^2$ is a popular measure of regression performance that compares the mean squared-deviation of predictions to the variance of the actual data. If instead of ...
AnotherShruggingPhysicist's user avatar
0 votes
0 answers
9 views

Extremely high logloss in binary classification problem [duplicate]

I have a binary classification problem that I am currently trying to tackle with xgboost. This is a low signal-to-noise ratio situation dealing with time series. Per this answer "Dumb" log-...
Baron Yugovich's user avatar
1 vote
1 answer
73 views

Logloss worse than random guessing with xgboost

I have a binary classification problem that I am currently trying to tackle with xgboost. This is a low signal-to-noise ratio situation dealing with time series. My out of sample AUC is 0.65, which is ...
Baron Yugovich's user avatar
0 votes
0 answers
10 views

How does conditional logit deal with binary predictors in choice sets of three or more alternatives?

I am interested in clogit, not multinomial logit, and the overall effect of the predictor on the choice of n alternatives that can only be identified by their attributes (i.e., the choice sets are ...
Olifa's user avatar
  • 1
1 vote
1 answer
27 views

Conducting a network meta-analysis in R - inputting dichotomous data

I am looking to conduct a meta-analysis in R using {netmeta}. However, my outcome of interest is a little bit complex and would appreciate advice in how best to enter it into the dataset. I am looking ...
emmz's user avatar
  • 11
2 votes
1 answer
71 views

How do I measure the regularity of the distribution in a list of binary data?

Suppose I have a list list = [0, 1, 0, 1, 0, 1, 1, 0, 0, 0, 1, 1, 1, 0, 1], which gives information about whether a person was sick on a day (1) or not (0), since ...
marvelfab12's user avatar
0 votes
0 answers
18 views

What is the appropriate statistical test in this scenario?

I work with a particular genus of plants where there have been many instances where what was once considered a species is now considered to be a synonym of another species. I am interested in ...
Serge's user avatar
  • 11
0 votes
0 answers
8 views

Similarity measure for sparse, ordered, binary vectors, with more weighting to True values

I have two sparse, ordered, binary vectors. The size of the vectors is around ~100. I am under the impression that cosine similarity is useful for sparse, ordered, binary vectors. For my purposes, it ...
feonyte's user avatar
  • 69
0 votes
0 answers
11 views

Can you use a binary outcome variable in a longitudinal latent-growth mediation model

I am wanting to create a LGCM in which I am assessing whether two latent factors (the intercept and slope) mediate the relationship between a continuous predictor and binary outcome variable but am ...
user417995's user avatar
3 votes
1 answer
49 views

Testing for difference in trends of binary-parameter proportion/fraction versus continuous parameter between two (or more) different populations

I have three samples of objects: sample X (total = 138), sample Y (same basic class as X, but located in a different environment; total = 186), and sample Z (total = 996). For each object, I have a ...
Peter Erwin's user avatar
1 vote
1 answer
14 views

Handling Composite Variables in Latent Class Analysis

I am working with secondary survey data that includes 13 dichotomous variables. According to my theoretical model, I need to group these variables into composite variables. However, I understand that ...
Irina's user avatar
  • 11
2 votes
0 answers
23 views

Quantifying a Sequence of Binary Outcomes

I'm compiling a dataset on animal color patterns. Blotches flank the body of the animal (Agkistrodon contortrix) and meet along its midline (i.e., spine). At midline, the blotches can join ...
Ross Couvillon's user avatar
0 votes
0 answers
20 views

Correlation Coefficent is higher when likelihood of an event is lower, how does this occur?

I have different variables that I am interested in if they influence pass/fail rates. To see what variables I might use as a leading indicator, I've pulled different variables such as "tutoring&...
helloyellobird's user avatar
0 votes
0 answers
7 views

How to draw a ROC curve given estimated probability that a unit is positive and actual observed class? [duplicate]

Assume that a classification model fitted to data available to you has provided for each statistical unit a probability $P(+|x)$ that the unit is positive. The following table shows all available ...
evans5's user avatar
  • 3
0 votes
0 answers
33 views

Panel Data logistic regression

I am working with a panel dataset that describes company code as an id variable and fiscal year as a time variable (1500 firms from 2013-2022). The dependent variable is binary. First I checked for ...
Lou's user avatar
  • 31

15 30 50 per page
1
2 3 4 5
98