Questions tagged [logistic-regression]
Refers generally to statistical procedures that utilize the logistic function, most commonly various forms of logistic regression
705
questions
0
votes
0
answers
23
views
How to choose thresholds to discretize target for binary classification
My group is using logistic regression to investigate the most predictive features in a dataset. Our target variable is actually a continuous variable that we discretized using two cutoff thresholds (...
0
votes
0
answers
16
views
Can I train a logistic regression model for combining ML models to form an ensemble?
I have 3 ML models trained to perform classification on a dataset. I want to combine them into an ensemble model. I understand that there are multiple ways to do this - voting classifier, stacking, ...
0
votes
0
answers
18
views
Logistic regression on unknown features
Consider the following problem: I want to classify data into classes. The features have names, and for example, assume that the names are lowercase words in english. We assume that not all elements of ...
0
votes
0
answers
20
views
How to train Logistic regression model with multiple inputs for 1 target value?
My data looks like similar to this: (the picture below is not mine, but describes perfectly my situation)
where the IDs are not unique but for each ID value I have a unique target value
The following ...
0
votes
0
answers
26
views
Comparing ROC curve and AUC score of different models for binary classification
I am doing a binary classification problem. The dataset has around 100K records with 40 variables.
I have tried different ML models. First, I used a logistic regression model and ended up getting a ...
0
votes
1
answer
33
views
Why does precision decrease with inceasing threshold?
I've trained a Logistic Regression model using scikit-learns LogisticRegression class. I'm dealing with stock data so it's quite noisy and difficult to predict ...
0
votes
1
answer
71
views
ROC curve manual calculation vs. pROC package R
I want do recreate ROC curve manually on my dataset and compare it to roc function from pROC package in R. I'm using dataset on customer churn telco.csv from Kaggle....
0
votes
0
answers
34
views
How to Determine the Minimum Value of a Continuous Variable for Predicting Categorical variable using Logistic Regression?
I am using logistic regression to predict df['MortSubiteCardiaque'], which contains 0 and 1, based on my continuous variable df['NTProBNP']. I would like to determine the threshold for df['NTProBNP'], ...
1
vote
1
answer
32
views
Multinomial Logistic Regression sensitive to choice of Encoding
I am using the following LogisticRegression model using sklearn. The task requires to select one label from multi-labels, so if I provide a, b the output could be <...
0
votes
0
answers
26
views
How many different confusion matrices?
We want to determine whether a data point belongs to the class + or −. We train a classifier using binary logistic regression. Each data point has two features, f1 and f2, which can take the values 1 ...
0
votes
1
answer
39
views
Selecting optimal regression model using cross validation
I have a logistic mixed model (lme4 package in R). I want to assess whether participants scores on the measures 'sumspq', 'sumpdi', and 'sumcaps' significantly affect the difference in performance ...
2
votes
0
answers
53
views
Why is cross-entropy increasing with accuracy?
I'm making an implementation of the softmax regression and I'm struggling to understand the nature behind the problem of increasing value of Cross-Entropy: $H(y_i, p_i)=-\sum_{i=1}^C y_i log(p_i)$, ...
1
vote
0
answers
40
views
Analysis of relationship between accuracy and total loss (or cost) during training with logistic loss function and threshold 0.5
I'm trying to understanding the relationship between training accuracy and training loss in classification tasks, specifically using logistic regression. When using logistic loss as the loss function ...
0
votes
0
answers
56
views
Help with multinomial logistic regression
I am a data science student and have the opportunity to work on an article regrading cardiac arrests in our country. For now I performed the multinomial regression model and I also plan on doing a ...
0
votes
1
answer
205
views
Random Forest overfitting to unbalanced data set
I am working on an unbalanced classification problem. I have have 2000 points which are positive, and 6000 points as -ve (chosen randomly from 100k universe of -ve points universe). Although I have ~...