Questions tagged [xgboost]
For questions related to the eXtreme Gradient Boosting algorithm.
709
questions
0
votes
1
answer
24
views
How can I improve xgboost classifier if overfitting start from the initial epochs?
I am training a XGBoost multi-class classifier, but got very bad result. The train/val leaning multi-class logloss curve showed that overfitting started from the early epochs. What directions can I ...
1
vote
0
answers
18
views
Using a prediction from a First Principles Model in a second, statistical model to improve accuracy
I am trying to figure out what this is called, so I can do some reading on it and see which types of statistical models excel in this framework (and known pitfalls to avoid). I have been calling it a ...
1
vote
0
answers
9
views
Use a metric that is not available in the list of metric for xgboost
Working in R.
I am following this post on stack overflow.
I am train an xgboost model and I want to use another metric that is not in the list of metric we can whoose for the eval_metric parameter.
I ...
0
votes
0
answers
22
views
Improving Recall and Precision of the Minority Class with XGBoost to Maximize Profits in Unbalanced Data
The company is interested in identifying profitable customers who are likely to purchase a ticket when given a promotional offer. My goal is to build a model to predict whether a customer will buy a ...
1
vote
0
answers
30
views
Why do I get missing value error in XGBoost when it is supposed to be supported?
I read that in the latest versions of XGBoost, the model can handle missing values. I am using the model on some data that contains for example, the BMI, bloodpressure, age, binary data (yes/no) and ...
0
votes
0
answers
11
views
training with a grouped data and having conditions on the group [closed]
i have a data of students, with various features of the students. Now students are are divided in multiple groups. each group having 3-6 students. I have to predict the marks of individual students ...
0
votes
0
answers
13
views
can we build sequential model?
Like is it possible to train a model on X1_i Inputs Y1_i Output and then the second one is running on X1_i + Y1_i to give output Y2_i ??
Context : (I am just getting my hand into ML and trying to ...
2
votes
0
answers
31
views
XGBoost GPU Support Regression vs Classification
I am having trouble getting an XGBRegressor run on my GPU when it works fine with XGBClassifier. Here is the code and the error:
Classification:
...
2
votes
1
answer
29
views
Does it make sense that the performance of XG Boost varies dramatically from two machines holding all hyperparameters fixed?
I am hyperparameter tuning an xgboost model and I am finding that depending if I train the model locally on my machine vs on AWS sagemaker, I get quite different results.
Running cross-validation ...
1
vote
1
answer
37
views
Making cpp function from xgboost dump_model() output
I'd like to use the output of an xgboost BDT model in a code base without having to rely explicitly on xgboost or otherwise. Using a modified version of this script, xgb2cpp I am able to generate a ...
0
votes
0
answers
26
views
Comparing ROC curve and AUC score of different models for binary classification
I am doing a binary classification problem. The dataset has around 100K records with 40 variables.
I have tried different ML models. First, I used a logistic regression model and ended up getting a ...
1
vote
0
answers
28
views
Xgboost custom objective function. How to modify the weights?
I have a custom objective function to xgboost:
...
1
vote
0
answers
57
views
Multiclass matrix loss function in scikit-learn / xgboost / lightgbm
I have data with 4 classes: $c_1, c_2, c_3, c_4$.
I'd like to create a classifier which has different scaling for the loss function per class combination:
$$ \begin{bmatrix} 0 & l \left( \hat{c}_{...
1
vote
1
answer
46
views
XGB predict_proba estimates don't match sum of leaves
When using an XGB model in the context of binary classification, I observed that the test estimates given by predict_proba were close but not equal to the results I ...
0
votes
0
answers
13
views
a few records for training, and one record after training
I try to make some credit score task. I stuck in conceputal problem.
There is:
train_data (62 columns, 10339239 rows, 1250000 unique ID values [0 - 1249999]([min-max] ID values))
test_data (62 ...