Questions tagged [classification]
An instance of supervised learning that identifies the category or categories which a new instance of dataset belongs.
911
questions with no upvoted or accepted answers
7
votes
2
answers
133
views
Data transformations in hierarchical classification
I am building a hierarchical text classifier using the Local Classifier Per Parent Node (LCPN) approach with the 'siblings' policy
as described in the A survey of hierarchical classification across ...
6
votes
2
answers
3k
views
How to deal with missing data for Bernoulli Naive Bayes?
I am dealing with a dataset of categorical data that looks like this:
...
6
votes
0
answers
99
views
Classify driver based on time-series sensor data
I want to build a model that can detect which driver is driving now the car based on a dataset that contains 20 driver records for 3600s each driver ( the dataset contains all the car sensors values ...
5
votes
1
answer
507
views
Illustrating the dimensionality reduction done by a classification or regression model
Tl;DR: You can predict something, but how do you explain the prediction?
EDIT: I have built a website that tries to answer this question with means of embedding / visually clustering data according ...
4
votes
0
answers
85
views
Non-Gaussian like distributions - Classifier of source data fails on target data
I ask you for help on a classification problem (classes are represented by the numbers 0,1 and 2). All features are extracted from time series data (fundamental is sinus shape).
I have a source ...
4
votes
4
answers
279
views
Decision boundary in a classification task
I have 1000 data points from the bivariate normal distribution $\mathcal{N}$ with mean $(0,0)$ and variance $\sigma_1^2=\sigma_2^2=10$ with the covariances being $0$. Also there are 20 more points ...
4
votes
2
answers
617
views
How to choose layer from which to unfreeze image classification model
I'm wondering what steps do you take to
decide on the part of the model to unfreeze. Do you do
multiple experiments? Since the use of GPU is expensive, you must
have some guidelines.
Note: I know ...
4
votes
0
answers
2k
views
How to train continuous/soft classification model?
The classic classification problem is like finding the function $F:\mathbb{R}^n\mapsto \{0,1\}$. The label set will be [Apple,Banana,Banana,...,Apple].
What if I want to train a function $F:\mathbb{R}...
4
votes
1
answer
836
views
Product classification in hierarchical categories based on multiple parameters and non-standard descriptions
I want to start a machine learning project in my company and a really big pain for spend analysts is to classify the products that buyers order for maintenance, tooling, raw material and such, as the ...
3
votes
0
answers
50
views
Weird behaviour when using RobERTA for text classification
I have a dataset with around 70 classes and the dataset is largely balanced ~150 samples per class. I am finetuning RoBERTA-base for 4 epochs with a ...
3
votes
1
answer
73
views
Selecting a subset of the population such that the distribution of a certain variable in the subset matches a reference distribution?
I have a large population ($P$) of entries and I am given the distribution of a certain variable ($x_0$) for a specific subset ($S$) of the population. The subset $S$ is not a random sample and hence ...
3
votes
2
answers
62
views
How to detect whether an entire series is an outlier relative to others?
I have multiple price series of the same asset as follows.
Visually, it is obvious that series "A" (the flat line) is an outlier, and series "E" (the line with the zig-zag pattern)...
3
votes
3
answers
131
views
Classification when the classification of the previous itens matter
I have a classification problem to solve, that seems to be common but I am struggling to find the name of this task and the best way to model this problem.
Suppose I have a series of events that are ...
3
votes
2
answers
1k
views
Why does my random forest classifier predicts one class more often?
I have a random forest classifier that predicts 0 class about twice as often as class 1. It also predicts class 0 with higher probabilities than class 1.
It is not a imbalanced dataset.
I tried ...
3
votes
0
answers
543
views
how can i plot probability distribution of my classes in the way below?
All,
I would like to plot the following:
I have a binary classification problem where I am using xgboost as my 'model' below:
...