Skip to main content

Questions tagged [classification]

An instance of supervised learning that identifies the category or categories which a new instance of dataset belongs.

911 questions with no upvoted or accepted answers
7 votes
2 answers
133 views

Data transformations in hierarchical classification

I am building a hierarchical text classifier using the Local Classifier Per Parent Node (LCPN) approach with the 'siblings' policy as described in the A survey of hierarchical classification across ...
matentzn's user avatar
  • 171
6 votes
2 answers
3k views

How to deal with missing data for Bernoulli Naive Bayes?

I am dealing with a dataset of categorical data that looks like this: ...
Chuck's user avatar
  • 161
6 votes
0 answers
99 views

Classify driver based on time-series sensor data

I want to build a model that can detect which driver is driving now the car based on a dataset that contains 20 driver records for 3600s each driver ( the dataset contains all the car sensors values ...
John Karimov's user avatar
5 votes
1 answer
507 views

Illustrating the dimensionality reduction done by a classification or regression model

Tl;DR: You can predict something, but how do you explain the prediction? EDIT: I have built a website that tries to answer this question with means of embedding / visually clustering data according ...
BenoitParis's user avatar
4 votes
0 answers
85 views

Non-Gaussian like distributions - Classifier of source data fails on target data

I ask you for help on a classification problem (classes are represented by the numbers 0,1 and 2). All features are extracted from time series data (fundamental is sinus shape). I have a source ...
deniz's user avatar
  • 51
4 votes
4 answers
279 views

Decision boundary in a classification task

I have 1000 data points from the bivariate normal distribution $\mathcal{N}$ with mean $(0,0)$ and variance $\sigma_1^2=\sigma_2^2=10$ with the covariances being $0$. Also there are 20 more points ...
thenac's user avatar
  • 141
4 votes
2 answers
617 views

How to choose layer from which to unfreeze image classification model

I'm wondering what steps do you take to decide on the part of the model to unfreeze. Do you do multiple experiments? Since the use of GPU is expensive, you must have some guidelines. Note: I know ...
user's user avatar
  • 71
4 votes
0 answers
2k views

How to train continuous/soft classification model?

The classic classification problem is like finding the function $F:\mathbb{R}^n\mapsto \{0,1\}$. The label set will be [Apple,Banana,Banana,...,Apple]. What if I want to train a function $F:\mathbb{R}...
Icyblade's user avatar
  • 4,346
4 votes
1 answer
836 views

Product classification in hierarchical categories based on multiple parameters and non-standard descriptions

I want to start a machine learning project in my company and a really big pain for spend analysts is to classify the products that buyers order for maintenance, tooling, raw material and such, as the ...
mad_dash2's user avatar
3 votes
0 answers
50 views

Weird behaviour when using RobERTA for text classification

I have a dataset with around 70 classes and the dataset is largely balanced ~150 samples per class. I am finetuning RoBERTA-base for 4 epochs with a ...
user1274878's user avatar
3 votes
1 answer
73 views

Selecting a subset of the population such that the distribution of a certain variable in the subset matches a reference distribution?

I have a large population ($P$) of entries and I am given the distribution of a certain variable ($x_0$) for a specific subset ($S$) of the population. The subset $S$ is not a random sample and hence ...
dan's user avatar
  • 31
3 votes
2 answers
62 views

How to detect whether an entire series is an outlier relative to others?

I have multiple price series of the same asset as follows. Visually, it is obvious that series "A" (the flat line) is an outlier, and series "E" (the line with the zig-zag pattern)...
finstats's user avatar
  • 131
3 votes
3 answers
131 views

Classification when the classification of the previous itens matter

I have a classification problem to solve, that seems to be common but I am struggling to find the name of this task and the best way to model this problem. Suppose I have a series of events that are ...
bratao's user avatar
  • 31
3 votes
2 answers
1k views

Why does my random forest classifier predicts one class more often?

I have a random forest classifier that predicts 0 class about twice as often as class 1. It also predicts class 0 with higher probabilities than class 1. It is not a imbalanced dataset. I tried ...
Ondřej Vitík's user avatar
3 votes
0 answers
543 views

how can i plot probability distribution of my classes in the way below?

All, I would like to plot the following: I have a binary classification problem where I am using xgboost as my 'model' below: ...
Maths12's user avatar
  • 526

15 30 50 per page
1
2 3 4 5
61