Skip to main content

Questions tagged [neural-network]

Artificial neural networks (ANN), are composed of 'neurons' - programming constructs that mimic the properties of biological neurons. A set of weighted connections between the neurons allows information to propagate through the network to solve artificial intelligence problems without the network designer having had a model of a real system.

0 votes
0 answers
12 views

How can I select subsets of features using neural network?

This listing selects the best features from the 1000 available columns in a given dataset. The first three columns are dropped because they are useless data. The dataset is huge. So, they were read in ...
user366312's user avatar
0 votes
0 answers
22 views

How weight vector behave when we initialize the weight to 0 in case of perceptron

While reading in book i encountered this statement Now, the reason we don't initialize the weights to zero is that the learning rate (eta) only has an effect on the classification outcome if the ...
Vipin Dubey's user avatar
0 votes
1 answer
17 views

Everything is classified as background by segmentation model

I am training a U-NET model for medical image segmentation. Problem is that the binary masks that im using to train the model mostly consist of background pixels and a very small region of the whole ...
Ashwin Singh's user avatar
0 votes
0 answers
12 views

Does it common for LM (hundreds million parameters) beat LLM (billion parameters) for binary classification task?

Preface I am trying to fine-tune the transformer-based model (LM and LLM). The LM that I used is DEBERTA, and the LLM is LLaMA 3. The task is to classify whether a text contains condescending language ...
sempraEdic's user avatar
0 votes
0 answers
13 views

How to increase the optimial cutoff point(youden index) after training a model?

So I trained a model based on a medical dataset and and I got an AUROC for detecting cancer in brain images as about 0.96 and i noticed that the youden index is 0.1 but i want to increase it to 0.5 , ...
mutli-arm-bandit's user avatar
-1 votes
1 answer
8 views

WGAN generating images from the training data

Is it possible for gan to remember somehow training data distribution? Or maybe somеthing leaks out when I calculate gradients? ...
Тима 's user avatar
0 votes
0 answers
24 views

Is it legit to normalize time series with respect to the x-axis?

I have a data set consisting of multivariate time series, e.g. a batch of my data has the shape (batch_size, timesteps, number_input_features) and I want to train a neural network on it to predict ...
ZenDen's user avatar
  • 13
1 vote
1 answer
37 views

How does seeing training batches only once influence the generalization of a neural network?

I am referring to this question/scenario Train neural network with unlimited training data but unfortunately I can not comment. As I am not seeing any training batch multiple times I would guess that ...
ZenDen's user avatar
  • 13
0 votes
0 answers
9 views

How to handle sequences with crossEntropyLoss

fist of all i am ne wto the whole thing, so sorry if this is superdumb. I'm currently training a Transformer model for a sequence classification task using CrossEntropyLoss. My input tensor has the ...
Tobias's user avatar
  • 101
0 votes
0 answers
22 views

What is the most accurate way of computing the evaluation time of a neural network model?

I am training some neural networks in pytorch to use as an embedded surrogate model. Since I am testing various architectures, I want to compare the accuracy of each one, but I am also interested in ...
HWIK's user avatar
  • 1
0 votes
0 answers
9 views

Mobilenet vs resnet

Q1-Why dont we remove relu after addition of skip connection in resnet50 like we do in mobile-net v2 for better performance? Q2-And why dont we have Convolution layer in skip connection for dimention ...
Tarun Saxena's user avatar
2 votes
1 answer
22 views

Benchmark Neural Networks on High-Dimensional Functions

For a personal project, I am interested in benchmarking certain neural network architectures in the context of high-dimensional function approximation. Specifically, I am interested in continuous, ...
user82261's user avatar
  • 121
0 votes
1 answer
18 views

What is the "fast version" of ZFNet referenced in SPPNet and Faster R-CNN papers?

I'm reading old papers: SPPNet: Link Faster R-CNN: Link In both cases, the authors refer to a "fast version of Zeiler and Fergus (ZF) Net"; specifically: In SPPNet: ZF-5: this ...
Papemax89's user avatar
1 vote
0 answers
45 views

Why can't I replicate the results from this paper?

I'm trying to train a model to evaluate chess positions, following the methodology from this paper (note that the author presents several different architectures, but I'm only looking at the ANN with ...
William Markley's user avatar
1 vote
1 answer
53 views

wierd neural network approache

I'm working on a problem where I need to create a neural network to optimize the seating arrangement for 24 unique individuals in a 6x4 grid, minimizing conflicts between adjacent (up,down,left,right) ...
Moein's user avatar
  • 101

15 30 50 per page
1
2 3 4 5
293