Questions tagged [neural-network]
Artificial neural networks (ANN), are composed of 'neurons' - programming constructs that mimic the properties of biological neurons. A set of weighted connections between the neurons allows information to propagate through the network to solve artificial intelligence problems without the network designer having had a model of a real system.
4,383
questions
1
vote
1
answer
55
views
wierd neural network approache
I'm working on a problem where I need to create a neural network to optimize the seating arrangement for 24 unique individuals in a 6x4 grid, minimizing conflicts between adjacent (up,down,left,right) ...
2
votes
0
answers
13
views
What's the best way to incorporate momentum and regularization when training a neural network?
I want to implement the momentum algorithm to train a neural network, but I'm uncertain about where the regularization term should be incorporated. For ridge regularization, one option is to have:
$$
...
1
vote
0
answers
9
views
Residual Network Skip Connection Clarification
In ResNets do skip connections get utilised at every step? If not what causes a layer to be skipped vs not skipped?
Thank you,
1
vote
1
answer
34
views
Predicted output is only 0s
I am developing a neural network using Home credit Default Risk Dataset.
The prediction should be between 0.0 and 1.0 but my algorithm's outcome is just 0.0 for every row.
My Code
...
0
votes
0
answers
14
views
Semantics Building In LSTM-Based Models - How does a LSTM is able to extract and represent long data using just one value (long-memory)
How does a LSTM is able to extract and represent long sequences with data while using just one value (long-memory / LM) to maintain all this information?
If multiple value were used, it could be ...
0
votes
0
answers
14
views
Impact of Adding Imbalanced Data on Model Performance for Different Groups
Suppose I initially have a dataset with 50 samples of type A and 50 samples of type B, each with several features. I built a neural network model using this data and recorded the prediction accuracy ...
3
votes
1
answer
233
views
What ML model for regression given tabular AND image data?
I'd like to predict the power production of a windfarm given the wind speed, its direction and other variables related to the specific wind turbines. However, due to wake effects (wind speed decreases ...
1
vote
0
answers
38
views
Class imbalance for binary classification tasks
I am looking to train a binary classifier. Most of my experience so far has been with generative models, not classifiers, so I am wondering with respect to training data, what is a good ratio of 0 and ...
0
votes
1
answer
27
views
How to update first layer weights?
I’m trying to make a neural network without using any deep learning library that recognizes numbers in the mnist database. Its structure is: 784 input neurons (for the 784 pixels in the number images),...
3
votes
1
answer
45
views
Is it legal to use a model found on github for a personal project and uploading the personal project onto github? [closed]
I found a great model I would like to use and make improvements upon for a personal project. It doesn't contain any liscenses nor does it mention anything about restrictions of use.
Are AI models like ...
3
votes
1
answer
29
views
Outputting handwritten digits with a Neural Network
I know that you can use a neural Network to recognize handwritten digits. How would you then use that same neural network to output handwritten digits in the unique style of that network? In other ...
0
votes
0
answers
23
views
Theoretical Limitations of Achieving 100% Accuracy in Modeling Non-linear Relationships with Neural Networks
I am working on a project where I need to model a specific non-linear relationship using a neural network. The relationship is given by $y = 3x_1^2x_2^3 $. The approach involves:
Preprocessing the ...
6
votes
1
answer
180
views
Changing output size from a model
So I am currently training some deep learning models for some basic classification problems, and I am trying to figure out if it is possible to change the output size of the model in case I want to ...
0
votes
1
answer
30
views
How to explain missing dates to a model?
I have this dataset that I'm trying to train a neural network on.
The problem is that since weekend dates are not available, I am not confident in whether the model is able to account for that. ...
1
vote
1
answer
62
views
Improving GPU Utilization in LLM Inference System
I´m trying to build a distributed LLM inference platform with Huggingface support. The implementation involves utilizing Python for model processing and Java for interfacing with external systems. ...