Questions tagged [neural-network]

Ask Question

Artificial neural networks (ANN), are composed of 'neurons' - programming constructs that mimic the properties of biological neurons. A set of weighted connections between the neurons allows information to propagate through the network to solve artificial intelligence problems without the network designer having had a model of a real system.

4,383 questions

283 votes

12 answers

278k views

What are deconvolutional layers?

I recently read Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, Trevor Darrell. I don't understand what "deconvolutional layers" do / how they work. The ...

Martin Thoma

asked Jun 13, 2015 at 9:56

199 votes

5 answers

155k views

What is the "dying ReLU" problem in neural networks?

Referring to the Stanford course notes on Convolutional Neural Networks for Visual Recognition, a paragraph says: "Unfortunately, ReLU units can be fragile during training and can "die". For ...

tejaskhot

4,085

asked May 7, 2015 at 4:11

198 votes

6 answers

373k views

How to draw Deep learning network architecture diagrams?

I have built my model. Now I want to draw the network architecture diagram for my research paper. Example is shown below:

Muhammad Ali

2,487

asked Nov 3, 2016 at 3:10

182 votes

21 answers

260k views

How do you visualize neural network architectures?

When writing a paper / making a presentation about a topic which is about neural networks, one usually visualizes the networks architecture. What are good / simple ways to visualize common ...

Martin Thoma

asked Jul 18, 2016 at 17:08

181 votes

6 answers

185k views

When to use GRU over LSTM?

The key difference between a GRU and an LSTM is that a GRU has two gates (reset and update gates) whereas an LSTM has three gates (namely input, output and forget gates). Why do we make use of GRU ...

Sayali Sonawane

2,061

asked Oct 17, 2016 at 11:47

150 votes

17 answers

126k views

Best python library for neural networks

I'm using Neural Networks to solve different Machine learning problems. I'm using Python and pybrain but this library is almost discontinued. Are there other good alternatives in Python?

Community wiki

2 revs, 2 users 83%
marcodena

114 votes

11 answers

127k views

Choosing a learning rate

I'm currently working on implementing Stochastic Gradient Descent, SGD, for neural nets using back-propagation, and while I understand its purpose I have some ...

ragingSloth

1,824

asked Jun 16, 2014 at 18:08

111 votes

5 answers

85k views

Backprop Through Max-Pooling Layers?

This is a small conceptual question that's been nagging me for a while: How can we back-propagate through a max-pooling layer in a neural network? I came across max-pooling layers while going through ...

shinvu

1,240

asked May 12, 2016 at 8:38

90 votes

1 answer

89k views

When to use (He or Glorot) normal initialization over uniform init? And what are its effects with Batch Normalization?

I knew that Residual Network (ResNet) made He normal initialization popular. In ResNet, He normal initialization is used , while the first layer uses He uniform initialization. I've looked through ...

Rizky Luthfianto

2,216

asked Jul 28, 2016 at 17:12

87 votes

4 answers

53k views

How are 1x1 convolutions the same as a fully connected layer?

I recently read Yan LeCuns comment on 1x1 convolutions: In Convolutional Nets, there is no such thing as "fully-connected layers". There are only convolution layers with 1x1 convolution ...

Martin Thoma

asked Jul 17, 2016 at 13:23

83 votes

5 answers

50k views

What is the difference between "equivariant to translation" and "invariant to translation"

I'm having trouble understanding the difference between equivariant to translation and invariant to translation. In the book Deep Learning. MIT Press, 2016 (I. Goodfellow, A. Courville, and Y. Bengio)...

Aamir

asked Jan 4, 2017 at 8:41

78 votes

6 answers

165k views

What is the difference between Gradient Descent and Stochastic Gradient Descent?

What is the difference between Gradient Descent and Stochastic Gradient Descent? I am not very familiar with these, can you describe the difference with a short example?

Developer

1,099

asked Aug 4, 2018 at 6:36

76 votes

6 answers

152k views

Cross-entropy loss explanation

Suppose I build a neural network for classification. The last layer is a dense layer with Softmax activation. I have five different classes to classify. Suppose for a single training example, the <...

enterML

3,051

asked Jul 10, 2017 at 10:26

70 votes

5 answers

52k views

Adding Features To Time Series Model LSTM

have been reading up a bit on LSTM's and their use for time series and its been interesting but difficult at the same time. One thing I have had difficulties with understanding is the approach to ...

Rjay155

1,225

asked Feb 21, 2017 at 22:17

69 votes

11 answers

103k views

Why should the data be shuffled for machine learning tasks

In machine learning tasks it is common to shuffle data and normalize it. The purpose of normalization is clear (for having same range of feature values). But, after struggling a lot, I did not find ...

Green Falcon

14.1k

asked Nov 9, 2017 at 7:42

15 30 50 per page

2 3 4 5

…

293 Next

Stack Exchange Network

Questions tagged [neural-network]

What are deconvolutional layers?

What is the "dying ReLU" problem in neural networks?

How to draw Deep learning network architecture diagrams?

How do you visualize neural network architectures?

When to use GRU over LSTM?

Best python library for neural networks

Choosing a learning rate

Backprop Through Max-Pooling Layers?

When to use (He or Glorot) normal initialization over uniform init? And what are its effects with Batch Normalization?

How are 1x1 convolutions the same as a fully connected layer?

What is the difference between "equivariant to translation" and "invariant to translation"

What is the difference between Gradient Descent and Stochastic Gradient Descent?

Cross-entropy loss explanation

Adding Features To Time Series Model LSTM

Why should the data be shuffled for machine learning tasks

Hot Network Questions

Questions tagged [neural-network]

Related Tags