Skip to main content

Questions tagged [rnn]

A recurrent neural network (RNN) is a class of artificial neural network where connections between units form a directed cycle.

1 vote
1 answer
31 views

using a 2d matrix as a feature input to LSTM / RNN models

i am building an LSTM model to predict the combination of items that will be sold at a store level on a daily basis. Please note, this is an exploratory model and i have a good idea about the ...
Vikram Murthy's user avatar
0 votes
0 answers
14 views

How to combine Embedding layer with 3D input and 2D input in Pytorch

This familiar with my ideas. How to use Embedding() with 3D tensor in Keras? I'm re-implementing some table-to-text papers using RNN-based seq2seq (like this one https://arxiv.org/pdf/1603.07771v3) ...
jupyter's user avatar
  • 101
0 votes
0 answers
6 views

How BPTT updating the weights while input are varing

how the RNN gets trained(BPTT) when the input size is varying because to update the weights the network has to be stable right please reply on this Thanks in advance
ali nawaz's user avatar
0 votes
0 answers
14 views

Semantics Building In LSTM-Based Models - How does a LSTM is able to extract and represent long data using just one value (long-memory)

How does a LSTM is able to extract and represent long sequences with data while using just one value (long-memory / LM) to maintain all this information? If multiple value were used, it could be ...
Linces games's user avatar
0 votes
0 answers
33 views

Fuzzy Name Matching with Machine Learning. Input data encoding

I have a huge dataset: Last name, first name, date of birth of Indian residents and I need to match them for similarity. The matching is fuzzy, the data looks like this (names are fictitious for the ...
ккк ккк's user avatar
0 votes
0 answers
19 views

What is the shape of the hidden/cell state of convLSTM2D?

I am new to convLSTM2D and I understand how it works, however, I am confused about the shape of the hidden states at different epochs ...
user43280's user avatar
  • 111
1 vote
0 answers
28 views

recognition of names, surnames and patronymics

is there an example of neural networks on Github or Kaggle that perform the task of recognizing identical surnames, first names and patronymics? I'm just learning neural networks so it's interesting ...
nanana's user avatar
  • 11
-2 votes
2 answers
50 views

Determining the threshold value for the neural network

I have a dataset with last name, first name, middle name of people participating in sporting events. I need to train a neural network that will match similar surnames, first names and patronymics. But ...
nanana's user avatar
  • 11
0 votes
0 answers
11 views

Deep neural network is plateauing on a regression task

I'm training a deep neural network on temporal graph data. Currently, I'm trying to get a feel for how large / complex of a model I should aim for, so I'm trying to overfit to my smallest dataset. ...
aadithyaa's user avatar
1 vote
1 answer
15 views

Call volume prediction using LSTM and GRU

Machine Learning call volume prediction using LSTM and GRU I am trying to predict the number of incoming calls using LSTM and GRU I have done all the data preprocessing but upon training the model I ...
Kuda Kulrider's user avatar
1 vote
0 answers
25 views

Are formulas in the article incorrect?

I am learning about backpropagation in LSTM. I have been studying an article and watching two videos on the topic. The videos 1 and 2 repeat all the information from the article, but with additional ...
Тима 's user avatar
0 votes
0 answers
18 views

Drum sound classification using RNN issues - help needed

I am new to the field of machine learning, even tho I have solid background in semi-related fields (am control system engineer by trade) and as a hobby project I wanted to work a bit with sound ...
APasagic's user avatar
0 votes
0 answers
9 views

Adding sliding window dimension to data causes error: "Expected 3D or 4D (batch mode) tensor ..."

I wrote a pytorch data loader which used to return data of shape (4,1,192,320) representing the 4 samples of single channel image, each of size ...
Mahesha999's user avatar
0 votes
0 answers
18 views

Model Architecture for Time-Series Forecasting with Categorical and Multivariate Data

Context: I was looking at using an LSTM model to forecast the amount of gold gained for each of 10 heroes in a game of Dota 2, a MOBA game, as a base model in some type of model architecture. The game ...
DCRA's user avatar
  • 1
0 votes
1 answer
23 views

Connecting Flatten layer to Dense layer

I'm struggling with my neural network. In short, I need to recreate a model from anywhere on the internet, I've found a model that combines BiLSTM, LSTM and GRU. However, based on the error I got when ...
Tatiana Budanova's user avatar

15 30 50 per page
1
2 3 4 5
50