How to handle sequences with crossEntropyLoss

Ask Question

Asked 30 days ago

Modified 30 days ago

Viewed 9 times

fist of all i am ne wto the whole thing, so sorry if this is superdumb.

I'm currently training a Transformer model for a sequence classification task using CrossEntropyLoss. My input tensor has the shape (batch_size, classes, seq_len) and my target tensor has the shape (batch_size, seq_len).

Chatgpt advised me to the following:

yHatReshaped = yHat.view(-1, 512)
yReshaped = y.view(-1)
error = lossFunction(yHatReshaped, yReshaped)

Is that correct and the best way to handle a seqence? The documentation also just confuses me, since it says (N,C,d1,d2,...,dK) for k-dimensional loss. Is my sequence basicly a d1? I dont understand the whole thing.

Thanks in advance for your help!

asked Jun 21 at 15:12

Tobias

1011 bronze badge

Add a comment |

Stack Exchange Network

How to handle sequences with crossEntropyLoss

0

Browse other questions tagged
neural-network
pytorch
loss-function
or ask your own question.

Hot Network Questions

How to handle sequences with crossEntropyLoss

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Browse other questions tagged neural-networkpytorchloss-function or ask your own question.

Related

Hot Network Questions

Browse other questions tagged
neural-network
pytorch
loss-function
or ask your own question.