Skip to main content

All Questions

6 votes
3 answers
758 views

Application of the chain rule to $3$-layers neural network

Consider the differentiable functions $L^1(x,\theta^1),L^2(x^2,\theta^2),L^3(x^3,\theta^3)$, where every $x_k,\theta^k$ are real vectors, for $k=1,2,3$. Also define $\theta=(\theta^1,\theta^2,\theta^3)...
Lilla's user avatar
  • 2,109
3 votes
1 answer
197 views

Implementing binary logistic regression from scratch

Background knowledge: To train a logistic regression model for a classification problem with two classes (called class $0$ and class $1$), we are given a training dataset consisting of feature vectors ...
littleO's user avatar
  • 52.5k