Linear Regression
Logistic Regression For Classification
Find a linear hyperplane to separate the data, better that output the probability of class
- Linear model:
z(w,x)=w⋅x
- Link Function:
ˆp(z)=11+exp(−z)
- Cross entropy loss:
l(y,ˆp)=ylogˆp+(1−y)log(1−ˆp)
- Cost Function:
L(w,{xi,yi}mi=1)=m∑i=1log(1+exp(w⋅xi))−yiw⋅xi
- Gradient:
∇wL(w,{xi,yi}mi=1)=(11+exp(−w⋅xi)−yi)xi
The backpropagation algorithm works through the layers of deeper neural networks to calculate error gradients w.r.t to weights