Hinge loss in Support Vector Machines
From our SVM model, we know that hinge .
Looking at the graph for SVM in Fig 4, we can see that for , hinge loss is ‘0’. However, when , then hinge loss increases massively. As increases with every misclassified point (very wrong points in Fig 5), the upper bound of hinge loss also increases exponentially.
Hence, the points that are farther away from the decision margins have a greater loss value, thus penalising those points.
Conclusion: This is just a basic understanding of what loss functions are and how hinge loss works. I will be posting other articles with greater understanding of ‘Hinge loss’ shortly