Why are most Lagrange multipliers zero in the SVM solution?

Question

I read everywhere that a non-zero Lagrange multiplier $\lambda_i$ signifies that the corresponding point $x_i$ is a support vector, but I can't see how a support vector and a non-support vector have a different value for the Lagrange multiplier.

Can you please explain how the process of optimizing the Lagrangian leads to some Lagrange multipliers being zero and some non-zero?

I think you'll find this helpful: engr.mun.ca/~baxter/Publications/LagrangeForSVMs.pdf — Alex R., Commented May 27, 2016 at 21:52
@AlexR. In 4.1 (Example 4): First it is stated that $x^2-1 \geq 0$ but later on I see that, after deriving w.r.t. $\lambda$, we get $x^2-1=0$. How is this possible? Can't this value be greater than zero? — Arnomoonens, Commented Jun 1, 2016 at 7:22
I think you can find this answer helpful. stats.stackexchange.com/questions/54976/… — iRestMyCaseYourHonor, Commented Jul 3, 2020 at 20:16

Alex R. · Accepted Answer · 2016-05-27 22:04:17Z

1

When solving your SVM problem, you'll be optimizing a Lagrangian subject to KKT conditions. Specifically, something like:

$$L(x)=f(x)-\sum_k \lambda_k c_k(x),$$

where your constraint satisfies $c_k(x)\geq 0$ and $\lambda_k\geq 0$. The optimum is achieved when the gradient of the above lagrangian is equal to 0 and $\lambda_i\geq 0$ and $\lambda_i c_i(x)=0$ for all $i$. Specifically, when $\lambda_i\neq 0$, the constraint is said to active, whereas if $\lambda_i=0$, then you can freely move out of the constraint region while preserving the optimum. This is why we demand $\lambda_i>0$.

answered May 27, 2016 at 22:04

Alex R.

32.9k1 gold badge39 silver badges79 bronze badges

$\begingroup$ I see that it can move freely around when $c_i(x)=0$, because then the constraint $\lambda_i c_i(x)=0$ is already satisfied. But why would it become greater than zero? Is it because then the Lagrangian is minimized? $\endgroup$
– Arnomoonens
Commented May 28, 2016 at 8:51

Add a comment |

Stack Exchange Network

Why are most Lagrange multipliers zero in the SVM solution?

1 Answer 1

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged
optimization
lagrange-multiplier
machine-learning
.

Hot Network Questions

Why are most Lagrange multipliers zero in the SVM solution?

1 Answer 1

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged optimizationlagrange-multipliermachine-learning.

Related

Hot Network Questions

Not the answer you're looking for? Browse other questions tagged
optimization
lagrange-multiplier
machine-learning
.