Multiple Constraints

Introduction

I have a previous post on Lagrange multipliers for a single constraint, however, as you have seen in some of my other posts. I have done constrained optimization on multiple constraints, in these cases I just add multiple Lagrange multipliers. So why do/can we do that?

Informal / Intuitive Proof

Consider we have a function f:RnRf:\mathbb{R}^n \rightarrow \mathbb{R}, and mm constraint equations gi:RnRg_i:\mathbb{R}^n \rightarrow \mathbb{R}. Lets consider a point xRnx \in \mathbb{R}^n such that all the constraints are satisfied. Therefore, if x is a local min/max on this constraints set, if we inspect all the gradients, {xgi}\{\nabla_xg_i\} you will notice that they form a subspace and xf\nabla_xf lies in this subspace which is why we can add more Lagrange multipliers since they are the coefficients for the linear combinations. But why does xf\nabla_xf lie in the constraint gradients’ subspace? Well imagine it doesn’t, therefore it will have a component that is orthogonal to all the constraint gradients’. Because the gradient is normal to the tangent hyperplane of the contour set then a vector that is orthogonal to all the gradients, must lie in all the tangent planes. Therefore, that vector points in the direction of the intersection of all the constraints, therefore, xf\nabla_xf can’t have any component along that direction since that will imply that xx is not a local min/max.