Change of variables and necessary conditions for optimality
12 Aug 2017
In this post, we consider how removal and addition of degrees of freedom
through change of variables can help in searching for a minimum of a function.
Removing degrees of freedom
Consider a function . The necessary condition for optimality
says that its differential should be equal to zero,
However, we can introduce a new variable and consider a function
, which obviously has no critical point. What happened here is
that we lost one degree of freedom. Initially, we could change and
independently, but after introducing , we can only change their
product .
Sometimes, though, we don’t need to use all degrees of freedom to find
a critical point of a function.
For example, if and , then
describes the same set of solutions as the system of equations
obtained from the partial derivatives of .
So, we incur no loss of information by removing some degrees of freedom
in this case.
One should be careful, nevertheless, to ensure that the solution of
the equation lies in the range of the function .
For example, if , then we could write
with . Provided , we could haste to
declare a critical point of despite its lying
outside the image of .
To summarize,
- removing degrees of freedom can help in finding critical points;
- not all critical points are guaranteed to be found in this way;
- found points may be unreachable through original variables.
Adding degrees of freedom
Adding degrees of freedom can only hurt. Consider the function
that has a minimum at . Assuming we are studying its restriction
on , we can introduce the function of two variables
with and . Equating its differential to zero,
leads to contradiction, since it implies and at the same time.
We ran into such troubles because two variables give more flexibility than
we can actually afford with one. Expanding and in the differential,
we see that
and even though equation suggests setting ,
multiplying by results in a finite quantity.
Introducing extra variables just obscures the problem. Most importantly,
critical points of do not tell us anything about critical points of .
Therefore, adding degrees of freedom in its pure form is not helpful
for optimization.