minβmaxi|ri|y−Xβ=r
As in [1], β are coefficients to estimate and X,y are data. r are the residuals.
Some obvious and less obvious formulations are:
- Variable splitting:minzz≥r+i+r−ir+i−r−i=yi–∑jXi,jβjr+i,r−i≥0 With variable splitting we use two non-negative variables r+i−r−i to describe a value ri that can be positive or negative. We need to make sure that one of them is zero in order for r+i+r−i to be equal to the absolute value |ri|. Here we have an interesting case, as we are only sure of this for the particular index i that has the largest absolute deviation (i.e. where |ri|=z). In cases where |ri|<z we actually do not enforce r+i⋅r−i=0. Indeed, when looking at the solution I see lots of cases where r+i>0,r−i>0. Effectively those r+i,r−i have no clear physical interpretation. This is very different from the LAD regression formulation [1] where we require all r+i⋅r−i=0.
- Bounding:minz–z≤yi–∑jXi,jβj≤zHere z can be left free or you can make it a non-negative variable (it will be non-negative automatically). Note that there are actually two constraints here: –z≤yi–∑jXi,jβj and yi–∑jXi,jβj≤z. This model contains the data twice, leading to a large number of nonzero elements in the LP matrix.
- Sparse bounding: This model tries to remedy the disadvantage of the standard bounding model by introducing extra free variables d and extra equations:minz–z≤di≤zdi=yi–∑jXi,jβj Note again that –z≤di≤z is actually two constraints: –z≤di and di≤z.
- Dual:max∑iyi(di+ei)∑iXi,j(di+ei)=0⊥βj∑i(di−ei)=1di≥0,ei≤0The estimates β can be found to be the duals of equation XT(d+e)=0.
Opposed to the LAD regression example in [1], the bounding formulation is very fast here. The dual formulation remains doing very good.
Historical Note
The use of this minimax principle goes back to Euler (1749) [3,4].![]() |
Leonhard Euler (1707-1783) |
![]() |
Pafnuty Lvovich Chebyshev (1821-1894) |
References
- Linear Programming and LAD Regression, http://yetanothermathprogrammingconsultant.blogspot.com/2017/11/lp-and-lad-regression.html
- A.Giloni, M.Padberg, Alternative Methods of Linear Regression, Mathematical and Computer Modeling 35 (2002), pp.361-374.
- H.L. Harter, The method of least squares and some alternatives, Technical Report, Aerospace Research Laboratories, 1972.
- L. Euler, Pièce qui a Remporté le Prix de l'Academie Royale des Sciences en 1748, sur les Inegalités de Mouvement de Saturn et de Jupiter. Paris (1749).
No comments:
Post a Comment