Objective | CVXPY code | Result | Notes |
---|---|---|---|
\[\color{darkred}x^T \color{darkred}x\] | x.T@x | DCP error | print shows minimize x@x, i.e. transpose is dropped |
\[\color{darkred}x^T \color{darkred}x\] | x@x | DCP error | |
\[\color{darkred}x^T \color{darkred}x\] | cp.sum_squares(x) | transformed into quad_over_lin(x, 1.0) | |
\[\color{darkred}x^T \color{darkblue}Q \color{darkred}x\] | x.T@Q@x | transformed into QuadForm(x,Q) | |
\[\color{darkred}y:=\color{darkred}x-\color{darkblue}p\]\[\color{darkred}x^T \color{darkblue}Q \color{darkred}y\] | y=x-p x.T@Q@y | DCP error | |
\[\color{darkred}x^T \color{darkblue}Q \color{darkred}x - \color{darkred}x^T \color{darkblue}Q \color{darkblue}p\] | x.T@Q@x - x.T@Q@p | first term transformed into QuadForm(x,Q) |
import cvxpy as cp import numpy as np n = 5 x = cp.Variable(n, "x", nonneg=True) Q = np.eye(n) p = np.ones(n) obj = x.T @ x # not DCP ??? Print says x @ x #obj = x @ x # not DCP #obj = cp.sum_squares(x) # transformed into quad_over_lin(x, 1.0) #obj = x.T @ Q @ x # transformed into QuadForm(x,Q) y = x-p #obj = x.T @ Q @ y # not DCP #obj = x.T @ Q @ x - x.T @ Q @ p # accepted prob = cp.Problem(cp.Minimize(obj), [sum(x)==1]) print(prob) result = prob.solve(verbose=True)
We can use: x = cp.Variable((n,1), "x", nonneg=True). This would generate an \((n\times 1)\) matrix. The printed model now shows the correct objective (with the transpose): minimize x.T @ x. But still, we are getting:
DCPError: Problem does not follow DCP rules. Specifically:The objective is not DCP. Its following subexpressions are not:x.T @ x
I suspect this is interpreted as \(\color{darkred}x^T\color{darkred}y\), i.e., as two possibly different vectors. Just a guess. Of course, the workarounds are obvious: use cp.sum_squares(x), or use \(\color{darkred}x^T \color{darkblue}I \color{darkred}x\). I think, not being able to recognize \(\color{darkred}x^T\color{darkred}x\) as convex, can be classified as a bug.
I think the rule is that in DCP you are not allowed to multiply two variables together. You are right that x^Tx is just a quadform with an identity matrix as Q. Maybe this particular case could be added to be DCP, but I am afraid users are going to request many other similar "evidently" convex operations.
ReplyDelete