Abstract
The problem of minimizing a polynomial over the standard simplex is one of the basic NP-hard nonlinear optimization problems—it contains the maximum clique problem in graphs as a special case. It is known that the problem allows a polynomial-time approximation scheme (PTAS) for polynomials of fixed degree, which is based on polynomial evaluations at the points of a sequence of regular grids. In this paper, we provide an alternative proof of the PTAS property. The proof relies on the properties of Bernstein approximation on the simplex. We also refine a known error bound for the scheme for polynomials of degree three. The main contribution of the paper is to provide new insight into the PTAS by establishing precise links with Bernstein approximation and the multinomial distribution.
Similar content being viewed by others
References
Altomare, F., Campiti, M.: Korovkin-type approximation theory and its applications. De Guyter studies in mathematics, vol. 17. Walter de Guyter publishers, Berlin (1994)
Bellare, M., Rogaway, P.: The complexity of approximating a nonlinear program. Math. Program. 69, 429–441 (1995)
Bomze, I.M., de Klerk, E.: Solving standard quadratic optimization problems via semidefinite and copositive programming. J. Global Optim. 24(2), 163–185 (2002)
Bomze, I.M., Gollowitzer, S., Yildirim, E.A.: Rounding on the standard simplex: regular grids for global optimization. J. Global Optim. 59(2–3), 243–258 (2014)
Bos, L.P.: Bounding the Lebesque function for Lagrange interpolation in a simplex. J. Approx. Theory 38, 43–59 (1983)
Ditzian, Z.: Inverse theorems for multidimensional Bernstein operators. Pac. J. Math. 121(2), 293–319 (1986)
Ditzian, Z.: Best polynomial approximation and Bernstein polynomial approximation on a simplex. Indag. Math. 92, 243–256 (1989)
Ditzian, Z., Zhou, X.: Optimal approximation class for multivariate Bernstein operators. Pac. J. Math. 158, 93–120 (1993)
Johnson, N.L., Kotz, S., Balakrishnan, N.: Discrete Multivariate Distributions. Wiley, New York (1997)
de Klerk, E., den Hertog, D., Elabwabi, G.: On the complexity of optimization over the standard simplex. Eur. J. Oper. Res. 191(3), 773–785 (2008)
de Klerk, E., Laurent, M., Parrilo, P.: A PTAS for the minimization of polynomials of fixed degree over the simplex. Theor. Comput. Sci. 361(2–3), 210–225 (2006)
de Klerk, E., Laurent, M.: Error bounds for some semidefinite programming approaches to polynomial minimization over the hypercube. SIAM J. Optim. 20(6), 3104–3120 (2010)
Knoblauch, A.: Closed-form expressions for the moments of the binomial probability distribution. SIAM J. Appl. Math. 69(1), 197–204 (2008)
Lasserre, J.B.: Global optimization with polynomials and the problem of moments. SIAM J. Optim. 11, 796–817 (2001)
Motzkin, T.S., Straus, E.G.: Maxima for graphs and a new proof of a theorem of Túran. Can. J. Math. 17, 533–540 (1965)
Nesterov, Y.: Random Walk in a Simplex and Quadratic Optimization over Convex Polytopes. CORE Discussion Paper 2003/71, CORE-UCL, Louvain-La-Neuve (2003)
Nesterov, Y., Wolkowicz, H., Ye, Y.: Semidefinite programming relaxations of nonconvex quadratic optimization. In: Wolkowicz, H., Saigal, R., Vandenberghe, L. (eds.) Handbook of Semidefinite Programming, pp. 361–419. Kluwer Academic Publishers, Norwell (2000)
Sagol, G., Yildirim, E.A.: Analysis of Copositive Optimization Based Bounds on Standard Quadratic Optimization. Technical Report. Department of Industrial Engineering, Koc University, Sariyer, Istanbul, Turkey (2013)
Vavasis, S.: Approximation algorithms for concave quadratic programming. In: Floudas, C.A., Pardalos, P. (eds.) Recent Advances in Global Optimization, pp. 3–18. Princeton University Press, Princeton (1998)
Yildirim, E.A.: On the accuracy of uniform polyhedral approximations of the copositive cone. Optim. Methods Softw. 27(1), 155–173 (2012)
Author information
Authors and Affiliations
Corresponding author
Appendix: Proof of Theorem 7
Appendix: Proof of Theorem 7
In this Appendix, we give a self-contained proof for Theorem 7, which provides an explicit description of the moments of the multinomial distribution in terms of the Stirling numbers of the second kind (and as a direct application the explicit formulation for the Bernstein approximation on the simplex from Corollary 4).
Given \(x\in \varDelta _n\), we consider the multinomial distribution with \(n\) categories and \(r\) independent trials, where the probability of drawing the \(i\)-th category is given by \(x_i\). Hence, given \(\alpha \in I(n,r)\), the probability of drawing \(\alpha _i\) times the \(i\)-th category for each \(i\in [n]\) is equal to \({r!\over \alpha !}x^\alpha \) and, for any \(\beta \in {\mathbb N}^n\), the \(\beta \)-th moment of the multinomial distribution is given by
Our objective is to show the following reformulaton of the \(\beta \)-th moment in terms of the Stirling numbers of the second kind:
Our proof is elementary in the sense that we will obtain the moments of the multinomial distribution using its moment generating function. One of the ingredients which we will use is the fact that the identity (17) holds for the case \(n=2\) of the binomial distribution when \(\beta \in {\mathbb N}^2\) is of the form \(\beta =(\beta _1,0)\). Namely, the following identity is shown in [13] (see Theorem 2.2 and relation (3.1) there).
Lemma 5
[13] Given \(\beta _1\in {\mathbb N}\) and \(x_1\in {\mathbb R}\) such that \(0\le x_1\le 1\), one has
This implies that the identity (17) holds for the moments of the multinomial distribution when the order \(\beta \) has a single non-zero coordinate, i.e., \(\beta \) is of the form \(\beta =\beta _ie_i\) with \(\beta _i\in {\mathbb N}\).
Corollary 6
Given \(\beta _i\in {\mathbb N}\) and \(x\in \varDelta _n\), one has
Proof
By (16), we have
which is equal to \( \sum _{\alpha _i=0}^{\beta _i} r^{\underline{\alpha _i}} x_i^{\alpha _i} S(\beta _i,\alpha _i)\) by Lemma 5.\(\square \)
In order to determine the moments of the multinomial distribution we use its moment generating function
Then, for \(\beta \in {\mathbb N}^n\), the \(\beta \)-th moment of the multinomial distribution is equal to the \(\beta \)-th derivative of the moment generating function evaluated at \(t=0\). Namely,
By Corollary 6 we know that, for any \(\beta _i\in {\mathbb N}\),
Next we show an analogue of the above relation (19) for the evaluation of the \(\beta _ie_i\)-th derivative of the moment generating function at any point \(t\in {\mathbb R}^n\).
Lemma 6
For \(x\in \varDelta _n, \beta _i\in {\mathbb N}\) and \(t\in {\mathbb R}^n\), one has
For the proof we will use the following recursive relation for the Stirling numbers of the second kind.
Lemma 7
For any integers \(\beta \ge 0\) and \(\alpha \ge 1\), one has
Proof
This well known fact can be easily checked as follows. By definition, \(S(\beta +1,\alpha )\) counts the number of ways of partitioning the set \(\{1,\ldots ,\beta ,\beta +1\}\) into \(\alpha \) nonempty subsets. Considering the last element \(\beta +1\), one can either put it in a singleton subset (so that there are \(S(\beta ,\alpha -1)\) such partitions), or partition \(\{1,\ldots ,\beta \}\) into \(\alpha \) nonempty subsets and then assign the last element \(\beta +1\) to one of them (so that there are \(\alpha S(\beta ,\alpha )\) such partitions). This shows the result.\(\square \)
Proof
(of Lemma 6) To simplify notation we set \(M^r=M_{x}^{r}(t)\). We show the result using induction on \(\beta _i\ge 0\). The result holds clearly for \(\beta _i=0\) and also for \(\beta _i=1\) in which case we have
We now assume that the result holds for \(\beta _i\) and we show that it also holds for \(\beta _i+1\). For this, using the induction assumption, we obtain
Now, using (21), we can compute the last term as follows:
Plugging this into relation (22), we deduce
which concludes the proof.\(\square \)
We now extend the result of Lemma 6 to an arbitrary derivative of the moment generating function.
Theorem 9
For any \(x\in \varDelta _n, \beta \in {\mathbb N}^n\) and \(t\in {\mathbb R}^n\), one has
Proof
We show the result using induction on the size \(k\) of the support of \(\beta \), i.e., \(k=|\{i\in [n]:\beta _i\ne 0\}|.\) The result holds clearly for \(k=0\) and, for \(k=1\), the result holds by Lemma 6. We now assume that the result holds for \(k\) and we show that it also holds for \(k+1\). For this, consider the sequences \(\beta '=(\beta _1,\ldots ,\beta _k,\beta _{k+1},0,\ldots ,0)\) and \(\beta =(\beta _1,\ldots ,\beta _k,0,0,\ldots ,0)\in {\mathbb N}^n\), where \(\beta _1,\ldots ,\beta _{k+1}\ge 1\). By the induction assumption we know that
setting again \(M^r=M^r_x(t)\) for simplicity. Using (23), we obtain
Note that \(\alpha _{k+1}=0\) since \(\alpha _{k+1}\le \beta _{k+1}\) and \(\beta _{k+1}=0\). Hence, \(M^{r-|\alpha |}\) is the only term containing the variable \(t_{k+1}\) and thus (24) implies
We now use Lemma 6 to compute the last term:
Plugging (26) into (25) we obtain
after setting \(\alpha '=\alpha +e_{k+1}\theta _{k+1}\). This concludes the proof of Theorem 9. \(\square \)
Rights and permissions
About this article
Cite this article
de Klerk, E., Laurent, M. & Sun, Z. An alternative proof of a PTAS for fixed-degree polynomial optimization over the simplex. Math. Program. 151, 433–457 (2015). https://doi.org/10.1007/s10107-014-0825-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10107-014-0825-6