- Research
- Open access
- Published:
The modified accelerated Bregman method for regularized basis pursuit problem
Journal of Inequalities and Applications volume 2014, Article number: 130 (2014)
Abstract
In this paper, a modified accelerated Bregman method (MABM) for solving the regularized basis pursuit problem is considered and analyzed in detail. This idea is based on the fact that the linearized Bregman method (LBM) proposed by Osher et al. (Multiscale Model. Simul. 4(2):460-489, 2005) is equivalent to a gradient descent method applied to a certain dual formulation which converges to the solution of the regularized basis pursuit problem. The proposed method is based on an extrapolation technique which is used in accelerated proximal gradient methods presented by Nesterov (Dokl. Akad. Nauk SSSR 269:543-547, 1983). It is verified that the modified accelerated Bregman method (MABM) is equivalent to the corresponding accelerated augmented Lagrangian method (AALM). The theoretical results confirm that the method has a rapid convergence rate of .
1 Introduction
Compressed sensing, an interesting research field involving how to obtain information, acts as a crucial role in signal processing, image restoration, etc. The origin of its name is based on the idea of encoding a large sparse signal exploiting a relatively small number of linear measurements, then decoding the signal either through minimizing the 1-norm or utilizing a combinational algorithm, a greedy algorithm, etc. It resulted from elementary analysis and approximation theory by Kashin [1], but was brought into the forefront by the work of Candés et al. [2–4] and Donoho [5] who constructed specific methods and showed their application prospects. The concrete model considered in compressed sensing is the so-called basis pursuit problem [6, 7]
where , . It can be deemed a classical transform in the compressed sensing field into the following NP-hard discrete optimization problem:
where , , denotes the number of nonzero elements.
In many practical circumstances, the data or signals are frequently represented by a few matrices, which, in fact, is convenient for data processing and analysis. However, the data are often given with damage, loss or noise pollution, etc. In this case, how to restore the original data is among the practical difficulties faced when solving the matrix reconstruction problem. Being similar to handling the original signal (vector form) in the compressed sensing field, matrix remodeling refers to an original compressible or sparse representation matrix which can be exactly or approximately reconstructed with a proper model. The matrix reconstruction problem can be classified with matrix completion (MC) [8, 9] and matrix recovery (MR) [10, 11], which is met in various areas, e.g., a famous application is the Netflix system [12] in matrix completion; while the matrix recovery problem originates from face image processing, background modeling and so forth, it can be applied to image alignment as by Peng et al. [13]. Generally speaking, the matrix reconstruction problem is written as
where and C is a convex set. Normally, the following affine constrained minimization problem can be considered:
where the linear map , is a given vector.
The model emerges in many fields, such as determining a low-order controller for a plant [14] and a minimum order linear system realization [15], and solving low-dimensional Euclidean embedding problems [16]. A natural generalization in the sense of compressed sensing, (1.4) can be formulated as the nuclear norm minimization problem
where denotes the sum of singular values of the corresponding matrix. The matrix completion problem
where , , Ω is a certain index set, is considered as a special case of (1.3). The idea is inspired by the method presented in [17, 18]etc. Now, we consider the so-called regularized basis pursuit problem
Adding the penalized term of the constraint condition to the objection function of the regularized basis pursuit problem, we obtain
Particularly in [18], putting in the term in (1.7) yields the tractable object function, which is a strictly convex function. Thus, the linearly constrained basis pursuit problem has a sole solution and its dual problem is smooth. When μ is sufficiently small, the solution to (1.7) is also the solution of (1.1). This exact regularization property of (1.7) was studied in [19, 20]. Problem (1.7) has a 2-norm in the regularizer term, thus it is considered less sensitive to noise than the basis pursuit problem (1.1).
To solve (1.1), a linearized Bregman iteration method was presented in [21], which was motivated by [22]. The idea of the linearized Bregman method is to combine a fixed point iteration with the Bregman method in [23, 24], but the linearized Bregman method is a bit slow in the convergence result. Hence, many accelerated schemes have been brought up in various theses. For example, in [21] the kicking technique was introduced. Yin [19] verified that the linearized Bregman method is equivalent to a gradient descent method applied to the Lagrangian dual problem of (1.7). They improved the linearized Bregman method, utilizing the Barzilai-Borwein line of searching [25], nonlinear conjugate gradient methods, and the method of limited memory BFGS [26]. Huang et al. [18] proposed an accelerated linearized Bregman method which is based on the fact that the linearized Bregman method is equivalent to the gradient descent method applied to a Lagrangian dual of problem (1.7) and the extrapolation technique, which is adopted in the accelerated proximal gradient methods [27] proposed by Nesterov et al. To solve problem (1.7), Goldstein et al. [28] used an alternating split technique and its Lagrange dual problem.
Based on these studies, we extend the accelerated Bregman method to solve (1.7) in which the object function might be not differentiable but have the ‘good’ performance (convex and continuous). We put forward a new improvement formula based on the accelerated Bregman method. It can be proved to have the property that the modified Bregman method is equivalent to the corresponding accelerated augmented Lagrangian method, and the latter has a rapid convergence rate which can be deemed to be an improvement of [17].
The rest of this article is organized as follows. In Section 2, we sketch the original Bregman method and the linearized Bregman method which are useful for the subsequent analysis. In Section 3, we introduce the accelerated augmented Lagrangian method (AALM), and we present our modified accelerated Bregman method (MABM). Section 4 is devoted to the convergence of the regularized basis pursuit problem and here we analyze the error bound of the MABM in detail. In Section 5, we give some conclusions and discuss the research plans for our future work.
2 The original Bregman method and its linearized form
The Bregman method was introduced into image processing by Osher et al. in [23] for solving the total-variation (TV)-based image restoration problems. Let be a Bregman function, the Bregman distance [29] of the point u to the point v is defined as
where , the subdifferential of φ at v. Note that updating the formula for (1.7), this Bregman iterative regularization procedure recursively solves
for  , starting with , . Since (2.2) is a convex programming problem, the optimality conditions can be given by
from which we obtain the updated formula
Hence, the Bregman iterative scheme is given by
By the following lemma, we get the equivalent form of (2.5)
starting with , .
For the sake of the requirement for the whole theoretical analysis, in the following, we propose some significant equivalence results, and we give the detailed proofs.
Lemma 2.1 The Bregman iterative scheme, which is given by (2.5), will obtain the same optimum point in the first term as (2.6) if
holds, where λ is a certain positive constant.
Proof From the first formula of (2.5) and the definition of Bregman distance (2.1), we get
therefore, we complete the proof. □
From the discussion above, we can give a crucial conclusion.
Theorem 2.2 The original Bregman iterative scheme (2.5) is equivalent to its variant (2.6).
Proof By induction, in fact, we only need to verify that (2.7) holds.
If , , (2.5) holds by the initial conditions and .
Now, we suppose that (2.7) holds for . Then
where the first equality is from the second term of (2.5), and the third equality is from the second term of (2.6). Therefore, the original Bregman iterative scheme (2.5) is equivalent to its variant (2.6). □
Noting that [24, 30, 31] and the references therein, it was argued that the Bregman iterative method is equivalent to the augmented Lagrangian method. The significance of this statement will be demonstrated in our later analysis in Section 4.
For the following analysis, we define the Lagrangian function of (1.7),
where is the Lagrangian multiplier. The corresponding augmented Lagrangian function can be expressed as
where and is the penalty term.
The augmented Lagrangian iterative scheme is
starting from .
Lemma 2.3 The first item of the iterative sequence of (2.9) and that of (2.6) are equal if
holds.
Proof From the first formula of (2.6) and (2.10), we get
The constant of the objective function does not affect the optimum point, so by comparing with the above equations, we get the conclusion. □
Theorem 2.4 The augmented Lagrangian iterative scheme (2.9) is equivalent to the Bregman iterative method variant (2.6).
Proof It is not difficult to see that the proof is the same as that of Theorem 2.2. Similarly, by the mathematical induction, we simply show that (2.7) holds.
If , , (2.7) holds by the initial conditions and . Now, we suppose that (2.7) holds for . Then, when ,
where the first equality is from the second term of (2.6), the second equality has its roots in induction, and the third equality is derived from the second term of (2.9). Therefore, the augmented Lagrangian iterative scheme (2.9) is equivalent to the Bregman iterative method variant (2.6). Moreover, we can get the equivalence of (2.5) and (2.9) from the two theorems above. □
For solving the subproblem of Bregman iterative method (2.2), different algorithms were conceived in [27, 32, 33], however, these methods have to perform significant iterations for solving the subproblem. In order to overcome the difficulties, the linearized Bregman method is obtained by linearizing the last term in (2.2) into (where ) and adding the -proximity term . The concrete scheme is
By the optimality conditions, we obtain the following formula:
That is, for the second term of (2.11) . In [7, 19, 21], the linearized Bregman method was analyzed; as (where denotes the spectral norm of the corresponding matrix) the iterates of the linearized Bregman method converge to the solution of the regularized basis pursuit problem
It has appeared in the accelerated Bregman algorithm in recent years, such as in [18]etc. But we shall argue that the accelerated Bregman method has large room to advance. We give some valid improvements on these accelerated Bregman methods. It can be verified that the theoretical result as regards the proposed method has a rapid convergence rate of .
3 The modified accelerated Bregman method
In [24], the linearized Bregman algorithm could be written as follows.
Algorithm 3.1 (Linearized Bregman method (LBM))
Step 0. Input: ; initial point: , .
Step 1. Initialize: , let and .
Step 2. Compute .
Step 3. Set .
Step 4. Set , go to step 2.
For the iterative scheme above, . Next, we give its equivalence form from the following lemma.
Lemma 3.2 The linearized Bregman method (LBM) in Algorithm 3.1 is equivalent to the iterative scheme
starting from , where .
Proof From step 3 of Algorithm 3.1, we have
Thus, we denote
then we simplify step 2 of Algorithm 3.1,
On the other hand,
where the second equality is from (3.2). So, step 2 and step 3 in Algorithm 3.1 can be rewritten as (3.1). □
Yin et al. presented all kinds of techniques, for example, line search, and L-BFGS and BB steps, to accelerate the linearized Bregman method. It is interesting that for the latter in [18] the accelerated linearized Bregman algorithm is argued for as follows.
Algorithm 3.3 (Accelerated linearized Bregman method (ALBM))
Step 0. Initialize: , , , .
Step 1. Compute .
Step 2. Set .
Step 3. Set .
Step 4. Set .
Step 5. Set , go to step 1.
These methods motivated us to consider the following algorithm, and its acceleration idea is based on the extrapolation technique proposed by Nesterov in [34, 35]; see also the references therein.
Algorithm 3.4 (Modified accelerated Bregman method (MABM))
Step 0. Initialize: , , , .
Step 1. Compute .
Step 2. Set .
Step 3. Set .
Step 4. Set .
Step 5. Set , go to step 2.
The basic idea of the equivalence between MABA and the corresponding AALM can be traced back to [31]. Especially, we can see that our updated iterative for is obviously better than the pre-iterative [17], since we consider a sufficient amount of information about the former iterative. In this way, most of the better iterative efficiency could be expected, which is just our purpose in improving the method. Then we will be dependent on a series of transformations in preparation for the convergence proof in Section 4.
Lemma 3.5 The MABM in Algorithm 3.4 is equal to the following iterative scheme:
starting with .
Proof Recalling (2.7) and noting Theorem 2.2, we can prove that (2.7) holds for all k by induction.
If , , (2.7) holds by the initial conditions and . Now, we assume that (2.7) holds for , then
where the first equality is directly derived from step 2 of Algorithm 3.4, the first equality is derived from the induction hypothesis, the fourth equality utilizes the second step of (3.3).
Moreover, when , we have
where the first equality is from step 3 of Algorithm 3.4, the second term is from (3.4) and the induction hypothesis, the fourth equality is from the third term of (3.3), so (2.7) holds for all k. Namely, the MABM in Algorithm 3.4 is equal to (3.3). □
Lemma 3.6 Iterative scheme (3.3) is equal to the following AALM iterative scheme:
starting from , where is the Lagrangian multiplier.
Proof It is not difficult to see that the idea has likeness to Theorem 2.4, we are just required to verify that (2.7) holds. To this end, we proceed by mathematical induction.
If , , (2.7) holds by the initial conditions and . Suppose that (2.7) holds for , then, when , we have
where the first equality is from the second term of (3.3), the second equality stems from induction, and the third equality is derived from the second term of (3.5).
Thus, we get
Therefore, AALM (3.5) is equivalent to iterative scheme (3.3). Moreover, we can get the equivalence of the MABM in Algorithm 3.4 to AALM from the above two lemmas. □
Theorem 3.7 The MABM in Algorithm 3.4 is equivalent to the corresponding AALM (3.5).
4 The convergence analysis
A practical challenge for the regularized basis pursuit problem is to offer an efficient method to solve the non-smooth optimization problems. Many algorithms have been proposed in recent years [36]. In these methods, some schemes of approximation that have to do with the non-smooth norm term are usually employed. However, a fast global convergence is difficult to guarantee. Due to the non-smooth nature of the 1-norm, a simple method to solve these problems is the subgradient approach [37], which converges only as , where k is the iteration counter.
In this paper, we present an efficient method with fast global convergence rate to solve the regularized basis pursuit problem. Particularly, we verify that this result is an extended gradient algorithm with the convergence rate of , like that for smooth problems. Following the Nesterov method for accelerating the gradient method [34, 35], we show that the MABM can be further accelerated to converge as .
A series of lemmas in the following are to ensure the convergence rate of the MABM.
Lemma 4.1 Let be generated by the augmented Lagrangian iterative scheme (2.9), and let be a globally optimal solution of the problem
then the inequality
for any and holds.
Proof By the optimality conditions, we get
From the second term of (2.9), we obtain
By the definition of subdifferential, we have
So (4.1) holds for . Owing to the fact that of (4.1) satisfies the KKT condition, it is easy to obtain
Compared with (4.3), inequality (4.2) holds for , which completes the proof. □
Lemma 4.2 Let be generated by the augmented Lagrangian iterative scheme (2.9). For any , the inequality
holds.
Proof By the definition of the Lagrangian function and Lemma 4.1, we get the following result:
The second inequality is derived from (4.2), and the third equality is from (2.9). □
From the fact that , , and the definition of subdifferential, we have
then
thus
where we exploit the fact that .
Lemma 4.3 Let be generated by the augmented Lagrangian iterative scheme (2.9), then
Proof Considering (4.4) and replacing with , we get
which completes the proof. □
Moreover, holds due to (4.5), this implies the global convergence of (2.9). By summing the above inequality over , we get
Hence,
Theorem 4.4 Let be generated by the augmented Lagrangian iterative scheme (2.9), then .
Proof Noting (4.4) of Lemma 4.2 and substituting for , we get
By multiplying with and summing it over , we obtain
From the above inequality, we have
On the other hand, it follows from Lemma 4.3 that
Summing the inequality over , we obtain
Combining (4.9) with (4.10), we get
Consequently, we obtain
This completes the proof. □
Comparing (2.9) with (3.5), we have the following two lemmas by replacing with .
Lemma 4.5 Let be generated by the augmented Lagrangian iterative scheme (3.5). For any , we have the inequality
Lemma 4.6 Let be generated by the augmented Lagrangian iterative scheme (3.5), then
Theorem 4.7 Let be generated by the augmented Lagrangian iterative scheme (3.5), then
Proof We work from step 4 of Algorithm 3.4, i.e., , where . By a simple calculation, we have
and
Based on Lemma 4.5, by replacing with and , setting , we get the two inequalities
where
By multiplying both sides of (4.15), (4.16) with and 2, respectively, and adding the two sides, we obtain
For the sake of convenience, we denote
and
By multiplying both sides of (4.17) with , we have
where the second equality is derived from the fact that .
Then we have
where
and the third equality is from the third iterative scheme of (3.5) and we make a transposition to the third term. Thus,
Summing the inequality over , we get
Then, combining (4.7), (4.14) with (4.18), we have
as k tends to infinity, where ε is an arbitrarily small positive number. Hence,
Thus, we complete the convergence proof. □
Remark The right formula of (4.18) exploits the fact that the choice of the penalty factor λ can be seen as a monotonically increasing sequence (such as ) that depends on the selection of in Algorithm 3.4. In this way, we are not only able to guarantee the convergence in the formula of divided by , but also to play a critical role of punishment to the constraint condition.
5 Conclusion
In this paper, we put forward the modified accelerated Bregman method (MABM) for solving the regularized basis pursuit problem. We give some beneficial improvement tasks on the basis of some recent literature on the accelerated Bregman method, and we perform the theoretical feasibility analysis in detail. It can be showed that the proposed MABM has a rapid convergence rate of . We will devote our future study to combining the advantages of LBM with our MABM as regards theory and numerical results.
References
Kashin B: The widths of certain finite dimensional sets and classes of smooth functions. Izv. Akad. Nauk SSSR, Ser. Mat. 1977, 41: 334-351.
Candés E, Romberg J, Tao T: Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory 2006, 52: 489-509.
Candés E, Romberg J, Tao T: Stable signal recovery from incomplete and inaccurate measurements. Commun. Pure Appl. Math. 2006, 59: 1207-1223. 10.1002/cpa.20124
Candés E, Tao T: Decoding by linear programming. IEEE Trans. Inf. Theory 2005, 51: 4203-4215. 10.1109/TIT.2005.858979
Donoho D: Compressed sensing. IEEE Trans. Inf. Theory 2006, 52: 1289-1306.
Candés E, Wakin M: An introduction to compressive sampling. IEEE Signal Process. Mag. 2008, 21: 21-30.
Cai J, Osher S, Shen Z: Linearized Bregman iterations for compressed sensing. Math. Comput. 2009, 78: 1515-1536. 10.1090/S0025-5718-08-02189-3
Cai J, Candés E, Shen Z: A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 2010, 20: 1956-1982. 10.1137/080738970
Candés E, Recht B: Exact matrix completion via convex optimization. Found. Comput. Math. 2009, 9: 717-772. 10.1007/s10208-009-9045-5
Candés, E, Li, X, Ma, Y, Wright, J: Robust principal component analysis. Preprint (2009)
Figueiredo MAT, Nowak RD, Wright SJ: Gradient projection for sparse reconstruction: application to compressed sensing and other inverse problems. IEEE J. Sel. Top. Signal Process. 2007,1(4):586-597.
Beck A, Teboulle M: Fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2009, 2: 183-202. 10.1137/080716542
Peng Y, Ganesh A, Wright J, Xu W, Ma Y: RASL: robust alignment by sparse and low-rank decomposition for linearly correlated images. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2010.
Ghaoui LE, Gahinet P: Rank minimization under LMI constraints: a framework for output feedback problems. Proceedings of the European Control Conference 1993.
Fazel M, Hindi H, Boyd SP: A rank minimization heuristic with application to minimum order system approximation. 6. Proceedings of the American Control Conference 2001, 4734-4739.
Linial N, London E, Rabinovich Y: The geometry of graphs and some of its algorithmic applications. Combinatorica 1995, 15: 215-245. 10.1007/BF01200757
Kang M, Yun S, Woo H, Kang M:Accelerated Bregman method for linearly constrained minimization. J. Sci. Comput. 2012, 56: 515-534.
Huang B, Ma SQ, Goldfarb D: Accelerated linearized Bregman method. J. Sci. Comput. 2013, 54: 428-453. 10.1007/s10915-012-9592-9
Yin W: Analysis and generalizations of the linearized Bregman method. SIAM J. Imaging Sci. 2010, 3: 856-877. 10.1137/090760350
Friedlander M, Tseng P: Exact regularization of convex programs. SIAM J. Optim. 2007, 18: 1326-1350.
Osher S, Mao Y, Dong B, Yin W: Fast linearized Bregman iteration for compressive sensing and sparse denoising. Commun. Math. Sci. 2010, 8: 93-111. 10.4310/CMS.2010.v8.n1.a6
Darbon, J, Osher, S: Fast discrete optimization for sparse approximations and deconvolutions. Preprint (2007)
Osher S, Burger M, Goldfarb D, Xu J, Yin W: An iterative regularization method for total variation-based image restoration. Multiscale Model. Simul. 2005,4(2):460-489. (electronic). MR2162864 (2006c:49051) 10.1137/040605412
Yin W, Osher S, Goldfarb D, Darbon J:Bregman iterative algorithms for -minimization with applications to compressed sensing. SIAM J. Imaging Sci. 2008,1(1):143-168. 10.1137/070703983
Barzilai J, Borwein J: Two point step size gradient methods. IMA J. Numer. Anal. 1988, 8: 141-148. 10.1093/imanum/8.1.141
Liu D, Nocedal J: On the limited memory method for large scale optimization. Math. Program., Ser. B 1989, 45: 503-528. 10.1007/BF01589116
Bennett J, Lanning S: The Netflix prize. Proceedings of KDD Cup and Workshop 2007.
Goldstein T, Osher S: The split Bregman method for L1-regularized problems. SIAM J. Imaging Sci. 2009,2(2):323-343. 10.1137/080725891
Bregman L: The relaxation method of finding the common points of convex sets and its application to the solution of problems in convex programming. USSR Comput. Math. Math. Phys. 1967, 7: 200-217.
Powell MJD: A method for nonlinear constraints in minimization problems. In Optimization Edited by: Fletcher R. 1972, 283-298.
Rockafellar RT: Augmented Lagrangians and applications of the proximal point algorithm in convex programming. Math. Oper. Res. 1976, 1: 97-116. 10.1287/moor.1.2.97
Hale ET, Yin W, Zhang Y:Fixed-point continuation for minimization: methodology and convergence. SIAM J. Optim. 2008, 19: 1107-1130. 10.1137/070698920
Berg EVD, Friedlander MP: Probing the Pareto frontier for basis pursuit solutions. SIAM J. Sci. Comput. 2008, 31: 890-912.
Nesterov YE:A method for unconstrained convex minimization problem with the rate of convergence . Dokl. Akad. Nauk SSSR 1983, 269: 543-547.
Nesterov YE 87. Introductory Lectures on Convex Optimization 2004, 220-236.
Rennie JDM, Srebro N: Fast maximum margin matrix factorization for collaborative prediction. Proceedings of the International Conference on Machine Learning 2005, 713-719.
Bertsekas D: Nonlinear Programming. Athena Scientific, Nashua; 1999.
Acknowledgements
The project is supported by the Scientific Research Special Fund Project of Fujian University (Grant No. JK2013060), Fujian Natural Science Foundation (Grant No. 2013J01006) and the National Natural Science Foundation of China (Grant No. 11071041).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
All authors contributed equally to the writing of this paper. All authors read and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Xie, Y., Ke, Y. & Ma, C. The modified accelerated Bregman method for regularized basis pursuit problem. J Inequal Appl 2014, 130 (2014). https://doi.org/10.1186/1029-242X-2014-130
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1029-242X-2014-130