Alternating Direction Method of Multipliers (ADMM)
July 18, 2015
Consider minimizing subject to affine constraints
The augmented Lagrangian
Idea: perform block descent on and and then update multiplier vector
Example: fused lasso
Fused lasso problem minimizes
Define , where
Then we minimize subject to
Augmented Lagrangian is
ADMM
- Update is a smooth quadratic problem
- Update is a separated lasso problem (elementwise thresholding)
- Update multipliers
Same algorithm applies to a general regularization matrix (generalized lasso)
Remarks on ADMM
Related algorithms
- Split Bregman iteration 11 Goldstein, T. and Osher, S. (2009). The split Bregman method for l1-regularized problems. SIAM J. Img. Sci., 2:323-343.
- Dykstra’s alternating projection algorithm 22 Dykstra, R. L. (1983). An algorithm for restricted least squares regression. J. Amer. Statist. Assoc., 78(384):837-842.
- Proximal point algorithm applied to the dual
- Numerous applications in statistics and machine learning: lasso, gen. lasso, graphical lasso, (overlapping) group lasso, …
- Embraces distributed computing for big data 33 Boyd, S., Parikh, N., Chu, E., Peleato, B., and Eckstein, J. (2011). Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. learn., 3(1):1-122.