一类具有充分下降性的混合型谱共轭梯度法

王森森; 张俊容; 韩信; 王逸云

doi:10.13718/j.cnki.xdzk.2017.05.021

摘要: 首先基于共轭梯度法的下降性条件，提出了一类结合了FR法、WYL法、PRP法优点的充分下降的混合型谱共轭梯度法.在Wolfe线搜索下用反证法证明了新的混合型谱共轭梯度法的全局收敛性.最后通过数值算例，将本文算法与WYL法、FR法进行比较，结果表明新算法在迭代次数与迭代总时间上均优于其他另外两种算法.算法的全局收敛性和数值效果的优越性表明新算法是有效的.

Abstract: First, a mixed sufficiently descent spectral conjugate gradient method is put forward, which satisfies the descent condition. Besides, the method possesses the advantages of FR, WYL and PPR. Then, the global convergence of the new hybrid spectral conjugate gradient method is proved with the reduction to absurdity under the Wolfe line search. Finally, the new algorithm and the existing WYL and FR algorithms are compared in their iterative times and computing time. The comparison results show that the new algorithm is superior to the other two algorithms. The global convergence and the numerical superiority of the new algorithm indicate that it is an effective algorithm which is worth studying.

Key words:

全文HTML

考虑如下的无约束优化问题

其中f(x)是连续可微函数，其梯度向量$\nabla $f(x)记为g(x).共轭梯度法是求解上述无约束优化问题的一种十分有效的算法.经典共轭梯度法的基本迭代格式为：

其中：d_k为搜索方向；α_k为步长因子，通常可以由Wolfe非精确的一维线搜索来确定.

δ和σ为满足0＜δ＜σ＜1的常数；β_k为标量参数.关于β_k的选择有许多著名的公式，如：

其中y_k-1=g_k-g_k-1.上述4个公式分别对应4种不同的共轭梯度方法.每种方法又有很多变形，例如Wei等^[1]给出了PRP的一个变形，参数β_k取值为：

WYL法继承了PRP法的一些优良性质，例如良好的数值表现，并且Huang等^[2]证明了此方法在强Wolfe线搜索下具有全局收敛性和下降性.

2001年Birgin和Martinez^[3]提出了一种谱共轭梯度法，其搜索方向定义如下：

其中${\theta _k} = \frac{{{\boldsymbol{s}_{k-1}}^{\text{T}}{\boldsymbol{s}_{k-1}}}}{{{\boldsymbol{s}_{k-1}}^{\text{T}}{\boldsymbol{y}_{k - 1}}}} $为谱系数，$ {\beta _k} = \frac{{{{\left( {{\theta _k}{\boldsymbol{y}_{k-1}}-{\boldsymbol{s}_{k-1}}} \right)}^{\text{T}}}{\boldsymbol{y}_{k - 1}}}}{{{\boldsymbol{d}_{k - 1}}^{\text{T}}{\boldsymbol{y}_{k - 1}}}}$，s_k-1=x_k-x_k-1=α_k-1d_k-1.但是Birgin和Martinez提出的谱共轭梯度法的搜索方向d_k不满足下降性，当然也不具有全局收敛性.为了获得全局收敛性，Zhang等^[4]在此基础上构造出了FR型谱共轭梯度法，其搜索方向d_k中${\theta _k} = \frac{{{\boldsymbol{d}_{k-1}}^{\text{T}}{\boldsymbol{y}_{k-1}}}}{{{{\left\| {{\boldsymbol{g}_{k-1}}} \right\|}^2}}} $，β_k=β_k^FR.此方法的一个重要特点是满足充分下降条件

其中t＞0为常数.此后Wang，Cao等^[5-6]提出了CD型谱共轭梯度法；Du和Liu提出了HS型谱共轭梯度法^[7]；Wan，Huang等^[8-9]提出了PRP型的谱共轭梯度法.上述方法都满足充分下降条件(8).

本文在上述文献的基础上，提出了一类具有充分下降性的混合型谱共轭梯度法，参数β_k为：

谱系数${\theta _k} = c + \beta _k^{{\text{WS}}}\frac{{\boldsymbol{g}_k^{\text{T}}{\boldsymbol{d}_{k-1}}}}{{{{\left\| {{\boldsymbol{g}_k}} \right\|}^2}}} $，c为大于0的参数，然后基于(9) 式给出混合型谱共轭梯度法的算法，证明了算法的收敛性，并进行了数值实验.

1. 混合型谱共轭梯度法的算法及其性质

本节中，首先给出混合型谱共轭梯度法(算法1)，然后说明它所具有的一些性质.

算法1 混合型谱共轭梯度法.

步骤1 给定初始点x₁及精度ε，计算g₁，若‖g₁‖≤ε，停止.否则，转步骤2.

步骤2 计算搜索方向d_k

步骤3 由(4) 式计算步长因子α_k.

步骤4 迭代计算x_k+1=x_k+α_kd_k，g_k+1=g(x_k+1).若‖g_k+1‖≤ε，则停止.

步骤5 k：=k+1，转步骤2.

命题1 由算法1产生的序列{g_k}和{d_k}满足下降性，即对任意k≥1，都有g_k^Td_k＜0.

证由算法1得

命题2 对任意k≥1，参数β_k^WS满足$ 0 \leqslant \beta _k^{{\text{WS}}} \leqslant \frac{{{{\left\| {{\boldsymbol{g}_k}} \right\|}^2}}}{{{{\left\| {{\boldsymbol{g}_{k-1}}} \right\|}^2}}}$.

证由参数β_k^WS的定义可知命题成立.

分析(9) 式有：参数β_k^WS为FR，WYL，PRP 3者的混合.下面分3种情况对此说明：

1) 当g_k^Tg_k-1≤0时，

2) 当g_k^Tg_k-1＞0且$\frac{{\left\| {{\boldsymbol{g}_k}} \right\|}}{{\left\| {{\boldsymbol{g}_{k-1}}} \right\|}} \leqslant 1$时，

3) 当g_k^Tg_k-1＞0且$\frac{{\left\| {{\boldsymbol{g}_k}} \right\|}}{{\left\| {{\boldsymbol{g}_{k-1}}} \right\|}} > 1 $时，

此外由算法1的谱系数的选取可知，可通过选取不同的参数c来优化算法1的数值效果.

2. 算法收敛性

为了研究算法的收敛性，下面给出一些基本假设：

(A1) 设f(x)的水平集$\mathit{\Omega } = \left\{ {\boldsymbol{x} \in {\mathbb{R}^n}|f\left( \boldsymbol{x} \right) \leqslant f\left( {{\boldsymbol{x}_1}} \right)} \right\} $有界，其中x₁为初始迭代点.

(A2) 存在Ω的某个邻域Λ，使得f(x)在该邻域上连续可微且梯度函数g(x)满足Lipschitz条件，即存在常数L＞0，使得

引理1^[10] 假设(A1)，(A2) 成立，如果搜索方向d_k满足下降性，步长α_k由Wolfe非精确的一维线搜索确定，那么$\sum {\frac{{{{\left( {\boldsymbol{g}_k^{\text{T}}{\boldsymbol{d}_k}} \right)}^2}}}{{{{\left\| {{\boldsymbol{d}_k}} \right\|}^2}}}} < \infty $.

定理1 如果条件(A1)，(A2) 成立，算法1产生的序列{g_k}有

证假设结论不成立，则存在常数γ＞0使得

由(10) 式可得

从而有

将(19) 式展开得

移项可得

由命题1的证明，(21) 式可变为

(22) 式两边同时除以(g_k^Td_k)²得

进而有

综上所述可得

与引理1矛盾.

姓名
	姓名不能为空！
邮箱
	邮箱不能为空！非法的邮箱地址。
手机号码
	电话不能为空！请输入有效手机号!
标题
	标题不能为空！
留言内容
	内容不能为空！
验证码
	验证码不能为空！验证码错误！

函数名称	算法	维数/维	迭代次数/次	迭代时间/s	‖g_k‖	每迭代一次的平均时间/s	i
Example1	1	60	19	314	3.16×10^-5	16.5	0
	WYL	60	34	1190	5.89×10^-5	35	0
	FR	60	110	3123	8.57×10^-5	28.4	0

Example2	1	100	78	387	9.31×10^-5	4.96	0
	WYL	100	179	2866	7.83×10^-5	16	0
	FR	100	19	1898	4.81×10³⁰	99.9	1

POWER	1	50	426	1752	8.82×10^-5	4.11	0
	WYL	50	279	3601	3.45×10^-1	12.9	1
	FR	50	434	3605	9.35×10^-1	8.31	1

Diagonal 4	1	200	69	610	5.51×10^-5	8.84	0
	WYL	200	45	2127	1.44×10¹	47.3	1
	FR	200	69	1488	7.3×10^-5	21.6	0

Extended Penalty	1	20	28	474	8.5×10^-5	16.9	0
	WYL	20	28	679	5.27×10^-5	24.3	0
	FR	20	36	609	6.45×10^-5	16.9	0

Extended Rosenbrock	1	10	144	217	9.39×10^-5	1.51	0
	WYL	10	644	3606	1.37×10^-1	5.6	1
	FR	10	151	861	6.72×10¹	5.7	1

Extended White Holst	1	100	42	1535	2.97×10^-5	36.6	0
	WYL	100	57	3699	0.36×10¹	64.9	1
	FR	100	51	3685	2.27×10^-2	52.6	1

Perturbed Quadrtic	1	20	31	283	6.35×10^-5	9.11	0
	WYL	20	41	874	2.06×10^-3	21.3	1
	FR	20	41	441	1.87×10^-2	21.3	1

ENGVAL 1	1	50	91	1182	9.43×10^-5	13	0
	WYL	50	35	2137	3.02×10^-5	61	0
	FR	50	83	2682	7.61×10^-5	32.3	0

[1]	WEI Z X, YAO S W, LIU Y X. The Convergence Properties of Some New Conjugate Gradient Methods [J]. Applied Mathematics and Computation, 2006, 183(2): 1341-1350. doi: 10.1016/j.amc.2006.05.150
[2]	HUANG H, WEI Z X, YAO S W. The Proof of the Sufficient Descent Condtion of Wei-Yao-Liu Conjugate Gradient Method under the Strong Wolfe-Powell Line Search [J]. Applied Mathematics and Computation, 2007, 189(2): 1241-1245. doi: 10.1016/j.amc.2006.12.006
[3]	BIRGIN E G, MARTINEZ J M. A Spectral Conjugate Gradient Method for Unconstraiend Optimization [J]. Applied Mathematics and Optimization, 2001, 43(2): 117-128. doi: 10.1007/s00245-001-0003-0
[4]	ZHANG L, ZHOU W J, Li D H. Global Convergence of a Modified Fletcher-Reeves Conjugate Gradient Method with Armijo-Type Line Search [J]. Numerical Mathematics, 2006, 104(4): 561-572. doi: 10.1007/s00211-006-0028-z
[5]	王开荣, 曹伟, 王银河. Armijo型线搜素下的谱CD共轭梯度法[J].山东大学学报(理学版), 2011, 45(11): 104-108. doi: http://www.cnki.com.cn/Article/CJFDTOTAL-SDDX201011023.htm
[6]	doi: https://www.researchgate.net/publication/268017897_Global_convergence_of_a_modified_spectral_CD_conjugate_gradient_method CAO W, WANG K R, WANG Y H. Global Convergence of a Modified Spectral CD Conjugate Gradient Method [J]. Journal of Mathematical Research and Exposition, 2011, 31(2): 261-268.
[7]	DU X L, LIU J K. Global Convergence of a Spectral HS Conjugate Gradient Method [J]. Procedia Engineering, 2011, 15; 1487-1492. doi: 10.1016/j.proeng.2011.08.276
[8]	WAN Z, YANG Z L, WANG Y L. New Spectral PRP Conjugdte Gradient Method for Unconstrained Optimization [J]. Applied Mathematics Letters, 2011, 24(1): 16-22. doi: 10.1016/j.aml.2010.08.002
[9]	黄海, 林穗华.一个PRP型共轭梯度法的收敛性[J].西南大学学报(自然科学版), 2012, 34(3): 22-29. doi: http://xbgjxt.swu.edu.cn/jsuns/jsuns/ch/reader/view_abstract.aspx?file_no=z20120306&flag=1
[10]	戴彧虹, 袁亚湘.非线性共轭梯度法[M].上海:上海科学出版社, 2000: 10-13.
[11]	doi: https://www.researchgate.net/publication/228737339_An_unconstrained_optimization_test_functions_collection ANDREI N. An Uconstrained Optimization Test Fuctions Collection [J]. Advanced Modelling and Optimzation, 2008, 10(1): 147-161.

留言板