共役勾配法

共役勾配法（きょうやくこうばいほう、テンプレート:Lang-en-short、CG法とも呼ばれる）は対称正定値行列を係数とする連立一次方程式を解くためのアルゴリズムである^[1]^[2]^[3]^[4]。反復法として利用され^[1]^[2]^[3]^[4]、コレスキー分解のような直接法では大きすぎて取り扱えない、大規模な疎行列を解くために利用される。そのような問題は偏微分方程式などを数値的に解く際に常に現れる^[1]^[5]^[6]^[7]。

共役勾配法は、エネルギー最小化などの最適化問題を解くために用いることもできる^[8]^[9]^[10]。

テンプレート:仮リンクは、共役勾配法の非対称問題への拡張である^[11]。

また、非線形問題を解くために、さまざまな非線形共役勾配法が提案されている^[12]^[13]^[14]^[15]。

詳説

対称正定値行列Aを係数とするn元連立一次方程式

テンプレート:Indent

の解をx_*とする。

直接法としての共役勾配法

非零ベクトルu、vがテンプレート:Indent を満たすとき、u、vはAに関して共役であるという^[2]^[3]^[4]。Aは対称正定値なので、左辺から内積テンプレート:Indent を定義することができる。この内積に関して2つのベクトルが直交するなら、それらのベクトルは互いに共役である。この関係は対称で、uがvに対して共役なら、vもuに対して共役である（この場合の「共役」は複素共役と無関係であることに注意）。

{p_k}をn個の互いに共役なベクトル列とする。p_kは基底Rⁿを構成するので、Ax = bの解x_*をこの基底で展開すると、テンプレート:Indent と書ける。ただし係数はテンプレート:Indent で与えられる。

この結果は、上で定義した内積を考えるのが最も分かりやすいと思われる。

以上から、Ax = bを解くための方法が得られる。すなわち、まずn個の共役な方向を見つけ、それから係数α_kを計算すればよい。

反復法としての共役勾配法

共役なベクトル列p_kを注意深く選ぶことにより、一部のベクトルからx_*の良い近似を得られる可能性がある。そこで、共役勾配法を反復法として利用することを考える^[2]^[3]^[4]。こうすることで、nが非常に大きく、直接法では解くのに時間がかかりすぎるような問題にも適用することができる。

x_*の初期値をx₀ = 0 とする。x_*が二次形式テンプレート:Indent を最小化する一意な解であることに注意し、最初の基底ベクトルp₁をx = x₀でのfの勾配Ax₀−b=−bとなるように取る。このとき、基底の他のベクトルは勾配に共役である。そこで、この方法を共役勾配法と呼ぶ^[2]^[3]^[4]。

r_kをkステップ目での残差テンプレート:Indent とする。r_kはx = x_kでのfの負の勾配であることに注意されたい。最急降下法はr_kの方向に進む解法である。p_kは互いに共役でなければならないので、r_kに最も近い方向を共役性を満たすように取る。これはテンプレート:Indent のように表すことができる（記事冒頭の図を参照）。

アルゴリズム

以上の方法を簡素化することにより、Aが実対称正定値である場合にAx = bを解くための以下のアルゴリズムを得る^[4]。初期ベクトルx₀は近似解もしくは0とする。

 $r_{0} = b - A x_{0}$ 
 $p_{0} = r_{0}$ 

for (k = 0; ; k++) 
     $α_{k} = \frac{r_{k}^{T} p_{k}}{p_{k}^{T} A p_{k}}$ 
     $x_{k + 1} = x_{k} + α_{k} p_{k}$ 
     $r_{k + 1} = r_{k} - α_{k} A p_{k}$ 

    if  $r_{k + 1}$  が十分に小さい then
        break

     $β_{k} = \frac{r_{k + 1}^{T} r_{k + 1}}{r_{k}^{T} r_{k}}$ 
     $p_{k + 1} = r_{k + 1} + β_{k} p_{k}$ 
結果は  $x_{k + 1}$

Octaveでの共役勾配法の記述例

Gnu Octaveで書くと以下のようになる。

 function [x] = conjgrad(A,b,x0)

    r = b - A*x0;
    w = -r;
    z = A*w;
    a = (r'*w)/(w'*z);
    x = x0 + a*w;
    B = 0;

    for i = 1:size(A)(1);
       r = r - a*z;
       if( norm(r) < 1e-10 )
            break;
       endif
       B = (r'*z)/(w'*z);
       w = -r + B*w;
       z = A*w;
       a = (r'*w)/(w'*z);
       x = x + a*w;
    end
 end

前処理

前処理行列とは、Aと同値なP^-1A (P^T)^-1の条件数がAより小さく、Ax=bよりP^-1A (P^T)^-1 x′ =P^-1b′の方が容易に解けるような正定値行列 P.P^Tを指す^[4]。前処理行列の生成には、ヤコビ法、ガウス・ザイデル法、対称SOR法などが用いられる^[16]^[17]。

最も単純な前処理行列は、Aの対角要素のみからなる対角行列である。これはヤコビ前処理または対角スケーリングとして知られている。対角行列は逆行列の計算が容易かつメモリも消費しない点で、入門用として優れた方法である。より洗練された方法では、κ(A)の減少による収束の高速化とP^-1の計算に要する時間とのトレードオフを考えることになる。

正規方程式に対する共役勾配法

任意の実行列Aに対してA^TAは対称（半）正定値となるので、係数行列をA^TA、右辺をA^Tbとする正規方程式を解くことにより、共役勾配法を任意のn×m行列に対して適用することができる（CGNR法^[18]）。

テンプレート:Indent

反復法としては、A^TAを明示的に保持する必要がなく、行列ベクトル積、転置行列ベクトル積を計算すればよいので、Aが疎行列である場合にはCGNR法は特に有効である。ただし、条件数κ(A^TA)がκ(A²)に等しいことから収束は遅くなる傾向があり、前処理行列を使用するCGLS (Conjugate Gradient Least Squares^[19])、LSQRなどの解法が提案されている。LSQRはAが悪条件である場合に最も数値的に安定な解法である^[20]^[21]。

脚注

テンプレート:脚注ヘルプ

出典

テンプレート:Reflist

参考文献

テンプレート:Refbegin

テンプレート:Refend

外部リンク

Conjugate Gradient Method by Nadir Soualem.
Preconditioned conjugate gradient method by Nadir Soualem.
An Introduction to the Conjugate Gradient Method Without the Agonizing Pain by Jonathan Richard Shewchuk.
Iterative methods for sparse linear systems by Yousef Saad
LSQR: Sparse Equations and Least Squares by Christopher Paige and Michael Saunders.

テンプレート:Linear algebra テンプレート:最適化アルゴリズムテンプレート:Authority control

↑ ^1.0 ^1.1 ^1.2 テンプレート:Cite book
↑ ^2.0 ^2.1 ^2.2 ^2.3 ^2.4 森正武. 数値解析第2版. 共立出版.
↑ ^3.0 ^3.1 ^3.2 ^3.3 ^3.4 数値線形代数の数理とHPC, 櫻井鉄也, 松尾宇泰, 片桐孝洋編（シリーズ応用数理 / 日本応用数理学会監修, 第6巻）共立出版, 2018.8
↑ ^4.0 ^4.1 ^4.2 ^4.3 ^4.4 ^4.5 ^4.6 皆本晃弥. (2005). UNIX & Informatioin Science-5 C 言語による数値計算入門.
↑ 田端正久; 偏微分方程式の数値解析, 2010. 岩波書店.
↑ 登坂宣好, & 大西和榮. (2003). 偏微分方程式の数値シミュレーション. 東京大学出版会.
↑ Zworski, M. (2002). Numerical linear algebra and solvability of partial differential equations. Communications in mathematical physics, 229(2), 293-307.
↑ Gill, P. E., Murray, W., & Wright, M. H. (1991). Numerical linear algebra and optimization (Vol. 1, p. 74). Redwood City, CA: Addison-Wesley.
↑ Gilbert, J. C., & Nocedal, J. (1992). Global convergence properties of conjugate gradient methods for optimization. SIAM Journal on optimization, 2(1), 21-42.
↑ Steihaug, T. (1983). The conjugate gradient method and trust regions in large scale optimization. SIAM Journal on Numerical Analysis, 20(3), 626-637.
↑ Black, Noel and Moore, Shirley. "Biconjugate Gradient Method." From MathWorld--A Wolfram Web Resource, created by Eric W. Weisstein. テンプレート:URL
↑ Loyce Adams, and J. L. Nazareth (Eds.): Linear and Nonlinear Conjugate Gradients-Related Methods, SIAM, ISBN 0-89871-376-5 (1996).
↑ Dai, Y. H. (2010). Nonlinear conjugate gradient methods. Wiley Encyclopedia of Operations Research and Management Science.
↑ Hager, W. W., & Zhang, H. (2006). A survey of nonlinear conjugate gradient methods. Pacific journal of Optimization, 2(1), 35-58.
↑ Dai, Y., Han, J., Liu, G., Sun, D., Yin, H., & Yuan, Y. X. (2000). Convergence properties of nonlinear conjugate gradient methods. SIAM Journal on Optimization, 10(2), 345-358.
↑ Eisenstat, S. C. (1981). Efficient implementation of a class of preconditioned conjugate gradient methods. SIAM Journal on Scientific and Statistical Computing, 2(1), 1-4.
↑ Kaasschieter, E. F. (1988). Preconditioned conjugate gradients for solving singular systems. Journal of Computational and Applied Mathematics, 24(1-2), 265-275.
↑ Black, Noel and Moore, Shirley. "Conjugate Gradient Method on the Normal Equations." From MathWorld--A Wolfram Web Resource, created by Eric W. Weisstein. テンプレート:URL
↑ Bjorck, A. (1996). Numerical methods for least squares problems (Vol. 51). SIAM.
↑ Paige, C. and Saunders, M. "LSQR: An Algorithm for Sparse Linear Equations and Sparse Least Squares." ACM Trans. Math. Soft. 8, 43-71, 1982.
↑ Paige, C. C., & Saunders, M. A. (1982). Algorithm 583: LSQR: Sparse linear equations and least squares problems. ACM Transactions on Mathematical Software (TOMS), 8(2), 195-209.

[Yamamoto1-1] 1.0 ^1.1 ^1.2 テンプレート:Cite book

[mori-2] 2.0 ^2.1 ^2.2 ^2.3 ^2.4 森正武. 数値解析第2版. 共立出版.

[hpc-3] 3.0 ^3.1 ^3.2 ^3.3 ^3.4 数値線形代数の数理とHPC, 櫻井鉄也, 松尾宇泰, 片桐孝洋編（シリーズ応用数理 / 日本応用数理学会監修, 第6巻）共立出版, 2018.8

[clang-4] 4.0 ^4.1 ^4.2 ^4.3 ^4.4 ^4.5 ^4.6 皆本晃弥. (2005). UNIX & Informatioin Science-5 C 言語による数値計算入門.

[tabata-5] 田端正久; 偏微分方程式の数値解析, 2010. 岩波書店.

[to-6] 登坂宣好, & 大西和榮. (2003). 偏微分方程式の数値シミュレーション. 東京大学出版会.

[7] Zworski, M. (2002). Numerical linear algebra and solvability of partial differential equations. Communications in mathematical physics, 229(2), 293-307.

[8] Gill, P. E., Murray, W., & Wright, M. H. (1991). Numerical linear algebra and optimization (Vol. 1, p. 74). Redwood City, CA: Addison-Wesley.

[9] Gilbert, J. C., & Nocedal, J. (1992). Global convergence properties of conjugate gradient methods for optimization. SIAM Journal on optimization, 2(1), 21-42.

[10] Steihaug, T. (1983). The conjugate gradient method and trust regions in large scale optimization. SIAM Journal on Numerical Analysis, 20(3), 626-637.

[11] Black, Noel and Moore, Shirley. "Biconjugate Gradient Method." From MathWorld--A Wolfram Web Resource, created by Eric W. Weisstein. テンプレート:URL

[12] Loyce Adams, and J. L. Nazareth (Eds.): Linear and Nonlinear Conjugate Gradients-Related Methods, SIAM, ISBN 0-89871-376-5 (1996).

[13] Dai, Y. H. (2010). Nonlinear conjugate gradient methods. Wiley Encyclopedia of Operations Research and Management Science.

[14] Hager, W. W., & Zhang, H. (2006). A survey of nonlinear conjugate gradient methods. Pacific journal of Optimization, 2(1), 35-58.

[15] Dai, Y., Han, J., Liu, G., Sun, D., Yin, H., & Yuan, Y. X. (2000). Convergence properties of nonlinear conjugate gradient methods. SIAM Journal on Optimization, 10(2), 345-358.

[16] Eisenstat, S. C. (1981). Efficient implementation of a class of preconditioned conjugate gradient methods. SIAM Journal on Scientific and Statistical Computing, 2(1), 1-4.

[17] Kaasschieter, E. F. (1988). Preconditioned conjugate gradients for solving singular systems. Journal of Computational and Applied Mathematics, 24(1-2), 265-275.

[18] Black, Noel and Moore, Shirley. "Conjugate Gradient Method on the Normal Equations." From MathWorld--A Wolfram Web Resource, created by Eric W. Weisstein. テンプレート:URL

[19] Bjorck, A. (1996). Numerical methods for least squares problems (Vol. 51). SIAM.

[20] Paige, C. and Saunders, M. "LSQR: An Algorithm for Sparse Linear Equations and Sparse Least Squares." ACM Trans. Math. Soft. 8, 43-71, 1982.

[21] Paige, C. C., & Saunders, M. A. (1982). Algorithm 583: LSQR: Sparse linear equations and least squares problems. ACM Transactions on Mathematical Software (TOMS), 8(2), 195-209.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

共役勾配法

目次

詳説

直接法としての共役勾配法

反復法としての共役勾配法

アルゴリズム

Octaveでの共役勾配法の記述例

前処理

正規方程式に対する共役勾配法

脚注

出典

参考文献

関連項目

外部リンク

ナビゲーションメニュー

共役勾配法

詳説

直接法としての共役勾配法

反復法としての共役勾配法

アルゴリズム

Octaveでの共役勾配法の記述例

前処理

正規方程式に対する共役勾配法

脚注

出典

参考文献

関連項目

外部リンク

ナビゲーション メニュー

検索

ナビゲーションメニュー