標題: A second-order learning algorithm for multilayer networks based on block Hessian matrix
作者: Wang, YJ
Lin, CT
電控工程研究所
Institute of Electrical and Control Engineering
關鍵字: multilayer perceptrons;Hessian matrix;forward-backward propagation;Newton's method;least squares estimation
公開日期: 1-十二月-1998
摘要: This article proposes a new second-order learning algorithm for training the multilayer perceptron (MLP) networks. The proposed algorithm is a revised Newton's method. A forward-backward propagation scheme is first proposed for network computation of the Hessian matrix, H, of the output error function of the MLP. A block Hessian matrix, H-b, is then defined to approximate and simplify H. Several lemmas and theorems are proved to uncover the important properties of H and H-b, and verify the good approximation of H-b to H; H-b preserves the major properties of H. The theoretic analysis leads to the development of an efficient way for computing the inverse of H-b recursively. In the proposed second-order learning algorithm, the least squares estimation technique is adopted to further lessen the local minimum problems. The proposed algorithm overcomes not only the drawbacks of the standard backpropagation algorithm (i.e. slow asymptotic convergence rate, bad controllability of convergence accuracy, local minimum problems, and high sensitivity to learning constant), but also the shortcomings of normal Newton's method used on the MLP, such as the lack of network implementation of H, ill representability of the diagonal terms of H, the heavy computation load of the inverse of H, and the requirement of a good initial estimate of the solution (weights). Several example problems are used to demonstrate the efficiency of the proposed learning algorithm. Extensive performance (convergence rate and accuracy) comparisons of the proposed algorithm with other learning schemes (including the standard backpropagation algorithm) are also made. (C) 1998 Elsevier Science Ltd. All rights reserved.
URI: http://dx.doi.org/10.1016/S0893-6080(98)00091-4
http://hdl.handle.net/11536/31733
ISSN: 0893-6080
DOI: 10.1016/S0893-6080(98)00091-4
期刊: NEURAL NETWORKS
Volume: 11
Issue: 9
起始頁: 1607
結束頁: 1622
顯示於類別:期刊論文


文件中的檔案:

  1. 000077631700004.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。