Fast Newton-CG Method for Batch Learning of Conditional Random Fields

Yuta Tsuboi; Yuya Unno; Hisashi Kashima; Naoaki Okazaki

doi:10.1609/aaai.v25i1.7894

Fast Newton-CG Method for Batch Learning of Conditional Random Fields

Authors

Yuta Tsuboi IBM Research - Tokyo
Yuya Unno Preferred Infrastructure, Inc.
Hisashi Kashima The University of Tokyo
Naoaki Okazaki Tohoku University

DOI:

https://doi.org/10.1609/aaai.v25i1.7894

Abstract

We propose a fast batch learning method for linear-chain Conditional Random Fields (CRFs) based on Newton-CG methods. Newton-CG methods are a variant of Newton method for high-dimensional problems. They only require the Hessian-vector products instead of the full Hessian matrices. To speed up Newton-CG methods for the CRF learning, we derive a novel dynamic programming procedure for the Hessian-vector products of the CRF objective function. The proposed procedure can reuse the byproducts of the time-consuming gradient computation for the Hessian-vector products to drastically reduce the total computation time of the Newton-CG methods. In experiments with tasks in natural language processing, the proposed method outperforms a conventional quasi-Newton method. Remarkably, the proposed method is competitive with online learning algorithms that are fast but unstable.

Downloads

Published

2011-08-04

How to Cite

Tsuboi, Y., Unno, Y., Kashima, H., & Okazaki, N. (2011). Fast Newton-CG Method for Batch Learning of Conditional Random Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 25(1), 489–494. https://doi.org/10.1609/aaai.v25i1.7894

Download Citation

Issue

Vol. 25 No. 1 (2011): Twenty-Fifth AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: Machine Learning

Fast Newton-CG Method for Batch Learning of Conditional Random Fields

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information