Table of Contents
Fetching ...

Approximating Higher-Order Derivative Tensors Using Secant Updates

Karl Welzel, Raphael A. Hauser

TL;DR

Higher-order secant updates are proposed which generalize the idea of quasi-Newton updates to higher-order derivatives, approximating for example third derivatives from given Hessian evaluations.

Abstract

Quasi-Newton methods employ an update rule that gradually improves the Hessian approximation using the already available gradient evaluations. We propose higher-order secant updates which generalize this idea to higher-order derivatives, approximating for example third derivatives (which are tensors) from given Hessian evaluations. Our generalization is based on the observation that quasi-Newton updates are least-change updates satisfying the secant equation, with different methods using different norms to measure the size of the change. We present a full characterization for least-change updates in weighted Frobenius norms (satisfying an analogue of the secant equation) for derivatives of arbitrary order. Moreover, we establish convergence of the approximations to the true derivative under standard assumptions and explore the quality of the generated approximations in numerical experiments.

Approximating Higher-Order Derivative Tensors Using Secant Updates

TL;DR

Higher-order secant updates are proposed which generalize the idea of quasi-Newton updates to higher-order derivatives, approximating for example third derivatives from given Hessian evaluations.

Abstract

Quasi-Newton methods employ an update rule that gradually improves the Hessian approximation using the already available gradient evaluations. We propose higher-order secant updates which generalize this idea to higher-order derivatives, approximating for example third derivatives (which are tensors) from given Hessian evaluations. Our generalization is based on the observation that quasi-Newton updates are least-change updates satisfying the secant equation, with different methods using different norms to measure the size of the change. We present a full characterization for least-change updates in weighted Frobenius norms (satisfying an analogue of the secant equation) for derivatives of arbitrary order. Moreover, we establish convergence of the approximations to the true derivative under standard assumptions and explore the quality of the generated approximations in numerical experiments.
Paper Structure