In this paper we propose a Deep Learning architecture to approximate diffeomorphisms diffeotopic to the identity. We consider a control system of the form (x)over dot = Sigma(l)(i=1) F-i(x)u(i), with linear dependence in the controls, and we use the corresponding flow to approximate the action of a diffeomorphism on a compact ensemble of points. Despite the simplicity of the control system, it has been recently shown that a Universal Approximation Property holds. The problem of minimizing the sum of the training error and of a regularizing term induces a gradient flow in the space of admissible controls. A possible training procedure for the discrete-time neural network consists in projecting the gradient flow onto a finite-dimensional subspace of the admissible controls. An alternative approach relies on an iterative method based on Pontryagin Maximum Principle for the numerical resolution of Optimal Control problems. Here the maximization of the Hamiltonian can be carried out with an extremely low computational effort, owing to the linear dependence of the system in the control variables. Finally, we use tools from Gamma-convergence to provide an estimate of the expected generalization error.
DEEP LEARNING APPROXIMATION OF DIFFEOMORPHISMS VIA LINEAR-CONTROL SYSTEMS / Scagliotti, A. - In: MATHEMATICAL CONTROL AND RELATED FIELDS. - ISSN 2156-8472. - (2022), pp. 1-32. [10.3934/mcrf.2022036]
DEEP LEARNING APPROXIMATION OF DIFFEOMORPHISMS VIA LINEAR-CONTROL SYSTEMS
Scagliotti, A
2022-01-01
Abstract
In this paper we propose a Deep Learning architecture to approximate diffeomorphisms diffeotopic to the identity. We consider a control system of the form (x)over dot = Sigma(l)(i=1) F-i(x)u(i), with linear dependence in the controls, and we use the corresponding flow to approximate the action of a diffeomorphism on a compact ensemble of points. Despite the simplicity of the control system, it has been recently shown that a Universal Approximation Property holds. The problem of minimizing the sum of the training error and of a regularizing term induces a gradient flow in the space of admissible controls. A possible training procedure for the discrete-time neural network consists in projecting the gradient flow onto a finite-dimensional subspace of the admissible controls. An alternative approach relies on an iterative method based on Pontryagin Maximum Principle for the numerical resolution of Optimal Control problems. Here the maximization of the Hamiltonian can be carried out with an extremely low computational effort, owing to the linear dependence of the system in the control variables. Finally, we use tools from Gamma-convergence to provide an estimate of the expected generalization error.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.