SISSA DIGITAL LIBRARYInstitutional Research Information System (Statistiche: prodotti, OA)
Per informazioni contatta sdl@sissa.it

Teacher-student models provide a framework in which the typical-case performance of high-dimensional supervised learning can be described in closed form. The assumptions of Gaussian i.i.d. input data underlying the canonical teacher-student model may, however, be perceived as too restrictive to capture the behaviour of realistic data sets. In this paper, we introduce a Gaussian covariate generalisation of the model where the teacher and student can act on different spaces, generated with fixed, but generic feature maps. While still solvable in a closed form, this generalization is able to capture the learning curves for a broad range of realistic data sets, thus redeeming the potential of the teacher-student framework. Our contribution is then two-fold: first, we prove a rigorous formula for the asymptotic training loss and generalisation error. Second, we present a number of situations where the learning curve of the model captures the one of a realistic data set learned with kernel regression and classification, with out-of-the-box feature maps such as random projections or scattering transforms, or with pre-learned ones - such as the features learned by training multi-layer neural networks. We discuss both the power and the limitations of the framework.

Learning curves of generic features maps for realistic datasets with a teacher-student model / Loureiro, B.; Gerbelot, C.; Cui, H.; Goldt, S.; Krzakala, F.; Mezard, M.; Zdeborova, L.. - 22:(2021), pp. 18137-18151. (Intervento presentato al convegno Advances in Neural Information Processing Systems 34 (NeurIPS 2021)).

Learning curves of generic features maps for realistic datasets with a teacher-student model

Loureiro, B.;Gerbelot, C.;Cui, H.;Goldt, S.;Krzakala, F.;Mezard, M.;Zdeborova, L.

2021-01-01

Abstract

Teacher-student models provide a framework in which the typical-case performance of high-dimensional supervised learning can be described in closed form. The assumptions of Gaussian i.i.d. input data underlying the canonical teacher-student model may, however, be perceived as too restrictive to capture the behaviour of realistic data sets. In this paper, we introduce a Gaussian covariate generalisation of the model where the teacher and student can act on different spaces, generated with fixed, but generic feature maps. While still solvable in a closed form, this generalization is able to capture the learning curves for a broad range of realistic data sets, thus redeeming the potential of the teacher-student framework. Our contribution is then two-fold: first, we prove a rigorous formula for the asymptotic training loss and generalisation error. Second, we present a number of situations where the learning curve of the model captures the one of a realistic data set learned with kernel regression and classification, with out-of-the-box feature maps such as random projections or scattering transforms, or with pre-learned ones - such as the features learned by training multi-layer neural networks. We discuss both the power and the limitations of the framework.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Titolo del volume
	
				ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS
			
	Serie
	
				ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS
			
	Numero del volume
	
				22
			
	Da pagina
	
				18137
			
	A pagina
	
				18151
			
	Tutti gli autori
	
						Loureiro, B.; Gerbelot, C.; Cui, H.; Goldt, S.; Krzakala, F.; Mezard, M.; Zdeborova, L.
					
	Appare nelle tipologie:
	
				4.1 Contribution in Conference proceedings

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/135754

Citazioni

ND

73

0

social impact