SISSA DIGITAL LIBRARYInstitutional Research Information System (Statistiche: prodotti, OA)
Per informazioni contatta sdl@sissa.it

Recurrent neural networks have been extensively studied in the context of neuroscience and machine learning due to their ability to implement complex computations. While substantial progress in designing effective learning algorithms has been achieved, a full understanding of trained recurrent networks is still lacking. Specifically, the mechanisms that allow computations to emerge from the underlying recurrent dynamics are largely unknown. Here we focus on a simple yet underexplored computational setup: a feedback architecture trained to associate a stationary output to a stationary input. As a starting point, we derive an approximate analytical description of global dynamics in trained networks, which assumes uncorrelated connectivity weights in the feedback and in the random bulk. The resulting mean-field theory suggests that the task admits several classes of solutions, which imply different stability properties. Different classes are characterized in terms of the geometrical arrangement of the readout with respect to the input vectors, defined in the high-dimensional space spanned by the network population. We find that such an approximate theoretical approach can be used to understand how standard training techniques implement the input-output task in finite-size feedback networks. In particular, our simplified description captures the local and the global stability properties of the target solution, and thus predicts training performance.

A geometrical analysis of global stability in trained feedback networks / Mastrogiuseppe, Francesca; Ostojic, Srdjan. - In: NEURAL COMPUTATION. - ISSN 0899-7667. - 31:6(2019), pp. 1139-1182. [10.1162/neco_a_01187]

A geometrical analysis of global stability in trained feedback networks

Mastrogiuseppe, Francesca;Ostojic, Srdjan

2019-01-01

Abstract

Recurrent neural networks have been extensively studied in the context of neuroscience and machine learning due to their ability to implement complex computations. While substantial progress in designing effective learning algorithms has been achieved, a full understanding of trained recurrent networks is still lacking. Specifically, the mechanisms that allow computations to emerge from the underlying recurrent dynamics are largely unknown. Here we focus on a simple yet underexplored computational setup: a feedback architecture trained to associate a stationary output to a stationary input. As a starting point, we derive an approximate analytical description of global dynamics in trained networks, which assumes uncorrelated connectivity weights in the feedback and in the random bulk. The resulting mean-field theory suggests that the task admits several classes of solutions, which imply different stability properties. Different classes are characterized in terms of the geometrical arrangement of the readout with respect to the input vectors, defined in the high-dimensional space spanned by the network population. We find that such an approximate theoretical approach can be used to understand how standard training techniques implement the input-output task in finite-size feedback networks. In particular, our simplified description captures the local and the global stability properties of the target solution, and thus predicts training performance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Rivista
	
				NEURAL COMPUTATION
			
	Numero del volume
	
				31
			
	Fascicolo
	
				6
			
	Da pagina
	
				1139
			
	A pagina
	
				1182
			
	Codice DOI
	
				https://dx.doi.org/10.1162/neco_a_01187
			
	URL
	
				https://arxiv.org/abs/1809.02386
https://pubmed.ncbi.nlm.nih.gov/30979353/
			
	Tutti gli autori
	
						Mastrogiuseppe, Francesca; Ostojic, Srdjan
					
	Appare nelle tipologie:
	
				1.1 Journal article

File in questo prodotto:

File	Dimensione	Formato
mastrogiuseppe_neuralcomp_2019.pdf accesso aperto Descrizione: pdf editoriale Licenza: Non specificato Dimensione 4.8 MB Formato Adobe PDF Visualizza/Apri	4.8 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/148435

Citazioni

ND

12

13

social impact