KD-AHOSVD: Neural Network Compression via Knowledge Distillation and Tensor Decomposition

Meneghetti, Laura; Bianchi, Edoardo; Demo, Nicola; Rozza, Gianluigi

doi:10.1007/978-3-031-87897-8_7

In the field of Deep Learning, the high number of parameters in models has become a significant concern within the scientific community due to the increased computational resources and memory required for training and inference. Addressing this issue, we propose a novel tensorized technique to compress network architectures. Our approach aims to significantly reduce the network’s size and the number of parameters by integrating Averaged Higher Order Singular Value Decomposition with a novel Knowledge Distillation approach. Specifically, we replace certain layers of the original architecture with layers that perform linear projections onto a reduced space defined by our reduction technique. We conducted experiments on image classification tasks using multiple architectures and datasets. The evaluation focuses on final accuracy, model size, and parameter reduction, comparing our approach with both the original models and quantization, a widely used reduction method. The results underscore the effectiveness of our method in significantly reducing the number of parameters and the overall size of neural networks while maintaining high performance.

KD-AHOSVD: Neural Network Compression via Knowledge Distillation and Tensor Decomposition / Meneghetti, Laura; Bianchi, Edoardo; Demo, Nicola; Rozza, Gianluigi. - 15569 LNCS:(2025), pp. 81-92. ( 18th International Workshop on Design and Architecture for Signal and Image Processing, DASIP 2025 Barcelona, Spain 20-22 January 2025) [10.1007/978-3-031-87897-8_7].

KD-AHOSVD: Neural Network Compression via Knowledge Distillation and Tensor Decomposition

Meneghetti, Laura;Bianchi, Edoardo;Demo, Nicola;Rozza, Gianluigi

2025-01-01

Abstract

In the field of Deep Learning, the high number of parameters in models has become a significant concern within the scientific community due to the increased computational resources and memory required for training and inference. Addressing this issue, we propose a novel tensorized technique to compress network architectures. Our approach aims to significantly reduce the network’s size and the number of parameters by integrating Averaged Higher Order Singular Value Decomposition with a novel Knowledge Distillation approach. Specifically, we replace certain layers of the original architecture with layers that perform linear projections onto a reduced space defined by our reduction technique. We conducted experiments on image classification tasks using multiple architectures and datasets. The evaluation focuses on final accuracy, model size, and parameter reduction, comparing our approach with both the original models and quantization, a widely used reduction method. The results underscore the effectiveness of our method in significantly reducing the number of parameters and the overall size of neural networks while maintaining high performance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				Lecture Notes in Computer Science
			
	Serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Numero del volume
	
				15569 LNCS
			
	Da pagina
	
				81
			
	A pagina
	
				92
			
	Codice ISBN
	
				9783031878961
9783031878978
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-031-87897-8_7
			
	Nome editore
	
				Springer Science and Business Media Deutschland GmbH
			
	Tutti gli autori
	
						Meneghetti, Laura; Bianchi, Edoardo; Demo, Nicola; Rozza, Gianluigi
					
	Appare nelle tipologie:
	
				4.1 Contribution in Conference proceedings

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/146810

Citazioni

ND

0

0

SISSA DIGITAL LIBRARYInstitutional Research Information System (Statistiche: prodotti, OA)
Per informazioni contatta sdl@sissa.it

KD-AHOSVD: Neural Network Compression via Knowledge Distillation and Tensor Decomposition

Meneghetti, Laura;Bianchi, Edoardo;Demo, Nicola;Rozza, Gianluigi

2025-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

SISSA DIGITAL LIBRARYInstitutional Research Information System (Statistiche: prodotti, OA)Per informazioni contatta sdl@sissa.it

KD-AHOSVD: Neural Network Compression via Knowledge Distillation and Tensor Decomposition

Meneghetti, Laura;Bianchi, Edoardo;Demo, Nicola;Rozza, Gianluigi

2025-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

SISSA DIGITAL LIBRARYInstitutional Research Information System (Statistiche: prodotti, OA)
Per informazioni contatta sdl@sissa.it

Scheda breve

Scheda completa

Scheda completa (DC)