The density matrix renormalization group (DRMG) algorithm, a numerical technique that has been successfully used for investigating the low energy properties of one-dimensional (1D) strongly correlated quantum systems, has recently emerged as an effective tool for studying two-dimensional (2D) systems as well. At the core of DMRG is a general decimation procedure that allows the systematic truncation of the Hilbert space leaving only the most relevant basis states. However, studying 2D systems requires more degrees of freedom and greater computational resources. To address this computational roadblock, we develop a massively parallel implementation of the DMRG algorithm that targets a large number of basis states. It relies on parallel linear algebra libraries that distribute the generation and diagonalization of large sparse matrices, as these remain to be the most time-consuming steps in DMRG. We tailor our developed code for efficient performance on two sections of CINECA Marconi, a class Tier-0 supercomputing infrastructure, and evaluate its performance and scalability on up to thousands of processors. From the performance analysis we identify some limitations in scalability and suggest possible ways to rectify them.

Large-Scale Implementation of the Density Matrix Renormalization Group Algorithm(2017 Dec 18).

Large-Scale Implementation of the Density Matrix Renormalization Group Algorithm

-
2017-12-18

Abstract

The density matrix renormalization group (DRMG) algorithm, a numerical technique that has been successfully used for investigating the low energy properties of one-dimensional (1D) strongly correlated quantum systems, has recently emerged as an effective tool for studying two-dimensional (2D) systems as well. At the core of DMRG is a general decimation procedure that allows the systematic truncation of the Hilbert space leaving only the most relevant basis states. However, studying 2D systems requires more degrees of freedom and greater computational resources. To address this computational roadblock, we develop a massively parallel implementation of the DMRG algorithm that targets a large number of basis states. It relies on parallel linear algebra libraries that distribute the generation and diagonalization of large sparse matrices, as these remain to be the most time-consuming steps in DMRG. We tailor our developed code for efficient performance on two sections of CINECA Marconi, a class Tier-0 supercomputing infrastructure, and evaluate its performance and scalability on up to thousands of processors. From the performance analysis we identify some limitations in scalability and suggest possible ways to rectify them.
18-dic-2017
Vance, James
Dalmonte, Marcello
Girotto, Ivan
File in questo prodotto:
File Dimensione Formato  
James Vance.pdf

accesso aperto

Tipologia: Tesi
Licenza: Non specificato
Dimensione 1.31 MB
Formato Adobe PDF
1.31 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/68070
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact