Abstract

Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC applications because it not only significantly reduces storage overhead but also can retain high fidelityfor postanalysis. Because supercomputers and HPC applicationsare becoming heterogeneous using accelerator-based architectures,in particular GPUs, several development teams have recently released GPU versions of their lossy compressors. However, existingstate-of-the-art GPU-based lossy compressors suffer from eitherlow compression and decompression throughput or low compression quality. In this paper, we present an optimized GPU version,cuSZ, for one of the best error-bounded lossy compressors-SZ.To the best of our knowledge, cuSZ is the first error-boundedlossy compressor on GPUs for scientific data. Our contributions arefourfold. (1) We propose a dual-qantization scheme to entirelyremove the data dependency in the prediction step of SZ such thatthis step can be performed very efficiently on GPUs. (2) We developan efficient customized Huffman coding for the SZ compressor onGPUs. (3) We implement cuSZ using CUDA and optimize its performance by improving the utilization of GPU memory bandwidth. (4)We evaluate our cuSZ on five real-world HPC application datasetsfrom the Scientific Data Reduction Benchmarks and compare it withother state-of-the-art methods on both CPUs and GPUs. Experiments show that our cuSZ improves SZ's compression throughputby up to 370.1× and 13.1×, respectively, over the production version running on single and multiple CPU cores, respectively, whilegetting the same quality of reconstructed data. It also improves thecompression ratio by up to 3.48× on the tested data compared withanother state-of-the-art GPU supported lossy compressor.

Meeting Name

ACM International Conference on Parallel Architectures and Compilation Techniques, PACT '20 (2020: Oct. 3-7, Virtual)

Department(s)

Computer Science

Comments

This research was supported by the Exascale Computing Project (ECP), Project Number: 17-SC-20-SC. This work was also supported by the National Science Foundation under Grants CCF-1619253, OAC- 2003709, OAC-1948447/2034169, and OAC-2003624/2042084.

International Standard Book Number (ISBN)

978-145038075-1

International Standard Serial Number (ISSN)

1089-795X

Document Type

Article - Conference proceedings

Document Version

Final Version

File Type

text

Language(s)

English

Rights

© 2020 The Authors, All rights reserved.

Creative Commons Licensing

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Publication Date

30 Sep 2020

Share

 
COinS