TY - GEN
T1 - Parallel Compression and Indexing of Large-Scale Geospatial Raster Data with GPGPUs
AU - Kaligirwa, Nathalie
AU - Leal, Eleazar
AU - Gruenwald, Le
AU - Zhang, Jianting
AU - You, Simin
PY - 2017/9/7
Y1 - 2017/9/7
N2 - Global remote sensing and large-scale environment modeling have generated vast amounts of raster geospatial data. Performing spatial queries over such data has applications in many domains, such as climate impact studies, water and wildlife management, and urban planning. Processing those queries is greatly facilitated by the existence of spatial indices. However, I/O transfer is still a major bottleneck in the overall system performance. One of the solutions to this issue is to compress data before sending it over the I/O channel. Therefore, a lossless compression technique that also supports spatial indexing to improve query response time is highly desirable. To fill this gap, in this paper we propose two parallel GPGPU algorithms, called Multi-Block per Tile (MBPT) and One-Block per Tile (OBPT), to compress and index large-scale geospatial raster data using BQ-Trees. Experiments comparing our best performing proposed algorithm, OBPT, against HFPaC, a state-of-the-art geospatial parallel GPGPU compression algorithm, using three real datasets of satellite images, show that our algorithm achieves a compression time speedup of up to 2X, and a 2.5X increment in compression ratio. OBPT also yields a comparable average spatial query response time to HFPaC.
AB - Global remote sensing and large-scale environment modeling have generated vast amounts of raster geospatial data. Performing spatial queries over such data has applications in many domains, such as climate impact studies, water and wildlife management, and urban planning. Processing those queries is greatly facilitated by the existence of spatial indices. However, I/O transfer is still a major bottleneck in the overall system performance. One of the solutions to this issue is to compress data before sending it over the I/O channel. Therefore, a lossless compression technique that also supports spatial indexing to improve query response time is highly desirable. To fill this gap, in this paper we propose two parallel GPGPU algorithms, called Multi-Block per Tile (MBPT) and One-Block per Tile (OBPT), to compress and index large-scale geospatial raster data using BQ-Trees. Experiments comparing our best performing proposed algorithm, OBPT, against HFPaC, a state-of-the-art geospatial parallel GPGPU compression algorithm, using three real datasets of satellite images, show that our algorithm achieves a compression time speedup of up to 2X, and a 2.5X increment in compression ratio. OBPT also yields a comparable average spatial query response time to HFPaC.
KW - query processing on GPGPUs
KW - query processing on raster data
KW - raster data compression
KW - raster data indexing
UR - http://www.scopus.com/inward/record.url?scp=85032387346&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85032387346&partnerID=8YFLogxK
U2 - 10.1109/BigDataCongress.2017.27
DO - 10.1109/BigDataCongress.2017.27
M3 - Conference contribution
AN - SCOPUS:85032387346
T3 - Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017
SP - 137
EP - 144
BT - Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017
A2 - Karypis, George
A2 - Zhang, Jia
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th IEEE International Congress on Big Data, BigData Congress 2017
Y2 - 25 June 2017 through 30 June 2017
ER -