TY - GEN
T1 - GLIN
T2 - 11th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2023
AU - Wang, Congying
AU - Yu, Jia
AU - Zhao, Zhuoyue
N1 - Publisher Copyright: © 2023 ACM.
PY - 2023/11/13
Y1 - 2023/11/13
N2 - Although spatial indexes shorten the query response time, they rely on complex tree structures to narrow down the search space. Such structures in turn yield additional storage overhead and take a toll on index maintenance. Recently, there have been a flurry of efforts attempting to leverage Machine-Learning (ML) models to simplify the index structures. However, existing geospatial indexes can only index point data rather than complex geometries such as polygons and trajectories that are widely available in geospatial data. As a result, they cannot efficiently and correctly answer geometry relationship queries. This paper introduces GLIN, an indexing mechanism for spatial relationship queries on complex geometries. To achieve that, GLIN transforms geometries to Z-address intervals, and then harnesses an existing order-preserving learned index to model the cumulative distribution function between these intervals and the record positions. The lightweight learned index greatly reduces indexing overhead and provides faster or comparable query latency. Most importantly, GLIN augments spatial query windows to support queries exactly for common spatial relationships. Our experiments on real-world and synthetic datasets show that GLIN has 80%-90% lower storage overhead than Quad-Tree and 60% -80% than R-tree and 30% - 70% faster query on medium selectivity. Moreover, GLIN's maintenance throughput is 1.5 times higher on insertion and 3 - 5 times higher on deletion.
AB - Although spatial indexes shorten the query response time, they rely on complex tree structures to narrow down the search space. Such structures in turn yield additional storage overhead and take a toll on index maintenance. Recently, there have been a flurry of efforts attempting to leverage Machine-Learning (ML) models to simplify the index structures. However, existing geospatial indexes can only index point data rather than complex geometries such as polygons and trajectories that are widely available in geospatial data. As a result, they cannot efficiently and correctly answer geometry relationship queries. This paper introduces GLIN, an indexing mechanism for spatial relationship queries on complex geometries. To achieve that, GLIN transforms geometries to Z-address intervals, and then harnesses an existing order-preserving learned index to model the cumulative distribution function between these intervals and the record positions. The lightweight learned index greatly reduces indexing overhead and provides faster or comparable query latency. Most importantly, GLIN augments spatial query windows to support queries exactly for common spatial relationships. Our experiments on real-world and synthetic datasets show that GLIN has 80%-90% lower storage overhead than Quad-Tree and 60% -80% than R-tree and 30% - 70% faster query on medium selectivity. Moreover, GLIN's maintenance throughput is 1.5 times higher on insertion and 3 - 5 times higher on deletion.
KW - learned index
KW - spatial relationship query
UR - https://www.scopus.com/pages/publications/85178648425
U2 - 10.1145/3615833.3628590
DO - 10.1145/3615833.3628590
M3 - Conference contribution
T3 - BigSpatial 2023 - Proceedings of the 11th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
SP - 1
EP - 12
BT - BigSpatial 2023 - Proceedings of the 11th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
A2 - Shashidharan, Ashwin
A2 - Werner, Martin
A2 - Gadiraju, Krishna Karthik
A2 - Chandola, Varun
A2 - Vatsavai, Ranga Raju
PB - Association for Computing Machinery, Inc
Y2 - 13 November 2023
ER -