TurboReg: A framework for scaling up spatial logistic regression models

Ibrahim Sabek; Mashaal Musleh; Mohamed F Mokbel

doi:10.1145/3274895.3274987

TurboReg: A framework for scaling up spatial logistic regression models

Ibrahim Sabek, Mashaal Musleh, Mohamed F Mokbel

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

12 Scopus citations

Abstract

Predicting the presence or absence of spatial phenomena has been of great interest to scientists pursuing research in several applications including epidemic diseases detection, species occurrence prediction and earth observation. In this operation, a geographical space is divided by a two-dimensional grid, where the prediction (i.e, either 0 or 1) is performed at each cell in the grid. A common approach to solve this problem is to build spatial logistic regression models (a.k.a autologistic models) that estimate the prediction at any location based on a set of predictors (i.e., features) at this location and predictions from neighboring locations. Unfortunately, existing methods to build autologistic models are computationally expensive and do not scale up for large-scale grid data (e.g., fine-grained satellite images). This paper introduces TurboReg, a scalable framework to build autologistic models for predicting large-scale spatial phenomena. TurboReg considers both the accuracy and efficiency aspects when learning the regression model parameters. TurboReg is built on top of Markov Logic Network (MLN), a scalable statistical learning framework, where its internals and data structures are optimized to process spatial data. A set of experiments using large real and synthetic data show that TurboReg achieves at least three orders of magnitude performance gain over existing methods while preserving the model accuracy.

Original language	English (US)
Title of host publication	26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018
Editors	Li Xiong, Roberto Tamassia, Kashani Farnoush Banaei, Ralf Hartmut Guting, Erik Hoel
Publisher	Association for Computing Machinery
Pages	129-138
Number of pages	10
ISBN (Electronic)	9781450358897
DOIs	https://doi.org/10.1145/3274895.3274987
State	Published - Nov 6 2018
Event	26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018 - Seattle, United States Duration: Nov 6 2018 → Nov 9 2018

Publication series

Name	GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems

Other

Other	26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018
Country/Territory	United States
City	Seattle
Period	11/6/18 → 11/9/18

Keywords

Autologistic models
Factor graph
First-order logic
Markov logic networks
Spatial regression

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access

10.1145/3274895.3274987

OpenUrl availability

Full text

Cite this

Sabek, I., Musleh, M., & Mokbel, M. F. (2018). TurboReg: A framework for scaling up spatial logistic regression models. In L. Xiong, R. Tamassia, K. F. Banaei, R. H. Guting, & E. Hoel (Eds.), 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018 (pp. 129-138). (GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems). Association for Computing Machinery. https://doi.org/10.1145/3274895.3274987

TurboReg: A framework for scaling up spatial logistic regression models. / Sabek, Ibrahim; Musleh, Mashaal; Mokbel, Mohamed F.
26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018. ed. / Li Xiong; Roberto Tamassia; Kashani Farnoush Banaei; Ralf Hartmut Guting; Erik Hoel. Association for Computing Machinery, 2018. p. 129-138 (GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Sabek, I, Musleh, M & Mokbel, MF 2018, TurboReg: A framework for scaling up spatial logistic regression models. in L Xiong, R Tamassia, KF Banaei, RH Guting & E Hoel (eds), 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018. GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems, Association for Computing Machinery, pp. 129-138, 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018, Seattle, United States, 11/6/18. https://doi.org/10.1145/3274895.3274987

Sabek I, Musleh M, Mokbel MF. TurboReg: A framework for scaling up spatial logistic regression models. In Xiong L, Tamassia R, Banaei KF, Guting RH, Hoel E, editors, 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018. Association for Computing Machinery. 2018. p. 129-138. (GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems). doi: 10.1145/3274895.3274987

Sabek, Ibrahim ; Musleh, Mashaal ; Mokbel, Mohamed F. / TurboReg : A framework for scaling up spatial logistic regression models. 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018. editor / Li Xiong ; Roberto Tamassia ; Kashani Farnoush Banaei ; Ralf Hartmut Guting ; Erik Hoel. Association for Computing Machinery, 2018. pp. 129-138 (GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems).

@inproceedings{01d2e17a8fa64571b509a13993d4af84,

title = "TurboReg: A framework for scaling up spatial logistic regression models",

abstract = "Predicting the presence or absence of spatial phenomena has been of great interest to scientists pursuing research in several applications including epidemic diseases detection, species occurrence prediction and earth observation. In this operation, a geographical space is divided by a two-dimensional grid, where the prediction (i.e, either 0 or 1) is performed at each cell in the grid. A common approach to solve this problem is to build spatial logistic regression models (a.k.a autologistic models) that estimate the prediction at any location based on a set of predictors (i.e., features) at this location and predictions from neighboring locations. Unfortunately, existing methods to build autologistic models are computationally expensive and do not scale up for large-scale grid data (e.g., fine-grained satellite images). This paper introduces TurboReg, a scalable framework to build autologistic models for predicting large-scale spatial phenomena. TurboReg considers both the accuracy and efficiency aspects when learning the regression model parameters. TurboReg is built on top of Markov Logic Network (MLN), a scalable statistical learning framework, where its internals and data structures are optimized to process spatial data. A set of experiments using large real and synthetic data show that TurboReg achieves at least three orders of magnitude performance gain over existing methods while preserving the model accuracy.",

keywords = "Autologistic models, Factor graph, First-order logic, Markov logic networks, Spatial regression",

author = "Ibrahim Sabek and Mashaal Musleh and Mokbel, {Mohamed F}",

year = "2018",

month = nov,

day = "6",

doi = "10.1145/3274895.3274987",

language = "English (US)",

series = "GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems",

publisher = "Association for Computing Machinery",

pages = "129--138",

editor = "Li Xiong and Roberto Tamassia and Banaei, {Kashani Farnoush} and Guting, {Ralf Hartmut} and Erik Hoel",

booktitle = "26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018",

note = "26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018 ; Conference date: 06-11-2018 Through 09-11-2018",

}

TY - GEN

T1 - TurboReg

T2 - 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018

AU - Sabek, Ibrahim

AU - Musleh, Mashaal

AU - Mokbel, Mohamed F

PY - 2018/11/6

Y1 - 2018/11/6

N2 - Predicting the presence or absence of spatial phenomena has been of great interest to scientists pursuing research in several applications including epidemic diseases detection, species occurrence prediction and earth observation. In this operation, a geographical space is divided by a two-dimensional grid, where the prediction (i.e, either 0 or 1) is performed at each cell in the grid. A common approach to solve this problem is to build spatial logistic regression models (a.k.a autologistic models) that estimate the prediction at any location based on a set of predictors (i.e., features) at this location and predictions from neighboring locations. Unfortunately, existing methods to build autologistic models are computationally expensive and do not scale up for large-scale grid data (e.g., fine-grained satellite images). This paper introduces TurboReg, a scalable framework to build autologistic models for predicting large-scale spatial phenomena. TurboReg considers both the accuracy and efficiency aspects when learning the regression model parameters. TurboReg is built on top of Markov Logic Network (MLN), a scalable statistical learning framework, where its internals and data structures are optimized to process spatial data. A set of experiments using large real and synthetic data show that TurboReg achieves at least three orders of magnitude performance gain over existing methods while preserving the model accuracy.

AB - Predicting the presence or absence of spatial phenomena has been of great interest to scientists pursuing research in several applications including epidemic diseases detection, species occurrence prediction and earth observation. In this operation, a geographical space is divided by a two-dimensional grid, where the prediction (i.e, either 0 or 1) is performed at each cell in the grid. A common approach to solve this problem is to build spatial logistic regression models (a.k.a autologistic models) that estimate the prediction at any location based on a set of predictors (i.e., features) at this location and predictions from neighboring locations. Unfortunately, existing methods to build autologistic models are computationally expensive and do not scale up for large-scale grid data (e.g., fine-grained satellite images). This paper introduces TurboReg, a scalable framework to build autologistic models for predicting large-scale spatial phenomena. TurboReg considers both the accuracy and efficiency aspects when learning the regression model parameters. TurboReg is built on top of Markov Logic Network (MLN), a scalable statistical learning framework, where its internals and data structures are optimized to process spatial data. A set of experiments using large real and synthetic data show that TurboReg achieves at least three orders of magnitude performance gain over existing methods while preserving the model accuracy.

KW - Autologistic models

KW - Factor graph

KW - First-order logic

KW - Markov logic networks

KW - Spatial regression

UR - http://www.scopus.com/inward/record.url?scp=85058653614&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058653614&partnerID=8YFLogxK

U2 - 10.1145/3274895.3274987

DO - 10.1145/3274895.3274987

M3 - Conference contribution

AN - SCOPUS:85058653614

T3 - GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems

SP - 129

EP - 138

BT - 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018

A2 - Xiong, Li

A2 - Tamassia, Roberto

A2 - Banaei, Kashani Farnoush

A2 - Guting, Ralf Hartmut

A2 - Hoel, Erik

PB - Association for Computing Machinery

Y2 - 6 November 2018 through 9 November 2018

ER -

TurboReg: A framework for scaling up spatial logistic regression models

Abstract

Publication series

Other

Keywords

UN SDGs

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this