TY - GEN
T1 - DynMDL
T2 - 7th IEEE International Congress on Big Data, BigData Congress 2018
AU - Leal, Eleazar
AU - Gruenwald, Le
PY - 2018/9/7
Y1 - 2018/9/7
N2 - The purpose of trajectory segmentation algorithms is to replace an input trajectory by a sub-trajectory with fewer points than the input, but that is also a good approximation to the original trajectory. As such, trajectory segmentation is an essential pre-processing step for trajectory mining algorithms, such as clustering. Among the segmentation strategies that are commonly used for trajectory clustering is Minimum Description Length (MDL)-based segmentation, which consists in finding a sub-trajectory such that the sum of its distance to the input trajectory and its overall length is minimum. However, there are no efficient algorithms for optimal MDL-based segmentation; there are only approximate algorithms. In this work we fill this gap by proposing a parallel multicore algorithm for MDL-based trajectory segmentation. We use three real-life datasets to show that our algorithm achieves optimal MDL, and compare its performance against Traclus, the state-of-the-art approximate Description Length (DL) segmentation algorithm.
AB - The purpose of trajectory segmentation algorithms is to replace an input trajectory by a sub-trajectory with fewer points than the input, but that is also a good approximation to the original trajectory. As such, trajectory segmentation is an essential pre-processing step for trajectory mining algorithms, such as clustering. Among the segmentation strategies that are commonly used for trajectory clustering is Minimum Description Length (MDL)-based segmentation, which consists in finding a sub-trajectory such that the sum of its distance to the input trajectory and its overall length is minimum. However, there are no efficient algorithms for optimal MDL-based segmentation; there are only approximate algorithms. In this work we fill this gap by proposing a parallel multicore algorithm for MDL-based trajectory segmentation. We use three real-life datasets to show that our algorithm achieves optimal MDL, and compare its performance against Traclus, the state-of-the-art approximate Description Length (DL) segmentation algorithm.
KW - MDL principle
KW - multicore algorithms
KW - parallel computing
KW - trajectory data
KW - trajectory segmentation
UR - http://www.scopus.com/inward/record.url?scp=85057758387&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057758387&partnerID=8YFLogxK
U2 - 10.1109/BigDataCongress.2018.00036
DO - 10.1109/BigDataCongress.2018.00036
M3 - Conference contribution
AN - SCOPUS:85057758387
T3 - Proceedings - 2018 IEEE International Congress on Big Data, BigData Congress 2018 - Part of the 2018 IEEE World Congress on Services
SP - 215
EP - 218
BT - Proceedings - 2018 IEEE International Congress on Big Data, BigData Congress 2018 - Part of the 2018 IEEE World Congress on Services
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 2 July 2018 through 7 July 2018
ER -