Abstract

With the massive amount of data generated, a data engineering method is needed to process the data to obtain new information, which can later be used for further understanding and business decisions. One of the data processing methods is the k-nearest neighbor (k-NN) query. The k-NN queries aim to identify k-objects with the shortest distance based on object references over the network. However, a critical limitation of the k-NN query lies in its inability to process a query when an object has proximity durability to a reference object at a particular time interval. This limitation arises since the k-NN query does not consider the durability and time interval variables in its calculation algorithm. This article discusses the problem of time interval-based durable k-NN queries on time series data by introducing the algorithmic framework and employing dictionary and grid data structure to process these queries. The proposed algorithm, which is a dictionary and grid-based durable k-NN queries method, incorporates parallel computing techniques to efficiently search for a collection of k objects that exhibit durability in their proximity to a reference object within a predefined time interval. The effectiveness and efficiency of the devised algorithms were assessed through extensive testing using both real-world and synthetic datasets.

Original languageEnglish
Title of host publication2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1378-1385
Number of pages8
ISBN (Electronic)9798350300673
DOIs
Publication statusPublished - 2023
Event2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 - Taipei, Taiwan, Province of China
Duration: 31 Oct 20233 Nov 2023

Publication series

Name2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023

Conference

Conference2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
Country/TerritoryTaiwan, Province of China
CityTaipei
Period31/10/233/11/23

Keywords

  • continuous
  • data banks
  • data engineering
  • grid
  • k-NN queries
  • multidimensional
  • time series

Fingerprint

Dive into the research topics of 'Efficient Dictionary and Grid-Based Framework for Answering Durable k-Nearest Neighbor Queries on Time Series Data'. Together they form a unique fingerprint.

Cite this