Skip to main navigation Skip to search Skip to main content

Hybrid Attention-based Super-resolution for Satellite Imagery Using Vast Receptive Field and Swin Attention to Improve Vehicle Detection

  • Budi Setiyono*
  • , Abdul Aziz Ar Rasyid
  • *Corresponding author for this work
  • Institut Teknologi Sepuluh Nopember

Research output: Contribution to journalArticlepeer-review

Abstract

Satellite imagery-based vehicle detection presents a promising approach to support intelligent transportation systems. However, satellite images often suffer from low spatial resolution and external distortions due to long-range imaging. To address these challenges, we propose a hybrid super-resolution framework that integrates Vast Receptive Field Pixel Attention mechanisms with shifted-window attention named VapSwinSR. The proposed model design begins with the VapSR architecture, which utilizes depth-wise dilated convolution, and is then modified by adding a self-attention mechanism. This modification led to an increase in the size of the receptive field, accompanied by a more effective attention weight, resulting in improved super-resolution image quality. Improvements in satellite image resolution have also been shown to improve vehicle detection performance. Experimental results on publicly available satellite imagery datasets including xView and DOTA showed that the proposed method achieved a 2.7% PSNR improvement over Swin Transformer-based methods with fewer parameters, and a 9.95% improvement over classical methods. In the vehicle detection experiment using YOLOv8, the use of super-resolved images resulted in an average increase of 14.26% in mAP50. These findings indicate that improving satellite image resolution not only enhances visual quality but also significantly boosts the effectiveness of vehicle detection, particularly for small objects commonly found in urban traffic environments.

Original languageEnglish
Pages (from-to)128-149
Number of pages22
JournalInternational Journal of Intelligent Engineering and Systems
Volume18
Issue number10
DOIs
Publication statusPublished - 30 Nov 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • Satellite imagery
  • Shifted window attention
  • Super-resolution
  • Vast receptive field pixel attention
  • Vehicle detection

Fingerprint

Dive into the research topics of 'Hybrid Attention-based Super-resolution for Satellite Imagery Using Vast Receptive Field and Swin Attention to Improve Vehicle Detection'. Together they form a unique fingerprint.

Cite this