Automatic Radiology Report Generator Using Transformer With Contrast-Based Image Enhancement

Hilya Tsaniya, Chastine Fatichah*, Nanik Suciati

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Writing radiology reports based on radiographic images is a time-consuming task that demands the expertise of skilled radiologists. Consequently, the integration of technology capable of automated report generation would be advantageous. Developing a coherent predictive text is the main challenge in automatic report generation. It is necessary to develop methods that can increase the relevance of features in producing predictive text. This study constructed a medical report generator model using the transformer approach and image enhancement implementation. To leverage the visual and semantic features, an approach to enhance the noise-prone nature of the medical image is explored in this study along with the transformers method to generate a radiology report based on Chest X-ray images. Four contrast-based image enhancement methods were used to investigate the effect of image enhancement techniques on the radiology report generator. The encoder-decoder model is used with text feature embedding using Bidirectional Encoder Representation from Transformer (BERT) and visual feature extraction utilizing a pre-trained model ChexNet and Multi-Head Attention (MHA) mechanism. The performance of the MHA model with gamma correction is 5% in better with a 0.377 value using the Bilingual Assessment Understudy (BLEU) with 4 n-gram evaluation. MHA also produces 15% better results with a 0.412 value than the baseline model. This method is able to outperform the baseline model and other previous works. It can be concluded that the use of transformer MHA encoder layer and BERT is effective in leveraging visual and text features. Additionally, the inclusion of an image enhancement approach has been found to have a positive impact on the model's performance.

Original languageEnglish
Pages (from-to)25429-25442
Number of pages14
JournalIEEE Access
Volume12
DOIs
Publication statusPublished - 2024

Keywords

  • BERT embedding
  • ChexNet
  • image enhancement
  • medical image captioning
  • multi-head attention

Fingerprint

Dive into the research topics of 'Automatic Radiology Report Generator Using Transformer With Contrast-Based Image Enhancement'. Together they form a unique fingerprint.

Cite this