Skip to main navigation Skip to search Skip to main content

Optimizing Mel-Frequency Cepstral Coefficients for Improved Robot Speech Command Recognition Accuracy

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this study, we optimized Mel-Frequency Cepstral Coefficients (MFCC) for speech recognition. The experiment involved recording eight words by one hundred individuals with tempo, pitch, duration, resolution, and speech clarity variations. The results showed that the image's shape for each word remained consistent despite variations in pronunciation, confirming that MFCC is an effective method for feature extraction in speech recognition, particularly in robotic environments. Using logarithms in MFCC clarifies minor differences in amplitude scale, resulting in sharper contours and more visible details. This study recommends optimizing MFCC parameters, testing on larger datasets, and applying the method in various operational conditions to enhance its reliability in the future.

Original languageEnglish
Title of host publicationProceedings - 2024 International of Seminar on Application for Technology of Information and Communication
Subtitle of host publicationSmart And Emerging Technology for a Better Life, iSemantic 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages284-289
Number of pages6
ISBN (Electronic)9798350366648
DOIs
Publication statusPublished - 2024
Event2024 International of Seminar on Application for Technology of Information and Communication, iSemantic 2024 - Semarang, Indonesia
Duration: 21 Sept 202422 Sept 2024

Publication series

NameProceedings - 2024 International of Seminar on Application for Technology of Information and Communication: Smart And Emerging Technology for a Better Life, iSemantic 2024

Conference

Conference2024 International of Seminar on Application for Technology of Information and Communication, iSemantic 2024
Country/TerritoryIndonesia
CitySemarang
Period21/09/2422/09/24

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Accuracy
  • Recognition
  • Signal
  • Speech
  • feature extraction

Fingerprint

Dive into the research topics of 'Optimizing Mel-Frequency Cepstral Coefficients for Improved Robot Speech Command Recognition Accuracy'. Together they form a unique fingerprint.

Cite this