Abstract
In this study, we optimized Mel-Frequency Cepstral Coefficients (MFCC) for speech recognition. The experiment involved recording eight words by one hundred individuals with tempo, pitch, duration, resolution, and speech clarity variations. The results showed that the image's shape for each word remained consistent despite variations in pronunciation, confirming that MFCC is an effective method for feature extraction in speech recognition, particularly in robotic environments. Using logarithms in MFCC clarifies minor differences in amplitude scale, resulting in sharper contours and more visible details. This study recommends optimizing MFCC parameters, testing on larger datasets, and applying the method in various operational conditions to enhance its reliability in the future.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2024 International of Seminar on Application for Technology of Information and Communication |
| Subtitle of host publication | Smart And Emerging Technology for a Better Life, iSemantic 2024 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 284-289 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798350366648 |
| DOIs | |
| Publication status | Published - 2024 |
| Event | 2024 International of Seminar on Application for Technology of Information and Communication, iSemantic 2024 - Semarang, Indonesia Duration: 21 Sept 2024 → 22 Sept 2024 |
Publication series
| Name | Proceedings - 2024 International of Seminar on Application for Technology of Information and Communication: Smart And Emerging Technology for a Better Life, iSemantic 2024 |
|---|
Conference
| Conference | 2024 International of Seminar on Application for Technology of Information and Communication, iSemantic 2024 |
|---|---|
| Country/Territory | Indonesia |
| City | Semarang |
| Period | 21/09/24 → 22/09/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Accuracy
- Recognition
- Signal
- Speech
- feature extraction
Fingerprint
Dive into the research topics of 'Optimizing Mel-Frequency Cepstral Coefficients for Improved Robot Speech Command Recognition Accuracy'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver