Voiced/unvoiced determination of speech signal in noisy environment using harmonicity measure based on instantaneous frequency

Dhany Arifianto*, Takao Kobayashi

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

This paper presents a voiced/unvoiced determination algorithm using instantaneous frequency amplitude spectrum (IFAS) in adverse environment. The proposed algorithm measures the degree of periodicity of speech signal, defined as harmonicity measure, where the difference between voiced part and unvoiced speech can be quantitatively obtained. We describe a new technique for voicing decision using IFAS-based F0 evaluation function with variable window length and IF band selection. The proposed technique is evaluated with speech signal corrupted by additive white Gaussian, pink, and traffic noises. The results show that the proposed method outperforms ESPS, AMDF and TEMPO for both female and male speakers in all simulated conditions.

Original languageEnglish
Title of host publication2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages877-880
Number of pages4
ISBN (Print)0780388747, 9780780388741
DOIs
Publication statusPublished - 2005
Externally publishedYes
Event2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
Duration: 18 Mar 200523 Mar 2005

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
VolumeI
ISSN (Print)1520-6149

Conference

Conference2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
Country/TerritoryUnited States
CityPhiladelphia, PA
Period18/03/0523/03/05

Fingerprint

Dive into the research topics of 'Voiced/unvoiced determination of speech signal in noisy environment using harmonicity measure based on instantaneous frequency'. Together they form a unique fingerprint.

Cite this