TY - JOUR
T1 - A text-to-audiovisual synthesizer for Indonesian by morphing Viseme
AU - Arifin,
AU - Sumpeno, Surya
AU - Hariadi, Mochamad
AU - Haryanto, Hanny
N1 - Publisher Copyright:
© 2015 Praise Worthy Prize S.r.l. - All rights reserved.
PY - 2015/11
Y1 - 2015/11
N2 - There are many researches held on the text-to-audiovisual, but only a few are applied on Indonesian language. The results of the present research can be applied to a very wide field, e.g. gaming industry, animation industry, human computer interaction systems, etc. The correspondence among speech, mouth movements (visual phoneme/viseme) and phoneme spoken is needed to produce a realistic text-to-audiovisual. This research aims to develop a text-toaudiovisual synthesizer for Indonesian language based on inputted Indonesian text called TTAVI (Text-To-AudioVisual synthesizer for Indonesian language). The method consists of four major parts, namely, building the models of Indonesian’s viseme, converting a text-to-speech, synchronization process, and stringing the visemes by using the morphing viseme algorithm. Morphing viseme algorithm shows that a virtual character of the phonemes pronunciation resulting from the TTAVI synthesizer is smoother. 10 Indonesian texts inputted to TTAVI synthesizer were examined by 30 users. The appraisal results of users were calculated by applying Mean Opinion Score (MOS) methods. The average of the MOS score is 4.106 with a value range from 1 to 5. This shows that TTAVI synthesizer is considered good, and morphing viseme algorithm is able to make the result of TTAVI synthesizer smoother.
AB - There are many researches held on the text-to-audiovisual, but only a few are applied on Indonesian language. The results of the present research can be applied to a very wide field, e.g. gaming industry, animation industry, human computer interaction systems, etc. The correspondence among speech, mouth movements (visual phoneme/viseme) and phoneme spoken is needed to produce a realistic text-to-audiovisual. This research aims to develop a text-toaudiovisual synthesizer for Indonesian language based on inputted Indonesian text called TTAVI (Text-To-AudioVisual synthesizer for Indonesian language). The method consists of four major parts, namely, building the models of Indonesian’s viseme, converting a text-to-speech, synchronization process, and stringing the visemes by using the morphing viseme algorithm. Morphing viseme algorithm shows that a virtual character of the phonemes pronunciation resulting from the TTAVI synthesizer is smoother. 10 Indonesian texts inputted to TTAVI synthesizer were examined by 30 users. The appraisal results of users were calculated by applying Mean Opinion Score (MOS) methods. The average of the MOS score is 4.106 with a value range from 1 to 5. This shows that TTAVI synthesizer is considered good, and morphing viseme algorithm is able to make the result of TTAVI synthesizer smoother.
KW - A model of indonesian’s visemes
KW - Audiovisual
KW - Indonesian text
KW - Morphing viseme
KW - Viseme
UR - http://www.scopus.com/inward/record.url?scp=84956962132&partnerID=8YFLogxK
U2 - 10.15866/irecos.v10i11.7833
DO - 10.15866/irecos.v10i11.7833
M3 - Article
AN - SCOPUS:84956962132
SN - 1828-6003
VL - 10
SP - 1149
EP - 1156
JO - International Review on Computers and Software
JF - International Review on Computers and Software
IS - 11
ER -