A Multimodal Approach to Audiovisual Text-to-Speech Synthesis (2013)
Abstract / truncated to 115 words
Speech, consisting of an auditory and a visual signal, has always been the most important means of communication between humans. It is well known that an optimal conveyance of the message requires that both the auditory and the visual speech signal can be perceived by the receiver. Nowadays people interact countless times with computer systems in every-day situations. Since the ultimate goal is to make this interaction feel completely natural and familiar, the most optimal way to interact with a computer system is by means of speech. Similar to the speech communication between humans, the most appropriate human-machine interaction consists of audiovisual speech signals. In order to allow the computer system to transfer a spoken ...
visual speech synthesis – audiovisual speech synthesis – audiovisual speech perception – phoneme-to-viseme mapping
Information
- Author
- Mattheyses, Wesley
- Institution
- Vrije Universiteit Brussel
- Supervisor
- Publication Year
- 2013
- Upload Date
- Oct. 2, 2013
The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.
The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.