Visual Speech Synthesis by Morphing Visemes

dc.date.accessioned	2004-10-20T21:04:35Z
dc.date.accessioned	2018-11-24T10:23:35Z
dc.date.available	2004-10-20T21:04:35Z
dc.date.available	2018-11-24T10:23:35Z
dc.date.issued	1999-05-01	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/7263
dc.identifier.uri	http://repository.aust.edu.ng/xmlui/handle/1721.1/7263
dc.description.abstract	We present MikeTalk, a text-to-audiovisual speech synthesizer which converts input text into an audiovisual speech stream. MikeTalk is built using visemes, which are a small set of images spanning a large range of mouth shapes. The visemes are acquired from a recorded visual corpus of a human subject which is specifically designed to elicit one instantiation of each viseme. Using optical flow methods, correspondence from every viseme to every other viseme is computed automatically. By morphing along this correspondence, a smooth transition between viseme images may be generated. A complete visual utterance is constructed by concatenating viseme transitions. Finally, phoneme and timing information extracted from a text-to-speech synthesizer is exploited to determine which viseme transitions to use, and the rate at which the morphing process should occur. In this manner, we are able to synchronize the visual speech stream with the audio speech stream, and hence give the impression of a photorealistic talking face.	en_US
dc.format.extent	5662753 bytes
dc.format.extent	1408669 bytes
dc.language.iso	en_US
dc.title	Visual Speech Synthesis by Morphing Visemes	en_US

Files in this item

Files	Size	Format	View
AIM-1658.pdf	1.408Mb	application/pdf	View/Open
AIM-1658.ps	5.662Mb	application/postscript	View/Open