Show simple item record

The effect of using normalized models in statistical speech synthesis

dc.creatorShannon, Matt
dc.creatorZen, Heiga
dc.creatorByrne, William Joseph
dc.date.accessioned2018-11-24T13:11:52Z
dc.date.available2013-04-09T19:18:33Z
dc.date.available2018-11-24T13:11:52Z
dc.date.issued2011-08-27
dc.identifierhttp://www.dspace.cam.ac.uk/handle/1810/244406
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/123456789/3039
dc.description.abstractThe standard approach to HMM-based speech synthesis is inconsistent in the enforcement of the deterministic constraints between static and dynamic features. The trajectory HMM and autoregressive HMM have been proposed as normalized models which rectify this inconsistency. This paper investigates the practical effects of using these normalized models, and examines the strengths and weaknesses of the different models as probabilistic models of speech. The most striking difference observed is that the standard approach greatly underestimates predictive variance. We argue that the normalized models have better predictive distributions than the standard approach, but that all the models we consider are still far from satisfactory probabilistic models of speech. We also present evidence that better intra-frame correlation modelling goes some way towards improving existing normalized models.
dc.languageen
dc.publisherISCA (International Speech Communication Association)
dc.rightshttp://creativecommons.org/licenses/by/2.0/uk/
dc.rightsAttribution 2.0 UK: England & Wales
dc.subjectHMM-based speech synthesis
dc.subjectacoustic modelling
dc.subjectautoregressive HMM
dc.subjecttrajectory HMM
dc.subjectnormalization
dc.titleThe effect of using normalized models in statistical speech synthesis
dc.typeConference Object


Files in this item

FilesSizeFormatView
shannon2011effect.pdf820.6Kbapplication/pdfView/Open

This item appears in the following Collection(s)

Show simple item record