000 02267nam a2200337Ia 4500
000 03060nam a22003735i 4500
001 978-981-99-0827-1
003 DE-He213
005 20240319120915.0
007 cr nn 008mamaa
008 230529s2023 si | s |||| 0|eng d
020 _a9789819908271
_9978-981-99-0827-1
082 _a6.35
100 _aTan, Xu.
_932228
245 _aNeural Text-to-Speech Synthesis
_cby Xu Tan.
_h[electronic resource] /
250 _a1st ed. 2023.
260 _aSingapore
_bSpringer Nature Singapore
_c2023
300 _aXXV, 201 p. 24 illus. in color.
_bonline resource.
520 _aText-to-speech (TTS) aims to synthesize intelligible and natural speech based on the given text. It is a hot topic in language, speech, and machine learning research and has broad applications in industry. This book introduces neural network-based TTS in the era of deep learning, aiming to provide a good understanding of neural TTS, current research and applications, and the future research trend. This book first introduces the history of TTS technologies and overviews neural TTS, and provides preliminary knowledge on language and speech processing, neural networks and deep learning, and deep generative models. It then introduces neural TTS from the perspective of key components (text analyses, acoustic models, vocoders, and end-to-end models) and advanced topics (expressive and controllable, robust, model-efficient, and data-efficient TTS). It also points some future research directions and collects some resources related to TTS. This book is the first to introduce neural TTS in a comprehensive and easy-to-understand way and can serve both academic researchers and industry practitioners working on TTS.
650 _aArtificial intelligence.
_932229
650 _aArtificial Intelligence.
_932230
650 _aMachine learning.
_932231
650 _aMachine Learning.
_932232
650 _aNatural language processing (Computer science).
_932233
650 _aNatural Language Processing (NLP).
_932234
650 _aSignal processing.
_932235
650 _aSpeech and Audio Processing.
_932236
650 _aSpeech processing systems.
_932237
856 _uhttps://doi.org/10.1007/978-981-99-0827-1
942 _cEBK
_2ddc
999 _c15283
_d15283