5 books on Speech Recognition [PDF]

October 24, 2024

Books on Speech Recognition are covering various aspects of automatic speech-to-text conversion - from acoustic modeling and language modeling to voice assistant systems. They describe the technologies that allow to transcribe spoken language, such as deep learning and neural networks. Moreover, these books often provide practical examples and best practices, enabling startups to reach higher accuracy in their real-world applications.

1. Speech Recognition: Fundamentals and Applications
2023 by Fouad Sabry



From this book I found out that automatic speech recognition integrates computer science, linguistics and engineering. Nowadays deep learning and recurrent neural networks play a key role in advancing these technologies and large language models (as well as long short-term memory (LSTM)) are improving the accuracy of speech-to-text conversions. Popular applications of ASR include voice assistants, transcription and voice commands.
Download PDF

2. Deep Learning for NLP and Speech Recognition
2019 by Uday Kamath, John Liu, James Whitaker



This book contains hands-on case studies that demonstrate the implementation of deep learning in speech recognition. It proves that deep learning models (that are used for better understanding of spoken words) are essential for speech recognition and machine translation. NLP-based speech recognition can be integrated into various industries like finance and healthcare. Practical applications of deep learning in NLP include document classification and voice analysis.
Download PDF

3. Speech Recognition Using Articulatory and Excitation Source Features
2017 by K. Sreenivasa Rao, Manjunath K E



These authors dig deeper in articulatory features that enhance the performance of speech recognition systems. Besides, excitation source information helps differentiate sound units during speech production. Thus, combining spectral, articulatory and source features improves speech recognition accuracy. Speech recognition can be adapted for scripted, spontaneous and conversational speech. The book also lists different models that capture sound unit-specific insights from articulatory and excitation features.
Download PDF

4. Automatic Speech Recognition: A Deep Learning Approach
2014 by Dong Yu, Li Deng



I think this was the first book that focused exclusively on the deep learning approach to ASR. If you read it you'll learn the theoretical insights that underpin many successful deep learning models for ASR. This technology has revolutionized the accuracy and efficiency of voice-to-text. The book also provides a mathematical understanding of machine learning applied in ASR.
Download PDF

5. Advances in Speech Recognition: Mobile Environments, Call Centers and Clinics
2010 by Amy Neustein



This book was written in the times when speech recognition moved from experimental technology to practical applications and mobile phones started to use voice interfaces (VUI) instead of traditional GUIs. Voice recognition became vital for environments like call centers and clinical settings and multimodal interfaces combining voice and graphical elements emerged. The book explores how speech recognition could be critical for modern human-computer interaction, especially in constrained environments.
Download PDF



How to download PDF:

1. Install Google Books Downloader

2. Enter Book ID to the search box and press Enter

3. Click "Download Book" icon and select PDF*

* - note that for yellow books only preview pages are downloaded