Loading...
Share:

Voice and Speech Recognition: From Fundamentals to Full Implementation

Author(s): Dr. Riddhi Panchal Dr Gopal Sakarkar Dr. Deepali Bhende Dr. Vijayalakshmi. P Mr. Satyam Raval Mr. Aaryan Mavani Mr. Vinayak Chavan
335

  • Language:
  • English
  • ISBN13:
  • 9789369267651
  • ISBN10:
  • 9369267654
  • Format:
  • Paperback
  • Trim:
  • 6x9
  • Pages:
  • 161
  • Publication date:
  • 21-Aug-2025

  •   Available, Ships in 3-5 days
  •   10 Days Replacement Policy

This book offers a comprehensive guide to voice, speech, and speaker recognition, blending theory, coding, datasets, and real-world applications. It traces speech technology’s evolution from early prototypes to modern transformer-based systems, highlighting applications in virtual assistants, accessibility, and biometrics. Covering speech anatomy, audio preprocessing, and feature extraction techniques like MFCC, it explains libraries such as Librosa and PyDub. Machine learning and deep learning models, including CNNs, RNNs, and hybrid approaches, are detailed with code examples. Practical projects use datasets like CommonVoice and LibriSpeech, culminating in voice-controlled applications. The book emphasizes future trends, challenges, and career opportunities in speech AI.

Dr. Riddhi Panchal

Dr. Riddhi Panchal

Dr. Riddhi Panchal, holding MCA, MPhil, and Ph.D. degrees from Savitribai Phule Pune University, is presently an Assistant Professor and Program Head for M.Sc.
Dr Gopal Sakarkar

Dr Gopal Sakarkar

Dr Gopal Sakarkar completed his Master of Computer Applications (MCA) in 2006 and PhD in 2017 from S.G. B. Amravati University, Amravati. He has a total of 16+ years of teaching and research experience.
Dr. Deepali Bhende

Dr. Deepali Bhende

Dr. Deepali Bhende, Head of MCA Department at Wainganga College, Nagpur, holds MCA, M.Tech, and Ph.D. degrees in Computer Science. With 20+ years of teaching and research experience, she has published numerous papers, including SCOPUS-indexed ones, presented globally, and holds a patent, showcasing her commitment to innovation and excellence.
Dr. Vijayalakshmi. P

Dr. Vijayalakshmi. P

Dr. Vijayalakshmi P, Professor at Presidency University, Bangalore, holds a PhD in Computer Science from Karunya University. With 23 years of teaching experience, she has guided research, delivered expert talks, generated funds for workshops, published widely, served as journal editor, and specializes in soft computing, AI, and wireless mobility management.
Mr. Satyam Raval

Mr. Satyam Raval

Mr. Satyam Raval specializes in Data Science. With a strong foundation in Python, data analysis, machine learning, and NLP. He has successfully led projects in diverse areas such as voice and speech recognition, tourism analytics, and air quality modeling. Proficient in tools like pandas, NumPy, TensorFlow, and Power BI. He applies advanced analytics to drive insights and solutions.
Mr. Aaryan Mavani

Mr. Aaryan Mavani

Mr. Aaryan Mavani, a Data Science specialist, excels in Python, machine learning, SQL, and data analysis. He has worked on projects like crop yield prediction, tourism trend visualization, and Olympic performance prediction. Skilled in Pandas, NumPy, TensorFlow, Tableau, and Power BI, he delivers innovative, data-driven solutions.
Mr. Vinayak Chavan

Mr. Vinayak Chavan

Mr. Vinayak Sanjay Chavan, a Data Science and Big Data Analytics professional, specializes in Python, SQL, machine learning, AI, and NLP. Experienced with Deloitte USI, Finlatics, and NSDIC, he has delivered impactful projects in CRM automation, predictive modeling, climate forecasting, trade analysis, dashboards, and voice recognition.

You may also like

Top