Зарегистрироваться
Восстановить пароль
FAQ по входу

Обработка речи

Справочные материалы

Материалы конференций, симпозиумов, съездов, сборники научных работ

Учебно-методические материалы

Студенческие работы

Программное обеспечение

Смотри также

Теги, соответствующие этому тематическому разделу

Файлы, которые ищут в этом разделе

Доверенные пользователи и модераторы раздела

2025.04
Morgan & Claypool, 2013. — 164 p. This book introduces the theory, algorithms, and implementation techniques for efficient decoding in speech recognition mainly focusing on the Weighted Finite-State Transducer (WFST) approach. The decoding process for speech recognition is viewed as a search problem whose goal is to find a sequence of words that best matches an input speech...
  • №1
  • 1,38 МБ
  • добавлен
  • описание отредактировано
2024.04
Учебное пособие. — Санкт-Петербург: Университет ИТМО, 2024. — 97 с. В книге изложены материалы второй части курса лекций «Цифровая обработка речевых сигналов», прочитанных в течение ряда лет студентам, обучающимся по направлению «Информационные системы и технологии». Книга предполагает знакомство с курсом «Цифровая обработка сигналов». В книге приведены основные термины и...
  • №2
  • 4,19 МБ
  • добавлен
  • описание отредактировано
2024.03
Cambridge University Press, 2015. — xxii, 424 p. — ISBN 978-1-107-05557-5. With this comprehensive guide you will learn how to apply Bayesian machine learning techniques systematically to solve various problems in speech and language processing. A range of statistical models is detailed, from hidden Markov models to Gaussian mixture models, n-gram models and latent topic...
  • №3
  • 6,83 МБ
  • добавлен
  • описание отредактировано
2023.11
2nd Edition. — Wiley, 2024. — 595 p. — ISBN 9781119060994. Enables readers to understand the latest developments in speech enhancement/transmission due to advances in computational power and device miniaturization The Second Edition of Digital Speech Transmission and Enhancement has been updated throughout to provide all the necessary details on the latest advances in the...
  • №4
  • 16,32 МБ
  • добавлен
  • описание отредактировано
2023.08
Monografija. — Ljubljana: Znanstvenoraziskovalni center Slovenske akademije znanosti in umetnosti (ZRC SAZU), 2000. — 149 s. — (Linguistica et philologica, 3; ISSN 2712-2689). — ISBN 961-6358-21-9. Monografija ''Samodejno tvorjenje govora'' iz besedilje predelana doktorska disertacija, ki je bila izvedena na Fakulteti za elektrotehniko v Ljubljani. Prvotno besedilo doktorske...
  • №5
  • 8,17 МБ
  • добавлен
  • описание отредактировано
Monografija. — Ljubljana: Znanstvenoraziskovalni center Slovenske akademije znanosti in umetnosti (ZRC SAZU), 2000. — 149 s. — (Linguistica et philologica, 3; ISSN 2712-2689). — ISBN 961-6358-21-9. Monografija ''Samodejno tvorjenje govora'' iz besedilje predelana doktorska disertacija, ki je bila izvedena na Fakulteti za elektrotehniko v Ljubljani. Prvotno besedilo doktorske...
  • №6
  • 3,59 МБ
  • добавлен
  • описание отредактировано
2023.07
De Gruyter, 2019. — 287 p. — (Speech Technology and Text Mining in Medicine and Health Care). — ISBN 978-1-61451-759-7. Signal and Acoustic Modeling for Speech and Communication Disorders demonstrates how speech signal processing and acoustic modeling can be instrumental in early detection and successful intervention with speech deficits resulting from Parkinson’s disease,...
  • №7
  • 1,76 МБ
  • добавлен
  • описание отредактировано
2023.06
Springer, 2023. — 214 p. — (Artificial Intelligence: Foundations, Theory, and Algorithms). — ISBN 978-981-99-0826-4. Text-to-speech (TTS) synthesis is an Artificial Intelligence (AI) technique that renders a preferably naturally sounding speech given an arbitrary text. It is a key technological component in many important applications, including virtual assistants, AI-generated...
  • №8
  • 9,01 МБ
  • добавлен
  • описание отредактировано
2023.02
М.: Инфра-М, 2015. — 346 с. В монографии рассмотрены теория, алгоритмы и практические методы реализации цифровой обработки и распознавания речевых сигналов. Представлены основы математического анализа цифровых сигналов, необходимые для обработки речи. Кратко изложена акустическая теория речеобразования с построением общей дискретной модели. Рассмотрены основные характерные...
  • №9
  • 76,11 МБ
  • добавлен
  • описание отредактировано
2022.11
Nova Science Publishers, 2022. — 240 p. — (Computer Science, Technology and Applications). Speech represents the most natural means of communication between humans. By using Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) systems, machines also become able to interact with humans using speech. This is of particular importance for building interactive robots or...
  • №10
  • 20,33 МБ
  • добавлен
  • описание отредактировано
2022.04
Вища школа, 1983. — 169 с. В монографии рассматриваются динамические спектрограммы звуков, слогов, слов и слитных фраз русской речи. Основное внимание уделено отображению на спектрограммах работы артикуляционных органов в процессе произношения речевых сигналов. Особое значение придается изучению динамики артикуляционного процесса, отображенной в динамических спектрах речи....
  • №11
  • 11,80 МБ
  • добавлен
  • описание отредактировано
2022.01
М.: Воениздат, 1974. — 136 с.: ил. В брошюре излагается одна из наиболее сложных проблем нашего времени — автоматическое распознавание речевых сигналов и машинное (искусственное) воспроизводство связной речи. Брошюра охватывает все основные аспекты этой проблемы, в ней сформулированы предпосылки, обусловившие необходимость создания техники для прямого речевого общения человека...
  • №12
  • 2,31 МБ
  • добавлен
  • описание отредактировано
М.: Воениздат, 1974. — 136 с.: ил. В брошюре излагается одна из наиболее сложных проблем нашего времени — автоматическое распознавание речевых сигналов и машинное (искусственное) воспроизводство связной речи. Брошюра охватывает все основные аспекты этой проблемы, в ней сформулированы предпосылки, обусловившие необходимость создания техники для прямого речевого общения человека...
  • №13
  • 4,24 МБ
  • добавлен
  • описание отредактировано
2021.06
Springer, 2021. — 180 p. — (T-Labs Series in Telecommunication Services). — ISBN 978-3-030-71388-1. Обработка человеческой информации при оценке качества речи This book provides a new multi-method, process-oriented approach towards speech quality assessment, which allows readers to examine the influence of speech transmission quality on a variety of perceptual and cognitive...
  • №14
  • 4,43 МБ
  • добавлен
  • описание отредактировано
Wiley-ISTE, 2021. — 208 p. — (Cognitive Science Series). — ISBN 978-1-78630-319-6. The text sets out in simple and accessible terms the various methods of acoustic analysis of speech, placing them in their historical context, allowing a better understanding of the mathematical and technical solutions adopted today in phonetics and experimental phonology. Without mathematical...
  • №15
  • 24,65 МБ
  • добавлен
  • описание отредактировано
2021.04
Lausanne: Frontiers Media SA, 2020. — 310 p. Spoken language is conveyed via well-coordinated speech movements, which act as coherent units of control referred to as gestures. These gestures and their underlying movements show several distinctive features. However, currently, no existing theory successfully accounts for all properties of these movements. Even though models in...
  • №16
  • 53,85 МБ
  • добавлен
  • описание отредактировано
2021.02
New York: Academic Press, 2021. — 191 p. Applied Speech Processing: Algorithms and Case Studies is concerned with supporting and enhancing the utilization of speech analytics in several systems and real-world activities, including sharing data analytics related information, creating collaboration networks between several participants, and the use of video-conferencing in...
  • №17
  • 12,63 МБ
  • добавлен
  • описание отредактировано
М.: Связьиздат, 1963. — 452 с. Книга посвящена преобразованиям речи применительно к задачам техники связи и кибернетики. Книга рассчитана на специалистов в области техники связи, автоматики, кибернетики, инженеров, аспирантов и научных сотрудников, изучающих вопросы преобразования речи.
  • №18
  • 10,33 МБ
  • добавлен
  • описание отредактировано
М.: Связьиздат, 1963. — 452 с. Книга посвящена преобразованиям речи применительно к задачам техники связи и кибернетики. Книга рассчитана на специалистов в области техники связи, автоматики, кибернетики, инженеров, аспирантов и научных сотрудников, изучающих вопросы преобразования речи.
  • №19
  • 24,72 МБ
  • добавлен
  • описание отредактировано
2021.01
New York, USA: Routledge, 2020. — 260 p. — (Routledge Research in Language Education). — ISBN 978-1-138-73314-5. Количественные Данные Оценки Языка Quantitative Data Analysis for Language Assessment Volume II: Advanced Methods emonstrates advanced quantitative techniques for language assessment. The volume takes an interdisciplinary approach and taps into expertise from...
  • №20
  • 3,83 МБ
  • добавлен
  • описание отредактировано
New York, USA: Routledge, 2020. — 260 p. — (Routledge Research in Language Education). — ISBN 978-1-138-73314-5. Количественные Данные Оценки Языка Quantitative Data Analysis for Language Assessment Volume II: Advanced Methods emonstrates advanced quantitative techniques for language assessment. The volume takes an interdisciplinary approach and taps into expertise from...
  • №21
  • 6,46 МБ
  • добавлен
  • описание отредактировано
New York, USA: Routledge, 2019. — 289 p. — (Routledge Research in Language Education). — ISBN 978-1-138-73312-1. Количественные Данные Оценки Языка Quantitative Data Analysis for Language Assessment Volume I: Fundamental Techniques is a resource book that presents the most fundamental techniques of quantitative data analysis in the field of language assessment. Each chapter...
  • №22
  • 4,04 МБ
  • добавлен
  • описание отредактировано
СПб.: НИУ ИТМО, 2021. – 101 с. Пособие адресовано студентам магистратуры, обучающимся по направлению «Информационные системы и технологии» по профилю подготовки «Речевые информационные системы». В пособии изложены основы анализа и обработки речевых сигналов. Материал пособия представляет собой базу для последующего освоения углубленных курсов обработки речевых сигналов....
  • №23
  • 1,37 МБ
  • добавлен
  • описание отредактировано
2020.12
New York: Prentice-Hall, 2006. — 802 p. Essential principles, practical examples, current applications, and leading-edge research. In this book, Thomas F. Quatieri presents the field's most intensive, up-to-date tutorial and reference on discrete-time speech signal processing. Building on his MIT graduate course, he introduces key principles, essential applications, and...
  • №24
  • 6,10 МБ
  • добавлен
  • описание отредактировано
2020.08
Springer, 2020. — 808 p. — (Modern Acoustics and Signal Processing). — ISBN 978-3-030-00385-2. This book offers a computational framework for modeling active exploratory listening that assigns meaning to auditory scenes. Understanding auditory perception and cognitive processes involved with our interaction with the world are of high relevance for a vast variety of ICT systems...
  • №25
  • 31,29 МБ
  • добавлен
  • описание отредактировано
2020.07
Cambridge University Press, 2020. — 329 p. — ISBN: 978-1-108-42812-5. This book will help readers understand fundamental and advanced statistical models and deep learning models for robust speaker recognition and domain adaptation. This useful toolkit enables readers to apply machine learning techniques to address practical issues, such as robustness under adverse acoustic...
  • №26
  • 17,44 МБ
  • добавлен
  • описание отредактировано
2020.03
New York: Taylor & Francis, 2007. — 608 p. — ISBN: 0849350328, 9780849350320. The first book to provide comprehensive and up-to-date coverage of all major speech enhancement algorithms proposed in the last two decades, Speech Enhancement: Theory and Practice is a valuable resource for experts and newcomers in the field. The book covers traditional speech enhancement algorithms,...
  • №27
  • 56,18 МБ
  • добавлен
  • описание отредактировано
New York: Taylor & Francis, 2007. — 608 p. — ISBN: 0849350328, 9780849350320. The first book to provide comprehensive and up-to-date coverage of all major speech enhancement algorithms proposed in the last two decades, Speech Enhancement: Theory and Practice is a valuable resource for experts and newcomers in the field. The book covers traditional speech enhancement algorithms,...
  • №28
  • 140,71 МБ
  • добавлен
  • описание отредактировано
2019.12
Albany: Singular Publishing Group, 2001. — 319 p. An Introduction to the Study of Speech Acoustics Acoustic Theory of Speech Production Introduction to the Acoustic Analysis of Speech The Acoustic Characteristics of Vowels and Diphthongs The Acoustic Characteristics of Consonants The Acoustic Correlates of Speaker Characteristics Suprasegmental Properties of Speech Speech...
  • №29
  • 61,18 МБ
  • добавлен
  • описание отредактировано
2019.09
Пер. с англ. — Под ред. Ю. Н. Прохорова и В. С. Звездина. — М.: Связь, 1980. — 308 с. В книге излагается в полном объеме комплекс вопросов, связанных с обработкой речевых сигналов с помощью методов линейного предсказания. Представлены алгоритмы анализа речи и процедуры ее синтеза по множеству информативных параметров, доведенные до программ на языке ФОРТРАН. Рассмотрены вопросы...
  • №30
  • 11,50 МБ
  • добавлен
  • описание отредактировано
Кишинёв: Штиинца, 1987. — 175 с. Рассматриваются общие вопросы построения систем автоматического распознавания и синтеза речи. Содержатся сведения о речеобразовании и речевых сигналах, цифровой обработке речи, даётся краткое описание современных отечественных и зарубежных систем распознавания и синтеза речи. Книга рассчитана на массового читателя, студентов технических вузов,...
  • №31
  • 9,05 МБ
  • добавлен
  • описание отредактировано
Singapore: Springer, 2019. — 426 p. This book is about recent research in the area of profiling humans from their voice, which seeks to deduce and describe the speaker's entire persona and their surroundings from voice alone. It covers several key aspects of this technology, describing how the human voice is unique in its ability to both capture and influence the human persona...
  • №32
  • 14,72 МБ
  • добавлен
  • описание отредактировано
2019.07
Springer, 2019. — 282 p. — ISBN: 978-3-030-15852-1. This book explores the processes of spoken language production and perception from a neurobiological perspective. After presenting the basics of speech processing and speech acquisition, a neurobiologically-inspired and computer-implemented neural model is described, which simulates the neural processes of speech processing...
  • №33
  • 13,58 МБ
  • добавлен
  • описание отредактировано
2019.04
Academic Press, 2019. — 199 p. — ISBN: 978-0-12-818130-0. This book investigates the utilization of speech analytics across several systems and real-world activities, including sharing data analytics, creating collaboration networks between several participants, and implementing video-conferencing in different application areas. Chapters focus on the latest applications of...
  • №34
  • 11,06 МБ
  • добавлен
  • описание отредактировано
Eamon Dolan/Houghton Mifflin Harcourt, 2019. — 259 p. — ISBN10: 1328799301, 13 978-1328799302. The next great technological disruption is coming The titans of Silicon Valley are racing to build the last, best computer that the world will ever need. They know that whoever successfully creates it will revolutionize our relationship with technology—and make billions of dollars in...
  • №35
  • 4,22 МБ
  • добавлен
  • описание отредактировано
2019.02
Springer, 2018. — 120 р. This book shows ways of augmenting the capabilities of Natural Language Processing (NLP) systems by means of cognitive-mode language processing. The authors employ eye-tracking technology to record and analyze shallow cognitive information in the form of gaze patterns of readers/annotators who perform language processing tasks. The insights gained from...
  • №36
  • 2,57 МБ
  • добавлен
  • описание отредактировано
Springer, 2018. — 120 р. This book shows ways of augmenting the capabilities of Natural Language Processing (NLP) systems by means of cognitive-mode language processing. The authors employ eye-tracking technology to record and analyze shallow cognitive information in the form of gaze patterns of readers/annotators who perform language processing tasks. The insights gained from...
  • №37
  • 4,72 МБ
  • добавлен
  • описание отредактировано
2018.12
Springer, 2019. — 70 p. — ISBN: 978-1-4614-1158-1. Human beings recognize speaker, language, emotion, and speech using multiple cues present in speech signal and evidences are combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language, emotion, and speech recognition systems mostly rely on...
  • №38
  • 548,68 КБ
  • добавлен
  • описание отредактировано
2018.11
Cambridge: Cambridge University Press, 2015. — 446 p. — ISBN 978-1-107-05557-5. With this comprehensive guide you will learn how to apply Bayesian machine learning techniques systematically to solve various problems in speech and language processing. A range of statistical models is detailed, from hidden Markov models to Gaussian mixture models, n-gram models and latent topic...
  • №39
  • 12,78 МБ
  • добавлен
  • описание отредактировано
2018.09
Springer, 2019. — 70 p. Human beings recognize speaker, language, emotion, and speech using multiple cues present in speech signal and evidences are combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language, emotion, and speech recognition systems mostly rely on spectral/cepstral features which...
  • №40
  • 1,93 МБ
  • добавлен
  • описание отредактировано
2018.07
2014. — 88 p. — ASIN B00NV4DZ86. Learn to love Dragon Naturally Speaking with just 100+ Commands Get off to a flying start, improve your skills, speak with confidence - using this new 60 page, illustrated colour guide. Dragon speech recognition can transform the way people work with their computers - students, doctors, writers, family historians, people with dyslexia or...
  • №41
  • 3,08 МБ
  • добавлен
  • описание отредактировано
Изд. 3-е. — М.: КомКнига, 2012. — 328 с. Книга посвящена проблемам управления техническими устройствами с помощью устной речи, что имеет непосредственное отношение к развитию робототехнических систем, управляемых голосом. В работе отражены различные аспекты лингвистического компонента в подобных системах. Подчеркивается особое значение исследований в области фундаментального и...
  • №42
  • 49,11 МБ
  • добавлен
  • описание отредактировано
2018.06
Монографія. — Херсон: вид-во ФОП Вишемирський В.С., 2018. — 168 с. Проаналізовано існуючі на сьогоднішній день методи аналізу голосового сигналу людини. Досліджено сучасні методи аутентифікації особистості, які основані на аналізі голосового сигналу. Розроблено метод локальних максимумів, який дає точніші результати сегментації голосового сигналу у порівнянні з існуючими...
  • №43
  • 27,19 МБ
  • добавлен
  • описание отредактировано
2018.04
Wiesbaden: Springer, 2016. — 148 p. Almut Braun carried out forensic phonetic speaker identification experiments (voice lineups) with 306lay listeners. Blind listeners significantly outperformed sighted listeners when the speech recordings were presented in studio quality. For recordings in mobile phone quality or of whispering voices, blind and sighted listeners achieved...
  • №44
  • 7,43 МБ
  • добавлен
  • описание отредактировано
New York: Springer, 2018. — 112 p. This book presents and develops several important concepts of speech enhancement in a simple but rigorous way. Many of the ideas are new; not only do they shed light on this old problem but they also offer valuable tips on how to improve on some well-known conventional approaches. The book unifies all aspects of speech enhancement, from single...
  • №45
  • 988,32 КБ
  • добавлен
  • описание отредактировано
2018.01
2nd Ed. — Springer, 2017. — 115 p. — (SpringerBriefs in Electrical and Computer Engineering). — ISBN10: 3319690019, 13 978-3319690018. This new edition provides an updated and enhanced survey on employing wavelets analysis in an array of applications of speech processing. The author presents updated developments in topics such as; speech enhancement, noise suppression, spectral...
  • №46
  • 1,11 МБ
  • добавлен
  • описание отредактировано
2nd Ed. — Springer, 2017. — 96 p. — (SpringerBriefs in Electrical and Computer Engineering). — ISBN10: 3319690019, 13 978-3319690018. This new edition provides an updated and enhanced survey on employing wavelets analysis in an array of applications of speech processing. The author presents updated developments in topics such as; speech enhancement, noise suppression, spectral...
  • №47
  • 2,71 МБ
  • добавлен
  • описание отредактировано
2017.12
Springer, 2018. — 82 p. With the invention of less expensive means of internet access, voice communication via social media is on the rise, which often comprises threats and distortions. Incorrect speaker/speech identification may sometimes lead to ambiguities in speaker identification and misunderstandings. Therefore, proper identification of speech is a must in speech...
  • №48
  • 2,77 МБ
  • добавлен
  • описание отредактировано
Москва: Радио и связь, 2004. — 164 с. В книге рассматриваются методы обработки цифровой речи, предназначенные для формирования последовательности векторов признаков и два типа задач классификации речевого сигнала: распознавание слитной речи, идентификация диктора по его голосу. В задаче формирования векторов признаков основное внимание уделяется методам обнаружения и фильтрации...
  • №49
  • 2,45 МБ
  • добавлен
  • описание отредактировано
М.: Радио и связь, 1989. — 248 с. Монография посвящена описанию современного состояния развития техники, Использующей возможности речевой связи между человеком и машиной (роботом). Эта область научных исследований и технических разработок прогрессивно развивается в наиболее развитых в техническом отношении странах, что связано в первую очередь с освоением вычислительной техники и...
  • №50
  • 5,38 МБ
  • добавлен
  • описание отредактировано
Учебное пособие. — Санкт-Петербург: Университет ИТМО, 2017. — 152 с. В учебном пособии рассматриваются методы автоматического распознавания речи. Материал пособия разбит на 16 разделов. Первые два раздела посвящены вопросам речеобразования и восприятия слуховой системой. В каждом разделе приведены краткие теоретические и/или практические сведения. Пособие может быть...
  • №51
  • 3,79 МБ
  • добавлен
  • описание отредактировано
2017.11
Boston: Pearson, 2010. — 1060 p. Speech signal processing has been a dynamic and constantly developing field for more than 70 years. The earliest speech processing systems were analog systems. They included, for example, the Voder (voice demonstration recorder) for synthesizing speech by manual controls, developed by Homer Dudley and colleagues at Bell Labs in the 1930s and...
  • №52
  • 14,33 МБ
  • добавлен
  • описание отредактировано
Arunachal Pradesh: Technical and Scientific Publisher, 2017. — 11 p. Speech editing is nothing more than moving about some arrays of numbers. Enhancement filters can be used to remove both natural and intentional noise, to a reasonable extent. And pitch and formant analysis can be used to give a general idea of whether two speakers are the same person or not. There are also other...
  • №53
  • 473,64 КБ
  • добавлен
  • описание отредактировано
Springer, 2018. — 417 p. The recent progress on machine learning and signal processing has enabled the development of technologies for automatic analysis of sound scenes and events by computational means. This has attracted several research groups and companies to investigate this new field, which has potential in several applications and also has several research challenges....
  • №54
  • 7,21 МБ
  • добавлен
  • описание отредактировано
Springer, 2018. — 144 p. This book presents the consolidated acoustic data for all phones in Standard Colloquial Bengali (SCB), commonly known as Bangla, a Bengali language used by 350 million people in India, Bangladesh, and the Bengali diaspora. The book analyzes the real speech of selected native speakers of the Bangla dialect to ensure that a proper acoustical database is...
  • №55
  • 3,08 МБ
  • добавлен
  • описание отредактировано
2017.10
Springer, 2017. — 436 p. — ISBN: 9783319646794. The text provides insights and detailed descriptions of some of the new concepts and key technologies in the field, including novel architectures for speech enhancement, microphone arrays, robust features, acoustic model adaptation, training data augmentation, and training criteria. The contributed chapters also include...
  • №56
  • 4,22 МБ
  • добавлен
  • описание отредактировано
New York: Springer, 2017. — 436 p. The text provides insights and detailed descriptions of some of the new concepts and key technologies in the field, including novel architectures for speech enhancement, microphone arrays, robust features, acoustic model adaptation, training data augmentation, and training criteria. The contributed chapters also include descriptions of...
  • №57
  • 8,83 МБ
  • добавлен
  • описание отредактировано
2017.07
Springer, 2017. — 251 p. This book provides scientific understanding of the most central techniques used in speech coding both for advanced students as well as professionals with a background in speech audio and or digital signal processing. It provides a clear connection between the Why’s?, How’s?, and What’s, such that the necessity, purpose and solutions provided by tools...
  • №58
  • 8,49 МБ
  • добавлен
  • описание отредактировано
Springer, 2017. — 170 p. Text-to-Speech (TTS) synthesis, i.e., artificially produced speech, has finally attained a quality level that makes it possible to include it into ordinary services that are used by common people. With the increasing processing power of smartphones and the development of intelligent personal assistants like Siri, Cortana, and Google Now, synthetic...
  • №59
  • 3,05 МБ
  • добавлен
  • описание отредактировано
2017.06
СПб.: Университет ИТМО, 2014. — 92 с. В учебном пособии рассматриваются технологии синтеза интонационной речи. Синтез речи является одной из важнейших задач речевой обработки и имеет широкое применение в современных информационных технологиях. Материал пособия разбит на 6 разделов. Изложены история вопроса и основные этапы разработки систем автоматического синтеза. Пособие...
  • №60
  • 2,55 МБ
  • добавлен
  • описание отредактировано
Springer, 2017. — 100 p. The goal of developing a phone recognition system (PRS) is to derive the sequence of basic sound units from the speech signal. Most of the state-of-the-art PRSs are developed using spectral features such as Mel frequency cepstral coefficients. Spectral features mainly represent the gross shape of the vocal tract, but not the information related to the...
  • №61
  • 2,06 МБ
  • добавлен
  • описание отредактировано
Springer, 2017. — 233 p. — ISBN: 3319536117. This book focuses on speech signal phenomena, presenting a robustification of the usual speech generation models with regard to the presumed types of excitation signals, which is equivalent to the introduction of a class of nonlinear models and the corresponding criterion functions for parameter estimation. Compared to the general...
  • №62
  • 4,97 МБ
  • добавлен
  • описание отредактировано
2017.04
М.: Мир, 1985. — 237 с. — (В мире науки и техники). Книга рассказывает о теоретических исследованиях и практических разработках в технике синтеза речи. Автор приводит также конкретные схемы электронных блоков, используемых в реальных синтезаторах речи. Книга адресована широкому кругу читателей, интересующихся достижениями современной техники; особенно полезна она будет...
  • №63
  • 30,48 МБ
  • добавлен
  • описание отредактировано
2017.02
Springer, 2012. — 546 p. — ISBN: 978-1-4614-0263-3. Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism is an anthology of the research findings of thirty-five speaker recognition experts from around the world. The book provides a multidimensional look at the complex science involved in determining whether a suspect’s voice truly matches forensic speech samples,...
  • №64
  • 7,29 МБ
  • добавлен
  • описание отредактировано
2017.01
Диссертация, Cambridge University, 1995. — 157 p. The research presented in this thesis addresses the topic of ad hoc retrieval of information from collections of spoken items such as radio news bulletins. Modern digital computers are becoming increasingly adept at processing nontextual data, such as speech. Consequently, new methods are required to allow users to pin-point...
  • №65
  • 1,04 МБ
  • добавлен
  • описание отредактировано
2016.12
Учебно-методическое пособие для студентов специальности «Электронные вычислительные средства» дневной формы обучения. — Минск: БГУИР, 2005. — 51 с. Учебно-методическое пособие содержит описание алгоритмов, применяемых для обработки речи: детектора речи, анализа на основе линейного предсказания, векторного квантования. Даны примеры применения векторного квантования для...
  • №66
  • 1,85 МБ
  • добавлен
  • описание отредактировано
Springer, 2017. — 109 p. Speech communication assumes a dominant role in how we communicate, and it is nowadays available to support interaction with machines in a wide range of scenarios, ranging from personal assistants for smartphones to home entertainment. While in many circumstances audible speech may suffice, there are a multitude of scenarios for which it is inadequate...
  • №67
  • 2,98 МБ
  • добавлен
  • описание отредактировано
Springer, 2017. — 77. In the few last years, we saw the rise of practical speech recognition applications, which work well in English and a few other languages. There is no doubt that this trend will continue and a more natural interaction between humans and technology will become part of our lives. Language is one of the most important components of one’s culture and identity....
  • №68
  • 1,15 МБ
  • добавлен
  • описание отредактировано
2016.11
John Wiley, 2012. — 302 p. Advances in computing–in terms of both the creation of novel mathematical techniques and the design of data-driven technologies–have fuelled the ubiquitous development and deployment of speech technologies over the last two decades. Some of the core speech technologies and their applications to coding, recognition, synthesis, enhancement and such have...
  • №69
  • 1,20 МБ
  • добавлен
  • описание отредактировано
Тбилиси: Мецниереба, 1976. — 183 с. Монография посвящена проблеме автоматической идентификации голосов. В ней затронут круг вопросов, связанных с исследованием индивидуальных особенностей голоса, проявляющейся в процессе реальной речевой активности человека. Подробно обсуждается роль как отдельных фонем и их сочетаний, так и более сложных семантических единиц речи в передаче...
  • №70
  • 5,11 МБ
  • добавлен
  • описание отредактировано
2016.10
Springer, 2013. — 415 p. Summarising a research programme that lasted formore than 6 years is a demanding task due to the wealth of deliverables, publications and final results of each of the projects concerned. In addition to the content-related topics, which interest scientists, research programmes also lead to new insights for policy makers and programme managers. The former...
  • №71
  • 3,70 МБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 72 p. This book presents state of art research in speech emotion recognition. Readers are first presented with basic research and applications – gradually more advance information is provided, giving readers comprehensive guidance for classify emotions through speech. Simulated databases are used and results extensively compared, with the features and the...
  • №72
  • 716,82 КБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 250 p. This book addresses the subject of emotional speech, especially its encoding and decoding process during interactive communication, based on an improved version of Brunswik’s Lens Model. The process is shown to be influenced by the speaker’s and the listener’s linguistic and cultural backgrounds, as well as by the transmission channels used. Through...
  • №73
  • 5,57 МБ
  • добавлен
  • описание отредактировано
Cambridge: Cambridge University Press, 2012. — 508 p. When we speak, we configure the vocal tract which shapes the visible motions of the face and the patterning of the audible speech acoustics. Similarly, we use these visible and audible behaviors to perceive speech. This book showcases a broad range of research investigating how these two types of signals are used in spoken...
  • №74
  • 9,45 МБ
  • добавлен
  • описание отредактировано
2016.09
Springer, 2015. — 113 p. This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations.Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired noise are...
  • №75
  • 1,43 МБ
  • добавлен
  • описание отредактировано
Cambridge: Cambridge University Press, 2012. - 155 p. The mechanism of speech is a very complex one and in order to undertake any analysis of language it is important to understand the processes that go to make up the message that a speaker transmits and a listener receives. Professor Fry therefore first takes the reader through the various stages of the speech chain: from...
  • №76
  • 6,30 МБ
  • добавлен
  • описание отредактировано
Springer, 2016. — 126 p. Speech enhancement is incorporated as an essential component in all voice communication devices to improve their performance in noisy environments. Speech enhancement is an important issue for mobile phones, hands-free telephones and also for hearing aids. It has been a challenging problem for researchers to develop new enhancement algorithms that...
  • №77
  • 3,00 МБ
  • добавлен
  • описание отредактировано
2016.06
Academic Press, 2016. — 303 p. Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with...
  • №78
  • 4,08 МБ
  • добавлен
  • описание отредактировано
2016.03
Springer, 2016. — 288 p. This volume brings together through a peer-revision process advanced research results obtained on nonlinear speech processing, following the tradition initiated by the European COST Action 277: “Nonlinear Speech Processing” (http://www.cost. eu/COST_Actions/ict/277). The research published in this book was discussed for the first time at the 7th edition...
  • №79
  • 6,42 МБ
  • добавлен
  • описание отредактировано
Учебное пособие. — СПб.: Университет ИТМО, 2016. — 138 с. В учебном пособии рассматриваются методы автоматического распознавания речи. Материал пособия разбит на 16 разделов. Первые два раздела посвящены вопросам речеобразования и восприятия слуховой системой. В каждом разделе приведены краткие теоретические и/или практические сведения. Пособие может быть использовано при...
  • №80
  • 3,73 МБ
  • добавлен
  • описание отредактировано
2016.02
Springer, 2012. — 136 p. During production of speech human beings impose durational constraints and intonation patterns on the sequence of sound units to convey the intended message. This inherent ability of the human beings in using the prosody (duration and intonation) knowledge is naturally acquired, and is difficult to articulate. But for synthesizing speech from a text by...
  • №81
  • 1,67 МБ
  • добавлен
  • описание отредактировано
2016.01
O’Reilly Media, Inc., 2013. — 242 p. Go under the hood of an operating Voice over IP network, and build your knowledge of the protocols and architectures used by this Internet telephony technology. With this concise guide, you’ll learn about services involved in VoIP and get a first-hand view of network data packets from the time the phones boot through calls and subsequent...
  • №82
  • 13,87 МБ
  • добавлен
  • описание отредактировано
2015.11
Emerald Group, 2012. — 459 p. The last 15 years have seen a revolution in auditory physiology, but the new ideas have been slow to gain currency outside specialist circles. Undoubtedly, one of the main reasons for this has been the lack of a general source for non-specialists, and it is hoped that this book will bring current thinking to a much wider audience. While the book is...
  • №83
  • 4,52 МБ
  • добавлен
  • описание отредактировано
2015.10
Springer, 2010. — 354 p. This book describes the development and evaluation of a novel type of spoken language dialogue system that proactively interacts in the conversation with two users. Spoken language dialogue systems are increasingly deployed in more and more application domains and environments. As a consequence, the demands posed on the systems are rising rapidly. In...
  • №84
  • 1,23 МБ
  • добавлен
  • описание отредактировано
Pergamon Press, 1976. — 149 p. The study of speech is a multidisciplinary subject, and the topic of this book is no exception. The production of speech is properly the province of the anatomist and the physiologist, but in practice it has been studied mainly by the phonetician with help from the physicist. The sounds of speech have been classified by the phonetician, and...
  • №85
  • 2,09 МБ
  • добавлен
  • описание отредактировано
2015.07
IGI Global, 2009. — 573 p. It has been widely accepted that speech perception is a multimodal process and involves information from more than one sensory modality. The famous McGurk effect [McGurk and MacDonald, Nature 264(5588): 746–748, 1976] shows that visual articulatory information is integrated into our perception of speech automatically and unconsciously. For example, a...
  • №86
  • 117,59 МБ
  • добавлен
  • описание отредактировано
Springer, 2004. — 431 p. The present coming of age of speech technologies coincides with the advent of mobile computing and the accompanying need for ubiquitous information access. This has generated enormous commercial interest around deploying speech interaction to IT-based services. In his book, Michael gives an in-depth review of the nuts and bolts of constructing speech...
  • №87
  • 2,75 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 184 p. — ISBN 978-1-4614-4802-0, ISBN 978-1-4614-4803-7. Data driven methods have long been used in Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) synthesis and have more recently been introduced for dialogue management, spoken language understanding, and Natural Language Generation. Machine learning is now present end-to-end in Spoken Dialogue...
  • №88
  • 1,55 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2011. — 471 p. There are a number of books and textbooks on speech processing or natural language processing (even some covering speech and language processing), there are no books focusing on spoken language understanding (SLU) approaches and applications. In that respect, living between two worlds, SLU has not received the attention it deserves in spoken language...
  • №89
  • 3,29 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2008. — 555 p. When the book Digital Speech Transmission – Enhancement, Coding and Error Concealment by Peter Vary and Rainer Martin appeared in 2006, it was clear that a subject of this importance and this range could not be treated in all its details on 600-some pages. Important aspects had to be left out and had to be postponed to a succeeding volume. The...
  • №90
  • 7,97 МБ
  • добавлен
  • описание отредактировано
Springer, 1991. — 376 p. Speech coding has been an ongoing area of research for several decades, yet the level of activity and interest in this area has expanded dramatically in the last several years. Important advances in algorithmic techniques for speech coding have recently emerged and excellent progress has been achieved in producing high quality speech at bit rates as low...
  • №91
  • 10,52 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 227 p. One of the main reasons for the complexity of spoken dialogue systems (SDSs) development constitutes the multi-domain and thus the multi-topic nature of reallife processes. If the application domain is not clearly defined collecting a corpus or establishing valid rules to control the dialogue flow of the SDS becomes a complex task. Within the framework...
  • №92
  • 2,35 МБ
  • добавлен
  • описание отредактировано
Springer, 1996. — 682 p. This book is one outcome of the NATO Advanced Studies Institute (ASI) Workshop, "Speechreading by Man and Machine," held at the Chateau de Bonas, Castera-Verduzan (near Auch, France) from August 28 to September 8, 1995 - the first interdisciplinary meeting devoted the subject of speechreading ("lipreading"). The forty-five attendees from twelve...
  • №93
  • 12,55 МБ
  • добавлен
  • описание отредактировано
Springer, 1983. — 503 p. This volume contains invited and contributed papers presented at the, NATO Advanced study Institute on "Recent Advances in Speech, Understanding and Dialog systems" held in Bad Windsheim, Federal, Republic of Germany, July 5 to July 18, 1987. It is divided into the, three parts Speech coding and Segmentation, Word Recognition, and, Linguistic...
  • №94
  • 16,50 МБ
  • добавлен
  • описание отредактировано
Springer, 2000. — 302 p. This book originates from the Fifth European Summer School on Language and Speech Communication that was held in the summer of 1997 in Leuven, Belgium, under the auspices of the European Language and Speech Network (ELSNET). The central topic of the summer school was "Lexicon Development for Language and Speech Processing"; the choice of this theme was...
  • №95
  • 4,72 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1990. — 454 p. Speech sound production is one of the most complex human activities: it is also one of the least well understood. This is perhaps not altogether surprising as many of the complex neurological and physiological processes involved in the generation and execution of a speech utterance remain relatively inaccessible to direct investigation, and must be inferred...
  • №96
  • 7,88 МБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 119 p. This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Implicit processing of linear prediction (LP) residual and...
  • №97
  • 2,74 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1989. — 169 p. In order to perceive speech and other sounds, the incoming sound wave must be transformed into a variety of representations, each bringing forth different aspects of the signal, its source, and meaning. Understanding how we perceive and how machines can be made to perceive auditory signals means, in part, discovering appropriate representations for the...
  • №98
  • 5,64 МБ
  • добавлен
  • описание отредактировано
IOS Press, 2006. — 389 p. That speech is a dynamic process strikes as a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of...
  • №99
  • 4,46 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2014. — 345 p. It might be safe to claim that 20 years ago, neither the term ‘computational paralinguistics’ nor the field it denotes existed. Some 10 years ago, the term did not yet exist either. However, in hindsight, the field had begun to exist if we think of the first steps towards the automatic processing of emotions in speech in the mid-1990s. For example,...
  • №100
  • 4,72 МБ
  • добавлен
  • описание отредактировано
2015.05
N.-Y.: CRC Press, 2013. — 705 p. This text is, in part, an outgrowth of graduate course on speech signal processing at the University of Texas at Dallas since the fall of 1999. The fact that no textbook existed at the time on speech enhancement, other than a few edited books suitable for the experts, made it difficult to teach the fundamental principles of speech enhancement in...
  • №101
  • 17,51 МБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 212 p. The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based...
  • №102
  • 4,84 МБ
  • добавлен
  • описание отредактировано
2015.03
Перев. Попова Р., Кемерово, 2000. - 79 с. Дата выхода оригинальной работы - 1993 г. В этой работе мы рассмотрим компоненты алгоритмов обработки сигнала. Эти алгоритмы приводятся как часть общего обзора задачи параметризации сигнала, которазя делится на три направления: измерение, преобразование и статистическое моделирование. В соответствии с этой целью в работу включено...
  • №103
  • 824,70 КБ
  • добавлен
  • описание отредактировано
Cambridge. Tecnical Report Number 740, 2009. ISSN: 1476-2986 The focus of this research is on analysis of a wide range of emotions and mental states from non-verbal expressions in speech. In particular, on inference of complex mental states, beyond the set of basic emotions, including naturally evoked subtle expressions and mixtures of expressions.
  • №104
  • 2,57 МБ
  • добавлен
  • описание отредактировано
University of Ljubljana, 2012. - 116 p. The two main objectives of this project are to analyse the efficiency of several techniques widely used among the field of emotion recognition through spoken audio signals, and, secondly, obtain empirical data that proves that it is actually plausible to do so with a more than acceptable performance rate. For that purpose, our research will...
  • №105
  • 2,58 МБ
  • добавлен
  • описание отредактировано
2015.02
Springer, 2014. — 321 p. — ISBN10: 1447157788, ISBN13: 978-1-4471-5778-6. This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In...
  • №106
  • 7,56 МБ
  • добавлен
  • описание отредактировано
Lippincott Williams & Wilkins, 2011. - 416 p. Written in a clear, reader-friendly style, Speech Science Primer serves as an introduction to speech science and covers basic information on acoustics, the acoustic analysis of speech, speech anatomy and physiology, and speech perception. It also includes topics such as research methodology, speech motor control, and history/evolution...
  • №107
  • 8,83 МБ
  • добавлен
  • описание отредактировано
2015.01
The Distinctive Features and their Correlates The M-l-T Press, 1952. - 74 p. This report proposes some questions to be discussed by specialists working on various aspects of speech communication. These questions concern the ultimate discrete components of language, their specific structure, their inventory in the languages of the world, their identification on the acoustical...
  • №108
  • 1,32 МБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 156 p. "Ultra Low Bit-Rate Speech Coding" focuses on the specialized topic of speech coding at very low bit-rates of 1 Kbits/sec and less, particularly at the lower ends of this range, down to 100 bps. The authors set forth the fundamental results and trends that form the basis for such ultra low bit-rates to be viable and provide a comprehensive overview of...
  • №109
  • 2,28 МБ
  • добавлен
  • описание отредактировано
2014.12
Springer, 2015. — 87 p. Voice-based call centers or business process outsourcing units generate huge amounts of speech data everyday during their day-to-day operations. Large and diverse types of information are hidden in these natural language conversations, which is begging to be exploited. The whole area of voice analytics deals with the aspect of deriving usable information...
  • №110
  • 2,29 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2015. — 583 p. Emotion represents a psychological state of the human mind. Researchers from different domains have diverse opinions about the developmental process of emotion. Philosophers believe that emotion originates as a result of substantial (positive or negative) changes in our personal situations or environment. Biologists, however, consider our nervous and...
  • №111
  • 4,55 МБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 187 p. If we want the vocal human–computer interaction to become more intuitive, it is inevitable to make the computer notice, interpret, and react to human ways of expression and patterns in communication beyond the recognition of the mere word strings. This is specifically important when it comes to subtle or hidden characteristics carrying connotations or...
  • №112
  • 2,98 МБ
  • добавлен
  • описание отредактировано
Elsevier, 2015. — 194 p. In the information communication field, speech communication via network becomes an important way to transfer information. With the development of information technology, speech communication is widely used for military, diplomatic, and economic purposes as well as in cultural life and scientific research. Therefore, speech secure communication and the...
  • №113
  • 4,47 МБ
  • добавлен
  • описание отредактировано
Springer, 2015. — 336 p. This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech...
  • №114
  • 6,02 МБ
  • добавлен
  • описание отредактировано
2014.11
Springer, 2013. — 142 p. Speech is the most natural mode of communication and yet attempts to build systems which support robust habitable conversations between a human and a machine have so far had only limited success. A key reason is that current systems treat speech input as equivalent to a keyboard or mouse, and behaviour is controlled by pre-defined scripts that try to...
  • №115
  • 1,85 МБ
  • добавлен
  • описание отредактировано
Деркач М.Ф., Гумецкий Р.Я., Гура Б.М., Чабан М.Е. Львов: Вища школа, 1983. — 168 с. В монографии рассматриваются динамические спектрограммы звуков, слогов, слов и слитных фраз русской речи. Основное внимание уделено отображению на спектрограммах работы артикуляционных органов в процессе произношения речевых сигналов. Особое значение придается изучению динамики артикуляционного...
  • №116
  • 4,08 МБ
  • добавлен
  • описание отредактировано
2. Auflage. — Springer Vieweg, 2013. — xv, 398 S. — ISBN: 978-3-642-31502-2, ISBN: 978-3-642-31503-9. Klassiker der Sprachverarbeitung auf dem neuesten Stand der Technik, der neben theoretischen Grundlagen stets auch den Anwendungsbezug herstellt Mit neuen Kapiteln zu den Grundzügen der Signalanalyse sowie Sprachdialogsystemen Elektronisches Zusatzmaterial steht auf...
  • №117
  • 13,98 МБ
  • добавлен
  • описание отредактировано
2014.10
München: Lincom Europa, 2005. – 143 p. This monograph describes an experiment in Forensic Speaker Identification, showing how speeches samples from the same speaker can be discriminated from speech from different speakers with acoustic features commonly used in forensic. It also explains what is now considered the legally and logically correct approach to Forensic Speaker...
  • №118
  • 40,27 МБ
  • добавлен
  • описание отредактировано
Second Edition, Revised and Expanded. — Marcel Dekker, 2001. — 477 p. More than a decade has passed since the first edition of Digital Speech Processing, Synthesis, and Recognition was published. The book has been widely used throughout the world as both a textbook and a reference work. The clear need for such a book stems from the fact that speech is the most natural form of...
  • №119
  • 4,87 МБ
  • добавлен
  • описание отредактировано
Newnes, 2011. — 381 p. Voice over IP (VoIP) in particular and Voice over Packet (VoP) in general have been advocated and studied since the mid 1970s. It was the advent of DSP technology for voice compression in the late 1980s and early 1990s that gave these services the impetus they needed to enter the mainstream. Commercial-grade technologies and services started to appear in...
  • №120
  • 9,20 МБ
  • добавлен
  • описание отредактировано
2014.09
CRC Press, 2010. — 381 p. It is becoming increasingly apparent that all forms of communication—including voice—will be transmitted through packet-switched networks based on the Internet Protocol (IP). Therefore, the design of modern devices that rely on speech interfaces, such as cell phones and PDAs, requires a complete and up-to-date understanding of the basics of speech coding....
  • №121
  • 9,07 МБ
  • добавлен
  • описание отредактировано
Springer, 1997. — 306 p. The field of speech synthesis has secn a large increase in commercial applications in the last ten years. As recently as 1986, there were only a few companies in the synthesis market, all exploiting one of two basic technologies-either formant-based phonemic synthesis or LPC-based diphone synthesis. While these approaches still form the basis of most...
  • №122
  • 6,75 МБ
  • добавлен
  • описание отредактировано
2014.08
MIT Press, 1990. — 854 p. Auditory Scene Analysis addresses the problem of hearing complex auditory environments, using a series of creative analogies to describe the process required of the human auditory system as it analyzes mixtures of sounds to recover descriptions of individual sounds. In a unified and comprehensive way, Bregman establishes a theoretical framework that...
  • №123
  • 4,85 МБ
  • добавлен
  • описание отредактировано
2014.07
Springer, 2004. — 487 p. Springer Handbook of Auditory Research. Volume 18 Although our sense of hearing is exploited for many ends, its communicative function stands paramount in our daily lives. Humans are, by nature, a vocal species and it is perhaps not too much of an exaggeration to state that what makes us unique in the animal kingdom is our ability to communicate via the...
  • №124
  • 2,77 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 112 p. This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different...
  • №125
  • 1,45 МБ
  • добавлен
  • описание отредактировано
2014.06
Springer, 2014. — 215 p. Speech and hearing sciences are fundamental to numerous technological advances of the digital world in the past decade, from music compression in MP3 to digital hearing aids, from network based voice enabled services to speech interaction with mobile phones. Mathematics and computation are intimately related to these leaps and bounds. On the other hand,...
  • №126
  • 15,70 МБ
  • добавлен
  • описание отредактировано
L.: A Bradford Book, 1998. - 305p. This book reflects decades of important research on the mathematical foundations of speech recognition. It focuses on underlying statistical techniques such as hidden Markov models, decision trees, the expectation-maximization algorithm, information theoretic goodness criteria, maximum entropy probability estimation, parameter and data...
  • №127
  • 2,06 МБ
  • добавлен
  • описание отредактировано
Springer, 1999. — 315 p. This book is intended for researchers who want to keep abreast of current developments in corpus-based natural language processing. It is not meant as an introduction to this field; for readers who need one, several entry-level texts are available, including those of (Church and Mercer, 1993; Charniak, 1993; Jelinek, 1997). This book captures the...
  • №128
  • 3,61 МБ
  • добавлен
  • описание отредактировано
Springer, 1999. — 315 p. This book is intended for researchers who want to keep abreast of current developments in corpus-based natural language processing. It is not meant as an introduction to this field; for readers who need one, several entry-level texts are available, including those of (Church and Mercer, 1993; Charniak, 1993; Jelinek, 1997). This book captures the...
  • №129
  • 5,26 МБ
  • добавлен
  • описание отредактировано
2014.05
Academic Press, 2014. — 138 p. Speech enhancement is a classical problem in signal processing, yet still largely unsolved. Two of the conventional approaches for solving this problem are linear filtering, like the classical Wiener filter, and subspace methods. These approaches have traditionally been treated as different classes of methods and have been introduced in somewhat...
  • №130
  • 1,95 МБ
  • добавлен
  • описание отредактировано
Springer, 2014. — 199 p. Speech is a naturally occuring nonstationary signal essential not only for personto- person communication but has become an important aspect of Human Computer Interaction (HCI). Some of the issues related to analysis and design of speech-based applications for HCI have received widespread attention. With continuous upgradation of processing techniques,...
  • №131
  • 3,13 МБ
  • добавлен
  • описание отредактировано
2014.03
Springer, 2013. — 278 p. Since the release of the first Internet Phone in 1995, Voice over Internet Protocol (VoIP) has grown exponentially, from a lab-based application to today’s established technology, with global penetration, for real-time communications for business and daily life. Many organisations are moving from the traditional PSTN networks to modern VoIP solutions...
  • №132
  • 6,83 МБ
  • добавлен
  • описание отредактировано
2014.02
Kluwer, 2002. — 193 p. As the performance of speaker-independent continuous speech recognition has improved over the last decade, increasing attention has been given to the poor recognition performance obtained for some speakers, noisy conditions and environments where the quality and the type of the communication channel is unknown. At the same time an increasing number of...
  • №133
  • 11,31 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1996. — 524 p. The term speech and speaker recognition often refers to the science and technology of developing algorithms and implementing them on machines to recognize the linguistic content in a spoken utterance and to identify the talker who speaks the utterance. Since speech is the most natural means of communication among human beings, it also plays a key role in the...
  • №134
  • 7,84 МБ
  • добавлен
  • описание отредактировано
Springer, 1995. — 517 p. This book collects the contributions to the NATO Advanced Study Institute on New Advances and Trends in Speech Recognition and Coding, held in Bubi6n, Granada (Spain), from June 28th to July 10th 1993. The goal of the ASI was to bring together the most important experts on speech recognition and coding to discuss and disseminate their most recent...
  • №135
  • 11,59 МБ
  • добавлен
  • описание отредактировано
Springer, 2014. — 129 p. Robust speech systems in mobile environment have gained a special interest in recent years in order to enable access to remote voice-activated services. In this context, three major challenges that need to be considered are: varying background conditions, speech coding, and transmission channel errors. In this book, we focus on improving the recognition...
  • №136
  • 3,91 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2013. — 501 p. The term computer speech recognition conjures up visions of the science-fiction capabilities of HAL2000 in 2001, A Space Odessey, or Data, the anthropoid robot in Star Trek, who can communicate through speech with as much ease as a human being. However, our real-life encounters with automatic speech recognition are usually rather less impressive,...
  • №137
  • 7,75 МБ
  • добавлен
  • описание отредактировано
NY: Springer International Publishing, 2014. — 53 p. This book provides a survey on wide-spread of employing wavelets analysis in different applications of speech processing. The author examines development and research in different applications of speech processing. The book also summarizes the state of the art research on wavelet in speech processing.
  • №138
  • 1,19 МБ
  • добавлен
  • описание отредактировано
Springer, 2014. — 188 p. The most of the applications of digital speech processing deal with speech or speaker pattern recognition. To understand the practical implementation of the speech or speaker recognition techniques, there is the need to understand the concepts of digital speech processing and the pattern recognition. This book aims in giving the balanced treatment of...
  • №139
  • 9,37 МБ
  • добавлен
  • описание отредактировано
Springer, 2014. — 53 p. As the wavelets gain wide applications in different fields, especially within the signal processing realm, this chapter will provide a survey on widespread employing of wavelets analysis in different applications of speech processing. Many speech processing algorithms and techniques still lack some sort of robustness which can be improved through the use...
  • №140
  • 946,21 КБ
  • добавлен
  • описание отредактировано
Springer, 1972. — 446 p. Второе, дополненное издание монографии Джеймса Флэнагана "Анализ, синтез и восприятие речи" (первое издание, 1965 года, было переведено на русский в 1968 году издательством "Связь") Для изучающих обработку речевых сигналов.
  • №141
  • 13,68 МБ
  • добавлен
  • описание отредактировано
2014.01
Draft, 2nd edition: Prentice Hall, 2008 — 1024 p. An explosion of Web-based language techniques, the merging of distinct fields, the availability of phone-based dialogue systems, and much more make this an exciting time in speech and language processing. The first of its kind to thoroughly cover language technology – at all levels and with all modern technologies – this book...
  • №142
  • 18,89 МБ
  • добавлен
  • описание отредактировано
The MIT Press, 2012. — 339 p. — ISBN: 978-0-262-01685-8. На англ. языке. In The Voice in the Machine , Roberto Pieraccini examines six decades of work in science and technology to develop computers that can interact with humans using speech and the industry that has arisen around the quest for these technologies. He shows that although the computers today that understand speech...
  • №143
  • 7,19 МБ
  • добавлен
  • описание отредактировано
2013.12
John Wiley, 2002. — 407 p. Making machines speak like humans is a dream that is slowly coming to fruition. When the first automatic computer voices emerged from their laboratories twenty years ago, their robotic sound quality severely curtailed their general use. But now after a long period of maturation, synthetic speech is beginning to reach an initial level of acceptability....
  • №144
  • 2,67 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 136 p. During production of speech human beings impose durational constraints and intonation patterns on the sequence of sound units to convey the intended message. This inherent ability of the human beings in using the prosody (duration and intonation) knowledge is naturally acquired, and is difficult to articulate. But for synthesizing speech from a text by...
  • №145
  • 1,67 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2001. — 328 p. Modern speech synthesis began in the 1950s with the development of electronic formant synthesisers, such as PAT (Parametric Artificial Talker) designed by Walter Lawrence in the UK and OVE designed by Gunnar Fant in Sweden. Many others followed and, with the widespread introduction of fast digital computers, became implemented as computer programs. The best...
  • №146
  • 4,65 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1993. — 267 p. This volume contains 34 chapters, loosely grouped into six topical areas. The chapters in this volume reflect the progress and present the state of the art in low bit rate speech coding primarily at bit rates from 2.4 kbit/s to 16 kbit/s. Together they represent important contributions from leading researchers in the speech coding community. The book...
  • №147
  • 7,52 МБ
  • добавлен
  • описание отредактировано
Morgan Kaufmann, 1990. — 630 p. Despite several decades of research activity, speech recognition still retains its appeal as an exciting and growing field of scientific inquiry. Many advances have been made during these past decades; but every new technique and every solved puzzle opens a host of new questions and points us in new directions. Indeed, speech is such an intimate...
  • №148
  • 17,12 МБ
  • добавлен
  • описание отредактировано
2013.11
Санкт-Петербургский институт информатики и автоматизации Российской Академии Наук, 2013, -316 с. В монографии очерчен круг проблем, связанных с особенностями автоматического анализа разговорной русской речи в интерактивных диалоговых системах. Описаны методы дистанционной записи речи, учета вариативности произношения в разговорной речи, компактного представления словаря, а...
  • №149
  • 6,32 МБ
  • добавлен
  • описание отредактировано
Презентация доклада. 43 стр. Содержание/Outline Fundamentals of automatic speech recognition Acoustic modeling Language modeling Database (corpus) and task evaluation Transcription and dialogue systems Spontaneous speech recognition Speech understanding Speech summarization Summary (Annotation) Speech recognition technology has made significant progress with many potential...
  • №150
  • 1,19 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 146 p. In this book, hierarchical structures based on neural networks are investigated for automatic speech recognition. These structures are mainly evaluated in the task of phoneme recognition under the Hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) paradigm. The baseline hierarchical scheme consists of two levels where each level is based on...
  • №151
  • 1,79 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 301 p. The book covers a wide range of disciplines related to speech and language and vocal communication in animals. In Part I, the first chapter deals with the current state of understanding of the neurology of speech and language in terms of brain substrates, representation, and theoretical models. The second chapter is a review of what is known about the...
  • №152
  • 4,05 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 72 p. AT&T, Yahoo! Research, and other companies, along with academicians, technology developers, and market analysts. They analyze the growing markets for mobile speech, new methodological approaches to the study of natural language, empirical research findings on natural language and mobility, and future trends in mobile speech. This book is divided into...
  • №153
  • 4,71 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 134 p. During production of speech human beings impose emotional cues on the sequence of sound units to convey the intended message. Speech without emotional information is unnatural and monotonous. Most of the existing speech systems are able to process studio recorded neutral speech. However, in the present real world communication scenario, speech systems...
  • №154
  • 1,46 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 59 p. A leading use of speech recognition technology is the conversion of large speech databases into text for indexing and retrieval purposes. Using a large vocabulary continuous speech recognition (LVCSR) engine seems to provide a natural solution, as speech can be fully converted into text and then indexed and searched. One method used for searching speech...
  • №155
  • 592,68 КБ
  • добавлен
  • описание отредактировано
СПб.: ГУАП, 2013. — 314 с. В монографии очерчен круг проблем, связанных с особенностями автоматического анализа разговорной русской речи в интерактивных диалоговых системах. Описаны методы дистанционной записи речи, учета вариативности произношения, компактного представления словаря, а также синтаксическо-статистического моделирования языка в системах автоматического...
  • №156
  • 42,80 МБ
  • добавлен
  • описание отредактировано
2013.10
Blackwell, 2010. — 279 p. In undergraduate courses that include phonetics, students typically acquire skills both in ear-training and an understanding of the acoustic, physiological, and perceptual characteristics of speech sounds. But there is usually less opportunity to test this knowledge on sizeable quantities of speech data partly because putting together any database that is...
  • №157
  • 22,44 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1999. — 328 p. This book is the development of a series of lectures to undergraduate and postgraduate students at Macquarie University on basic principles in acoustic phonetics and speech signal processing. The first part of the book (Chapters 1 to 4) is intended to provide students with the ability to interpret acoustic records of speech signals in their various forms....
  • №158
  • 5,96 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2000. — 359 p. The study of prosody is perhaps the area of speech research which has undergone the most noticeable development during the past ten to fifteen years. As an indication of this, one can note, for example, that at the latest International Conference on Spoken Language Processing in Philadelphia (October 1996), there were more sessions devoted to prosody than...
  • №159
  • 5,60 МБ
  • добавлен
  • описание отредактировано
Springer, 1976. — 300 p. During the past ten years a new area in speech processing, generally referred to as linear prediction, has evolved. As with all scientific research, results did not always get published in a logical order and terminology was not always consistent. In mid-1974, we decided to begin an extra hours and weekends project of organizing the literature in linear...
  • №160
  • 4,80 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2001. — 277 p. Consider a computer system that you can talk to using ordinary speech (either directly or perhaps using your telephone), and that you can ask questions concerning such things as timetables for public transportation. For example, you might ask the system the departure time of a train from Brussels to Amsterdam, specifying that you wish to arrive in...
  • №161
  • 4,09 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 161 p. This book came out of approximately ten years of continuing research at Yamagata University. With the emergence of numerous algorithms for a variety of speech processing applications, such as coding, enhancement, and synthesis, a variety of distortion can now be observed. These disturbances degrade the speech quality in an unexpected manner. For...
  • №162
  • 11,21 МБ
  • добавлен
  • описание отредактировано
Springer, 1983. — 713 p. Pitch (i.e., fundamental frequency F 0 and fundamental period T 0 ) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The...
  • №163
  • 12,93 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1992. — 254 p. After almost three scores of years of basic and applied research, the field of speech processing is, at present, undergoing a rapid growth in terms of both performance and applications and this is fuelled by the advances being made in the areas of microelectronics, computation and algorithm design. Speech processing relates to three aspects of voice...
  • №164
  • 3,78 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1995. — 471 p. The term speech processing refers to the scientific discipline concerned with the analysis and processing of speech signals for getting the best benefit in various practical scenarios. These different practical scenarios correspond to a large variety of applications of speech processing research. Examples of some applications include enhancement, coding,...
  • №165
  • 6,73 МБ
  • добавлен
  • описание отредактировано
Springer, 1997. — 399 p. This book presents a collection of papers from the Spring 1995 Workshop on Computational Approaches to Processing the Prosody of Spontaneous Speech, hosted by the ATR Interpreting Telecommunications Research Laboratories in Kyoto, Japan. The workshop brought together leading researchers in the fields of speech and signal processing, electrical...
  • №166
  • 5,92 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1997. — 247 p. This book originates from the 2nd European Summer School on Language and Speech Communication that was held in the summer of 1994 in Utrecht, The Netherlands. During two weeks, 90 participants enjoyed 14 courses that were focussed on the theme "Corpus-Based Methods in Language and Speech Processing". The enthusiasm of the participants for the topic and the...
  • №167
  • 3,58 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2000. — 397 p. As the title indicates, "Intonation: Analysis, Modelling and Technology" is a contribution to the study of prosody, with major emphasis on intonation. Intonation and tonal themes are thus the central object of the volume, although temporal and dynamic aspects are also taken into consideration by a good number of papers. Although tonal and prosodic...
  • №168
  • 6,21 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2004. — 124 p. Speech is the most natural fonn of communication among humans. As machines become ever more capable and their use more widespread due to advances in computing. the need to allow natural communication between a human and a machine also gains critical significance. In order to realize such a system, it is essential that the speech communication process is well...
  • №169
  • 3,93 МБ
  • добавлен
  • описание отредактировано
Springer, 2004. — 292 p. The importance of speech and language technologies continues to grow as information, and information needs, pervade every aspect of our lives and every corner of the globe. Speech and language technologies are used to automatically transcribe, analyze, route and extract information from highvolume streams of spoken and written information. Equally...
  • №170
  • 5,85 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 109 p. Speech production and perception, man’s most widely used means of communication, has been the subject of research and intense study for more than 10 decades. Conventional theories of speech production are based on linearization of pressure and volume velocity relations and the speech production system is modeled as a linear source-filter model. This...
  • №171
  • 1,32 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 251 p. — ISBN10: 1461445922, ISBN13: 9781461445920. In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang Minker investigate statistical approaches that allow for recognition of negative dialog patterns in Spoken Dialog Systems (SDS). The presented stochastic methods allow a flexible, portable and accurate use. Beginning with the...
  • №172
  • 4,81 МБ
  • добавлен
  • описание отредактировано
Springer, 1989. — 216 p. Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates;...
  • №173
  • 3,17 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 70 p. Human beings recognize speaker, language and speech using multiple cues present in speech signal and evidences are combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language and speech recognition systems mostly rely on spectral/cepstral features which are affected by...
  • №174
  • 1,47 МБ
  • добавлен
  • описание отредактировано
Springer, 2004. — 399 p. The first edition having been sold out, gives me a welcome opportunity to augment this volume by some recent applications of speech research. A new chapter, by Holger Quast, treats speech dialogue systems and natural language processing. Dictation programs for word processors, voice dialing for mobile phones, and dialogue systems for air travel...
  • №175
  • 6,98 МБ
  • добавлен
  • описание отредактировано
ISTE/John Wiley, 2013. — 221 p. The preparation of this book was carried out while preparing an accreditation to supervise research. This is a synthesis covering the past 10 years of research, since my doctorate [LAN 04], in the field of man–machine dialogue. The goal here is to outline the theories, methods, techniques and challenges involved in the design of computer programs...
  • №176
  • 911,29 КБ
  • добавлен
  • описание отредактировано
Springer Science+Business Media, 2012. — 120 p. — ISBN 978-1-4614-1905-1, e-ISBN 978-1-4614-1906-8 This book describes novel approaches to improve automatic speech recognition for dialectal Arabic. Since the existing dialectal Arabic speech resources, that are available for the task of training speech recognition systems, are very sparse and are lacking quality, we describe how...
  • №177
  • 1,22 МБ
  • добавлен
  • описание отредактировано
Plenum Press, 1983. — 505 p. The work reported in this book results from years of research oriented toward the goal of making an experimental model capable of understanding spoken sentences of a natural language. This is, of course, a modest attempt compared to the complexity of the functions performed by the human brain. A method is introduced for conceiving modules performing...
  • №178
  • 7,76 МБ
  • добавлен
  • описание отредактировано
Springer, 2012. — 83 p. The fast pace of the advancement in information and communications technology is reshaping our society and vastly increasing our capabilities for faster learning, higher achievements, and better and wider communication, in addition to more effective and productive collaboration among speech scientists and engineers. One of the important frontiers of...
  • №179
  • 758,60 КБ
  • добавлен
  • описание отредактировано
Kluwer, 1993. — 197 p. The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker...
  • №180
  • 2,84 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1998. — 249 p. This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system described in Chapter 8 as this is clearly an issue which was not treated in the original version. This required the collection of data, the development of a concept for diagnostic evaluation of linguistic word...
  • №181
  • 3,59 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1994. — 329 p. This book describes how large multi-layer perceptron networks containing more than 150,000 weights were trained and integrated into a state-of-the-art Hidden Markov Model (HMM) recognizer to provide improved acoustic-phonetic modeling and improved recognition accuracy. The lessons learned along the way form a case study which demonstrates how hybrid...
  • №182
  • 4,51 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 207 p. In the present book, speech transmission quality is modeled on the basis of perceptual dimensions that are relevant for today’s public-switched and packet-based telecommunication systems. The complete transmission path from the mouth of the speaker to the ear of the listener is regarded, and both narrowband (300–3400 Hz) as well as wideband (50–7000 Hz)...
  • №183
  • 3,18 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2013. — 355 p. This book came about as a result of the standing-room-only special session on crowdsourcing for speech processing at Interspeech 2011. There has been a great amount of interest in this new technique as a means to solve some persistent issues. Some researchers dived in head first and have been using crowdsourcing for a few years by now. Others waited...
  • №184
  • 3,03 МБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 74 p. The diagnosis and monitoring of many common neurological conditions routinely involve acoustic analysis of the subject’s speech by an expert clinician. There are two significant problems with this: one is that the analysis is time-consuming, hence expensive, and therefore often performed too infrequently, and the other is that the results of the analysis...
  • №185
  • 761,03 КБ
  • добавлен
  • описание отредактировано
Springer, 2013. — 127 p. — ISBN 978-1-4614-6359-7, ISBN 978-1-4614-6360-3. Human beings use speech as a primary mode of communication for conveying messages. A speech signal carries multiple cues related to intended message, speaker and language identities, behavioural and emotional mood of the speaker and characteristics of background environment. Human beings exploit all...
  • №186
  • 1,94 МБ
  • добавлен
  • описание отредактировано
М.: Радио и связь, 2000. — 456 с. Рассматриваются проблемы цифровой обработки и передачи речи в системах со сжатием, статистическим уплотнением, пакетной коммутацией, IР-телефонии, сетях АТМ и Frame Relay. Анализируются принципы построения, характеристики и особенности функционирования кодеров формы, вокодеров, гибридных кодеров, реализующих алгоритмы CELP, LD-CELP, ACELP, МВЕ,...
  • №187
  • 10,96 МБ
  • добавлен
  • описание отредактировано
2013.09
Springer, 2004. — 237 p. Spoken dialog systems allow people to get information, conduct business, and be entertained, simply by speaking to a computer. There are hundreds of these systems currently in use, handling millions of interactions every day. How do they work? What problems do they solve? The goal of this book is to answer these questions and others like them, including:...
  • №188
  • 4,00 МБ
  • добавлен
  • описание отредактировано
2013.07
К.: Полиграф Консалтинг, 2005. — 138 с. В книге представлено спектрально-временное описание речевых сигналов как функций многих переменных. Приведено решение задач нахождения параметров частотной функции речеобразующей системы в одно- и двухмерном случаях по спектральной функции речевого сигнала. Приведены также некоторые алгоритмы и классификация задач базы знаний распознавания...
  • №189
  • 10,57 МБ
  • добавлен
  • описание отредактировано
John Wiley & Sons, Inc., 2013. — 384 p. — 3rd Edition. На англ. языке. Fully updated for the latest speech recognition tools and features, this bestselling guide helps you conquer Dragon NaturallySpeaking and gets you started creating documents, sending e-mail, searching the web, and more using only your voice. You?ll learn Dragon basics like dictation, formatting, and...
  • №190
  • 9,51 МБ
  • добавлен
  • описание отредактировано
John Wiley & Sons, Inc., 2013. — 384 p. — 3rd Edition. На англ. языке. Fully updated for the latest speech recognition tools and features, this bestselling guide helps you conquer Dragon NaturallySpeaking and gets you started creating documents, sending e-mail, searching the web, and more using only your voice. You?ll learn Dragon basics like dictation, formatting, and...
  • №191
  • 19,08 МБ
  • добавлен
  • описание отредактировано
2013.06
Speech Repairs, Intonational Boundaries and Discourse Markers: Modeling Speakers’ Utterances in Spoken Dialog by Peter Anthony Heeman University of Rochester, Rochester, New York. 1997 Abstract Interactive spoken dialog provides many new challenges for natural language understanding systems. One of the most critical challenges is simply determining the speaker’s...
  • №192
  • 849,23 КБ
  • добавлен
  • описание отредактировано
2013.03
O’Reilly Media, 2013. — 242 p. Go under the hood of an operating Voice over IP network, and build your knowledge of the protocols and architectures used by this Internet telephony technology. With this concise guide, you’ll learn about services involved in VoIP and get a first-hand view of network data packets from the time the phones boot through calls and subsequent...
  • №193
  • 24,71 МБ
  • добавлен
  • описание отредактировано
2013.02
Руководство к лабораторно-практическим занятиям по дисциплине "Безопасность жизнедеятельности. Часть 2 Информационная безопасность". Изд-во ТТИ ЮФУ. Таганрог, 2011. 48 с. Предназначено для студентов радиотехнических специальностей вуза с целью изучения разновидностей, характеристик, принципов построения и алгоритмических моделей аналоговых временных и частотных скремблеров...
  • №194
  • 2,21 МБ
  • добавлен
  • описание отредактировано
2012.11
InTech, 2012. — 326 p. — ISBN: 9535108313, ISBN: 9789535108313. This book focuses primarily on speech recognition and the related tasks such as speech enhancement and modeling. This book comprises 3 sections and thirteen chapters written by eminent researchers from USA, Brazil, Australia, Saudi Arabia, Japan, Ireland, Taiwan, Mexico, Slovakia and India. Section 1 on speech...
  • №195
  • 12,08 МБ
  • добавлен
  • описание отредактировано
Bradford Book, 1995. — 549 p. The chapters in this book represent the outcome of a research workshop held at the Park Hotel Fiorelle, Sperlonga, 16- 20 May 1988. Twenty-five participants gathered in this small coastal village in Italy , where the Emperor Tiberius kept a Summer house, to discuss psycholinguistic and computational issues in speech and natural-language processing....
  • №196
  • 9,82 МБ
  • добавлен
  • описание отредактировано
2012.10
John Wiley, 2002. — 403 p. Playing with a new technology is fun. I have been a teacher in one form or another for over 20 years, but it still gets me excited when I see something that seems so obvious and so simple that it is shocking it hasn’t been done before. That’s the way I feel about VoiceXML. VoiceXML makes it possible for anyone who can build a basic Web page to create...
  • №197
  • 2,06 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2003. — 222 p. In general, voice transmission over the Internet protocol (IP), or VoIP, means transmission of real-time voice signals and associated call control information over an IP-based (public or private) network. The term IP telephony is commonly used to specify delivery of a superset of the advanced public switched telephone network (PSTN) services using IP...
  • №198
  • 5,71 МБ
  • добавлен
  • описание отредактировано
World Scientific, 2007. — 563 p. It is generally agreed that speech will play a major role in defining next-generation human-machine interfaces because it is the most natural means of communication among humans. To push forward this vision, speech research has enjoyed a long and glorious history spanning the entire twentieth century. As a result in the last three decades we...
  • №199
  • 15,70 МБ
  • добавлен
  • описание отредактировано
Delmar, Cengage Learning, 2009. — 396 p. — ISBN: 1435427270. Understanding Voice Over IP Technology provides students with the in-depth knowledge of Voice over IP and the TCP/IP protocol that it is based on. Voice over IP technology, or making telephone calls over data networks such as the Internet, has now reached the tipping point, and is expected to eventually become the...
  • №200
  • 12,70 МБ
  • добавлен
  • описание отредактировано
2012.09
ISTE/John Wiley, 2009. — 505 p. This book, entitled Spoken Language Processing, addresses all the aspects covering the automatic processing of spoken language: how to automate its production and perception, how to synthesize and understand it. It calls for existing know-how in the field of signal processing, pattern recognition, stochastic modeling, computational linguistics,...
  • №201
  • 4,23 МБ
  • добавлен
  • описание отредактировано
2012.06
Springer, 2012. — 264 p. This book is organized by research topic. Each chapter focuses on a major topic and can be read independently. Each chapter contains advanced algorithms along with real speech examples and evaluation results to validate the usefulness of the selected topics. Special attention has been given to the topics related to improving overall system robustness...
  • №202
  • 4,53 МБ
  • добавлен
  • описание отредактировано
2012.05
М.: Связь, 1968. — 395 с. В монографии Дж. Фланагана, известного американского ученого, подробно рассматриваются широкий круг вопросов, связанных со свойствами речи как переносчика информации, основные ее параметры, проблемы анализа, синтеза и автоматического распознавания. Оцениваются характеристики каналов речевой связи. Большое внимание уделяется рассмотрению проблем...
  • №203
  • 9,21 МБ
  • добавлен
  • описание отредактировано
InTech, 2012. — 149 p. Speech processing is the process by which speech signals are interpreted, understood, and acted upon. Interpretation and production of coherent speech are both important in the processing of speech. It is done by automated systems such as voice recognition software or voice-to-text programs. Speech processing includes speech recognition, speaker recognition,...
  • №204
  • 5,72 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 267 p. The telephony network broadly changed during the last decades with the intensive introduction of Voice over Internet Protocol (VoIP) technology and third generation mobile networks. These networks enable new transmission paradigms that affect the perceived quality of speech signals. The perceived characteristics of a speech signal transmitted by a VoIP...
  • №205
  • 1,65 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 113 p. Soft Computing (SC) techniques have been recognized nowadays as attractive solutions for modeling highly nonlinear or partially defined complex systems and processes. These techniques resemble biological processes more closely than conventional (more formal) techniques. However, despite its increasing popularity, soft computing lacks a precise...
  • №206
  • 1,74 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 88 p. Signal enhancement is a fundamental topic of signal processing in general and of speech processing in particular [1]. In audio and speech applications such as cell phones, teleconferencing systems, hearing aids, human–machine interfaces, and many others, the microphones installed in these systems always pick up some interferences that contaminate the...
  • №207
  • 478,94 КБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 1029 p. — ISBN10: 0387775919, ISBN13: 978-0387775913 When I was being interviewed at the handwriting recognition group of IBM T.J. Watson Research Center in December of 1990, one of the interviewers asked me why, being a mechanical engineer, I was applying for a position in that group. Well, he was an electrical engineer and somehow was under the impression...
  • №208
  • 13,74 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2008. — 592 p. Voice over IP (VoIP) gained popularity through actual deployments and by making use of VoIP - based telephone and fax calls with global roaming and connectivity via the Internet. Several decades of effort have gone into VoIP, and these efforts are benefitting real applications. Several valuable books have been published by experts in the field. While I...
  • №209
  • 4,98 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2006. — 338 p. VoIP means transmitting speech over computer networks. In contrast to classical telephony, where research into the relation between physical transmission parameters, the resulting speech signal and the related speech quality has a longer tradition, speech quality of VoIP has only recently become an issue. The present book tries to merge knowledge of the...
  • №210
  • 2,69 МБ
  • добавлен
  • описание отредактировано
2012.04
McGraw-Hill, 2003. — 338 p. The focus of this book is the narrow question of how to assess quality of packet-switched voice services in general and VoIP services in particular. The approach taken in answering this vexing question is one that I have exploited to very good effect in more than 35 years’ working in the general area of test and evaluation of telecommunications...
  • №211
  • 1,84 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 185 p. A self-learning speech controlled system has been developed for unsupervised speaker identification and speech recognition. The benefits of a speech controlled device which identifies its main users by their voice characteristics are obvious: The human-computer interface may be personalized. New ways for interacting with a speech controlled system may...
  • №212
  • 1,61 МБ
  • добавлен
  • описание отредактировано
2012.03
Springer, 2011. — 163 p. Many of the things we think about, actions we take, the way we react to stimuli, generate a feeling or subjective experience, for example, an emotion, or a mood. The generic term used in the twentieth century psychology and philosophy literature to denote such an emotion or mood is an old, Middle English (fourteenth century) word affect. The outward...
  • №213
  • 1,52 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 125 p. The preparation of the present brief book was motivated by the significant and long-standing interest of the speech processing community to short-time cepstrum-based parameterization of speech. In approximately 100 pages, this volume brings together relevant information about 11 speech parameterization techniques and some of their variants that emerged...
  • №214
  • 2,24 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 125 p. Automatic speech recognition systems are increasingly applied for modern communication. One example are call centers, where speech recognition based systems provide information or help sorting customer queries in order to forward them to the according experts. The big advantage of those systems is that the computers can be online 24 h a day to process...
  • №215
  • 1,18 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 82 p. Spoken dialog systems have been the object of intensive research interest over the past two decades, and hundreds of scientif c articles as well as a handful of text books such as [25, 52, 74, 79, 80, 83] have seen the light of day. What most of these publications lack, however, is a link to the real world, i.e., to conditions, issues, and environmental...
  • №216
  • 892,52 КБ
  • добавлен
  • описание отредактировано
2012.02
Springer, 2010. — 279 p. During the past years the mystery of emotions has increasingly attracted interest in research on human–computer interaction. In this work we investigate the problem of how to incorporate the user’s emotional state into a spoken language dialogue system. The book describes the recognition and classification of emotions and proposes models integrating...
  • №217
  • 4,74 МБ
  • добавлен
  • описание отредактировано
2012.01
Oxford University Press, 1994. — 314 p. The most sophisticated and efficient means of communication between humans is spoken natural language (NL). It is a rare circumstance when two people choose to communicate via another means when spoken natural language is possible. Ochsman and Chapanis [OC74] conducted a study involving two person teams solving various problems using...
  • №218
  • 5,79 МБ
  • добавлен
  • описание отредактировано
EURASIP Journal on Audio, Speech, and Music Processing, 2010. — 90 p. One of the most important aspects of spoken language is its large degree of variability. Variability in speech is caused by many different sources, for instance, changes of the acoustic environment or transmission channel and differences between speakers or various speaking styles. Successful speech processing...
  • №219
  • 5,08 МБ
  • добавлен
  • описание отредактировано
EURASIP Journal on Audio, Speech, and Music Processing, 2009. — 66 p. The aim of this special issue is to provide a detailed description of state-of-the-art systems for animating faces during speech, and identify new techniques that have recently emerged from both the audiovisual speech and computer graphics research communities. This special issue is a followup to the first LIPS...
  • №220
  • 10,61 МБ
  • добавлен
  • описание отредактировано
IGI Global, 2010. — 342 p. As social scientists often define it, technology refers to devices and processes that extend our natural capabilities. Microscopes make it possible to see smaller things and telescopes enable us to see things that are further away. Cars extend the amount of space that we are able to travel far beyond where our feet can take us during a given period of...
  • №221
  • 2,44 МБ
  • добавлен
  • описание отредактировано
Springer, 2005. — 371 p. The chapters in this book jointly contribute to what we shall call the field of natural and multimodal interactive systems engineering. This is not yet a well-established field of research and commercial development but, rather, an emerging one in all respects. It brings together, in a process that, arguably, was bound to happen, contributors from many...
  • №222
  • 8,02 МБ
  • добавлен
  • описание отредактировано
Springer, 1995. — 589 p. Text-to-speech synthesis involves the computation of a speech signal from input text. Accomplishing this requires a system that consists of an astonishing range of components, from abstract linguistic analysis of discourse structure to speech coding. Several implications flow from this fact. First, text-to-speech synthesis is inherently...
  • №223
  • 3,46 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2006. — 644 p. The digital processing, storage, and transmission of speech signals have gained great practical importance. The main application areas are digital mobile radio, acoustic human–machine communication, and digital hearing aids. In fact, these applications are the driving force behind many scientific and technological developments in this field. A...
  • №224
  • 19,33 МБ
  • добавлен
  • описание отредактировано
Springer, 2008. — 483 p. Years ago when speech technology was younger, the designers of telephony-based speech recognition applications discovered something interesting. If human factors design, now often called user interface design, is applied to the prompts and flow of these applications, the result is improved system performance. Previously, nearly the only path of performance...
  • №225
  • 3,20 МБ
  • добавлен
  • описание отредактировано
EURASIP Journal on Advances in Signal Processing, 2010. — 94 p. Significant knowledge about microphone arrays has been gained from years of intense research and product development. There have been numerous applications suggested, for example, from large arrays (in the order of 100 elements) for use in auditoriums to small arrays with only 2 or 3 elements for hearing aids and...
  • №226
  • 7,14 МБ
  • добавлен
  • описание отредактировано
2011.12
Cambridge University Press, 2009. — 642 p. Speech processing technology has been a mainstream area of research for more than 50 years. The ultimate goal of speech research is to build systems that mimic (or potentially surpass) human capabilities in understanding, generating and coding speech for a range of human-to-human and human-to-machine interactions. In the area of speech...
  • №227
  • 3,91 МБ
  • добавлен
  • описание отредактировано
Cambridge University Press, 2009. — 642 p. Speech processing technology has been a mainstream area of research for more than 50 years. The ultimate goal of speech research is to build systems that mimic (or potentially surpass) human capabilities in understanding, generating and coding speech for a range of human-to-human and human-to-machine interactions. In the area of speech...
  • №228
  • 4,95 МБ
  • добавлен
  • описание отредактировано
Springer, 2008. — 403 p. The remarkable advances in computing and networking have sparked an enormous interest in deploying Automatic Speech Recognition on Mobile Devices and Over Communication Networks, and the trend is accelerating. This yields an abundance of practical systems, operational algorithms and scientific publications. There is, however, no integrated book...
  • №229
  • 2,13 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2009. — 181 p. State-of-the-art speech and language technology has reached a level that allows us to build interactive applications which the users can have short conversations with in order to search for information. We are already dealing with electronic banking facilities, information providing systems, restaurant guides, timetable services, assisting translation...
  • №230
  • 1,09 МБ
  • добавлен
  • описание отредактировано
Springer, 2008. — 176 p. Applications of Discrete Wavelet Transform and Wavelet Denoising to Speech Classification, Speech Enhancement and Robust Speech Recognition In this work, we study the application of wavelet analysis for robust speech processing. Reliable time-scale features (TS) which characterize the relevant phonetic classes such as voiced (V), unvoiced (UV), silence...
  • №231
  • 9,65 МБ
  • добавлен
  • описание отредактировано
Springer, 2007. — 279 p. The last meeting of the Management Committee of the COST Action 277: Nonlinear Speech Processing was held in Heraklion, Crete, Greece, September 20–23, 2005 during the Workshop on Nonlinear Speech Processing (WNSP). This was the last event of COST Action 277. The Action started in 2001. During the workshop, members of the Management Committee and...
  • №232
  • 4,05 МБ
  • добавлен
  • описание отредактировано
8th ELSNET Summer School, Chios Island, Greece, July 15-30 2000, Revised Lectures. — Springer, 2003. — 202 p. This book originated from the 8th ELSNET Summer School on Language and Communication that was held in the summer of 2000 on the island of Chios in ELSNET is the European Network in Human Language Technologies, a network some 140 academic institutions and private...
  • №233
  • 1,67 МБ
  • добавлен
  • описание отредактировано
Springer, 2010. — 352 p. More and more devices for human-to-human and human-to-machine communications, where sound pickup and rendering is necessary, require some sophisticated algorithms. This is due to the fact that the acoustic environment in which we live in and communicate is extremely challenging. The difficult problems encountered in this environment are very well known...
  • №234
  • 9,16 МБ
  • добавлен
  • описание отредактировано
Springer, 2010. — 177 p. Speech Processing has rapidly emerged as one of the most widespread and wellunderstood application areas in the broader discipline of Digital Signal Processing. Besides the telecommunications applications that have hitherto been the largest users of speech processing algorithms, several nontraditional embedded processor applications are enhancing their...
  • №235
  • 1,84 МБ
  • добавлен
  • описание отредактировано
Taylor & Francis, 2002. — 359 p. This book is about an aspect of applied scholarly endeavour, forensic phonetics, that carries with it very serious social responsibilities. The book makes it clear that forensic speaker identification requires scholarly expertise, and in several disparate areas. Expertise, like forensically useful fundamental frequency, is a long-term thing. It...
  • №236
  • 3,44 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2005. — 273 p. In many situations, the dialogue between two human beings seems to be performed almost effortlessly. However, building a computer program that can converse in such a natural way with a person, on any task and under any environmental conditions, is still a challenge. One reason why is that a large amount of different types of knowledge is involved in...
  • №237
  • 2,71 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2006. — 274 p. The total number of mobile phone subscribers worldwide is expected to exceed two billion in 2006. While ordinary voice calling remains the dominant application, mobile devices are becoming increasingly sophisticated, with features like multimedia messaging, cameras, web browsers, games, video, and music. The data capabilities of mobile networks are...
  • №238
  • 2,38 МБ
  • добавлен
  • описание отредактировано
Entropics Ltd., 1999. — 667 p. The HTK Application Programming Interface (HAPI) is a library of functions providing the programmer with an interface to any speech recognition system supplied by Entropic or developed using the Hidden Markov Model Toolkit (HTK). HTK is a set of Unix tools which are used to construct all the components of a modern speech recogniser. One of the...
  • №239
  • 1,68 МБ
  • добавлен
  • описание отредактировано
IEEE Press, 2000. — 560 p. Speech commW1ication is an interdisciplinary subject. Although much of the research material for the book comes from engineering literature (e.g., IEEE journals), a wide variety of sources is employed (especially for Chapters 3-5). The book is directed primarily at an engineering audience le.g., to a final-year undergraduate or graduate course in...
  • №240
  • 34,40 МБ
  • добавлен
  • описание отредактировано
Springer, 2009. — 206 p. State-of-the-art automatic speech recognition (ASR) systems use statistical data-driven methods based on hidden Markov models (HMMs). Although such approaches have proved to be efficient choices, ASR systems often perform much worse than human listeners, especially in the presence of unexpected acoustic variability. To improve performance, we usually...
  • №241
  • 2,08 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2004. — 104 p. The conjunction of several factors having occurred throughout the past few years will make humans significantly change their behavior vis-а-vis machines. In particular the use of speech technologies will become normal in the professional domain, but also in everyday life. The performance of speech recognition components has significantly improved: only...
  • №242
  • 2,26 МБ
  • добавлен
  • описание отредактировано
Springer, 2007. — 362 p. The best way to introduce this textbook is by using the words Volker Dellwo and his colleagues had chosen to begin their chapter How Is Individuality Expressed in Voice? While they use this statement to motivate the introductory chapter on speech production and the phonetic description of speech, it constitutes a framework of the entire book as...
  • №243
  • 4,17 МБ
  • добавлен
  • описание отредактировано
Springer, 2007. — 316 p. The best way to introduce this textbook is by using the words Volker Dellwo and his colleagues had chosen to begin their chapter How Is Individuality Expressed in Voice? While they use this statement to motivate the introductory chapter on speech production and the phonetic description of speech, it constitutes a framework of the entire book as...
  • №244
  • 5,04 МБ
  • добавлен
  • описание отредактировано
Springer, 2010. — 382 p. Advances in Speech Recognition: Mobile Environments, Call Centers and Clinics provides a forum for today’s speech technology industry leaders – drawn from private enterprises and academic institutions all over the world – to discuss the challenges, advances, and aspirations of voice technology. The collection of essays contained in this volume...
  • №245
  • 7,81 МБ
  • добавлен
  • описание отредактировано
Taylor&Francis, 1993. — 225 p. This text deals with two important technologies in human-computer interaction: computer generation of synthetic speech and computer recognition of human speech. These technologies are quite different and the ergonomics problems in implementation are also different. Nonetheless, synthetic speech and speech recognition are usually dealt with in the...
  • №246
  • 825,89 КБ
  • добавлен
  • описание отредактировано
Springer, 2005. — 490 p. An increasing number of telephone services are offered in a fully automatic way with the help of speech technology. The underlying systems, called spoken dialogue systems (SDSs), possess speech recognition, speech understanding, dialogue management, and speech generation capabilities, and enable a more-or-less natural spoken interaction with the human...
  • №247
  • 19,25 МБ
  • добавлен
  • описание отредактировано
Springer, 2010. — 490 p. Speech dereverberation has been on the agenda of the signal processing community for several years. It is only in the last decade, however, that the topic has really taken off, as seen from the growing number of publications appearing in the journals and at conferences. One of the reasons that the topic has become more popular is the rapidly growing...
  • №248
  • 9,19 МБ
  • добавлен
  • описание отредактировано
CRC Press, 2000. — 798 p. Speech has evolved over a period of tens of thousand of years as the primary means of communication between human beings. Since the evolution of speech and of homo sapiens have proceeded hand-inhand, it seems reasonable to assume that human speech production mechanisms, and the resulting acoustic signal, are optimally adapted to human speech perception...
  • №249
  • 2,97 МБ
  • добавлен
  • описание отредактировано
Springer, 1999. — 212 p. Automatic speech recognition and processing has received a lot of attention during the last decade. Prototypes for speech-to-speech translation are currently being developed that show first impressive results for this highly complex endeavor. They demonstrate that machines can actually be helpful in communicating information between persons speaking...
  • №250
  • 1,65 МБ
  • добавлен
  • описание отредактировано
Springer, 2008. — 445 p. Cost reduction is of increasing importance for medium and large enterprises. Seen in this context, Interactive Voice Response (IVR) systems are becoming more and more significant. IVR systems can help to automate business processes as for example in call centers, which are now a growing market for IVR systems. Automatic speech recognition (ASR) is the...
  • №251
  • 1,92 МБ
  • добавлен
  • описание отредактировано
Springer, 1997. — 367 p. Speech technology, the automatic processing of (spontaneously) spoken words and utterances, now is known to be technically feasible and will become the major tool for handling the confusion of languages. The economic implications of this tool are obvious, in particular in the multilingual European Union. Potential and current applications are dictation...
  • №252
  • 7,09 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 387 p. Automatic speech recognition suffers from a lack of robustness with respect to noise, reverberation and interfering speech. The growing field of speech recognition in the presence of missing or uncertain input data seeks to ameliorate those problems by using not only a preprocessed speech signal but also an estimate of its reliability, to selectively...
  • №253
  • 4,52 МБ
  • добавлен
  • описание отредактировано
Springer, 2007. — 438 p. We are surrounded by sounds. Such a noisy environment makes it difficult to obtain desired speech and it is difficult to converse comfortably there. This makes it important to be able to separate and extract a target speech signal from noisy observations for both man–machine and human–human communication. Blind source separation (BSS) is an approach for...
  • №254
  • 13,07 МБ
  • добавлен
  • описание отредактировано
Morgan & Claypool, 2010. — 167 p. Considerable progress has been made in recent years in the development of dialogue systems that support robust and efficient human–machine interaction using spoken language. Spoken dialogue technology allows various interactive applications to be built and used for practical purposes, and research focuses on issues that aim to increase the...
  • №255
  • 1,81 МБ
  • добавлен
  • описание отредактировано
Prentice Hall, 2001. — 965 p. Recognition and understanding of spontaneous unrehearsed speech remains an elusive goal. To understand speech, a human considers not only the specific information conveyed to the ear, but also the context in which the information is being discussed. For this reason, people can understand spoken language even when the speech signal is corrupted by...
  • №256
  • 9,62 МБ
  • добавлен
  • описание отредактировано
Morgan & Claypool, 2008. — 121 p. In this book, we introduce the background and mainstream methods of probabilistic modeling and discriminative parameter optimization for speech recognition. The specific models treated in depth include the widely used exponential-family distributions and the hidden Markov model. A detailed study is presented on unifying the common objective...
  • №257
  • 2,87 МБ
  • добавлен
  • описание отредактировано
2nd edition. — Taylor & Francis, 2001. — 317 p. As information technology continues to make more impact on many aspects of our daily lives, the problems of communication between human beings and informationprocessing machines become increasingly important. Up to now such communication has been almost entirely by means of keyboards and screens, but there are substantial...
  • №258
  • 2,42 МБ
  • добавлен
  • описание отредактировано
Springer, 1987. — 168 p. This book has its origins in a programme of work conducted at British Telecom Research Laboratories, aimed at developing easily usable, intelligent systems, based on human-computer interaction via spoken and written language, particularly the former. This involved the authors, as members of the Human Factors Division, in conduct-, ing a series of...
  • №259
  • 3,06 МБ
  • добавлен
  • описание отредактировано
Springer, 2005. — 207 p. As part of the steady progress being made in the field of information and telecommunication techniques, voice and speech quality assessment of systems has gained in importance over the last years. An engineering approach to voice and speech quality of systems includes the consideration of how a system is perceived by its users, and how the needs and...
  • №260
  • 1,17 МБ
  • добавлен
  • описание отредактировано
Addison Wesley, 2003. — 155 p. Most people have experienced an automated speech-recognition system when calling a company. Instead of prompting callers to choose an option by entering numbers, the system asks questions and understands spoken responses. With a more advanced application, callers may feel as if they're having a conversation with another person. Not only will the...
  • №261
  • 888,10 КБ
  • добавлен
  • описание отредактировано
Ellis Horwood Limited, 1987. — 282 p. An increased understanding of human speech comprehension is a major goal for research groups working in a number of closely related disciplines. We take the position that genuine advances in our understanding of speech comprehension will be based on explicit computational models of aspects of this process which yield predictions testable...
  • №262
  • 3,50 МБ
  • добавлен
  • описание отредактировано
2011.11
Springer, 2006. — 398 p. There is no question of the value of applying automatic speech recognition technology as one of the interaction tools between humans and different computational systems. There are many books on design standards and guidelines for different practical issues, such as Gibbon's book Handbook of Standards and Resources for Spoken Language System (1997) and...
  • №263
  • 19,07 МБ
  • добавлен
  • описание отредактировано
Springer, 2002. — 134 p. Speech recognition technology is being increasingly employed in humanmachine interfaces. Two of the key problems affecting such technology, however, are its robustness across different speakers and robustness to non-native accents, both of which still create considerable difficulties for current systems. In this book methods to overcome these problems...
  • №264
  • 1,10 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 200 p. Many existing natural language and spoken language dialogue systems are either very limited in the scope of domain functionality or require a rather cumbersome interaction. With an increasing number of application domains, ranging from unified messaging to trip planning and appointment scheduling, it seems to be obvious that the current interfaces need...
  • №265
  • 2,41 МБ
  • добавлен
  • описание отредактировано
Marcel Dekker, 1992. — 871 p. This book originated in an invitation from Marcel Dekker, Inc., to put together a book of original articles on various aspects of speech signal processing. After discussing the possible scope of such a book with several of our colleagues, we decided that the chapters should stress the advances during the past five to ten years. The past decade has...
  • №266
  • 7,13 МБ
  • добавлен
  • описание отредактировано
Springer, 2010. — 187 p. The idea for this book was formed during the doctorate of Bernd Iser. Bernd Iser was working on efficient and robust bandwidth extension algorithms in hands-free systems for Harman/Becker Automotive Systems. It turned out that bandwidth extension of speech signals was a topic of appreciable interest, where lots of scientific publications discussing...
  • №267
  • 7,11 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 221 p. The analysis and measurement of the spectrum of a speech signal is one of the most important areas of sound signal processing for a number of fields, yet it is not an area to which a book has been specifically devoted. The accurate determination of the speech spectrum is commonly pursued in diverse areas including speech processing, recognition, and...
  • №268
  • 5,16 МБ
  • добавлен
  • описание отредактировано
NOWPress, 2007. — 24 p. — (Foundations and Trends in Signal Processing). Hidden Markov Models (HMMs) provide a simple and effective framework for modelling time-varying spectral vector sequences. As a consequence, almost all present day large vocabulary continuous speech recognition (LVCSR) systems are based on HMMs. Whereas the basic principles underlying HMM-based LVCSR are...
  • №269
  • 707,27 КБ
  • добавлен
  • описание отредактировано
Springer, 2009. — 228 p. The development of computer and telecommunication technologies led to a revolution in the way that people work and communicate with each other. One of the results is that large amount of information will increasingly be held in a form that is natural for users, as speech in natural language. In the presented work, we investigate the speech signal...
  • №270
  • 4,34 МБ
  • добавлен
  • описание отредактировано
Springer, 2011. — 137 p. I know what you are asking yourself – there are a lot of books available in speech processing, what is novel in this book? Well, I can summarize the answer for this question in the following points: You always see different algorithms for speech enhancement, deconvolution, signal separation, watermarking, and encryption, separately, without specific...
  • №271
  • 3,74 МБ
  • добавлен
  • описание отредактировано
CMP Books, 2001. — 338 p. In the summer of 2000, I came across the VoiceXML 1.0 standard published by the VoiceXML Forum. I downloaded the specification and began to read it. I had been working on software development in computer telephony for more than 10 years, but I was completely baffled; I couldn't understand most of the specification. I had no idea what the motivation or...
  • №272
  • 2,85 МБ
  • добавлен
  • описание отредактировано
Springer, 2008. — 338 p. — (Text, Speech and Language Technology Series 39). This book edition highlights recent trends and important issues that still remain only partially solved or even unsolved within the broad field of discourse and dialogue. The field is discussed and illustrated both from an overall spoken (multimodal) dialogue system perspective as well as from a more...
  • №273
  • 2,08 МБ
  • добавлен
  • описание отредактировано
Springer, 2008. — 305 p. This book has its point of departure in courses held at the Tenth European Language and Speech Network (ELSNET) Summer School on Language and Speech Communication which took place at NISLab in Odense, Denmark, in July 2002. The topic of the summer school was Evaluation and Assessment of Text and Speech Systems. Nine (groups of) lecturers contributed to...
  • №274
  • 3,22 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2007. — 373 p. The Media Resource Control Protocol (MRCP) is a key enabling technology delivering standardised access to advanced media processing resources including speech recognisers and speech synthesisers over IP networks. MRCP leverages Internet and Web technologies such as SIP, HTTP, and XML to deliver an open standard, vendor-independent, and versatile...
  • №275
  • 1,97 МБ
  • добавлен
  • описание отредактировано
Springer, 2009. — 235 p. Noise is everywhere and in most applications that are related to audio and speech, such as human-machine interfaces, hands-free communications, voice over IP (VoIP), hearing aids, teleconferencing/telepresence/telecollaboration systems, and so many others, the signal of interest (usually speech) that is picked up by a microphone is generally...
  • №276
  • 4,26 МБ
  • добавлен
  • описание отредактировано
Springer, 1998. — 130 p. Once in a while, something nice happens, as if by coincidence, serendipitously. It happened to me when T.V. Raman asked me to supervise his Ph.D. thesis on building a system to speak documents, especially those with technical content or a lot of structure. The project had many interesting points, for example: the need for a programming language for writing...
  • №277
  • 1,92 МБ
  • добавлен
  • описание отредактировано
Kluwer, 2005. — 327 p. There is a serious problem in the recognition of sounds. It derives from the fact that they do not usually occur in isolation but in an environment in which a number of sound sources (voices, traffic, footsteps, music on the radio, and so on) are active at the same time. When these sounds arrive at the ear of the listener, the complex pressure waves...
  • №278
  • 14,23 МБ
  • добавлен
  • описание отредактировано
Morgan & Claypool, 2006. — 118 p. Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech chain starts with the formation of a linguistic message in a speaker’s brain and ends with the arrival of the message in a listener’s brain. Given the intricacy of the dynamic speech process and its fundamental importance...
  • №279
  • 1,68 МБ
  • добавлен
  • описание отредактировано
IEEE/Wiley-Interscience, 2000. — 1041 p. Purposes and Scope. The purposes of this book are severalfold. Principally, of course, it is intended to provide the reader with solid fundamental tools and sufficient exposure to the applied technologies to support advanced research and development in the array of speech processing endeavors. As an academic instrument, however, it may also...
  • №280
  • 14,47 МБ
  • добавлен
  • описание отредактировано
Kluwer, 1987. — 278 p. It is well-known that phonemes have different acoustic realizations depending on the context. Thus, for example, the phoneme /t/ is typically realized with a heavily aspirated strong burst at beginning of a syllable as in the word Tom, but without a burst at the end of a syllable in a like cat. Variation such as this is often considered to be problematic...
  • №281
  • 10,05 МБ
  • добавлен
  • описание отредактировано
CRC Press, 2003. — 385 p. Approaches to the problems of designing speech and language processing algorithms for human machine communication used to be taken from the perspectives of linguistics and speech science, until the late 1970s. Due to the advances in computing and statistical modeling, data driven pattern recognition methods have become a fast moving research area during...
  • №282
  • 3,66 МБ
  • добавлен
  • описание отредактировано
Springer, 2010. — 351 p. In recent years spoken language research has been successful in establishing technology which can be used in various applications, and which has also brought forward novel research topics that advance our understanding of the human speech and communication processes in general. This book got started in order to collect these different trends together,...
  • №283
  • 3,59 МБ
  • добавлен
  • описание отредактировано
Springer, 1999. — 212 p. Automatic speech recognition and processing has received a lot of attention during the last decade. Prototypes for speech-to-speech translation are currently being developed that show first impressive results for this highly complex endeavor. They demonstrate that machines can actually be helpful in communicating information between persons speaking...
  • №284
  • 960,81 КБ
  • добавлен
  • описание отредактировано
Cambridge University Press, 2004. — 226 p. Although widely employed in image processing, the use of fractal techniques and the fractal dimension for speech characterization and recognition is a relatively new concept, which is now receiving serious attention. This book represents the fruits of research carried out to develop novel fractal-based techniques for speech and audio...
  • №285
  • 5,77 МБ
  • добавлен
  • описание отредактировано
Morgan & Claypool, 2011. — 112 p. This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations. Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired...
  • №286
  • 1,39 МБ
  • добавлен
  • описание отредактировано
Springer, 2005. — 415 p. We live in a noisy world! In all applications (telecommunications, hands-free communications, recording, human-machine interfaces, etc) that require at least one microphone, the signal of interest is usually contaminated by background noise and reverberation. As a result, the microphone signal has to be "cleaned" with digital signal processing tools...
  • №287
  • 5,41 МБ
  • добавлен
  • описание отредактировано
Springer, 2005. — 203 p. The goal of this book is to present a discussion of the ideas arising from the European Special Event (ESE) on the Integration of Phonetic Knowledge in Speech Technology at Eurospeech 2001 in Aalborg. Where there is discussion, there must be unresolved questions, doubts must exist, integration is not a fait accompli. The different questions asked,...
  • №288
  • 5,68 МБ
  • добавлен
  • описание отредактировано
Morgan & Claypool, 2005. — 136 p. Immediately following the Second World War, between 1947 and 1955, several classic papers quantified the fundamentals of human speech information processing and recognition. In 1947 French and Steinberg published their classic study on the articulation index. In 1948 Claude Shannon published his famous work on the theory of information. In 1950...
  • №289
  • 1,43 МБ
  • добавлен
  • описание отредактировано
2011.09
InTech, 2011. — 442 p. The book Speech Technologies addresses different aspects of the research field and a wide range of topics in speech signal processing, speech recognition and language processing. The chapters are divided in three different sections: Speech Signal Modeling, Speech Recognition and Applications. The chapters in the first section cover some essential topics...
  • №290
  • 25,54 МБ
  • добавлен
  • описание отредактировано
InTech, 2010. — 174 p. Speech processing has come a long way since the year of 1947, when R. K. Potter, G. A. Kopp, and H. Green from Bell Labs introduced the sound spectrograph, the fi rst instrument to produce human voice-prints in the short-time Fourier-transform domain. Ever since, speech recognition has been constantly evolving. From isolated word recognition with small...
  • №291
  • 6,83 МБ
  • добавлен
  • описание отредактировано
John Wiley, 2009. — 584 p. Серьезная книга по современным речевым технологиям As the authors of Distant Speech Recognition note, automatic speech recognition is the key enabling technology that will permit natural interaction between humans and intelligent machines. Core speech recognition technology has developed over the past decade in domains such as office dictation and...
  • №292
  • 19,73 МБ
  • добавлен
  • описание отредактировано
InTech, 2008. — 576 p. After decades of research activity, speech recognition technologies have advanced in both the theoretical and practical domains. The technology of speech recognition has evolved from the first attempts at speech analysis with digital computers by James Flanagan’s group at Bell Laboratories in the early 1960s, through to the introduction of dynamic...
  • №293
  • 41,85 МБ
  • добавлен
  • описание отредактировано
InTech, 2007. — 470 p. Digital speech processing is a major field in current research all over the world. In particular for automatic speech recognition (ASR). Very significant achievements have been made since the first attempts of digit recognizers in the 1950’s and 1960’s when spectral resonances were determined by analogue filters and logical circuits. As prof. Furui...
  • №294
  • 9,21 МБ
  • добавлен
  • описание отредактировано
Пер. с англ. Под ред. М. В. Назарова и Ю. Н. Прохорова. — М.: Радио и связь, 1981. — 496 с.: ил. Рассматриваются вопросы цифровой обработки речевых сигналов в системах передачи информации и управления ЭВМ голосом. Излагаются проблемы цифрового представления речевых сигналов: временная дискретизация, интерполяция, квантование, проектирование цифровых фильтров. Обсуждаются...
  • №295
  • 8,13 МБ
  • добавлен
  • описание отредактировано
М.: Государственное издательство литературы по вопросам связи и радио, 1963. — 452 с. Книга посвящена преобразованиям речи применительно к задачам техники связи и кибернетики. Книга рассчитана на специалистов в области техники связи, автоматики, кибернетики, инженеров, аспирантов и научных сотрудников, изучающих вопросы преобразования речи.
  • №296
  • 5,42 МБ
  • добавлен
  • описание отредактировано
???
Wai. C. Chu. Speech Coding Algorithms. Foundation and Evolution of Standardized Coders Mobile Media Laboratory. DoCoMo USA Labs. San Jose, California Wiley &Sons publishing. 578 pages. Speech coding is a highly mature branch of signal processing deployed in products such as cellular phones, communication devices, and more recently, voice over internet protocol This book...
  • №297
  • 3,48 МБ
  • дата добавления неизвестна
  • описание отредактировано
Second Edition — John Wiley &Sons Ltd, 2004. — 459 p. This Second Edition continues to provide the fundamental technical background required for low bit rate speech coding and the hottest developments in digital speech coding techniques that are applicable to evolving communication systems. Features new chapters on Pitch Estimation and Voice-Unvoiced Classification of Speech,...
  • №298
  • 9,44 МБ
  • дата добавления неизвестна
  • описание отредактировано
NOWPress, 2007. — 194 p. — (Foundations and Trends in Signal Processing). Краткое изложение современных подходов к цифровой обработке речи. Since even before the time of Alexander Graham Bell’s revolutionary invention, engineers and scientists have studied the phenomenon of speech communication with an eye on creating more efficient and effective systems of human-to-human and...
  • №299
  • 3,19 МБ
  • дата добавления неизвестна
  • описание отредактировано
Prentice Hall, 1978. — 512 p. Классическая книга по цифровой обработке речевых сигналов Fundamentals of Digital Processing Digital Models for Speech Signal Time-Domain Methods for Speech Processing Digital Representations of the Speech Waveform Short-Time Fourier Analysis Homomorphic Speech Processing Linear Predictive Coding of Speech Digital Speech Processing for Man-Machine...
  • №300
  • 35,57 МБ
  • дата добавления неизвестна
  • описание отредактировано
Prentice-Hall, 2002. — 800 p. Speech and hearing, man's most used means of communication, have been the objects of intense study for more than 150 years-from the time of von Kempelen's speaking machine to the present day. With the advent of the telephone and the explosive growth of its dissemination and use, the engineering and design of evermore bandwidth-efficient and...
  • №301
  • 18,80 МБ
  • дата добавления неизвестна
  • описание отредактировано
М.: Мир, 1985. — 237 с. — (В мире науки и техники). Книга рассказывает о теоретических исследованиях и практических разработках в технике синтеза речи. Автор приводит также конкретные схемы электронных блоков, используемых в реальных синтезаторах речи. Основы компьютерного синтеза речи. Как мы говорим Немного о лингвистике. Этика поведения компьютера - синтезатора речи. Немного...
  • №302
  • 4,32 МБ
  • дата добавления неизвестна
  • описание отредактировано
Now Publishers, 2010. — 152 p. — (Foundations and Trends in Signal Processing). In December 1974 the first real-time conversation on the ARPAnet took place between Culler-Harrison Incorporated in Goleta, California, and MIT Lincoln Laboratory in Lexington, Massachusetts. This was the first successful application of real-time digital speech communication over a packet network and...
  • №303
  • 8,74 МБ
  • дата добавления неизвестна
  • описание отредактировано
CRC Press, 2002. — 400 p. A wide range of potential sources of noise and distortion can degrade the quality of the speech signal in a communication system. Noise Reduction in Speech Applications explores the effects of these interfering sounds on speech applications and introduces a range of techniques for reducing their influence and enhancing the acceptability, intelligibility,...
  • №304
  • 10,08 МБ
  • дата добавления неизвестна
  • описание отредактировано
CRC Press, 2000. — 247 p. Всеобъемлющее описание алгоритмов и методов кодирования речи. Детали реализации этих алгоритмов в распространенных речевых кодеках. Speech Production The Speech Chain Articulation Excitation Vocal Tract Phonemes Source-Filter Model Speech Analysis Techniques Sampling the Speech Waveform Systems and Filtering Z-Transform Fourier Transform Discrete...
  • №305
  • 4,23 МБ
  • дата добавления неизвестна
  • описание отредактировано
Под ред. Сапожкова М. А. — М.: Радио и связь, 1987. — 168 с. Во многих научных центрах в СССР и за рубежом ведутся интенсивные исследования в области передачи сигналов речи по узкополосным каналам связи, автоматического распознавания речевых команд в системах обработки и передачи данных, обучению людей с дефектами слуха и речи, иноязычных и др. Данным исследованиям посвящены...
  • №306
  • 5,72 МБ
  • дата добавления неизвестна
  • описание отредактировано
Academic Press, 2006 Обработка естественного языка с многоязыковой точки зрения Language Characteristics Linguistic Data Resources Multilingual Acoustic Modeling Multilingual Dictionaries Multilingual Language Modeling Multilingual Speech Synthesis Automatic Language Identification Other Challenges: Non-native Speech, Dialects, Accents,and Local Interfaces Speech-to-Speech...
  • №307
  • 2,63 МБ
  • дата добавления неизвестна
  • описание отредактировано
М.: Наука, 1992. — 392 с. — ISBN 5-02-014665-Х. Синтез речи с использованием ЭВМ является составной частью современной информационной технологии. Методы синтеза речи находят широкое применение в информационно-справочных системах, в системах обучения с помошыо ЭВМ и т. д. Читатель, обратившись к этой книге, сможет познакомиться с различными методами моделирования процессов...
  • №308
  • 11,85 МБ
  • дата добавления неизвестна
  • описание отредактировано
Wiley, 2005. — xi, 342 p. — ISBN 978-0470012604. With a growing need for understanding the process involved in producing and perceiving spoken language, this timely publication answers these questions in an accessible reference. Containing material resulting from many years’ teaching and research, Speech Synthesis provides a complete account of the theory of speech. By bringing...
  • №309
  • 3,03 МБ
  • дата добавления неизвестна
  • описание отредактировано
A study of digital speech processing, synthesis and recognition. This edition contains sections on the international standardization of robust and flexible speech coding techniques, waveform unit concatenation-based speech synthesis, large vocabulary continuous-speech recognition based on statistical pattern recognition, and more.
  • №310
  • 2,40 МБ
  • дата добавления неизвестна
  • описание отредактировано
Монография. — М.: Государственное издательство литературы по вопросам связи и радио, 1962. — 391 с. В монографии «Расчёт и измерение разборчивости речи» излагается теория разборчивости с качественным и количественным описанием свойств и акустических характеристик речи и слуха, определяющих величину фонетической и смысловой информации, передаваемой по телефонным и...
  • №311
  • 5,31 МБ
  • дата добавления неизвестна
  • описание отредактировано
М.: Радио и связь, 1981. — 496 с., ил. Рассматриваются вопросы цифровой обработки речевых сигналов в системах передачи информации и управления ЭВМ голосом. Излагаются проблемы цифрового представления речевых сигналов: временная дискретизация, интерполяция, квантование, проектирование цифровых фильтров. Обсуждаются способы построения цифровых систем передачи, систем...
  • №312
  • 38,54 МБ
  • дата добавления неизвестна
  • описание отредактировано
М.: Радио и связь, 1989. — 248 с., ил. — ISBN: 5-256-00267-8. Монография посвящена описанию современного состояния развития техники, использующей возможности речевой связи между человеком и машиной (роботом). Эта область научных исследований и технических разработок прогрессивно развивается в наиболее развитых в техническом отношении странах, что связано в первую очередь с...
  • №313
  • 2,98 МБ
  • дата добавления неизвестна
  • описание отредактировано
Prentice-Hall International, Inc. , Englewood Cliffs, New Jersey, 1993. — 507 p. From preface of the book: ".the fundamental goal of the book would be to provide a theoretically sound, technically acurate, and reasonably complete description of the basic knowledge and ideas that constitute a modern system for speech recognition by machine. "
  • №314
  • 4,16 МБ
  • дата добавления неизвестна
  • описание отредактировано
Дж. А. Барнет, М. И. Бернстейн и др. Методы автоматического распознавания речи: В 2-х книгах. Пер. с англ. /Под ред. У. Ли. – М.: Мир, 1983. – Кн. 2. 392 с., ил. Монография написана ведущими специалистами США, Франции, Италии, Японии и Польской Народной Республики в области распознавания речи. В русском переводе выпускается в двух книгах. Книга 2 посвящена конкретным системам...
  • №315
  • 9,92 МБ
  • дата добавления неизвестна
  • описание отредактировано
У. А. Ли, Э. П. Нейбург, Т. Б. Мартин, Дж. Р. Уэлч, В. У. Зу, Р. М. Шварц, Дж. Е. Шуп, А. Р. Смит, М. Р. Самбур, Ф. Хейс-Роз, Г. Гудмэн, Р. Редди. Методы автоматического распознавания речи: В 2-х книгах. Пер. с англ. /Под ред. У. Ли. – М.: Мир, 1983. – Кн. 1. 328 с., ил. Монография написана ведущими специалистами США, Франции, Италии, Японии и Польской Народной Республики в...
  • №316
  • 8,69 МБ
  • дата добавления неизвестна
  • описание отредактировано
Киев: Наук. думка, 1987. – 264 с. В монографии рассматриваются вопросы автоматического анализа, распознавания, смысловой интерпретации, синтеза и компрессированной передачи речевых сигналов применительно к устному диалогу человека и ЭВМ на формализованных и естественных языках предметных областей для использования в человеко-машинных системах сбора, обработки информации и...
  • №317
  • 3,57 МБ
  • дата добавления неизвестна
  • описание отредактировано
Москва: Изд-во "Радио и связь", 2004. 164 с. Аннотация. В книге рассматриваются методы обработки цифровой речи, предназначенные для формирования последовательности векторов признаков и два типа задач классификации речевого сигнала: распознавание слитной речи, идентификация диктора по его голосу. В задаче формирования векторов признаков основное внимание уделяется методам...
  • №318
  • 2,15 МБ
  • дата добавления неизвестна
  • описание отредактировано
Пер. с англ. — Под ред. Ю. Н. Прохорова и В. С. Звездина. — М.: Связь, 1980. — 308 с.: ил. В книге излагается в полном объеме комплекс вопросов, связанных с обработкой речевых сигналов с помощью методов линейного предсказания. Представлены алгоритмы анализа речи и процедуры ее синтеза по множеству информативных параметров, доведенные до программ на языке ФОРТРАН. Рассмотрены...
  • №319
  • 2,74 МБ
  • дата добавления неизвестна
  • описание отредактировано
Пер. с англ. А. А. Пирогова. — М.: Связь, 1968. — 397 с. В монографии Дж. Фланагана, известного американского ученого, подробно рассматриваются широкий круг вопросов, связанных со свойствами речи как переносчика информации, основные ее параметры, проблемы анализа, синтеза и автоматического распознавания. Оцениваются характеристики каналов речевой связи. Большое внимание...
  • №320
  • 4,66 МБ
  • дата добавления неизвестна
  • описание отредактировано
В этом разделе нет файлов.

Комментарии

В этом разделе нет комментариев.