Speech Recognition: Principles And Applications Essay, Research Paper
Table of contents
Abstract 3
Overview of the Characteristics of Automatic Speech Recognition Systems 4
Number of Words 4
Use of Grammar 5
Continuous vs. Discrete Speech 5
Speaker Dependency 6
Early Approaches to Automatic Speech Recognition 6
Acoustic-Phonetic Approach 7
Statistical Pattern Recognition Approach 8
Modern Approach to Automatic Speech Recognition 8
Hidden Markov Models 9 Training of an Automatic Speech Recognition System Based on HMMs 11 Sub-Word Units 11
Applications of Automatic Speech Recognition Systems 12
Automated Call-Type Recognition 13
Data Entry 13
Future Applications Using Automatic Speech Recognition Systems 14
Conclusion 14
References 15
Abstract
With the advances of technology, a lot of people may think that integrating the ability of understanding human speech in a computer system is a piece of cake. However, scientists disagree. Since the early nineteen fifties, scientists have tried to implement the perfect automatic speech recognition system, but they failed. They were successful in making the computer recognise a large number of words, but till now, a computer that understands everything without meeting any conditions does not exist. Due to the enormous applications, a lot of money and time is spent in improving speech recognition systems.
SPEECH RECOGNITION: PRINCIPLES AND APPLICATIONS
Nowadays, computer systems play a major role in our lives. They are used everywhere beginning with homes, offices, restaurants, gas stations, and so on. Nonetheless, for some, computers still represent the machine they will never know how to use. Communicating with a computer is done using a keyboard or a mouse, devices many people are not comfortable using. Speech recognition solves this problem and destroys the boundaries between humans and computers. Using a computer will be as easy as talking with your friend.
Unfortunately, scientists have discovered that implementing a perfect speech recognition system is no easy task. This report will present the principles and the major approaches to speech recognition systems along with some of their applications.
Overview of the Characteristics of Automatic Speech Recognition Systems
How can we evaluate a speech recognition system. Obviously describing it by good or bad isn’t enough since the performance of such a system may be outstanding in one application and poor in another. In fact, speech recognition systems are designed according to the application. Some of these variable characteristics are presented below.
Number of Words
The major characteristic of a speech recognition system is the number of words it can recognise. The question that comes to mind is how many words are enough so that the performance of a speech recognition system is acceptable. The answer depends on the application (6, p98). Some applications may require few words, like automated call-type recognition, others may require thousands, like data entry. However, increasing the number of words or the vocabulary of a speech recognition system increases its complexity and decreases its performance (probability of error is higher)(6, p.98). Systems with large vocabularies are also slower since more time is needed to search a word in a large vocabulary. Increasing the number of words isn’t enough because the speech recognition system is unable to differentiate words like ‘to’ and ‘two’ or ‘right’ and ‘write’ (6 ,p.98).
Use of Grammar
Using grammar, differentiating words like ‘to’ and ‘two’ or ‘right’ and ‘write’ is possible. Grammar is also used to speed up a speech recognition system by narrowing the range of the search (6,p.98). Grammar also increases the performance of a speech recognition system by eliminating inappropriate word sequencing. However, grammar doesn’t allow random dictation which is a problem for some applications (6, p.98).
Continuous vs. Discrete Speech
When speaking to each other, we don’t pause between words. In other words, we use continuous speech. However, for speech recognition systems, there is difficulty in dealing with continuous speech (6, p.98). The easy way out will be using discrete speech where we pause between words (6, p.100). With discrete speech input, the silent gap between words is used to determine the boundary of the word, whereas in continuous speech, the speech recognition system must separate words using an algorithm which is not a hundred per cent accurate. Still, for a small vocabulary and using grammar, continuous speech recognition systems are available. They are reliable and do not require great computational power (6, p.100). However, for large vocabulary, continuous speech recognition systems are very difficult to achieve, require huge computational power, as well as being slow. In fact, processing a speech sample can take three to ten times the time required for a person to say it (6, p.100).
Speaker Dependency
Speech recognition system designers must consider another important issue: whether their systems are speaker-dependent or speaker-independent. Each person pronounces a word differently. Although it is easy for humans t
Наверняка у вас есть товары или услуги, продажа которых приносит вам максимальную прибыль. Для быстрого старта в сети вам необходимо создание посадочной страницы (одностраничного сайта), на которой будет размещена информация о маржинальных товарах/услугах интернет магазина. За 8 лет опыта разработки конверсионных страниц мы выработали оптимальную структуру, которая позволит привлекать через landing page больше продаж. На такую структуру «одевается» ваш контент — фирменный стиль, тексты, фотографии, уникальные торговые предложения, после чего страница выходит в свет. Разработка лендинга и запуск в сети — до 7 рабочих дней. Стоит отметить, что в разработку самой посадочной страницы входит и написание копирайтером продающих текстов для вашего бизнеса, чтобы каждый посетитель страницы захотел совершить покупку именно у вас. Результат: качественно разработаная продающая посадочная страница, которая готова приносить вам новых клиентов.