We’d like to let you know a little more about our product at SpeechAngel and give a short explanation of speech recognition 🙂
There are two types of speech recognition. One is called speaker-dependent and the other speaker-independent. Speaker-dependent solutions are found in specialsed use cases where there a limited number of words that need to be recognized with high accuracy, while speaker–independent software is more often found in telephone applications.
Speaker–dependent software operates by learning the unique, individual characteristics of a single person’s voice, in a way similar to voice recognition. New users must first “train” the software by speaking to it, so the computer can analyse the way in which the person talks. This usually means users have to read a few pages of text to the computer before they can use the speech recognition software.
Speaker–independent software is designed to recognise anyone’s voice, so it requires no training. This means it is the only concrete option for applications such as interactive voice response systems — where businesses can’t ask callers to read pages of text before using the system. The downside is that speaker–independent software is generally speaking less accurate than speaker–dependent software.
Speech recognition engines that are speaker independent generally deal with this fact by limiting the grammars they use. By using a smaller list of recognized words, the speech engine is more likely to correctly recognize what a speaker said.
This makes speaker–independent software ideal for most IVR systems, and any application where a large number of people will be using the same system.
We hope this paints a clearer picture!
The SpeechAngel team.