World's most popular travel blog for travel bloggers.

[Solved]: Build Automatic Speech Recognition (ASR) from scratch

, , No Comments
Problem Detail: 

I want to build a Automatic Speech Recognition (ASR) engine for myself, but I've no idea from where to start.

I've read that most ASR's are build upon Hidden Markov Models, but also I've read that HMM is limited somehow and a better approach is to build a ASR upon Machine Learning features.

Overall I am confused.

Can someone recommend me books on topics from which I can start?

Thanks (:

Asked By : 0xdeadcode

Answered By : Nikolay Shmyrev

I want to build a Automatic Speech Recognition (ASR) engine for myself, but I've no idea from where to start.

Start with trying existing open source speech recognition system, learn how they work, play with them. Check http://cmusphinx.sourceforge.net.

I've read that most ASR's are build upon Hidden Markov Models, but also I've read that HMM is limited somehow and a better approach is to build a ASR upon Machine Learning features.

HMM which provide time variance is still a reasonable model for speech, most of the system use it in one form or another. Limitations of HMM exist, but you do not need to care about them on that level.

You can implement a basic speech recognition system without HMM. For example check

http://de.mathworks.com/company/newsletters/articles/developing-an-isolated-word-recognition-system-in-matlab.html

That would be enough for "yes" and "no" discrimination. For any more serious use you will need HMM.

Can someone recommend me books on topics from which I can start?

There are two good books on topic:

A basic guide to start:

http://www.amazon.com/Spoken-Language-Processing-Algorithm-Development/dp/0130226165

A modern state of the art

http://www.amazon.com/Automatic-Speech-Recognition-Communication-Technology/dp/1447157788/

Best Answer from StackOverflow

Question Source : http://cs.stackexchange.com/questions/37860

0 comments:

Post a Comment

Let us know your responses and feedback