Open Source Dictation: Language Model

A language model defines probable word succession probabilities: For example “now a daze” and “nowadays” are pronounced exactly the same, but because of context we know that “Now a daze I have a smartphone” is far less likely than “Nowadays I have a smartphone”. To model such contextual information, speech recognition systems usually use an n-gram that contains information of how likely a specific word is, given the context of the sentence. Continue Reading

What speech recognition application are you most looking forward to?

With the rising popularity of speech recognition in cars and mobile devices it’s not hard to see that we’re on the cusp of making speech recognition a first-class input method across our devices.
However, it shouldn’t be forgotten that what we’re seeing in our smart phones or laptops today is merely the beginning. I am convinced that we will see much more interesting applications of speech recognition technologies in the future.

So today, I wanted to ask: What application of speech recognition technology are you looking forward to the most? Continue Reading

Coming up: Simon 0.4.1

I’ve been quietly fixing various bugs and annoyances since the release of Simon 0.4.0 and I think this warrants a small maintenance release before diving into new features for the next major Simon version.

So without much ado, I’d like to announce Simon 0.4.1 coming to a mirror near you on 24th of June, 2013.

EDIT: The earlier version of this post erroneously said “July” instead of “June”. The release of Simon 0.4.1 is indeed planned for next Monday, 24th of June, 2013.