Rockbox General > Rockbox General Discussion

Can Rockbox be made to turn a Sansa into a voice recognition mp3 player?

<< < (2/2)

pondlife:
As a start, maybe just monitor the mic input and allow any sharp transient (say, hand clapping) trigger a "next track".  

Of course, no matter what calibration you can do, playback at loud volume is quite likely to result in unwanted skipping, but you'd be on the first step towards voice input.

pondlife

dd:
heya,

I don't know whether it's possible to use the source code of PocketSphinx [1]. It is a speech recognition program specifically designed for handhelds running linux (including code also for ARM-processors). It's a fork of Sphinx2 which uses HMMs (Hidden Markov Models) with semi-continuous output probability density functions. License is BSD-like.

It seems to be fast enough for our problem (but i don't know anything about ARM-processors…). I think, we will have another problem about speech recognition: The surrounding noise. My sansa has a microphone which can record people talking 10 meters apart…  Maybe one could control this by »loudness-equalizing«⁇
Another problem could be the dialect especially on Artist's names like Madonna… Native English/Italian/German speakers would all spell it out in another way. This problem could be solved by generating  language specific dictionarys.
So, if anyone with knowledge on ARM-processor-power could check it out, we'd propably get a neat thing called voice input. :D

dd

[1]: http://www.speech.cs.cmu.edu/pocketsphinx/

zajacattack:
Well, in my opinion, it wouldn't be worth it. To make optimized AND accurate voice recognition in rockbox is an oxymoron. The processors are no where near powerful enough.

GodEater:
If the product is designed for embedded devices, which it says it is - it should be possible to run it on at least SOME Rockbox targets.

Navigation

[0] Message Index

[*] Previous page

Go to full version