The mic I used in the video can actually cope with very noisy environments. With lesser mics, speech recognition is useless with even mild background noise.
Hi, I'm the guy in the video. You might also be interested in a presentation I gave last Sept at Strangeloop with a much longer demo of coding in Clojure and Elisp: http://www.infoq.com/presentations/Programming-Voice
What's the next big leap for speech to text programming? A language designed specifically to be speakable, ie, all keywords and no symbols?
I mean, I'd like speech recognition to get more natural error correction, drawing more from the way we use inflection to give feedback about which syllables to correct. (I love how Google on mobile now gives visual indication of which syllables it heard clearly, and which it didn't. I just wish it would understand when I shout "No, X not Y" to replace just that one misheard word.)
It'd be interesting to hear about where voice is heading from someone who uses the technology far more.