Practicing example WK sentences (speach-to-text recognition tool)

Hi,
when studying (doing lessons) I try to read all example patterns of use and example sentences (turns out it GREATLY improve the retention :D).

However, while it may sound OK-ish for me I’m looking for something that would either record and compare it with the example or transcribe-back what I said so I could compare if at least “the computer” understood.

For all such voice interactions on my computer I’m using Handy (GitHub - cjpais/Handy: A free, open source, and extensible speech-to-text application that works completely offline.; with Parakeet V3 model) and it works wonderful… but only for english/spanish/polish as it doesn’t seem to support Japanese :frowning:

Any suggestion how to do handle that? or user-script? (I know there is one for voice input reviews but it’s only for vocabulary)

I don’t know exactly how it is implemented, but there is already a voice input script for doing regular reviews. I haven’t used it in a while, but it used to work great for anything longer than one or two syllables. You could look at the implementation or ask the creator for advice.

For a more low tech option, if you just want to check if your output is parsable without doing anything with the result, you could try to use the voice input mode of google translate.

Windows, and I assume most other operating systems as well, also comes with a build in speech to text application which you could use to transcribe what you are saying to any textbox.

Yes, this is the script I mentioned but it looks like it only works with the main item :frowning:

Yes, this is what I’m looking for basically and what said Handy is doing :slight_smile:

But would love to have something build-in (script in WK) or readily available (like Handy, which basically runs constantly in the background and I have a quick shortcut, and then it basically outputs what I said; yes, basically I dictated most of this reply with handy :smiley: )

Reading aloud on a smartphone with voice typing? Depending on the keyboard, it might not need to be online.

Windows text-to-speech (Win+H) might work just similarly.

Otherwise, on a PC, some browsers may support SpeechRecognition API – Demo

1 Like

Yeah, I was doing that with Microsoft translate but it’s a bit cumbersome and disrupts a bit study flow (also, I try to avoid having a phone near me to avoid distractions :frowning: )

I’m on macOS and it’s text-to-speech is absurdly terrible (hence using external tool)

Despite it’s being on MDN it’s not supported by Firefox :frowning:


(1456885 - Not able to use Voice for Google Docs Integration due to missing SpeechRecognition support, 1856507 - Sites breaking due to missing SpeechRecognition support)


So yeah, to narrow it down: something multiplatform (with macOS support), working on computer or in a browser but not relying on SpeechRecognition API…

Maybe a LLM model that I could ask to be added to Handy?

I had made a STT deck for sentence shadowing and thought my iphone was a decent threshold for clean pronunciation and used it for text input (though they were common phrases). The only issue I’ve heard from native users is that when they have thick dialects, it tends to break. My understanding is STT has some context dependency and many WK sentences are intentionally absurd, though one would think common collocations should be ok (never tried it myself)….might be a good experiment if you know a native speaker for STT on the WK sentences and see what they get.

1 Like

Hmm… that’s interesting. Though, suprising as it may be, I’m not sure macOS and iOS use the same tech for STT :thinking:

1 Like