Vocabulary reading sounds API


#1

Hi everyone. I have a pretty big Anki deck that I have created myself with the words I encountered pretty much anywhere (anki, manga, anime, random words from the internet and songs) and it is perfectly tailored for me. The only thing that I am missing is readings for vocabulary. I have a python script that pulls stroke orders from jisho in batches and it’s super convenient. I want to write a script that’s gonna pull pronunciations (sound) for the words in my deck. However, I do not know any websites that have an API (or at least static links that can be easily serialized, like jisho’s http://jisho.org/kanji/details/辿 where you can just change the last character with the needed kanji and pull the stroke order diagram from the page).


#2

Would forvo work?
http://www.forvo.com/word/鰐/#ja


#3

I am not sure what help this is, but I used to have a user script that would pull from Japanese Pod 101 pronunciations to fill in for the lack of pronunciations here during reviews. My computer got wiped since I had it, so I don’t have it now. But maybe a helpful lead.


#4
Omnimancer said... Would forvo work?
http://www.forvo.com/word/鰐/#ja
 Thanks for the answer. While the website is great (and I will use it from now on :-), cheers for that), I cannot find the static links to the sound files.

If you know programming: I was able to catch that sound for 鰐 comes from this file, but as far as I can tell there is no easy way to get the link automatically. The reason is that the links are generated via a JS function called Play, which calls a PHP script which then returns a path to ogg (or mp3 if you want). While it's possible to get the raw mp3 files, it is hard to automate it with python because you would basically need to click 'play' on the website (simulate the onClick event with python), and then grab a JS variable (with Python!) to get the path to the mp3, which you will need to download. I am not sure how to do it with python, but if you (or anyone) knows, please let me know.

If you don't know programming: It's hard to get the link to the mp3 files because of the way the website is programmed.

Thanks for the link. It's a great website.

#5
AlexAlex said... I am not sure what help this is, but I used to have a user script that would pull from Japanese Pod 101 pronunciations to fill in for the lack of pronunciations here during reviews. My computer got wiped since I had it, so I don't have it now. But maybe a helpful lead.
Thanks for mentioning it. I looked at it and it works pretty great. I was able to get this file to work. The only problem is the annoying "File is not uploaded yet" message and the fact that you also need to give both kana and kanji for it to work. But I guess it will do for now. Thank you very much :-)

#6

There are direct links to the mp3 file on each page, but there are still numbers appended to the end of the URL that are different from file to file.

http://www.forvo.com/download/mp3/鰐/ja/2531211


#7
Omnimancer said... There are direct links to the mp3 file on each page, but there are still numbers appended to the end of the URL that are different from file to file.

http://www.forvo.com/download/mp3/鰐/ja/2531211
 Oh, I didn't notice. You probably need to be logged in to see it. Thanks a lot. You can get the number from the html for the play button. I already pulled all the pronunciations from japanesepod101, but will pull the missing ones from here