This website/script inserts word frequencies from the InnocentCorpus (5000+ novels) or BCCWJ into your Anki cards. That way, you can choose more common words to learn first. (Enables sorting!)
(NEW: version using BCCWJ corpus (Contemporary Written Japanese, relative frequency: 100 = 100th most common word) (~5.8MB download on first visit)
Yomichan shows you these frequencies for most words (that aren’t too rare):
This number just tells you how often the word occurs in a corpus of ~5000 books.
Anything over 10k is very common, below 100 is rather rare (私: ~900k, 新聞: ~30k, とろ火: 72).
(<100 doesn’t mean obscure though, the word can still be useful)
Yomichan can even export these frequencies with the Anki export feature (which is great, see plus sign). However, it puts HTML around it, which makes it hard to sort. Also, it can’t add frequencies to existing Anki cards.
Requirements
- The Anki addon AnkiConnect needs to be installed (which should already be the case if you use Yomichan and its Anki export).
- Anki needs to be running.
- Your notes need to have a field FrequencyInnocent or similar (you can change that name in the “frequency field” option near the top).
If your notes don’t have that field yet, you can add it in Anki via Tools → Manage Note Types → Fields. - You should close the Anki Browse window while doing the changes. I think the worst that can happen is that the currently opened card will not be updated. I tested this with ~900 changes and the rest was fine.
Disclaimer
This script should be very safe, since it only updates the FrequencyInnocent field of notes, if it already exists. But please back up your Anki collection via File → Export beforehand. It’s a good idea anyways. Use the script at your own risk, i won’t be responsible for changes to your Anki decks. The code is public though, you can check it here.
Using this to sort/search your Anki cards by frequency in Browse
You can either use another addon like Advanced Browser to be able to sort by custom fields:
Or if you just want to search without sorting or addons, you can use a query like deck:Yomichan FrequencyInnocent:9___
(3 underscores) which will find all cards with frequency 9xxx. (Or for frequency >10k: _____*
(5 underscores + *
wildcard)
To learn the most frequent, what i do is select some cards → right click → Reschedule / Set Due Date → place in review queue (0/0).
There’s probably a smarter way, since this makes the first interval 3 days for me, so i have to mark it ‘Again’ on the first review.
This is also just nice information, even if you don’t want to always learn the most frequent words first. The frequency column is nice to have in the Browse window.
Further usage and technical information
My other website: wtk-search
It has its own thread:
Enjoy of course, feel free to leave any feedback here.