Sequencing kanji based on amount of associated vocabulary


#1

Okay, so here’s my problem…

I’m working on a kanji project that could really make use of WaniKani’s pre-made vocabulary lists associated with individual kanji characters. I was wondering if there was a way to sequence/categorize each kanji in WaniKani based on the number of vocal terms associated with it. I would just manually go through all 2200 characters, but I figured someone knew some rad computer tricks to do it automatically or even already have it completed.

I just need numbers––not a list of vocabulary terms per se. I want to compare and chart which kanji characters tend to be used in common words than others.

Thanks so much! This would REALLY help me. :slight_smile:


Radical Count in WaniKani
#2

I have that data, shouldn’t be a problem.

You want something like 騎 -> 4 for all kanji, right?


#3

Yes please!!

If it could be ordered highest to lowest (or vice versa), that would be even more amazing! :smiley:


#4

Edit: deleted 〜, ー

This is “appearances in WK vocab”: list of kanji

91: 人
69: 大
68: 一
63: 日
51: 不
50: 気
44: 学者
43: 生物
42: 子
40: 中
39: 年
38: 手
36: 自
35: 地
34: 出水
32: 入
31: 々会金
30: 行国事的無
29: 本理
28: 外
27: 見
26: 力心悪感
25: 文回合動
24: 二下上月分書
23: 名主時場定
22: 作体道電度意
21: 用語
20: 全明所業
19: 目方来食通間楽新発
18: 立先車字県指器
17: 山口小切社空点高面機
16: 女内色近前話実取解
15: 止今言海長家付風数屋親味殺料性変品
14: 火四何売当音直血室決歌病神員信法洗害義
13: 十三公平休交死化船対開集伝放戦着兵愛告詞観造
12: 天正土台同形光東教予服返落最期流情原果巻
11: 花足代肉後有記民番勝和要待算運別様問保急可布連在状師違独込没
10: 川石五白安知黒朝身持院転起成画然士残私約真弁辞結薬説焼得営応過優張製迷撃除訳装油弾免
9: 王木母太元引早走両南茶思失欠助医反仮受線葉良利命根想願能望軍建遠治常官警験等禁報団容暴減順嫌難議権素断価収職規割象管絶冷恐離露酷
8: 左円万父古戸世打西角米図首星向店必強役馬重読進便特好島消深野館求念格仏英式証留守園曜側敗覚達類静座乱加防倒圧経論産務判脱示値現差額限景構満故極痛壊婚快盗更傷舞逃躍剤頃浄疎
7: 工千右田半毛皮毎草曲活校住支投表相美配終酒共初老温都悲旅題選福例芸術計飯関存政察非鼻試借比混余飛制総検統置援寝輸載視抜捕模効再症眠給移程詰版背散粉雑汚閉否皇納貴降腐劇介遺刑爆跡片甲艦潜握滅魔遇貫呆敢
6: 玉犬久少市百仕号声亡歩週魚弱氏未紙究絵使談部乗界送速飲顔波洋位注追球第族階駅像暗熱銀養周参囲固折骨材夫毒典干細栄席煙確震設資増税罰坊領態律演型燃触庁質居印突退段屈腹並児券庫婦攻逆覧越編華似党密染眼暮異裏誤損豆射破獄酔遅廃縄頼鹿募香盤排唱鉱誠包床泊焦網揺紛襲潮即封歳帳彩概勘瓶噴怠慢陳尚逸憂鎮伏遍
5: 六才兄牛北他写赤去虫男多青考次丁研客苦習路鉄農仲育湯息登短謝映輪整協基技完丈列晩幸笑底識枚句訓冊改若続易災犯穴評査任済費各案勢吸革則量担祝腕述販境環候影替隠掛復創怪河従我振接障獣筆誘惑迫怒妙幼録隊修精略乾杯催積航街奇乏延照系飾浮普泥帯均貨墓採厳欲操著推源承刻測寿恥払将及維超戻掲遣刊慮緩奥称託贈御銃群埋吹駐鋭繁隣孝透偽儀至衣侵括柄荒裂慰沈沼翼辛霊湿魂菌疲零帝瞬粘哀憎扇耐隔劣邸猟偏覇鎖孤緯舗惨瓜随虐叙痴奉賓弦錯窮飽鼓盲腸忌戯譜胆浪
4: 八九丸友広礼申皿谷麦京夜科辺鳥以組末局君始負聞調横鳴労級章商童歯練宿詩疑賞頑昼昨区単猫門喜浴箱危因塩恋虚幻禅脳僧許忘財歴宙笛個妻夢敵審省挙件派際認副提宅停護崎裁準備沢乳武供展株渡響訴補輩与鉛刺占貯針郵靴健端締織貸凍処清益板僚診怖請押撮漏翌購遊幾嘆巣脈富掃序志祖興複暖桜迎液卵宝砂糖肺尊敬熟盛蒸聖薦勤臓純沿磁幕丼滞紹汁彼熊炎諾甘摘核継踏換依跳執塁患抗旬聴削葬償闘賄避致懸房充棋雇渋稲斐堀薄巡携頻戒駆敏敷犠畑瀬拠徹樹炭挑軸芝袋牧威旨柔炉距懇塔縁缶髪雷叫寸卓肌狩裸陰穏碁黙涼泡詐佐柳憩挿炊塗殴辱尽騎溶踊賢輝麻灯浸覆邪仰沸枯耕苗釈隅頂擁倫恒殊膨陥稿騰繊鯉卸紋謡顕欄疾傍惜抹粗髄碑縫凡匠拍縛漬賊膜弔胎濁鶏媒帆慨赦慶醜
3: 七夕矢冬氷糸町里弟州姉夏由買具頭了働争競陽植港庭暑鏡億季卒束晴勇築春紀岩泣険府阪梅汽祈弓善舌忙困節宇履徒率被臭罪裕尾批委条責誕策賀域呼秀狭況鮮属捜激較豚就招昇暇濃訪胃浜巨博微潔稚娘緊宗欧烈索臣寄促宴旗詳貧適騒棒既驚徳探恩績衛捨秘酸筋垂宣拡忠灰蔵諸芋縦揮紅拝吐奴舎銅講互己剣酢湖旧盟債沖献般伸奈漁崩臨抱狙還妊傾抑描緒齢宜扱仙鋼邦勧圏項譲謙顧柱兼獲殿殖褒雅拳郷撤棄克双範肝喪揚滑綱珍趣籍朗垣擦忍丘匹竜俺粒刃棚芽矛凶暦塾磨溝舟眺墨鈍斬癖誇阻俵綿架盆滴霧砕欺唇如婆恨掌幣班脇遂盾斜脅蓄鉢闇畜飢咲培悔桑紫抽刈唯壇煮謀陶俗潤衰珠妃鬱駄銘漂翻伯偶壮搬疫洞召喚濯玄脂蓮偉慈渦膚貞軒襟遭呂剰啓寛帥胞勲庶粛鯨呈悠愚荘酬累凝循旦搾尉摂穀轄猶酵烏閲凸凹羅旋款衡遮醸乙朽酌殻錠礁蔑遷侮漆紡唄煩婿蛮
2: 刀耳竹村池雨午妹羊札黄答泳漢軽昔低岸功秒倍祭泉勉緑橋皆妥松希司帰寺係荷専堂渉署種喫胸絡尻妨企宮姿看導幹俳城施鬼腰層届票逮慣豊含肥絞励徴授汗菓討悩途康睡傘憲衆江猛閣韓添雄渇符預菜融尋鑑監豪廊倉孫径救陸偵飼永賛銭漠簡窓賃歓爪粋枝縮亀為噌鍋姓幅療貿牙陣抵恵湾兆契伴併択需繰奏却壁拒鈴岐阜隆慎祉枠控茂誉衝伺措蜂蜜仁析堅枢哲弧掘斎暫潟糾岳懲斉撲誰刷筒吉朱桃謎釣姫涙硬稼澄脚呪曇賭嫁滝狂鐘井塊穂寮寧椅瞳租幽錬鍛孔猿尺塀墜畳巾扉箸虹伊瞭胴蚊隙餓憶誓悟礎尼征奨淡漫蟹鰐峰巧廷簿訂諮堤奮墳晶彰軌把挟郊燥聡肯軟祥惰秩芳茨賠須牲糧諭丹蒙壌徐披据搭胡葵艇錦杏冠哺栽悼愉栓虜之尿癒脊弊呉宰寂柴窒紳舶蝶紺伐俸峡楓槽堕萌藩奔淑傑剖憤扶硫絹卑擬甚叔崇憧禍雌閑囚泌痢匿升慕湧寡渓藻
1: 又貝央羽林雪森雲拾努令寒標課雰秋坂浅冒冗是史昆閥械厚岡藤贅肩訟往郎綺麗魅懐枕督机誌隷彫厄杉醤津伎昭賂娠奪埼俊虎蛍酎遜墟到泰琴貢摩滋梨嵐僕斗笠娯侍叱砲也翔鳩棟鍵吾菊庄帽爽芯粧崖嬢蛇穫霜貼迅鶴駒拓拘剛陛唐后淀涯堰亭塚媛肪浦郡隻隼唆颯曙准戴緋鎌傲阿拐栞茎践佳憾昌朴栃該郭弥赴那遥凛庸嘉且恭悦智洪陵靖喝龍坪享凌暁嘱椎瑞窃肖迭陪沙汰篤亜姻岬峠拙詠酪鋳吟堪屯曹睦畔拷倹某妄桟漸矯罷謹逝狐劾坑廉殉藍
0: 梓莉遼菅瑠駿璃輔哉茜諒綾蒼漣乃亮瑛


#5

I am forever in your debt. 感謝‼︎


#6

Ah, quick question. What does 「〜」indicate?


#7

Hmm nothing, I needed to filter out katakana, numbers, etc. maybe I missed some things.

This is from “〜人”: “number of people, people, people counter” and the like.


#8

Ah, I see. Thank you!! With all of this information, I think I can safely ignore that category then. :slight_smile:


#9

There is also a long vowel mark ー left at 7 count.


#10

Maybe someone is interested in the opposite as well :smile:

592: る
253: す
238: い
103: り
94: し
91: く
76: れ
73: う
64: きむえ
60: 〜
58: めけ
42: か
36: ら
35: み
34: にお
32: ま
31: つ
22: が
20: さげて
19: な
18: の
17: わちぐ
16: ん
15: ぶ
14: じ
12: びや
11: たせず
9: ご
8: べぎで
7: スーこば
6: ろもゴっ
5: ンねルと
4: メぼミ
3: フラアリはトゃをム
2: カビ1ぜだハイそバモッ
1: ふシ20ぬづペガギざゲジャよチクボタぱゆベプエェコヒ


#11

Aaand there are also a few kanji without vocab as well:

0: 梓莉遼菅瑠駿璃輔哉茜諒綾蒼漣乃亮瑛


#12

Wait, why are both 瑠 and 璃 without a word when just putting them together MAKES a word? (瑠璃)
Is the WK team planning to add more words in the near future?


#13

Nice, I regretted not making a list of these when I ran into them - I keep failing this burns, so I’ve been wanting to make my own vocab for these… Thanks for this


#14

You might try


I have just built a list from NHK Japanese Pronunciation Accent Dictionary. You might find it useful, and it might be better than the vocab list above:


#15

Queue the people whining that there are too many useless words in WK :wink:


#16

Well, people do complain about that. But again, WK is a kanji learning tool, not a vocab learning tool.
(Not that we need to have that conversation yet again)