[Userscript] Keisei 形声 Semantic-Phonetic Composition

New version 1.6.2!


  • chargrids are now split in header and compounds
  • @seanblue your fixes, and items have a it more breathing space now in the lists
  • beta: fixed “dropdowns everywhere” problem in lessons summary page

Looks much better now!

Beta changes in 1.6.3:

  • added list of tone marks sorted by number of compounds
  • new nav feature should work on more pages now, and ensure that you cannot append the same section multiple times.

I think maybe 「致」 needs a cross-reference to 「到」.

I think this is a bug. I searched for the kanji 称 and in the similar kanji section, 除 is considered as not learned (even though it’s a lvl 31 kanji and I did learn it). Is it something that I’m not understanding?

1 Like

Answer is over there:

Hmm I should probably make two categories, derived tone marks and similar looking tone marks.

Be careful of feature creep though. At a certain point, there can be too much information such that it gets overwhelming and hard to follow.

With that in mind, you might want to explain somewhere in UI (e.g. tooltip, help dialog) to explain what all the settings and options are.


Ok, so. First part tells me 巻 is a poor match with its tone mark because it’s reading is かん instead of けん.
Then second part tells me けん is one of the readings for that tone mark. Is it that 巻 itself doesn’t use けん but other kanji that use it as a component do?

I should really add the explanation for the quality levels somewhere :slight_smile:

天 means that all readings of a compound match
上 means that the main reading matches, but you can also read it in a way not covered by the tone mark, but rarely
中 means that the main reading is different (かん here), but at least one of the phonetic readings is also used as a reading for the kanji (けん), just not in first place
下 means that the phonetic readings are nowhere to be found.

Kanjipedia: けん is a non-jouyou reading:


An overview of all your meanings (including the quality levels, the colors, the bolds) would be great.

Also 闘 says it has an unknown/contested tone mark, but you have とう as the reading for the phonetic component 豆 and this kanji is also read とう. Thoughts?

We are on a similar level, we always find similar things :slight_smile:

I thought it was a composite tone mark, like 豆+寸. But the only information I found is that it looked like this: 鬭, and 斲 is either the tone mark or just adds meaning (jisho says “cut, chop, hack”, something you would do in a struggle/battle). Maybe 斲 is really somehow related to 豆, or 豆 was even chosen as a simplification because of its reading.

I use the 豆 inside to remember the reading myself, but for the DB I try to keep the tone marks on “top level” (only two parts per kanji), so the mark would have to be 豆+寸 (can’t find it on its own, and 厨 is not とう (can be ちょう, though)).

I’m curious, what’s the reason for that?

No authoritative reason, but I believe this is how it is done.

The tone marks are not solely chosen for their reading, you would chose the most simple way to represent the sound in that, and always use the same mark. After some time you would have the idea to change your writing system to be only “sound-based”. Instead, oftentimes several kanji with the same reading are used as tone marks. I think that the tone marks are still chosen for their meaning (if possible).

So the tone mark is the whole thing, even if a part inside the part shows the tone (as in 青 => 生 => せい). Also, in Japanese they sometimes changed the reading of a compound tone mark, along with compounds themselves, so the original tone mark doesn’t fit anymore …

Similar to the above question, 浸 and 侵 both read as しん but apparently not related?

They are related, 寝 as well, the problem is that the tone mark is not printable (𠬶), you need a font with lots of kanji like MingLiU to see it. Don’t know if I should include it …

Question - what does ‘tone marks’ actually mean? I can’t figure it out.

Phonetic component in certain kanji.

Should 統 be part of the 充 phonetic group as a non-match? Right now it’s not included at all.

It was part of non-match already, the problem is that when you look at a kanji that is only not something that information is not displayed at the moment. If you arrive from outhouse (充) you can see it.

However, I looked it up and several sources say 統 is a phonetic compound, I changed it to matching.

1 Like