[Userscript] Keisei 形声 Semantic-Phonetic Composition

acm2010 · January 24, 2018, 1:08am

Hmm I should probably make two categories, derived tone marks and similar looking tone marks.

seanblue · January 24, 2018, 2:47am

Be careful of feature creep though. At a certain point, there can be too much information such that it gets overwhelming and hard to follow.

With that in mind, you might want to explain somewhere in UI (e.g. tooltip, help dialog) to explain what all the settings and options are.

coobie · January 30, 2018, 1:21pm

Ok, so. First part tells me 巻 is a poor match with its tone mark because it’s reading is かん instead of けん.
Then second part tells me けん is one of the readings for that tone mark. Is it that 巻 itself doesn’t use けん but other kanji that use it as a component do?

acm2010 · January 30, 2018, 1:31pm

I should really add the explanation for the quality levels somewhere

天 means that all readings of a compound match
上 means that the main reading matches, but you can also read it in a way not covered by the tone mark, but rarely
中 means that the main reading is different (かん here), but at least one of the phonetic readings is also used as a reading for the kanji (けん), just not in first place
下 means that the phonetic readings are nowhere to be found.

Kanjipedia: けん is a non-jouyou reading:

seanblue · January 30, 2018, 9:50pm

An overview of all your meanings (including the quality levels, the colors, the bolds) would be great.

Also 闘 says it has an unknown/contested tone mark, but you have とう as the reading for the phonetic component 豆 and this kanji is also read とう. Thoughts?

acm2010 · January 30, 2018, 11:53pm

We are on a similar level, we always find similar things

I thought it was a composite tone mark, like 豆+寸. But the only information I found is that it looked like this: 鬭, and 斲 is either the tone mark or just adds meaning (jisho says “cut, chop, hack”, something you would do in a struggle/battle). Maybe 斲 is really somehow related to 豆, or 豆 was even chosen as a simplification because of its reading.

I use the 豆 inside to remember the reading myself, but for the DB I try to keep the tone marks on “top level” (only two parts per kanji), so the mark would have to be 豆+寸 (can’t find it on its own, and 厨 is not とう (can be ちょう, though)).

seanblue · January 30, 2018, 11:57pm

I’m curious, what’s the reason for that?

acm2010 · January 31, 2018, 12:08am

No authoritative reason, but I believe this is how it is done.

The tone marks are not solely chosen for their reading, you would chose the most simple way to represent the sound in that, and always use the same mark. After some time you would have the idea to change your writing system to be only “sound-based”. Instead, oftentimes several kanji with the same reading are used as tone marks. I think that the tone marks are still chosen for their meaning (if possible).

So the tone mark is the whole thing, even if a part inside the part shows the tone (as in 青 => 生 => せい). Also, in Japanese they sometimes changed the reading of a compound tone mark, along with compounds themselves, so the original tone mark doesn’t fit anymore …

coobie · February 19, 2018, 1:51pm

Similar to the above question, 浸 and 侵 both read as しん but apparently not related?

acm2010 · February 19, 2018, 2:26pm

They are related, 寝 as well, the problem is that the tone mark is not printable (𠬶), you need a font with lots of kanji like MingLiU to see it. Don’t know if I should include it …

konekush · February 26, 2018, 6:31pm

Question - what does ‘tone marks’ actually mean? I can’t figure it out.

coobie · February 26, 2018, 8:52pm

Phonetic component in certain kanji.

seanblue · February 27, 2018, 11:10pm

Should 統 be part of the 充 phonetic group as a non-match? Right now it’s not included at all.

acm2010 · February 28, 2018, 12:31am

It was part of non-match already, the problem is that when you look at a kanji that is only not something that information is not displayed at the moment. If you arrive from outhouse (充) you can see it.

However, I looked it up and several sources say 統 is a phonetic compound, I changed it to matching.

acm2010 · February 28, 2018, 12:34am

If you mean the word tone mark itself, I took it from the translation of 声符, but is probably not the best word. I will change it to phonetic component or something.

acm2010 · March 7, 2018, 3:47am

Version 1.6.4 with a few minor modifications.

Reworded the info strings a bit.
Semantic components are now also shown, for example 魔=麻+鬼 or 透=秀+辵 (*)

(*) mainly a by-product of my attempt to make a “kanji matrix”:

seanblue · March 14, 2018, 1:05am

The script says that 令 also has the phonetic component of りょう, but 領 is the only kanji you list with this reading. If only one of six kanji listed has that reading (and not even the phonetic component itself), how can it be considered a phonetic component?

acm2010 · March 14, 2018, 3:57am

The reading りょう is uncommon, but it is the Go-on and listed as “outside jouyou”, you can see it here:

http://www.kanjipedia.jp/kanji/0007237600
https://ja.wiktionary.org/wiki/令

Interestingly 領 also has out-of-list reading れい listed in wiktionary. What gets listed as reading varies a lot, even kanjidict (which list lots of strange readings generally) misses many things.

Also here 令とは (レイとは) [単語記事] - ニコニコ大百科 under 「声符」 you can see more compositions not in WK that have りょう as an option.

seanblue · March 14, 2018, 10:36am

Follow-up question then. Sometimes you include kanji not on WaniKani in the script. How do you decide which ones to include?

acm2010 · March 14, 2018, 11:17am

I started with a list of all Jouyou kanji including the revisions from 2010 (some not in WK), then added the kanji included in WK that were still missing. Recently I also added all kanji that are used as phonetic components, this includes very obscure kanji not really used today.

But as compounds only Jouyou+WK additions will show up.

Topic		Replies	Views
Wanikani Phonetic-Semantic Composition 1.0.5 [No longer supported] API And Third-Party Apps	45	12320	April 21, 2022
Need clarify the phonetic-semantic - how to use Requesting Help	11	4081	January 11, 2020
Useful Add-ons for a beginner API And Third-Party Apps	19	14844	August 27, 2024
[Userscript] WaniKani Similar Kanji API And Third-Party Apps	111	50735	February 19, 2020
[Userscript] Niai 似合い Visually Similar Kanji API And Third-Party Apps	181	39512	June 18, 2023

[Userscript] Keisei 形声 Semantic-Phonetic Composition

Related topics