Hello all,
this is my first post, よろしくね。
I have been experimenting with different dependency parsers for Japanese recently, and I was wondering if anyone else is interested. Basically you (programmatically) try to split a sentence into its smallest parts, and then figure out what the relationship between the parts is. I think this should be helpful to learn how to parse/read sentences yourself, and make glossing easier.
There are several tools available for this, at the moment I’m checking out Jumanpp and KNP. Example output looks like this:
You read from top to bottom, and things on the left modify things on the right, with the predicate at the very end. Multiple things in parallel can modify a verb.
Has anyone tried to use something like this in the past?
Also, I’m currently building a program that turns KNP output into HTML.
Recent examples:
Above sentence again (NHK news easy)
A few sentences from a normal NHK news story today
You can click the sentence headers (white boxes) to go to another sentence in the second example, hover over units to show a gloss, click a unit to make the hovering item stay when you leave the unit with the mouse again, and toggle some additional internal data with the form buttons. The colors show the grammatical function.
Obviously the usability is not great, and you can’t enter your own sentences in these examples, but maybe it’s helpful. Also, I mainly use Chrome, Firefox and Safari seem to work, but no guarantees. Finally, Jumanpp and KNP are around 90-95% correct according to the guys who make it, so the chance is quite high that the analysis result is not really correct, glosses as well.
Is anyone interesting in a tool like this, or may find it helpful to learn Japanese?