ChristopherFritz's Study Log

And…it’s done. Well, I’m sure there’s room for tweaking this and that, but the main conversation work is done.

The best part is that many things I had to “settle” for with Wordpress because I couldn’t get it to work as I wanted, I’ve been able to get working using Jekyll.

For anyone curious, the site is at http://japanese.kurifuri.com/. It’s just a place for me to post random explanations and examples for grammar and such, mostly so I can refer back to them later if needed.

Zero testing on mobile at this time, as I never access the site anywhere other than my desktop computer.

7 Likes

I happen to be on mobile right now, so I gave it a test run. Looks pretty good to me, except for the issue that the header gets thicker and seems to cover the top part of the page?

3 Likes

Thanks for checking!

The last time I really dived into CSS, version 3 was barely a draft, so there’s a lot of functionality that I don’t know how to do, and that issue with the header is a symptom of that. For example, I’m familiar with fixed position, but reminded myself of sticky position via a web search just now.

Ideally, I wouldn’t have the header be fixed on mobile, as there’s less screen space versus a desktop monitor.

And the text looks so tiny in that screenshot.

3 Likes

The text is a bit small on the first page but fine in the examples!
Nice work!

3 Likes

Although it can still use some work, I think I maybe have done an okay job of getting my site looking slightly decent for the zero mobile users who will ever happen upon it.



Recently, I’ve been gathering up some of what I’ve written here and there on the forums and adding to my site for the zero people planning to read manga in Japanese who will happen upon these pages.

There’s still a little more to add for general “first time reading” posts.

9 Likes

I opened the website and scrolled around a bit while on my phone procrastinating getting ready to read, so you have at least 1 for each of those!

6 Likes

I’ve completed the next phase of “things no one will ever use”: manga vocabulary stats and frequency spreadsheets.

Stats include the minimum number of words you need to learn to be able to recognize a percentage of the text in a series. For example, for それでも歩は寄せてくる, the stats are:

Coverage Vocabulary Required %
75% 405 12%
80% 560 16%
85% 809 24%
90% 1,222 36%
95% 1,971 56%
96% 2,264 66%
97% 2,558 74%
98% 2,851 83%
99% 3,145 91%

Learning just 405 words (12% of the total unique words) teaches you enough to read 75% of the series.

Also included are per-volume stats:

Volume Total Words Unique Words Unique Words Used Once %
Series 29,358 3,438 1,595 46%
それでも歩は寄せてくる(1) 2,065 674 405 60%
それでも歩は寄せてくる(2) 1,899 638 339 53%
それでも歩は寄せてくる(3) 2,201 715 423 59%
それでも歩は寄せてくる(4) 2,384 804 469 57%
それでも歩は寄せてくる(5) 2,496 794 450 56%
それでも歩は寄せてくる(6) 2,277 739 464 63%
それでも歩は寄せてくる(7) 2,430 775 465 60%
それでも歩は寄せてくる(8) 2,688 850 514 60%
それでも歩は寄せてくる(9) 2,378 746 458 61%
それでも歩は寄せてくる(10) 2,300 772 465 60%
それでも歩は寄せてくる(11) 3,236 945 552 57%
それでも歩は寄せてくる(12) 3,004 873 510 57%

I didn’t even notice the slow increase of text content per volume!

Also included is a spreadsheet listing the vocabulary words (with frequency) for the series, as well as per volume (with the order of appearance as well)

Spreadsheet:

And it’s all 100% auto-generated, so if I adjust how I do the stats (such as to exclude character names and copyright pages), it’s quick and easy to update everything.

Actually, the main list of links isn’t auto-generated yet. I’d like to get that generated and have stats included in a table that can be sorted and filtered. Maybe come up with an algorithm to determine a “difficulty score” to be able to sort by difficulty.

5 Likes

Tbh, this looks really cool! I’d love to see one for the entirety of the JoJo manga so far, but I don’t have a text file of it

3 Likes

The longer a series is, I think the more interesting the stats can be.

For example, imagine if one did stats per volume, but assumed the reader is learning all words above a certain frequency threshold as they go. The first volume would have a really high number of words learned, but the number should shrink drastically and then slowly over the subsequent volumes. What would those stats really look like?

I don’t plan on trying to work out stats like that, though. I’m a bit too lazy for anything like that!

Since I don’t plan on reading JoJo, I expect Detective Conan (once I get back into reading it) to become my “longest series” that I have stats for.

2 Likes

I decided to add this into my stats, except assuming one learns 100% of the words as they go. This is entirely unrealistic but it’s still interesting to see the number of completely new words introduced in each volume in a series.

For example (specifically the final column):

Volume Total Words Unique Unique Used Once % New Unique
Series 41,865 5,367 2,286 43%
よつばと! 第01巻 2,713 958 573 60% 958
よつばと! 第02巻 1,865 778 492 63% 408
よつばと! 第03巻 2,353 901 563 62% 419
よつばと! 第04巻 3,077 1,100 646 59% 495
よつばと! 第05巻 2,738 1,012 610 60% 356
よつばと! 第06巻 2,448 912 572 63% 270
よつばと! 第07巻 3,071 1,093 643 59% 360
よつばと! 第08巻 2,835 1,019 596 57% 312
よつばと! 第09巻 2,931 1,041 614 59% 267
よつばと! 第10巻 2,381 956 586 61% 249
よつばと! 第11巻 2,286 967 623 64% 219
よつばと! 第12巻 2,788 1,135 749 66% 273
よつばと! 第13巻 3,454 1,236 764 62% 270
よつばと! 第14巻 3,327 1,149 712 62% 236
よつばと! 第15巻 3,598 1,154 685 59% 275

If I excluded words that appear only once in the whole series from that final column, those numbers would look a lot less intimidating than “I know you’re 14 volumes in, but the next volume has another 275 new words to learn in it”.

Maybe I should add columns for “new unique words that appear at least twice in the series” and “10 times in the series”. Something to consider.

Other charts:

三ツ星カラーズ
Volume Total Words Unique Unique Used Once % New Unique
Series 17,058 2,804 1,259 45%
三ツ星カラーズ Volume 1 2,365 828 478 57% 828
三ツ星カラーズ Volume 2 2,346 825 459 56% 511
三ツ星カラーズ Volume 3 2,245 737 400 54% 362
三ツ星カラーズ Volume 4 2,382 762 419 55% 312
三ツ星カラーズ Volume 5 1,753 571 303 53% 190
三ツ星カラーズ Volume 6 2,123 725 402 55% 234
三ツ星カラーズ Volume 7 1,934 608 323 53% 163
三ツ星カラーズ Volume 8 1,910 671 360 54% 204
レンタルおにいちゃん
Volume Total Words Unique Unique Used Once % New Unique
Series 7,880 1,490 682 46%
レンタルおにいちゃん 1巻 1,960 682 383 56% 682
レンタルおにいちゃん 2巻 2,096 676 373 55% 301
レンタルおにいちゃん 3巻 1,861 689 414 60% 252
レンタルおにいちゃん 4巻 1,963 737 453 61% 255
からかい上手の高木さん
Volume Total Words Unique Unique Used Once % New Unique
Series 15,266 2,205 960 44%
からかい上手の高木さん 1 2,307 775 445 56% 775
からかい上手の高木さん 2 2,171 699 385 55% 353
からかい上手の高木さん 3 2,066 683 364 53% 264
からかい上手の高木さん 4 1,791 565 316 56% 203
からかい上手の高木さん 5 1,840 594 321 54% 194
からかい上手の高木さん 6 1,670 534 294 55% 134
からかい上手の高木さん 7 1,749 592 312 53% 161
からかい上手の高木さん 8 1,672 536 283 53% 121

Those are some good numbers.

恋に恋するユカリちゃん
Volume Total Words Unique Unique Used Once % New Unique
Series 15,266 2,205 960 44%
恋に恋するユカリちゃん 1 2,056 841 520 62% 841
恋に恋するユカリちゃん 2 2,081 866 521 60% 516
恋に恋するユカリちゃん 3 2,500 1,060 675 64% 560
恋に恋するユカリちゃん 4 4,131 1,394 785 56% 719
恋に恋するユカリちゃん 5 3,074 1,205 766 64% 494

Looks like this one’s got more words going on than its Takagi counterpart. I blame all of Yukari’s internal monologues, but I’ll admit I didn’t even notice the difficulty difference while reading through the series.

ふだつきのキョーコちゃん
Volume Total Words Unique Unique Used Once % New Unique
Series 17,259 2,258 1,019 45%
ふだつきのキョーコちゃん 1 3,179 798 413 52% 798
ふだつきのキョーコちゃん 2 2,448 695 364 52% 331
ふだつきのキョーコちゃん 3 2,540 710 362 51% 273
ふだつきのキョーコちゃん 4 2,266 687 379 55% 252
ふだつきのキョーコちゃん 5 2,362 700 396 56% 219
ふだつきのキョーコちゃん 6 2,450 765 415 54% 239
ふだつきのキョーコちゃん 7 2,014 639 342 54% 146

Numbers very similar to Takagi.

それでも歩は寄せてくる
Volume Total Words Unique Unique Used Once % New Unique
Series 29,358 3,438 1,595 46%
それでも歩は寄せてくる(1) 2,065 674 405 60% 674
それでも歩は寄せてくる(2) 1,899 638 339 53% 330
それでも歩は寄せてくる(3) 2,201 715 423 59% 322
それでも歩は寄せてくる(4) 2,384 804 469 57% 322
それでも歩は寄せてくる(5) 2,496 794 450 56% 286
それでも歩は寄せてくる(6) 2,277 739 464 63% 236
それでも歩は寄せてくる(7) 2,430 775 465 60% 217
それでも歩は寄せてくる(8) 2,688 850 514 60% 237
それでも歩は寄せてくる(9) 2,378 746 458 61% 174
それでも歩は寄せてくる(10) 2,300 772 465 60% 173
それでも歩は寄せてくる(11) 3,236 945 552 57% 245
それでも歩は寄せてくる(12) 3,004 873 510 57% 222
ふらいんぐうぃっち
Volume Total Words Unique Unique Used Once % New Unique
Series 32,888 5,116 2,477 48%
ふらいんぐうぃっち(1) 2,472 946 557 59% 946
ふらいんぐうぃっち(2) 2,524 934 575 62% 531
ふらいんぐうぃっち(3) 2,506 898 523 57% 388
ふらいんぐうぃっち(4) 2,085 879 558 63% 378
ふらいんぐうぃっち(5) 2,867 1,121 703 63% 488
ふらいんぐうぃっち(6) 2,844 1,026 608 59% 368
ふらいんぐうぃっち(7) 3,757 1,340 808 60% 477
ふらいんぐうぃっち(8) 4,285 1,490 869 57% 532
ふらいんぐうぃっち(9) 2,825 1,204 791 66% 339
ふらいんぐうぃっち(10) 2,956 1,147 711 62% 302
ふらいんぐうぃっち(11) 3,767 1,330 783 59% 367

Bonus stats to motivate @MissDagger on reading more:

orange
Volume Total Words Unique Unique Used Once % New Unique
Series 20,151 2,357 1,002 43%
orange 1 3,912 1,019 531 52% 1,019
orange 2 3,621 942 510 54% 479
orange 3 2,805 711 363 51% 217
orange 4 2,593 723 370 51% 225
orange 5 4,235 900 437 49% 227
orange 6 2,985 793 400 50% 190

(I don’t expect these stats to actually motivate, but if you pretend they’re motivational, then maybe!)

9 Likes

Your stats are always interesting. It seems weird to me how many new words there are in each volume. With book books it doesn’t seem so strange, but manga doesn’t have as much text. I guess they’re always putting the characters in different situations that use more vocabulary, not to mention synonyms. ^^

More importantly, you made me chuckle on a day that’s been pretty stressful, and I appreciate that. :blush:

5 Likes

Would you believe that 相変わらず doesn’t show up in Orange until volume 4? Other words that we won’t see until volume 4 include 細かい, あいさつ, 借りる, and 転ぶ.

4 Likes

相変わらず definitely surprises me. Not that it is a word I see every other page, but probably at least once per volume of whatever I’m reading. The other ones I can understand, they are a bit more situational. But 相変わらず? I do indeed have a problem believing. :joy:

2 Likes

It’s interesting to see the blind spots one can have in one’s own progress.


I’m working on an algorithm to rate manga difficulty based on the vocabulary used in it.

I don’t know if it’ll work out, but it kept getting one thing wrong: saying Detective Conan is more difficult than Hayate the Combat Butler.

After all, I’ve read ten volumes of Detective Conan with relative ease, yet I basically brute-forced my way through the first volume of Hayate.

This is where that blind spot comes in.


2018: I painstakingly worked through my first manga volume.

2019: I started reading with the book clubs here on the forums, as well as a few easy things of my own.

2020: I expanded my reading quite a bit. Going into the final quarter of the year, I tried reading Hayate the Combat Butler but found it much too difficult for me.

2022: Early in the year, I picked up Detective Conan and completed 10 volumes.

Thus, Hayate is more difficult than Conan, right?


Oops, I forgot a year. What about 2021?

2020 and 2021: I worked on increasing my reading stamina.

There’s a whole year of improvement between reading Hayate and Conan. Not to mention vocabulary acquisition along the way.

These little things might mean something.


Today, I tried reading through a random scene from the first volume of Hayate, and I found it to be…quite readable. Actually understandable. And there were only two or three words I didn’t know rather than every other word. (Maybe it was just an easy scene?)

I may have to skim through volume one to remind myself of what’s happened, then maybe I can finally try reading volume 2.

8 Likes

I was actually thinking only yesterday that I might be ready to reread Alice 19th in Japanese. First time I did, I didn’t look up anything, just read it from start to finish and missing quite a bit of the Japanese. Having read it so many times in English that I didn’t need to understand the text to know exactly what was happening.

Although, rereading it now would be to expand my vocabulary in some new directions so when I finally get to Ouran High School Host Club, I’ll have some more of the weird vocabulary that I’m sure will show up during weird themed host club activities. (My hope is to start OHSHC after Orange finishes and then some, maybe late spring/summer next year.)

And now I’m wondering: how much easier will I be able to read it, even before look ups? I read it last in March 2020.

I wouldn’t think to try until I’ve gotten caught up with all the book clubs I want to be follow, but after that, maybe? And I’ll be curious to know if my experience will be similar to yours. :slight_smile:

3 Likes

You may have seen me mention this, but Alice 19th for you was Saint Tail for me. I knew the material well enough that I could read it without lookups (unless an unknown word kept showing up and I was curious).

This allowed me to help build my reading speed early on in my reading journey. It also helped expose me more to known grammar. (As I always say, the brain is a pattern recognition machine, and the more you read the more patterns you’re subconsciously internalizing.)

Just now I picked a random page from the series and was able to understand everything. (I only read two pages, though.) There were a couple of unknown words, such as 校庭(こうてい), but I recognized the kanji so I was able to piece together the meaning.

However, I also remembered the scene based on seeing two panels of characters standing in a classroom and reading two word balloons of the teaching saying they can go ahead and go outside to the school garden. If this was a series I’d never read/watched before, I wouldn’t have had that context to make it easier.

In other words, it’s really difficult to judge when I know the material so well!

4 Likes

Yeah, I can totally see that. I think Alice 19th would be a great target for full comprehension. Nowadays I’m being very careful with when I do that, because it takes a lot more time. Looking up grammar to make sure I understand the full nuance it brings (as much as I can), teasing out nuances of words, etc. (Obviously I don’t look up everything, but anytime I don’t feel like I have the full picture I do.)

The better I know the material, or the closer the Japanese are to +1, the more useful full comprehension reads become. Not that I can remember if there will be any tricky grammar. I do remember that when an American guy spoke, I found his Japanese much harder to understand, but I can’t remember why. xD

I wonder what 結婚しても愛してる will be like to reread when I get around to that (also so I can keep going with volume 2 and 3). That was for me, my true first manga. (I think I read/finished Chi’s sweet home first, but it didn’t have that much text and most of it was figuring out how Chi had baby and cat slurred her words.) I struggled so much with kekkon because it is chockfull of casual language. It did give me a good grounding in it, but it was hard.

I don’t expect it to be hard when I get back to it, because I’m so much better at casual language now and know far more grammar. :slight_smile:

It really is interesting how previous experience with material, the period we read them in and such things, can really shade the view we have of a work.

2 Likes

I’ve been undecided whether I’ll re-read it, but it was very difficult for me as well. I think that was my third or fourth manga read, so I was still taking in all the grammar and building up pattern recognition.

I’m getting close to picking a series to read based solely on this.

(Feel free to skip everything below. It’s just me looking over what one of my potential next-reads may be.)

My tracking shows the following as recommended reads for vocabulary being as close to +1 as possible:

Series Left Unique Overall
ひとりぼっちの○○生活 303 74.96% 92.10%
春色アストロノート 156 77.90% 90.45%
甘々と稲妻 238 74.44% 89.49%
日々蝶々 432 67.37% 88.50%
春待つ僕ら 865 60.59% 88.28%

For ひとりぼっちの○○生活, I read the first volume in (probably early) 2021. I had many kanji look-ups as there’s no furigana.

I haven’t gotten to volume two yet due to the time spent on kanji lookups. But Mokuro+Migaku 100% streamlines no-furigana kanji look-ups for me, so maybe I’ll get to it soon.

As for the numbers above, there are 303 unknown words I would need to look up, but this is spread across volumes 1 through 6. That’s an average of 50 unknown words per volume, roughly one unknown word every two or three pages. Next, factor in that there may be some characters introduced whose names I haven’t excluded from the stats, and it’s probably closer to one unknown word per two pages.

Then there’s 春色アストロノート, the story split across the Orange volumes. It’s about 135 pages of material, so that’s more than one look-up per page. (My stats above don’t factor in text density.)

The next three series I don’t know anything about. They’ve down in the 80%'s, so I’ll likely want to target learning their most frequent works before jumping into them.

When it comes to the kanji specifically (more important for series without furigana)

I also track stats for kanji.

I need to update my known kanji tracking to remove the ones I put into Anki then had become leeches.
Keeping that in mind, I get the following as recommended reads for kanji being as close to +1 as possible:

Series Left Unique Overall
日々蝶々 67 88.6% 97.76%
春色アストロノート 35 91.1% 97.06%
6月のラブレター 62 88.7% 96.70%
思い、思われ、ふり、ふられ 234 76.9% 96.66%
ひよ恋 172 81.5% 96.45%

Missing from this list is ひとりぼっちの○○生活, which has dropped down to position 19. I think if I looked up the names of all the main characters and excluded their kanji from the stats, it would rise up to a higher position.

日々蝶々 tops this list, showing there as being only 67 unknown kanji over the first three volumes. That means roughly one unknown kanji every eight pages.

Next, we have the return of 春色アストロノート. I plan to read this one after Orange, but if we have any one- or two-week breaks for holidays and/or to help people catch up (and if they don’t we kidnap their pets; sorry), I may slip reading chapters of this into the off-weeks.

I’m not familiar with 6月のラブレター and ひよ恋, but **思い、思われ、ふり、ふられ ** is the same mangaka as アオハライド.

I need to get back to learning new kanji (I accidentally closed Anki some weeks ago and instantly forgot to do my daily reviews since then…), but for now these stats have me looking to be in a comfortable position for a least a few manga series!

As for which one I’m likely to pick, ひとりぼっちの○○生活 volume two is unexpectedly (for me) a contender, but 日々蝶々 is in the top five for both lists so it’s getting closer and closer to being picked.

4 Likes

Those stats are pretty interesting, how you can have the fewest unknown words in one series, but it is much further down the list when looking at unknown kanji.

My plan for 春色アストロノート is probably to read all those chapters when I’ve caught up with the book clubs I’m behind on. Although I do find the character art of multiple chars so similar to ones in Orange, that I considered waiting to read it later, but at the same time, my Bookmeter stats are currently a sham because I’ve told it I finished Orange volume 1, but I haven’t read the chapter from 春色アストロノート. :joy:

From your stats, you have an easier way to find something that really fits the +1 parameter. While I personally have to guess or start reading to see.

I have some plans to use Koohi for specific series to ease the way into them. I wouldn’t use it for long, and certainly not until I “burned” all the words, but enough that I can have some extra help learning them (beyond immersion). I wouldn’t study the words much in advance of reading them (you can get the word lists in chronological order), 1-2 days max. And only study words that appear at least twice.

I quite look forward to doing that very targeted approach to SRS. WK plus Japanese language school plus reading has been good (and is good) to get me a broad base of vocabulary. But after I finish WK, I’m not interested in staying with such an broad approach. I want immersion/reading to be my main mode of study.

Sorry for the tangent, your posts are always so thought provoking I can’t stop myself. :slight_smile:

4 Likes

That’s one reason I didn’t jump right into another series from the mangaka of アオハライド.

Consider:

image image

Okay, so the bangs/fringe differs between the two, so there is that.

Next up:

image


(Why won’t Discourse let these two images be side-by-side like in the preview?)

The guy on the left wears an earring in one ear, but we can’t see it here to distinguish between them.

And how about:

image image

This isn’t a criticism. I like this mangaka’s character designs. And many mangaka have this same issue. Over in Sailormoon, Usagi and Minako practically have the same parents.

But it is something that makes me want to space the series out quite a bit between reading one and the next.

My system is mostly available for others if they don’t mind building up a list of their known words. But it has the big issues that 1) selection is limited and 2) there’s some manual work involved.

I’m sure in the next year, we’ll see a site such as Koohi or JPDB integrate manga into their existing system.

All it takes is for one person with a decent graphics card who doesn’t mind “sourcing” manga from somewhere for the sole purpose of extracting sentences/words into a database to run thousands of series through Mokuro, and they’ll likely attract a huge audience of manga-reading Japanese learners. (That sentence was unnecessarily long…)

I always advocate learning words in advance shortly before you read the material containing them because it might save you a look-up later and, more importantly, your brain makes a meaningful connection at the time of remembering it when seeing the word in context. (At least, based on my own personal experience.)

I don’t know if there’s any site out there where you can essentially say, “These are the words I know, this is what I’m reading, and here’s how far I am in it. What are the highest frequency words from the story that I’m likely to see the next time I read?” But it would be nice if there was.

If you want to try this with manga, the frequency lists on my site have the order as well (for the individual volume tabs, not for the combined series tab).

(Yes, I’m advertising my site now that I’ve added frequency lists =D )

3 Likes