Manga Kotoba: Manga Frequency Lists and Stats

Maybe someone added the Natively links to the database so they would match up on the next attempt :wink:

Especially surprising as it’s one I own.

There isn’t, as that’s not my recommended way of tracking known words (due to there being words you may not know), but I can give thought to an implementation (something with warnings that you may be marking unknown words and misparsed words as known). Maybe an option to “mark as known all works from this volume with a frequency of n or higher” allowing the user to include all the way down to words that appear only once.

Although the design of the site is based on my personal expected workflow, there’s a saying how a product isn’t what it’s designed for but rather how it’s used, so it’s good to lean into that (within reason).

3 Likes

magic :magic_wand: haha

my own use case probably isn’t fully aligned with how it’s all supposed to work … but the most recent post about being able to search words across your library inspired me a little :wink:

I found the issue.

If you have volumes like this:

Vol 1: Finished
Vol 2: In Progress
Vol 3: Owned

…the import is marking the series as “Owned”, so it appears in the owned library page.

I’ll see about fixing this so certain statuses get priority over others in the import.

1 Like

kewl… may have found something else as well related to importing/linking

I know this was already added

but on the import it’s still reporting not in DB.
I have tried clearing the csv and then doing a new import and same thing… somewhere this book isn’t linking correctly mayhaps?

1 Like

Manga Kotoba currently only supports links to Natively series pages, meaning single volumes cannot be linked to. It’s on my to-do list to think of how I want to implement. I might have to give it a bit more thought now that there’s a tool that can benefit from that information/linking.

1 Like

that makes sense… with natively kind of coming and going lately? not sure what’s going to happen to the site… hopefully Brandon comes back but there are quite a few things that have been broken lately and a huge backlog of books to be added.

Thinks probably generally this because I’m realizing I have Death Note and Non Non Biyori but I know those are already in MK…

anyway don’t have anymore time to play right now

I did send over one more mokuro file but realizing it’s going to take a long time to go through this list of 700 and cross check with MK to see if they are already there … so for now… will have to send what I can piecemeal… until then… gotta run

1 Like

The good news:

I’ve updated the import logic so it detects if a volume is “in_progress”, then it will set the series as “Reading”. Likewise, if a series have volumes that comprise of only “finished” and “owned” with at least one of each, it will default to “Paused” (as the reader is between volumes).

The bad news:

You can’t re-import volumes that match between the Natively file and Manga Kotoba, making it impossible to apply the new logic to your account through the importer (outside of my wiping out your statuses and letting you re-import).

I’m sure you wouldn’t mind that, but I went with,

The good news:

I did the import on my local database, then just applied it to your account on the site, so you should immediately see the corrected series statuses, with some items “Reading”, some “Paused”, some “Owned”, and some “Finished”.


If he does, this is the absolute best time for it.

I figured implementing the Natively data import into Manga Kotoba would have taken me a good week to fully implement.

But with AI tools, most of it was completed in no time. My main time was spent reviewing code for anything that looked amiss or poorly optimized, testing, prompting for changes and fixes, and deploying.

Is yours the individual volumes release or the “everything in a single volume” release?

Looks like there’s an incomptability in sort order between Natively and Manga Kotoba because Natively’s numbers (understandably) do not line up with the volume numbers after a certain point:

Book Title Series Order
のんのんびより 7 7
のんのんびより 8 8
のんのんびより 8.5 公式ガイドブック 9
のんのんびより 9 10
のんのんびより 10 11

Outside of utilizing logic to try to guess the volume number from the title, there’s no workaround for this. (Well, I do have a function to do just that. I’d just have to implement it…)

Along those lines, I have りめんばー as a “final volume” of the series, whereas Natively has it as a separate (single-volume) series. Since I haven’t read through the whole series, I can’t say which is the better decision. (BookWalker puts Remember as its own separate series.)

Or mass upload them (the form currently allows up to 20 MB per upload) and let the site’s owner deal with it.

You could optionally zip them all up and provide them via Discord (but then you don’t get in-site notifications of when they get added, in that hidden-away-in-the-dashboard notifications section). You would get updates on additions via Discord, though.

1 Like

haven’t looked yet but it’s good to be a guinea pig :smiley: worst case we wipe out it all if we have to and start fresh… probably fine w/o that though

It’s physical and I have the special edition with everything in one but I also have a sep set (which has a few vol in each but it’s not the 1 book = 1 volume)… I’d honestly have to find time to dig out the physical and I can share the info with you if you would like

any implementation too should be for the good of all not just my wonky database file :wink: I don’t know how much of my database file is not necessarily correct because books haven’t been getting approved for months… so wouldn’t stress too much…

haha I already feel a little guilty sending you down a rabbit hole from hell… It’s just some text with a comma delimiter how hard can it be muhahaha :rofl: (everything is always more complicated than it appears)

I don’t want to dump all of them on you though, that just seems downright evil LOL :smiling_face_with_horns:

I could do this but it’s 1250 volumes and if I search the *.mokuro it works out to 311 files about 120 mb… getting probably too big for discord even… and then you’d also have to deal with duplicates and sorting … seems like a pain…

and some of them I should probably hold back .... (for now)... want to stick with the regular commercial stuff first... and any others it's more BL centric so ... depending on the amount of fine art not sure how much you want/need in MK :innocent:

I will note that, for the time being, I’m mostly trying to avoid anything people would reasonably find “objectionable”. Eventually, I’d like to add options so one can block series matching specific genre/tags (with series for some tags not shown without an account and opt-in). But that’s not a priority, so the general request is to contribute only series you’d feel comfortable reading in public.

Granted, I do have vocabulary lists up for Love Hina.

No need to worry about that.

I’d say if you only wanted to get an idea of what words appear in the series, etc. mark the release that is on Manga Kotoba as owned (series and volumes).

I was thinking the same. Just need to do some implementation and testing and reviewing results and testing… Gotta home I remember later this week to give it a try.

1 Like

“you” being me or an average person haha

no worries :wink: the 311 would probably end around 150 ish… still a lot to sort through and assuming I didn’t screw up the folder organization, already found one out of place the other day

But I’m certain I have a lot of data that isn’t there that will help up the vocab database

edit----

left a present… filling in some missing sets still have quite a bit more to comb through but wasn’t able to upload multiple files at once (3 or 4) no where near any size limit so either I’m doing something wrong or tripped over another bug.

Guinea pig warning: There was an issue with an import (hopefully already fixed in a code update yesterday, as I cannot reproduce it). It caused some series to get a reading status even though there were no volumes with reading statuses. Since everything imported from the Natively file should have a volume reading status, I went ahead and wiped out the series reading statuses that had no volume statuses.

My to-do list for after work today:

Implement volume number detection on import, as Natively’s sort (like Manga Kotoba’s sort) is not intended to reflect a volume number.
Update contributed items admin interface to resolve some minor issues.
Go through all pending contributions.
Bonus Task: When a library status has multiple pages, the number of series in that status needs to reflect the total for that status, not just the total displayed on the page.

4 Likes

not caffinated yet… not sure what this means
so gotta work but later tonight I should do a fresh import and report back? or… ?

No action necessary on your part.

1 Like

leaving you a lot more presents hehe

but one at a time SHM… clicky backy clicky backy … i could zip and send but this gives me something to complain about :rofl:

edit…

yeah think I got everything that should be regular or SFW

I’m sure there are others but either haven’t read them yet or know enough so will have to add those as I come across… and of course not everything is digital either (not everything is mokuro)

I cannot tell if this is a dumb question or not but is there a way to search MK for just a particular word and have it tell me which manga it appears in directly, (w/o having to go to a list and select it?)

say I find a word that is interesting or know I’ve seen it elsewhere while I’m reading…if wanted to search which manga I ran across it before and where else it might show up in my unread?

I don’t have an interface for a general word search, but you can edit the URL:

https://manga-kotoba.com/word/WordGoesHere

Just replace “WordGoesHere” with a word and it’ll bring up that word’s entry.

1 Like

very kewl…now I need to mark a million words known ugh… though I’ll wait until most of this initial bit is done just in case need to nuke it all and start again

One option, for anyone wanting to go the route of “mark everything is known from manga I’ve read” would be:

  1. Bring up a series page for a series you’ve read.
  2. Click on the “Export Wordlist” link.
  3. Maybe enter a higher “Minimum Frequency” to focus on words that appear more commonly.
    • The output is a tab-delimited list of words and frequency counts.
  4. Copy and paste the list into a spreadsheet, then copy just the words column.
  5. Go to the “Import Words” and paste the word list in.

When you “Submit” a list of known words, anything that Manga Kotoba can match up to a single dictionary entry will be automatically marked as known, so word-by-word clicking necessary. Any words that match up to multiple dictionary entries are listed for manual confirmation, but if you’re aiming for the maximum result with minimum effort, I’d skip over those.

(For users aiming to utilize the site specifically for finding which words to focus on learning next, my recommendation remains to just manually known words now and then over time.)

1 Like

kewl I’ll try it right now … and oh i see stuff magically appearing! woo hoo

oh this only works by series? not just by vol w/in the series?

wait how do you do this? how do you choose a min freq hmmm I’m lost :face_with_spiral_eyes:

going here:

but then I’m here

settings doesn’t have anything for min freq…and if I click vocab json… I get this mess

and I panic and cry :smiling_face_with_tear:

:rofl:

never figured out the word list exporting thing… ??? :cry:

also were you able to import all the mokuro files?
the list doesn’t show up anymore so cannot tell (don’t think some of them made it - but didn’t want to go back through the giant list and start over)