Working on a SRS Site for Grammar (unlike bunpro)

leoluan · August 28, 2024, 2:12am

We are building a Japanese grammar learning application through open-ended practice and consistent memorization. Our application will use a spaced repetition system (SRS) and AI to enable our users to create sentences and receive instantaneous feedback.

User Journey: After creating an account, the user will take a placement test to determine their JLPT grammar level. After determining a JLPT level, the students will begin the learning, review, and master review workflow.
Learning Workflow: The student will first learn the new grammar points for each sublevel. The student will create one sentence given a situation for each grammar point until correct. The grammar point then gets moved into the level review pile.
Level Review Pile: According to the timing of the SRS, the student will review the available on-level grammar points in this window. The student will create one sentence given a situation for each grammar point. The student’s SRS score for the given grammar point will be adjusted accordingly depending on whether they create a grammatically correct sentence.
Master Review Pile: Once a grammar point has been progressed to Guru and finished in Level Review, it goes into the Master Review Pile. In this pile, your practices of making sentences from situations are consistent, but with a grammar bank of 3 functionally different grammar points (one of them being the correct one). You finish the grammar points until they are burned in this practice pile.
History Page: Lists past learning/review sessions, including previously inputted sentences along with associated feedback and corrected sentences. All “session objects” (sentences with feedback and corrections) will be sortable by grammar point and accessible from the practice page.
Wordbank: On the Review Pages page, appropriate vocabulary with translations for each given situation will be available. This will prevent students from mistranslating and using incorrect words when searching for vocabulary they do not know.

We’re a small team but we’re making progress on a daily basis. We post weekly updates in the discord and are looking to hear your thoughts. We don’t have a website yet, but we plan to have one up in the near future.

Wait…AI???

(Yes, we are aware of the shortcomings of AI, such as its occasional unnaturalness. However, we believe there is much to gain from using AI, such as correctly evaluating grammatical accuracy (especially for the N5-N3 level grammar points we are focusing on initially). To extract the best performance from our AI, we use highly engineered prompts that consider the context (situation) and the given grammar point being evaluated and then separate its evaluation of grammatical accuracy from its evaluation of naturalness. We are confident in the grammatical accuracy evaluation for N5-N3, but have also gotten promising results from the naturalness evaluation)

seanblue · August 28, 2024, 2:22am

So what do you have? No website or app yet, so any screenshots of the WIP you can share?

What does this mean exactly? That the AI will determine whether a sentence or phrase from the user is grammatically correct / natural? Do you have a native Japanese speaker involved to validate this process?

Also, what does “unlike bunpro” in the topic title mean since they also do grammar SRS.

leoluan · August 28, 2024, 2:30am

We do have screenshots, tho a bit outdated as I don’t have the most recent localhost.

The AI will first identify whether the phrase is grammatically correct and then separately evaluate the naturalness of the language. The SRS score is incremented based on the AI’s evaluation of grammatical accuracy. We do have native Japanese speakers involved in validating outputs, and it so far has been successful in regards to N5 in particular.

Yes, there are some text rendering error in the Grammar Point text. Sorry.

moffitt · August 28, 2024, 2:52am

Honestly, this sounds really promising.

While AI can be a bit of a taboo topic depending on its use (and I can understand having reservations here), I feel like it’d be a fantastic starting point for self learners, particularly those who don’t have a native Japanese speaker to regularly talk with and correct sentence structure/formality issues. Is AI perfect? No, but I think it could be a decent starting point. I certainly wouldn’t rely on it exclusively, but could see myself using it to start getting the hang of things.

Really keen to see how this progresses!

Edit to add:

This is good to know.

seanblue · August 28, 2024, 3:05am

I feel like with the current AI trend being LLMs, AI is often a terrible tool for self learners since they tend to make stuff up, but confidently present it as fact. So if you don’t already know something you have no way of knowing whether you can trust the output. But as I say, that’s really only a big problem with the LLMs. AI (specifically machine learning and neural nets) really come down to the training data.

Which raises the question: @leoluan can you give any insight into your training data? Type of data, size, etc.

Vanilla · August 28, 2024, 3:11am

When I heard unlike bunpro I had hopes that maybe it would be a comprehension based srs system rather than an output based one, but that doesn’t seem to be the case.

Sean blues comments about AI sums up my thoughts pretty well though so I guess I don’t have much to add. I personally didn’t like bunpro, and I can’t say I would see myself using this either (even if I was back at N5). Output and having AI evaluate it as part of an srs system seems very specific and dependent on the AI to actually be consistently right.

moffitt · August 28, 2024, 3:15am

I definitely agree with this, which is why I can understand the reservations. I’m a software developer by trade, so I’ve seen my fair share of questionable AI output haha. You’re right that it’s very problematic when you have no way of knowing if what it’s saying is correct. That being said, I’ve also seen how helpful it can be in certain scenarios so I guess I’m a bit of an optimist. AI is being honed at a terrifying speed, so while I don’t think it’ll ever be perfect, I do think it has a lot of room to grow from what we currently know.

This is key. Without some sort of way to prove the AI being used is outputting factual information (and not just a certain % of the time), then it does make the whole thing moot as much as I would love to see an idea like this be successful. But I don’t think it’s impossible.

leoluan · August 28, 2024, 3:18am

What would that entail? I am not sure I understand the difference between an output-based SRS and a comprehension-based SRS.

leoluan · August 28, 2024, 3:25am

We are using a prompted GPT4o, so the training data is currently out-of-the-box GPT4o. However, we are currently compiling training data from situations, grammar points, sentence in, and feedback combinations where the feedback is done by a native Japanese teacher. We now have 300 example sets that we will fine-tune GPT4o with.

taiyousea · August 28, 2024, 3:27am

The most basic explanation would be that output asks “what should go in this blank?” where comprehension gives grammar in context and then asks “what does this mean?”

Vanilla · August 28, 2024, 3:46am

Essentially instead of the card testing if you can produce Japanese to fit a certain sentence, you would be given a sentence in Japanese and tested over whether or not you could comprehend it

seanblue · August 28, 2024, 3:46am

So combining this with your previous statement about getting promising results for N5-N3, you’re claiming to get those results from vanilla GPT-4o? I will concede that I don’t know anything about using engineered prompts, but I find this hard to believe from my previous experience testing Chat-GPT and seeing bad grammar explanations shared by others.

Vanilla · August 28, 2024, 3:47am

Just like normal GPT but only gives sassy answers in regards to Japanese

seanblue · August 28, 2024, 3:53am

All responses include subtle reference to NSFW LNs.

Vanilla · August 28, 2024, 3:54am

My little language model can’t be this optimized

ctmf · August 28, 2024, 5:07am

From my little understanding of AI, it seems like “give me a grade for natural-ness and meaning-accuracy of this sentence” is more suited to what it could do, rather than correct/incorrect or “no, like this”. I’m not really seeing how SRS gets wedged into here naturally without a clear way of being “wrong”. (Ironic, because that’s usually how I feel about AI)

It would be more like what a human teacher would do, making you produce grammatical output and giving tips. (Which someone who disagreed or was confused could then follow up with a real person later)

Which does sound interesting to me and would be different from bunpro. The way it’s described in the post doesn’t compel me, but maybe I’d try it out (because why not)

Remun · August 28, 2024, 5:18am

I wish bunpro was more flexible with the grammar. An output based srs is so rigid and doesn’t allow you to make mistakes and it requires a certain answer.
But in the real world grammar is much more flexible and it works as long as it is comprehensible.
I quit bunpro because of this.
PS:This is just what I felt using bunpro and grammar textbooks.

leoluan · August 28, 2024, 4:34pm

Even though our SRS is output-based, we accept various answers as long as they are grammatically correct to a situation/grammar point pair. This flexibility is one of the key differences between Bunpro and us.

leoluan · August 28, 2024, 4:36pm

This is an idea we are experimenting with in Learn and On-level reviews: doing a given grammar point situation exercise and correcting your input sentence with feedback until it is correct grammatically. We will probably implement an adjusted version of this that is not too resource-intensive but allows students to make corrections in the earlier SRS stages of learning.

HA472 · August 30, 2024, 6:51pm

Just FYI: Bunpro does have a reading review type. See the “Quality of Life Updates” post below from the Bunpro Community: ✨ Quality of Life Updates - Bunpro - Bunpro Community

Reading Review-Type

For those of you who are frequent users of Cram, this new review type should be something you’re already familiar with! The feedback and love that the addition of the Reading Review Type received encouraged us to add the ‘Reading’ Review Type option to the main Review system.

This new type works by showing the user the full sentence (with answer included) and then having them self-grade themselves based upon whether or not they got it correct. This type will be quite effective for those who want as much immersion as possible, since by default it will only show you Japanese and no English. A user may find themselves being able to convert English into Japanese for their normal reviews but still have trouble with the same item in question when they are immersing in native content.

We hope that this new type can be of great aid to both the users that requested this and to anyone else that thinks this will be a good addition for their study toolbox

reading-mode-12612×1116 131 KB

reading-mode-22608×1106 176 KB

I (HA472) don’t personally use this setting, but this is what it looks like in my review settings:

Topic		Replies	Views
How do ya'll actually study grammar? Grammar	42	1837	November 21, 2024
Japanese Grammar App Japanese Language	32	4035	December 26, 2019
Finding a "right" way to study English → Japanese Speaking	33	2077	October 22, 2021
文プロ(Bunpro): New Grammar - July 31st, 2025 - Japanese Grammar and Vocab SRS Grammar	3135	245038	July 31, 2025
What is Your Recommended Way of Learning Grammar? Requesting Help	42	9633	April 12, 2021

Working on a SRS Site for Grammar (unlike bunpro)

Related topics