I explored memory models for spaced repetition in my master's thesis and later b...

IncreasePosts · 2025-08-04T20:44:25 1754340265

I've been playing with something similar, but far less thought out than what you have.

I have a script for it, but am basically waiting until I can run a powerful enough LLM locally to chug through it with good results.

Basically like the knowledge tree you mention towards the end, but attempt to create a knowledge DAG by asking a LLM "does card (A) imply knowledge of card (B) or vice versa". Then, take that DAG and use it to schedule the cards in a breadth first ordering. So, when reviewing a new deck with a lot of new cards, I'll be sure to get questions like "what was the primary cause of the civil war", before I get questions like "who was the Confederate general who fought at bull run"

ran3000 · 2025-08-04T20:58:45 1754341125

I'd love to see it.

What I like about your approach is that it circumvents the data problem. You don't need a dataset with review histories and flashcard content in order to train a model.

jarrett-ye · 2025-08-05T03:38:42 1754365122

Andy also tested this idea. You can read his notes here:

GPT-4 can probably estimate whether two flashcards are functionally equivalent

https://notes.andymatuschak.org/zJ7PMGzjcgBUoPjLUHBF9jn

GPT-4 can probably estimate whether one prompt will spoil retrieval of another

https://notes.andymatuschak.org/zK9Y15pCnRMLoxUahLCzdyc

gwd · 2025-08-05T08:50:11 1754383811

Thanks for the write-up!

I've got a system for learning languages that does some of the things you mention. The goal is to be able to recommend content for a user to read which combines 1) appropriate level of difficulty 2) usefulness for learning. The idea is to have the SRS system build into the system, so you just sit and read what it gives you, and review of old words and learning new words (according to frequency) happens automatically.

Separating the recall model from the teaching model as you say opens up loads of possibilities.

Brief introduction:

1. Identify "language building blocks" for a language; this includes not just pure vocabulary, but the grammar concepts, inflected forms of words, and can even include graphemes and what-not.

2. For each building block, assign a value -- normally this is the frequency of the building block within the corpus.

3. Get a corpus of selections to study. Tag them with the language building blocks. This is similar to Math Academy's approach, but while they have hundreds of math concepts, I have tens of thousands of building blocks.

3. Use a model to estimate the current difficulty of each word. (I'm using "difficulty" here as the inverse of "retrievability", for reasons that will be clear later.)

4. Estimate the delta of difficulty of each building block after being viewed. Multiply this delta by the word value to get the study value of that word.

5. For each selection, calculate the total difficulty, average difficulty, and total study value. (This is why I use "difficulty" rather than "retrievability", so that I can calculate total cognitive load of a selection.)

Now the teaching algorithm has a lot of things it can do. It can calculate a selection score which balances study value, difficulty, as well as repetitiveness. It can take the word with the highest study value, and then look for words with that word in it. It can take a specific selection that you want to read or listen to, find the most important word in that selection, and then look for things to study which reinforce that word.

You mentioned computational complexity -- calculating all this from scratch certainly takes a lot, but the key thing is that each time you study something, only a handful of things change. This makes it possible to update things very efficiently using an incremental computation [1].

But that does make the code quite complicated.

[1] https://en.wikipedia.org/wiki/Incremental_computing

ran3000 · 2025-08-05T10:38:48 1754390328

Interesting, I've been surprised to see how many language learning apps already include some of the ideas I've discussed in the blog post!

How far along are you in developing the system?

gwd · 2025-08-05T10:54:10 1754391250

It started out as a side project just for myself to study Mandarin in 2019.

There's an open beta of the system ported to Biblical Greek here:

https://www.laleolanguage.com

I've got several active users without really having done any advertising; working on revamping the UI and redesigning the website before I do a big push and start advertising. Most of the people using the site have learned Biblical Greek entirely through the system.

There are experimental ports to Korean and Japanese as well, but those (along with the Mandarin port) aren't public yet. The primary missing pieces are:

1. Content -- the system relies on having large amounts of high-quality content. Finding it, tagging it, and dealing with copyright will take some time

2. On-ramp: It works best to help people at the intermediate level to advance. But if you start at an intermediate level, it doesn't know what you know.

Another thread I'm pursuing is exposing the algorithm via API to other language learning apps:

https://api-dev.laleolanguage.com/v1/docs

All of that needs a better funnel. I'll probably post some stuff here once I've got everything in a better state.

(If anyone reading this is interested in the API, please contact me at contact@laleolanguage.com .)

ran3000 · 2025-08-06T07:18:06 1754464686

Just watched the video, great work!

rahimnathwani · 2025-08-05T11:25:28 1754393128

Is there a way to be notified when you launch Mandarin?