frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Content-Aware Spaced Repetition

https://www.giacomoran.com/blog/content-aware-sr/
41•ran3000•2h ago

Comments

ran3000•2h ago
I explored memory models for spaced repetition in my master's thesis and later built an SRS product. This post shares my thoughts on content-aware memory models.

I believe this technical shift in how SRS models the student's memory won't just improve scheduling accuracy but, more critically, will unlock better product UX and new types of SRS.

IncreasePosts•1h ago
I've been playing with something similar, but far less thought out than what you have.

I have a script for it, but am basically waiting until I can run a powerful enough LLM locally to chug through it with good results.

Basically like the knowledge tree you mention towards the end, but attempt to create a knowledge DAG by asking a LLM "does card (A) imply knowledge of card (B) or vice versa". Then, take that DAG and use it to schedule the cards in a breadth first ordering. So, when reviewing a new deck with a lot of new cards, I'll be sure to get questions like "what was the primary cause of the civil war", before I get questions like "who was the Confederate general who fought at bull run"

ran3000•1h ago
I'd love to see it.

What I like about your approach is that it circumvents the data problem. You don't need a dataset with review histories and flashcard content in order to train a model.

pessimizer•1h ago
> [....] Ignoring the following factors means we are leaving useful information on the table:

> 1. The review histories of related cards. Card semantics allow us to identify related cards. This enables memory models to account for the review histories of all relevant cards when estimating a specific card’s retrievability.

> 2. [...]

I've been thinking that card semantics shouldn't be analyzed at all, and just treated as a black box. You can get so much data off of just a few users of a flashcard deck that you could build your own map of the relationships between cards, just by noticing the ones that get failed or pass together over time. Just package that map with the deck and the scheduler might get a lot smarter.

That map could give you good info on which cards were redundant, too.

edit: this may be interesting to someone, but I've also been trying to flesh out a model where agents buy questions from a market, trade questions with each other, and make bets with each other about whether the user will be able to recall the question when asked. Bankrupt agents are replaced by new agents. Every incentive in the system is parameterized by the user's learning requirements.

ran3000•1h ago
Yes, that reminds me of knowledge tracing and methods like 1PL-IRT.

I think you can do both and get even better results. The main limitation is that the same flashcards must be studied by multiple students, which doesn't generally apply.

I also love the idea of the market, you could even extend it to evaluate/write high-quality flashcards.

rahimnathwani•1h ago
You mention that FSRS treats each card independently, even if they derive from the same note. I wonder whether you've tried this Anki plugin, which tries to increase the interval between reviews of 'sibling' cards: https://ankiweb.net/shared/info/759844606
ran3000•54m ago
Ah, I totally missed this, thanks for sharing it.

Since in Anki the "note" is the editing unit, that works for some cloze deletions but not for QA cards (only for double-sided QA cards). A content-aware memory model would allow you to apply "disperse siblings" to any set of cards, independently of whether they were created together in the same editing interface.

joshdavham•59m ago
I've been thinking about this for a while too as an FSRS developer [1].

In general, we can think of a spaced repetition system as being (i) Content-aware vs. Content-agnostic and (ii) Deck-aware vs. Deck-agnostic

Content-aware systems care about what you're studying (language, medecine, etc) while Content-agnostic systems don't care about what you're studying.

Deck-aware systems consider each card in the context of the rest of the cards (the "deck") while Deck-agnostic systems consider each card in pure isolation.

Currently, FSRS is both Content-agnostic as well as Deck-agnostic. This makes it extremely easy to integrate into a spaced repetition system, but this also means the model will underfit a bit.

It it interesting to note that you could in practice optimize seperate FSRS models for each deck covering different topics, which would make it Content-aware in a sense. Additionally, "fuzz" is a somewhat Deck-aware feature of the model in that it exists specifically to reduce interactions between other cards in the deck.

[1] https://github.com/open-spaced-repetition/py-fsrs

ran3000•41m ago
Being easy to integrate is an underappreciated feature of FSRS.

Using decks to draw semantic boundaries is likely overly constraining. I think we want to account for finer differences between cards. Decks are coarse and people differ in the ways they use them, some people recommend having just one global deck. Notes are too fine. We explored something in between: a note capturing an idea or concept, plus an associated set of cards. Turns out it's hard to draw idea boundaries. That's why I think it's easier to relate cards by semantic embeddings or more rigid but clearer structures, like the DAG of dependencies suggested elsewhere in this thread.

daft_pink•28m ago
After reading this, I would really like to know what other spaced repetition software there is for things like ai driven speech?

I love Anki and used it before when I needed to memorize things, but would love to know what other options on the market exist.

I built a collection of simple Python projects for beginners (CLI,GUI,Web,API)

https://github.com/Efeckc17/simple-example-projects-in-Python
2•toxi360•8m ago•1 comments

NASA to announce nuclear reactor on the moon

https://www.politico.com/news/2025/08/04/nasa-china-space-station-duffy-directives-00492172
2•standardUser•8m ago•3 comments

Learning Programming Languages Efficiently

https://www.flyingmachinestudios.com/programming/learn-programming-languages-efficiently/
1•lemonberry•8m ago•0 comments

Show HN: Tab'd – Track and share AI and clipboard operations within your IDE

https://github.com/iann0036/tabd
1•iann0036•9m ago•0 comments

Swimming in urban waterways across the world should be a right, say campaigners

https://www.theguardian.com/lifeandstyle/2025/jun/27/swimming-urban-waterways-campaign-swimmable-cities
2•edward•11m ago•0 comments

The Loop Is Back: Why HRM Is the Most Exciting AI Architecture in Years

https://medium.com/@gedanken.thesis/the-loop-is-back-why-hrm-is-the-most-exciting-ai-architecture-in-years-7b8c4414c0b3
1•Davexon•11m ago•0 comments

Efficient implementations of Dion and Muon optimizers for distributed ML

https://github.com/microsoft/dion
1•simonpure•12m ago•0 comments

The Myth of American Meritocracy [pdf]

https://www.theamericanconservative.com/pdf/The%20Myth%20of%20American%20Meritocracy-Unz.pdf
1•rd•13m ago•1 comments

What's New in MathJax v4.0

https://docs.mathjax.org/en/latest/upgrading/whats-new-4.0.html
1•xworld21•14m ago•0 comments

Markdown Monster

https://git.sr.ht/~xigoi/markdown-monster/blob/master/monster.md
1•xigoi•14m ago•0 comments

Rivian sues to sell its EVs directly in Ohio

https://techcrunch.com/2025/08/04/rivian-sues-to-sell-its-evs-directly-in-ohio/
3•impish9208•17m ago•0 comments

Session Mapping: Novel approach to context optimization

https://medium.com/@carlosdesantiago/session-mapping-lightweight-traceable-context-d286a101231e
1•CarlosD•17m ago•0 comments

MQFQ-Sticky: Fair Queueing for Serverless GPU Functions

https://arxiv.org/abs/2507.08954
1•PaulHoule•17m ago•0 comments

vp.net: VPN with hardware-enforced cryptographic privacy through Intel SGX

https://vp.net/l/en-US/technical
1•akyuu•18m ago•0 comments

You can run small HuggingFace LLMs on iPhone

https://github.com/Q2-Development/q2-edge-chat
1•Michaelgathara•19m ago•0 comments

Inside the collapse of Builder.ai: Was it even an AI company?

https://restofworld.org/2025/builderai-ai-apps-downfall/
1•runningmike•23m ago•0 comments

Show HN: Keyglide – Competitive Online Text Editing

https://keygli.de
1•townload•26m ago•0 comments

What Does One Billion Dollars Look Like?

https://whatdoesonebilliondollarslooklike.website/
2•alexrustic•30m ago•0 comments

From 2D to 3D: How AI Transforms Images into Models

https://www.amodeling.com/blog/2d-to-3d-technology
1•Jimmy6929•32m ago•0 comments

Lying in Wait: Uncovering Hidden Threats in Open Source Software

https://www.striderintel.com/lying-in-wait-report/
1•Dowwie•32m ago•0 comments

Newelle, AI "Assistant" for Gnome, Hits Version 1.0

https://www.omgubuntu.co.uk/2025/08/newelle-ai-assistant-ubuntu-linux-desktop
3•kertoip_1•33m ago•0 comments

Show HN: Discomfort – Control ComfyUI with Python

https://github.com/Distillery-Dev/Discomfort
1•felipeinfante•34m ago•0 comments

The Domestic Revolution, by Ruth Goodman

https://www.thepsmiths.com/p/review-the-domestic-revolution-by
2•baud147258•36m ago•0 comments

The rise of AI tools that write about you when you die

https://www.washingtonpost.com/technology/2025/08/03/ai-obituaries-funeral-homes/
2•bookofjoe•36m ago•1 comments

'Clash of Trades' Reality Show Aims to Boost Prestige of US Manufacturing

https://www.nytimes.com/2025/08/04/business/clash-of-trades-manufacturing.html
1•ianrahman•36m ago•0 comments

Is It FOSS?

https://isitreallyfoss.com/
2•exiguus•38m ago•0 comments

History of Mapping

https://maphappenings.com/history/
1•marklit•39m ago•0 comments

I built a decentralized message board as a blockchain experiment: 64board

https://64board.com/
1•timmoth_j•42m ago•1 comments

Sth Is Wrong

1•victor_cl•46m ago•3 comments

Palantir tops $1 billion in revenue for the first time

https://www.cnbc.com/2025/08/04/palantir-pltr-q2-earnings-2025.html
2•mgh2•47m ago•1 comments