I built LexPrep during my Master’s in Cognitive Neuroscience because preparing lexical datasets for reading experiments was taking me hours every week.
Most NLP libraries I tried were powerful but too general for psycholinguistics workflows. I needed something focused on experimental stimulus control rather than large-scale text processing.
LexPrep currently supports:
• Syllable counting • Grapheme-to-phoneme (G2P) • Orthographic neighborhood calculation • Lexical statistics for stimulus control • Multi-language support (Persian, English, Italian)
The goal is reproducible and fast stimulus preparation for cognitive and reading experiments.
I’d love feedback from the HN community. especially around API design, performance, and potential integrations.