frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Agent Orchestration Is Not the Future

https://moridinamael.github.io/agent-orchestration/
1•mordymoop•2m ago•1 comments

What is Agent context engine

https://ragflow.io/basics/what-is-agent-context-engine
1•yingfeng•4m ago•0 comments

Tempest Future Fighter Aims for "Extreme Range," Twice F-35 Payload

https://www.twz.com/air/tempest-future-fighter-aims-for-really-extreme-range-twice-f-35-payload
1•throwoutway•7m ago•0 comments

Politics and the English Language – George Orwell [Essay]

https://www.orwellfoundation.com/the-orwell-foundation/orwell/essays-and-other-works/politics-and...
1•nomilk•10m ago•0 comments

Show HN: Vho – AST-based analysis for better AI refactoring of large codebases

https://vue-hook-optimizer.vercel.app/
2•huali•11m ago•1 comments

vLLM: An Efficient Inference Engine for Large Language Models

https://www2.eecs.berkeley.edu/Pubs/TechRpts/2025/EECS-2025-192.html
1•matt_d•16m ago•0 comments

Linuxulator on FreeBSD Feels Like Magic

https://hayzam.com/blog/02-linuxulator-is-awesome/
5•arch1e•16m ago•0 comments

Ask HN: What app features actually help vocabulary stick long-term?

1•hussein-khalil•17m ago•1 comments

Ask HN: Is there a better alternative to email?

1•DinakarS•23m ago•1 comments

AI Safety ArXiv Scraper

https://theguardrail.net/
2•chiwilliams•26m ago•0 comments

Translating Cave Story into Classical Latin with Gemini

https://www.semilin.dev/blog/doukutsu-translator
2•semilin•31m ago•0 comments

Show HN: I Made a Gamma Clone with 1 Prompt

https://prompt-to-ppt.lovable.app/
1•nsemikey•31m ago•1 comments

Cool project, will you actually maintain it?

https://www.pcloadletter.dev/blog/cool-project/
2•ronbenton•31m ago•0 comments

The State of LLMs 2025: Progress, Problems, and Predictions

https://magazine.sebastianraschka.com/p/state-of-llms-2025
1•nsainsbury•36m ago•0 comments

The Intelligent Universe: AI, ET, and the Emerging Mind of the Cosmos

https://www.setileague.org/reviews/intellig.htm
1•teleforce•40m ago•0 comments

Physics of Language Models: How to Build Versatile Pretrain Playgrounds [video]

https://www.youtube.com/watch?v=x3G8knjPDbM
1•gmays•42m ago•0 comments

Ask HN: Have you been falsely accused of AI-generated content?

5•bmaupin•43m ago•2 comments

What Becomes Valuable When AI Makes Creative Work Easy

https://every.to/p/what-becomes-valuable-when-ai-makes-creative-work-easy?p=c0fe0e66aa5670c292b26...
2•herbertl•43m ago•0 comments

I built my dream terminal based task manager

https://github.com/fashton28/silo
2•fashton28•43m ago•1 comments

Scorg Marketplace – Player-to-Player Trading for Star Citizen Items

1•legitcoders•45m ago•0 comments

Show HN: I built a Music-to-Video API

https://peakmv.com
2•gautamaj•47m ago•0 comments

Researchers spot Saturn-sized planet in the "Einstein desert"

https://arstechnica.com/science/2026/01/researchers-spot-saturn-sized-planet-in-the-einstein-desert/
3•pseudolus•49m ago•0 comments

Show HN: PPTX Native AI Slides

https://www.textdeck.com/home
1•andventures•50m ago•0 comments

Self-driving cars could prevent over 1M injuries across the US by 2035

https://techxplore.com/news/2026-01-cars-million-road-injuries.html
3•geox•53m ago•0 comments

On the quantum mechanics of entropic forces

https://arxiv.org/abs/2502.17575
1•kaycebasques•56m ago•0 comments

Ask HN: Are there any antifeature-free power tools you can still buy new?

4•josephcsible•58m ago•1 comments

AI Personas and Dolls

https://stephen.bochinski.dev/blog/2026/01/02/ai-personas/
2•sbochins•1h ago•0 comments

Obesity as a Behavioral Addiction

https://www.academia.edu/2997-9196/2/3/10.20935/MHealthWellB7880
3•red369•1h ago•2 comments

1964 New York World's Fair

https://en.wikipedia.org/wiki/1964_New_York_World%27s_Fair
2•teleforce•1h ago•0 comments

Show HN: Black Box QA testing system to automate QA process

https://www.rocksmith.ai/
2•orangeAvocad0•1h ago•0 comments
Open in hackernews

17 weird facts about the Hunspell dictionary format

https://zverok.space/blog/2021-03-16-spellchecking-dictionaries.html
2•bmacho•16h ago

Comments

jll29•15h ago
[Hunspell has been very successful as the OP correctly points out, and my comments are intended to improve over the state of the art rather than badmouthing the fantastic work of its authors, two of who are friends of mine.]

Hunspell uses an ad-hoc file format and an ad-hoc method. When the original code was developed in Ocaml at the time, it evolved to where we are today (one of the developers, VT, was sharing offices with me for a few years, so I am a past "ear witness" of sorts).

There is an opportunity now to rebuild something more systematic based on the XFST formalism originally devised at Xerox Research Center Europe in Grenoble under Prof. Lauri Karttunen, Kenneth Beesley and team [1]. Especially since Mans Hulden has re-created their toolset as FOMA, a C re-implementation that has been open sourced.

The beauty of XFST and friends is that it's a formalization of regular relations, the language generated and accepted by extended finite state transducers - a form of two-way automata. The XFST formalism leads to more readable/maintainable lexicons and rules, and it can also be used to generate, not just to analyze.

[1] https://www.amazon.com/Finite-State-Morphology-Kenneth-Beesl...

[2] https://dsacl3-2018.github.io/xfst-demo/ and others (simplay search for e.g. "xfst|foma fst")

[3] Hulden, Mans (2008) https://aclanthology.org/E09-2008/ (A Python interface already exists, too: Hulden, M. et al. (2024) https://aclanthology.org/2024.acl-demos.24/ .)

There are many training resources for the XFST family of formalisms, and it is taught in computational linguistics courses around the world [2]. There is also tool support in the form of e.g. syntax coloring support for vim https://www.vim.org/scripts/script.php?script_id=3441 etc. - all this would make the set of potential contributors for a future version of the spell checker vastly larger (compared to requiring interested parties to analyze an obscure ad-hoc format). It would also open up future possibilities for new functionality in Open Office - e.g. the generation capability could be used to offer a button "pluralize word".