frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Libpostal: C library for parsing/normalizing street addresses around the world

https://github.com/openvenues/libpostal
48•nateb2022•7h ago

Comments

jandrese•5h ago
Wow, ambitious project. Anybody who has tried to verify addresses can tell you that the staggering number of different formats and conventions around the world make it and almost intractable problem. So many countries have wildly informal standards and people putting down just whatever they want because the mailman "just knows".
monero-xmr•5h ago
Maxmind is the quintessential example of what devs want to build in their heart of hearts. Low-touch sales but b2b. Almost a monopoly. Prints money for decades. Not a public company so they never increase costs to a usurious amount. Open source never quite meets the level needed
Ameo•3h ago
I used this at a previous company with quite good success.

With relatively minimal effort, I was able to spin up a little standalone container that wrapped around the service and exposed a basic API to parse a raw address string and return it as structured data.

Address parsing is definitely an extremely complex problem space with practically infinite edge cases, but libpostal does just about as well as I could expect it to.

degamad•3h ago
Ditto - I was impressed with how well it handled the weird edge cases in our data.

They've managed to create a great working implementation of a very, very small model of a very specific subset of language.

degamad•3h ago
Previously:

<https://news.ycombinator.com/item?id=18775099> Libpostal: A C library for parsing/normalizing street addresses around the world - 117 points by polm23 on Dec 29, 2018 (25 comments)

<https://news.ycombinator.com/item?id=11173920> Libpostal: international street address parsing in C trained on OpenStreetMap (mapzen.com) 74 points by riordan on Feb 25, 2016 (7 comments)

RobinL•3h ago
There are many useful applications of libpostal, and it's an impressive library, but one I would caution against is for the purpose of address matching, at least as the 'primary' approach.

The problem is the hardest to parse addresses are also often the hardest to match, making the problem somewhat circular. I wrote about this more in a recent blog on address matching: https://www.robinlinacre.com/address_matching/

kleiba•1h ago
Relevant? -> "Falsehoods programmers believe about addresses" (https://www.mjt.me.uk/posts/falsehoods-programmers-believe-a...)

Discussed on HN here: https://news.ycombinator.com/item?id=8907301

weinzierl•1h ago
In the same vein, there is also Google's excellent libphonenumber for parsing, formatting, and validating international phone numbers.

And because I had no idea before I worked on a project where we had to deal with customer data: many companies also use commercial services for address and phone number validation and normalization.

Helm local code execution via a malicious chart – CVE-2025-53547

https://github.com/helm/helm/security/advisories/GHSA-557j-xg8c-q2mm
85•irke882•3h ago•29 comments

Is the doc bot docs, or not?

https://www.robinsloan.com/lab/what-are-we-even-doing-here/
33•tobr•1h ago•6 comments

RapidRAW: A non-destructive and GPU-accelerated RAW image editor

https://github.com/CyberTimon/RapidRAW
155•l8rlump•6h ago•58 comments

7-Zip for Windows can now use more than 64 CPU threads for compression

https://www.7-zip.org/history.txt
71•doener•1d ago•11 comments

AI, Power and Sociolinguistics [pdf]

https://www.researchgate.net/profile/Ico-Maly-2/publication/385703534_AI_power_and_sociolinguistics/links/6813618cdf0e3f544f502f05/AI-power-and-sociolinguistics.pdf
23•AntonioBarthes•1h ago•2 comments

Bootstrapping a side project into a profitable seven-figure business

https://projectionlab.com/blog/we-reached-1m-arr-with-zero-funding
519•jonkuipers•1d ago•112 comments

Show HN: I rewrote an outdated React Native map clustering library

https://github.com/suwi-lanji/rn-maps-clustering
13•hadat•2h ago•1 comments

Breaking Git with a carriage return and cloning RCE

https://dgl.cx/2025/07/git-clone-submodule-cve-2025-48384
314•dgl•15h ago•111 comments

I'm Building LLM for Satellite Data EarthGPT.app

https://www.earthgpt.app/
31•sabman•1d ago•3 comments

US Court nullifies FTC requirement for click-to-cancel

https://arstechnica.com/tech-policy/2025/07/us-court-cancels-ftc-rule-that-would-have-made-canceling-subscriptions-easier/
111•gausswho•10h ago•125 comments

Supabase MCP can leak your entire SQL database

https://www.generalanalysis.com/blog/supabase-mcp-blog
710•rexpository•15h ago•363 comments

Frame of preference A history of Mac settings, 1984–2004

https://aresluna.org/frame-of-preference/
101•K7PJP•9h ago•16 comments

Smollm3: Smol, multilingual, long-context reasoner LLM

https://huggingface.co/blog/smollm3
292•kashifr•17h ago•57 comments

Bug Stories

https://500mile.email/
10•thinkingemote•2h ago•1 comments

iPod Linux – Linux for Your iPod (2017)

http://www.ipodlinux.org/
40•nickysielicki•7h ago•11 comments

SUSE launches new European digital sovereignty service to meet surging demand

https://www.zdnet.com/article/suse-launches-new-european-digital-sovereignty-support-service-to-meet-surging-demand/
34•saubeidl•2h ago•0 comments

Radium Music Editor

http://users.notam02.no/~kjetism/radium/
205•ofalkaed•15h ago•45 comments

Brut: A New Web Framework for Ruby

https://naildrivin5.com/blog/2025/07/08/brut-a-new-web-framework-for-ruby.html
175•onnnon•15h ago•57 comments

Springer Nature book on machine learning is full of made-up citations

https://retractionwatch.com/2025/06/30/springer-nature-book-on-machine-learning-is-full-of-made-up-citations/
37•ArmageddonIt•2h ago•6 comments

Libpostal: C library for parsing/normalizing street addresses around the world

https://github.com/openvenues/libpostal
48•nateb2022•7h ago•8 comments

Swahili on the Road

https://www.historytoday.com/archive/behind-times/swahili-road
28•Thevet•8h ago•3 comments

Rules of good writing (2007)

https://dilbertblog.typepad.com/the_dilbert_blog/2007/06/the_day_you_bec.html
98•santiviquez•1d ago•70 comments

Introduction to Indian English

https://www.oed.com/discover/introduction-to-indian-english/
36•sandwichsphinx•1d ago•26 comments

Surfing on a Matchbox (1999)

http://news.bbc.co.uk/2/hi/science/nature/276762.stm
26•TMWNN•2d ago•8 comments

Taking over 60k spyware user accounts with SQL injection

https://ericdaigle.ca/posts/taking-over-60k-spyware-user-accounts/
208•mtlynch•5d ago•62 comments

Show HN: OffChess – Offline chess puzzles app

https://offchess.com
327•avadhesh18•1d ago•145 comments

Dynamical origin of Theia, the last giant impactor on Earth

https://arxiv.org/abs/2507.01826
89•bikenaga•15h ago•31 comments

Xenharmlib: A music theory library that supports non-western harmonic systems

https://xenharmlib.readthedocs.io/en/latest/
67•retooth•10h ago•7 comments

New Horizons images enable first test of interstellar navigation

https://www.newscientist.com/article/2486823-new-horizons-images-enable-first-test-of-interstellar-navigation/
39•jnord•2d ago•3 comments

Where can I see Hokusai's Great Wave today?

https://greatwavetoday.com/
68•colinprince•5h ago•55 comments