frontpage.

Show HN: Name-classifier – infers attributes about a person from a name

https://github.com/douglas-larocca/name-classifier

2•defgeneric•2h ago

This was based on an old project that I resurrected, improved, and repackaged with claude code.

It's useful for estimating demographics in large datasets from limited information, i.e. just a name.

It's also fairly good at separating given name and family names across a wide variety of languages and contexts.

Contains a standalone binary, embeddable shared lib, and a python wrapper.

Examples

CLI:

user@box » ./build/name-classifier -j -c "Carlos Eduardo Fernando Salazar Montemayor" | jq .

{ "input": "Carlos Eduardo Fernando Salazar Montemayor", "script": "latin", "components": [ { "token": "Carlos", "role": "given", "index": 0, "surname_score": 0.009 }, { "token": "Eduardo", "role": "given", "index": 1, "surname_score": 0.001 }, { "token": "Fernando", "role": "given", "index": 2, "surname_score": 0.01 }, { "token": "Salazar", "role": "family", "index": 3, "surname_score": 0.998 }, { "token": "Montemayor", "role": "family", "index": 4, "surname_score": 0.975 } ], "attributes": { "gender": { "male": 0.9938, "female": 0.0062, "neutral": 0 }, "origin": { "english": 0, "french": 0, "germanic": 0, "nordic": 0, "iberian": 1, "italian": 0, "eastern_european": 0, "arabic": 0, "east_asian": 0, "south_asian": 0, "southeast_asian": 0 } }, "calibrated": true, "model_version": "embedded", "provenance": { "gender": { "lexicon": 0.598, "ngram": 0.302, "neural": 0.101 }, "origin": { "lexicon": 0, "ngram": 0, "neural": 0 } } }

Python:

from name_classifier import NameClassifier

nc = NameClassifier(args.model_dir)

nc.classify("Kateryna Olha Mykhailivna Shevchenko")

OpenClaw vs. Google – Mass Ban Wave [video]

Kash Patel's Girlfriend Seeks Fame and Fortune, Escorted by an FBI Swat Team

Apple's Rosetta 2 for Linux VM hides the CPU and kernel arch info

Could code written by humans (pre-AI) have any value in the future?

BarackObama will attack Iran in order to get re-elected

The Third Hard Problem

Built Netflix's Algorithm from Scratch

Agentic-coded Ethereum client targeting 2030 roadmap

Yes, and

Show HN: Panther - Bloomberg Terminal for prediction markets now in early access

Show HN: PyTorch/FEniCSx pipeline for elastocaloric metamaterial optimization

Maps Mania: This Is London Calling – Discover Global Radio Mapping

Show HN: ScreenBuddy – Mac screen recorder with auto-zoom on clicks

Local AI Devtool to assist setting up vibecoding env

Hazard Cascade

Show HN: Claude-plan-reviewer – Rival AI reviews Claude Code's plans

Ali Khamenei, Iran's Supreme leader, Is Dead

Custom Data Structures in E-Graphs

My iPhone Blue Up

Hey HN Creator Here

Show HN: VibeHQ Orchestrate multiple CLI agents as a real company team

Show HN: InterviewTrackr – All-in-one command center for CS job hunts

Show HN: Chatlite – simple Ollama desktop chat app under 5 MB

Vector and Semantic Search in Stoolap

Khamenei Dead

Trump Deliberated on Iran for Weeks

Show HN: Open-Plan-Annotator – Annotate Agent Plans Like a Google Doc 100% Local

The Birth of Light

Show HN: A Rust compiler with ownership checking, written in PHP

Google quantum-proofs HTTPS by squeezing 15kB of data into 700-byte space