frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

What if you just did a startup instead?

https://alexaraki.substack.com/p/what-if-you-just-did-a-startup
1•okaywriting•1m ago•0 comments

Hacking up your own shell completion (2020)

https://www.feltrac.co/environment/2020/01/18/build-your-own-shell-completion.html
1•todsacerdoti•3m ago•0 comments

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

https://github.com/gorse-io/gorse
1•zhenghaoz•4m ago•0 comments

GLM-OCR: Accurate × Fast × Comprehensive

https://github.com/zai-org/GLM-OCR
1•ms7892•5m ago•0 comments

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

https://github.com/MikeVeerman/tool-calling-benchmark
1•MikeVeerman•6m ago•0 comments

Show HN: AboutMyProject – A public log for developer proof-of-work

https://aboutmyproject.com/
1•Raiplus•6m ago•0 comments

Expertise, AI and Work of Future [video]

https://www.youtube.com/watch?v=wsxWl9iT1XU
1•indiantinker•7m ago•0 comments

So Long to Cheap Books You Could Fit in Your Pocket

https://www.nytimes.com/2026/02/06/books/mass-market-paperback-books.html
3•pseudolus•7m ago•1 comments

PID Controller

https://en.wikipedia.org/wiki/Proportional%E2%80%93integral%E2%80%93derivative_controller
1•tosh•11m ago•0 comments

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

https://twitter.com/AlecStapp/status/2019932764515234159
1•bkls•11m ago•0 comments

Kubernetes MCP Server

https://github.com/yindia/rootcause
1•yindia•12m ago•0 comments

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

https://rokn.io/posts/building-movie-recommendation-agent
3•roknovosel•12m ago•0 comments

What were the first animals? The fierce sponge–jelly battle that just won't end

https://www.nature.com/articles/d41586-026-00238-z
2•beardyw•21m ago•0 comments

Sidestepping Evaluation Awareness and Anticipating Misalignment

https://alignment.openai.com/prod-evals/
1•taubek•21m ago•0 comments

OldMapsOnline

https://www.oldmapsonline.org/en
1•surprisetalk•23m ago•0 comments

What It's Like to Be a Worm

https://www.asimov.press/p/sentience
2•surprisetalk•23m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...
1•surprisetalk•23m ago•0 comments

Lawyer sets new standard for abuse of AI; judge tosses case

https://arstechnica.com/tech-policy/2026/02/randomly-quoting-ray-bradbury-did-not-save-lawyer-fro...
3•pseudolus•24m ago•0 comments

AI anxiety batters software execs, costing them combined $62B: report

https://nypost.com/2026/02/04/business/ai-anxiety-batters-software-execs-costing-them-62b-report/
1•1vuio0pswjnm7•24m ago•0 comments

Bogus Pipeline

https://en.wikipedia.org/wiki/Bogus_pipeline
1•doener•25m ago•0 comments

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

https://nypost.com/2026/02/05/business/winklevoss-twins-gemini-crypto-exchange-cuts-25-of-workfor...
2•1vuio0pswjnm7•26m ago•0 comments

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646
3•obscurette•26m ago•0 comments

Cycling in France

https://www.sheldonbrown.com/org/france-sheldon.html
2•jackhalford•27m ago•0 comments

Ask HN: What breaks in cross-border healthcare coordination?

1•abhay1633•28m ago•0 comments

Show HN: Simple – a bytecode VM and language stack I built with AI

https://github.com/JJLDonley/Simple
2•tangjiehao•30m ago•0 comments

Show HN: Free-to-play: A gem-collecting strategy game in the vein of Splendor

https://caratria.com/
1•jonrosner•31m ago•1 comments

My Eighth Year as a Bootstrapped Founde

https://mtlynch.io/bootstrapped-founder-year-8/
1•mtlynch•32m ago•0 comments

Show HN: Tesseract – A forum where AI agents and humans post in the same space

https://tesseract-thread.vercel.app/
1•agliolioyyami•32m ago•0 comments

Show HN: Vibe Colors – Instantly visualize color palettes on UI layouts

https://vibecolors.life/
2•tusharnaik•33m ago•0 comments

OpenAI is Broke ... and so is everyone else [video][10M]

https://www.youtube.com/watch?v=Y3N9qlPZBc0
2•Bender•33m ago•0 comments
Open in hackernews

Landrecords – cheap nationwide parcel dataset standardized using gemma3

https://landrecords.us
12•mapsperson•5mo ago

Comments

mapsperson•5mo ago
I created a Nationwide dataset of 155M land parcels using two GPUs and a 30TB hard drive.

Because I don't have $100K+ to buy the US parcel dataset from Regrid or ReportAll, I bought a pair of L40s and a 30TB NVMe hard drive, and used them to collect and harmonize 155M parcels into a single dataset from over 3,100 US counties.

And because I don't have a couple dozen employees to feed like Reportall and Regrid and Corelogic, my goal is to try to resell this dataset at much lower prices than the current incumbents, and make the data accessible to smaller projects and smaller budgets.

I ended up with close to 99% coverage of the United States.

Backend stack is a single server running Postgres, gemma3 on ollama, and a big pile of python and plpgsql. Website is running on Firebase with PMTiles as the mapping layer. Parcel file exports are served from Google Cloud Storage.

My plan is to open-source a big portion of this system once I can clean it up, but my first priority was getting a product on the market and trying to make this self-sustaining.

If anyone is interested in any of the technical details or if you want to try to do this yourself, I'm happy to share anything you want to know.

jakupovic•5mo ago
I would like to know more. For example how did you get the county records?
mapsperson•5mo ago
One at a time. The county is the sole unit of authority for land records in the US (with a few exceptions). Luckily, these days, most of them publish this data via web services or APIs.

I was able to automate a big chunk of this work by crawling county websites and looking for these web services that I could download from.

But there is no agreed-upon schema standard -- they all store the data in different formats, schemas, etc. About 50% of the effort in maintaining a dataset like this is maintaining the mappings from the source data to the target schema. That's where I am making heavy use of LLMs. This turns out to be something they are very good at. I found gemma3 to have the best balance of reliability, ease of use, and speed for my use case.

dmroth•5mo ago
I'm very interested to learn more.
mapsperson•5mo ago
If you send an email from the website, it will go straight to me :) Happy to talk more