frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: A centralized resource for software, tools, and services

https://favz.link/
1•Tinymind•1m ago•0 comments

Show HN: I built a simulated AI containment terminal for my sci-fi novel

https://vertex.flowlogix.ai
2•stevengreser•1m ago•0 comments

FSQ Spatial Agent and FSQ H3 Hub – an AI agent for geospatial analysis workflows

https://foursquare.com/resources/blog/ad-tech/introducing-fsq-spatial-agent-your-geospatial-ai-an...
2•jbgrt•1m ago•0 comments

A New Complexity Theory for the Quantum Age

https://www.quantamagazine.org/a-new-complexity-theory-for-the-quantum-age-20260217/
2•ibobev•1m ago•0 comments

Even in Antarctica, Insects Are Eating Microplastics

https://e360.yale.edu/digest/antarctic-midges-microplastics
2•Brajeshwar•1m ago•0 comments

What's cooking with Pystd, the experimental C++ standard library?

https://nibblestew.blogspot.com/2026/02/whats-cooking-with-pystd-experimental-c.html
2•ibobev•2m ago•0 comments

Understanding Std:Shared_mutex from C++17

https://www.cppstories.com/2026/shared_mutex/
2•ibobev•3m ago•0 comments

Deploy SurrealDB with a Docker Desktop Extension- zero server setup, built-in UI

https://www.docker.com/blog/deploy-surrealdb-docker-desktop-extension/
2•kubetools•3m ago•1 comments

Using go fix to modernize Go code

https://go.dev/blog/gofix
2•todsacerdoti•4m ago•0 comments

How to Structure Projects for AI Agents and LLMs

https://mastra.ai/blog/how-to-structure-projects-for-ai-agents-and-llms
2•calcsam•4m ago•0 comments

AI May DOOM humans After All. I may have been wrong [video]

https://www.youtube.com/watch?v=GYfgjYVEYQ0
1•EPendragon•7m ago•0 comments

Dijkstra On the foolishness of "natural language programming"

https://www.cs.utexas.edu/~EWD/transcriptions/EWD06xx/EWD667.html?hn=1
2•mparramon•7m ago•0 comments

Show HN: ChangeWord, transformative word game in 6 languages

https://changeword.org/
1•oliwary•8m ago•0 comments

CBS didn't air Rep. James Talarico interview out of fear of FCC

https://www.nbcnews.com/business/media/stephen-colbert-cbs-james-talarico-fcc-rcna259341
7•theahura•9m ago•0 comments

Storyteller Lemmy – Attack of the Bossed-Up Biomimetic Bad Bitch

https://twitter.com/LemmySmackett/status/2020194094690042009
1•miniBill•12m ago•0 comments

Benchmarking CDC Tools: Supermetal vs. Debezium vs. Flink CDC

https://www.streamingdata.tech/p/benchmarking-cdc-tools
1•sap1enz•12m ago•0 comments

GStreamer 1.28 brings AI inference to your media pipeline

https://www.collabora.com/news-and-blog/news-and-events/gstreamer-1.28,-ready-for-ai.html
1•losgehts•12m ago•0 comments

Apple Plans M5-Based Private Cloud Compute Architecture for Apple Intelligence

https://9to5mac.com/2026/02/17/apple-plans-m5-based-private-cloud-compute-architecture-for-apple-...
4•alwillis•13m ago•0 comments

Deterministic Core, Agentic Shell

https://blog.davemo.com/posts/2026-02-14-deterministic-core-agentic-shell.html
1•amateurhuman•13m ago•0 comments

CalyxOS puts privacy and security into the hands of everyday users

https://calyxos.org
1•pretext•13m ago•0 comments

ChatGPT's Translation Skills Parallel Most Human Translators

https://spectrum.ieee.org/chatgpt-translate-skills-human-comparison
2•pseudolus•14m ago•0 comments

The Broken Equilibrium

https://stackgen.com/blog/the-broken-equilibrium
1•darccio•15m ago•0 comments

Don't Trip[Wire] Yourself: Testing Error Recovery in Zig

https://mitchellh.com/writing/tripwire
1•PaulHoule•15m ago•0 comments

What evidence format helps technical diligence decisions move faster?

https://sot-navigator.com/
2•mthdadalto•15m ago•1 comments

phyz: Differentiable Physics Engine for Rust

https://phyz.dev
2•ecto•16m ago•0 comments

Why I don't think AI is a bubble

https://honnibal.dev/blog/ai-bubble
2•syllogism•16m ago•0 comments

AI Panic Hits Trucking, Transport Stocks

https://www.wsj.com/livecoverage/stock-market-today-dow-sp-500-nasdaq-02-12-2026/card/ai-panic-hi...
2•mattas•17m ago•0 comments

Paper, Scissors, Gravity

https://campedersen.com/phyz
2•ecto•17m ago•1 comments

Opinion: The Finance Industry Is a Grift. Let's Start Treating It That Way

https://www.nytimes.com/2026/02/06/opinion/capitalism-industry-financialization.html
3•cs702•17m ago•0 comments

You probably can't trust your password manager if it's compromised

https://www.theregister.com/2026/02/16/password_managers/
3•maguszin•17m ago•0 comments
Open in hackernews

How AI Finds Fuzzy Duplicates in Large Datasets

https://futuresearch.ai/semantic-deduplication/
10•nbosse•1h ago

Comments

nbosse•1h ago
We built this after too many rounds of deduplication on messy data. Each technique in the deduplication funnel solves what the previous one can't, but the real pain is orchestrating all three together at scale: chunking to avoid O(n²), batching LLM calls (accuracy degrades past ~25 items), rate limiting across embedding and completion APIs simultaneously. We packaged the pipeline into a Python SDK. Here's a 500-row CRM dataset that cost $0.74, ~100 sec to dedupe: https://everyrow.io/docs/resolve-entities-python