frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: A production-style recommender using vector retrieval and re-ranking

2•ftonato•1h ago
I’ve been exploring how recommendation systems are actually implemented in production, beyond just training models.

A common pattern I kept seeing is to split the problem into two stages:

1. Retrieve a small set of relevant candidates

2. Re-rank them using a model

Instead of doing brute-force inference across all items, I built a small prototype around this idea.

The flow looks like this:

- Store embeddings in a vector database (ChromaDB)

- Retrieve the Top-K most similar items/users based on vector similarity

- Run a TensorFlow.js model to re-rank the candidates

The goal is to reduce the search space before applying inference, which seems necessary when latency and scale matter.

What I found interesting is that once you move to this approach, a lot of the complexity shifts from the model itself to the retrieval layer:

- choosing K

- filtering candidates

- embedding quality

- latency vs recall trade-offs

Curious how others approach this in real systems:

- How do you decide on K?

- Do you rely purely on vector similarity or add heuristics?

- How do you handle re-ranking at scale?

Project: https://github.com/ftonato/recommendation-system-chromadb-tf...

Show HN: Decision Guardian – Auto-surface architectural context on PRs and CLI

1•iamalizaidi•1m ago•0 comments

FCC approves Charter Communications' $34.5B deal to buy Cox

https://www.reuters.com/world/us-agency-approves-charter-communications-345-billion-deal-buy-cox-...
2•geox•4m ago•0 comments

Shift from passive documentation to active enforcement

3•iamalizaidi•5m ago•0 comments

Show HN: Accept.md now supports SvelteKit – return Markdown from any page

https://www.accept.md/
2•hval•7m ago•0 comments

I stopped using JSON for MQTT and use Zig to develop gRPC-like communication

https://gyokhan.com/stopped-using-json-for-mqtt-and-develop-zig-protobuf-mqtt/
2•gyokhan•15m ago•0 comments

Raided by the Police – Investigating Nintendo, Sega, & Devkit Arrests [video]

https://www.youtube.com/watch?v=NU040CTdJI0
1•wicket•16m ago•0 comments

Sam Altman Says OpenAI Is Working on Pentagon Deal

https://www.wsj.com/tech/ai/openais-sam-altman-calls-for-de-escalation-in-anthropic-showdown-with...
1•DGAP•17m ago•1 comments

Autokey Wayland – fork of AutoKey a desktop automation app with Wayland support

https://github.com/dlk3/autokey-wayland
1•Curiositry•18m ago•0 comments

Software development now costs less than than the wage of a minimum wage worker

https://ghuntley.com/real/
2•cebert•21m ago•1 comments

Pentagon approves OpenAI safety red lines after dumping Anthropic

https://www.axios.com/2026/02/27/pentagon-openai-safety-red-lines-anthropic
2•galleywest200•22m ago•0 comments

Show HN: I made a website to write online math as fast as paper

https://www.scratchpad-math.com/
3•maxtobiasen•25m ago•1 comments

Is AGI a Billion-Dollar Mirage? The AI Circular Trap

https://medium.com/@opsworld.g/ais-downhill-without-brakes-is-agi-a-billion-dollar-mirage-e5b9cb6...
4•om202•25m ago•0 comments

Money Is the First AI – and We Never Noticed

https://github.com/AionSystem/AION-BRAIN/blob/main/articles/MEDIUM/THE-THRESHOLD/Money-Is-The-Fir...
2•sheldonksalmon•27m ago•1 comments

LFortran Compiles Fpm

https://lfortran.org/blog/2026/02/lfortran-compiles-fpm/
1•wtlin•28m ago•0 comments

Show HN: I seriously think this is the most effective email leadgen tool

https://easyemailfinder.com/
1•faalbane•29m ago•0 comments

Devs who code like you (from public GitHub signals)

https://mates.symploke.dev?hn
1•thomasfromcdnjs•30m ago•0 comments

The Ballad of Dario and Pete

https://twitter.com/last_one_in_/status/2027536134214189435
2•cmogni1•31m ago•0 comments

A way to be a person

https://www.lrb.co.uk/the-paper/v48/n04/paul-taylor/a-way-to-be-a-person
1•hhs•32m ago•0 comments

Modernity: We launched two weeks ago, got 400 testers, zero customers. Fixes

https://modernity.live/
1•Fabric14•32m ago•1 comments

History of Software Design

https://historyofsoftware.org/
2•vinhnx•34m ago•0 comments

How strong is New York's "illegal gambling" case against Valve's loot boxes?

https://arstechnica.com/gaming/2026/02/how-strong-is-new-yorks-illegal-gambling-case-against-valv...
2•Bender•35m ago•0 comments

C inference for Qwen3-ASR 0.6B and 1.7B transcriptions models

https://github.com/antirez/qwen-asr
1•Curiositry•35m ago•0 comments

Show HN: OpenTimelineEngine – Shared local memory for Claude Code and codex

https://github.com/JOELJOSEPHCHALAKUDY/open-timeline-engine
5•joeljoseph_•40m ago•3 comments

Carabiner Hacking for Workout [video]

https://www.youtube.com/shorts/rhqroEZCrnI
1•demute•42m ago•0 comments

New AGI Framework Different

https://hal.science/hal-05528268
1•Sai-dewa•44m ago•4 comments

Claude's Constitution (2023)

https://www.anthropic.com/news/claudes-constitution
3•ponkylives•47m ago•0 comments

The Legal Rights of Extraterrestrials

https://www.rfreitas.com/Astro/LegalRightsOfETs.htm
1•TheAceOfHearts•47m ago•0 comments

'RentAHuman' platform includes Texans who can be rented by AI agents for tasks

https://dallasinnovates.com/rentahuman-platform-includes-north-texans-who-can-be-rented-by-ai-age...
1•hhs•49m ago•1 comments

Input Remapper – easy to use tool to change the behaviour of Linux input devices

https://github.com/sezanzeb/input-remapper
1•Curiositry•52m ago•0 comments

Enclave gem: Mega useful if you're building agents on Ruby on Rails

https://github.com/rubymonolith/enclave
1•obiefernandez•53m ago•0 comments