frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

A lot of population numbers are fake

https://davidoks.blog/p/a-lot-of-population-numbers-are-fake
59•bookofjoe•1h ago•22 comments

Claude Code Daily Benchmarks for Degradation Tracking

https://marginlab.ai/trackers/claude-code/
31•qwesr123•1h ago•6 comments

Europe’s next-generation weather satellite sends back first images

https://www.esa.int/Applications/Observing_the_Earth/Meteorological_missions/meteosat_third_gener...
434•saubeidl•7h ago•59 comments

Apple to soon take up to 30% cut from all Patreon creators in iOS app

https://www.macrumors.com/2026/01/28/patreon-apple-tax/
684•pier25•18h ago•571 comments

Render Mermaid diagrams as SVGs or ASCII art

https://github.com/lukilabs/beautiful-mermaid
322•mellosouls•12h ago•48 comments

Show HN: ShapedQL – A SQL engine for multi-stage ranking and RAG

https://playground.shaped.ai
23•tullie•2d ago•9 comments

Vitamin D and Omega-3 have a larger effect on depression than antidepressants

https://blog.ncase.me/on-depression/
487•mijailt•4h ago•345 comments

Apt-bundle: brew bundle for apt

https://github.com/apt-bundle/apt-bundle
16•sadeshmukh•4d ago•5 comments

We can’t send mail farther than 500 miles (2002)

https://web.mit.edu/jemorris/humor/500-miles
494•giancarlostoro•11h ago•75 comments

Mecha Comet – Open Modular Linux Handheld Computer

https://mecha.so/comet
199•Realman78•3d ago•60 comments

Maine’s ‘Lobster Lady’ who fished for nearly a century dies aged 105

https://www.theguardian.com/us-news/2026/jan/28/maine-lobster-lady-dies-aged-105
186•NaOH•12h ago•43 comments

Tea Chemistry (1997)

https://www.researchgate.net/profile/Matthew-Harbowy/publication/216792045_Tea_Chemistry/links/09...
53•aabiji•5d ago•15 comments

Building a High-Performance Rotating Bloom Filter in Java

https://medium.com/@udaysagar.2177/building-a-high-performance-rotating-bloom-filter-in-java-a9e7...
7•udaysagar•4d ago•0 comments

Decompiling Xbox games using PDB debug info

https://i686.me/blog/csplit/
73•orange_redditor•2d ago•9 comments

Airfoil (2024)

https://ciechanow.ski/airfoil/
494•brk•1d ago•54 comments

Xmake: A cross-platform build utility based on Lua

https://xmake.io/
70•phmx•4d ago•29 comments

Trinity large: An open 400B sparse MoE model

https://www.arcee.ai/blog/trinity-large
217•linolevan•1d ago•69 comments

Tesla ending Models S and X production

https://www.cnbc.com/2026/01/28/tesla-ending-model-s-x-production.html
431•keyboardJones•16h ago•848 comments

How London became the rest of the world’s startup capital

https://www.economist.com/britain/2026/01/26/how-london-became-the-rest-of-the-worlds-startup-cap...
167•ellieh•1d ago•236 comments

Questom (YC F25) is hiring an engineer

https://www.ycombinator.com/companies/questom/jobs/UBebsyO-founding-engineer
1•ritanshu•11h ago

AI on Australian travel company website sent tourists to nonexistent hot springs

https://www.cnn.com/2026/01/28/travel/ai-tourism-nonexistent-hotsprings-intl-scli
56•breve•4h ago•20 comments

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
196•jmuncor•20h ago•100 comments

Android’s desktop interface leaks

https://9to5google.com/2026/01/27/android-desktop-leak/
266•thunderbong•1d ago•347 comments

Mousefood – Build embedded terminal UIs for microcontrollers

https://github.com/ratatui/mousefood
220•orhunp_•21h ago•46 comments

Does running wear out the bodies of professionals and amateurs alike?

https://theconversation.com/does-running-wear-out-the-bodies-of-professionals-and-amateurs-alike-...
32•PaulHoule•2h ago•26 comments

LM Studio 0.4

https://lmstudio.ai/blog/0.4.0
194•jiqiren•20h ago•107 comments

Did a celebrated researcher obscure a baby's poisoning?

https://www.newyorker.com/magazine/2026/02/02/did-a-celebrated-researcher-obscure-a-fatal-poisoning
171•littlexsparkee•1d ago•66 comments

UK Government to Create 'British FBI', Roll Out Nationwide Facial Recognition

https://www.theepochtimes.com/world/uk-government-to-create-british-fbi-roll-out-nationwide-facia...
16•hentrep•6h ago•8 comments

The tech market is fundamentally fucked up and AI is just a scapegoat

https://bayramovanar.substack.com/p/tech-market-is-fucked-up
245•Bayramovanar•2h ago•152 comments

Bf-Tree: modern read-write-optimized concurrent larger-than-memory range index

https://github.com/microsoft/bf-tree
121•SchwKatze•16h ago•19 comments
Open in hackernews

Show HN: ShapedQL – A SQL engine for multi-stage ranking and RAG

https://playground.shaped.ai
23•tullie•2d ago
Hi HN,

I’m Tullie, founder of Shaped. Previously, I was a researcher at Meta AI, worked on ranking for Instagram Reels, and was a contributor to PyTorch Lightning.

We built ShapedQL because we noticed that while retrieval (finding 1,000 items) has been commoditized by vector DBs, ranking (finding the best 10 items) is still an infrastructure problem.

To build a decent for you feed or a RAG system with long-term memory, you usually have to put together a vector DB (Pinecone/Milvus), a feature store (Redis), an inference service, and thousands of lines of Python to handle business logic and reranking.

We built an engine that consolidates this into a single SQL dialect. It compiles declarative queries into high-performance, multi-stage ranking pipelines.

HOW IT WORKS:

Instead of just SELECT , ShapedQL operates in four stages native to recommendation systems:

RETRIEVE: Fetch candidates via Hybrid Search (Keywords + Vectors) or Collaborative Filtering. FILTER: Apply hard constraints (e.g., "inventory > 0"). SCORE: Rank results using real-time models (e.g., p(click) or p(relevance)). REORDER: Apply diversity logic so your Agent/User doesn’t see 10 nearly identical results.

THE SYNTAX: Here is what a RAG query looks like. This replaces about 500 lines of standard Python/LangChain code:

SELECT item_id, description, price

FROM

  -- Retrieval: Hybrid search across multiple indexes

  search_flights("$param.user_prompt", "$param.context"),

  search_hotels("$param.user_prompt", "$param.context")
WHERE

  -- Filtering: Hard business constraints

  price <= "$param.budget" AND is_available("$param.dates")
ORDER BY

  -- Scoring: Real-time reranking (Personalization + Relevance)

  0.5 * preference_score(user, item) +

  0.3 * relevance_score(item, "$param.user_prompt")
LIMIT 20

If you don’t like SQL, you can also use our Python and Typescript SDKs. I’d love to know what you think of the syntax and the abstraction layer!

Comments

thorax•1h ago
RE: syntax For casual use, I kinda always liked the whole MATCH/AGAINST syntax for old school Innodb, though obviously things have changed a lot since those days. But it felt less like calling embedded functions and more like extending SQL’s grammar.

Regarding the rest, it seems like a reasonable approach at first tinker.

refset•49m ago
Neat examples, and I agree that extending SQL like this has real potential. Another project along very similar lines is https://github.com/ryrobes/larsql
alexpadula•16m ago
Fairly easy to extend SQLite, Postgres and MariaDB/MySQL!

Curious what relational database do you @refset use? Is the code open source? Is the engine from scratch? What general dialect does it support?

Cheers!

refset•2m ago
I work on https://github.com/xtdb/xtdb which is broadly Postgres-compatible with a few key SQL extensions (SQL:2011 bitemporal tables + immutability, first-class nested data, pipeline syntax, etc). Built on Arrow and the JVM but is otherwise mostly from scratch.

XTDB is perhaps not directly relevant to the topic at hand, but I am a firm believer that ML workflows can benefit from robust temporal modelling.

jiwidi•22m ago
Great potential! Love the idea
hrimfaxi•18m ago
If I upload my own data, who exactly is it shared with? I can't find a list of subprocessors and this line in the privacy policy is alarming:

> We’ll whenever feasible ask for your consent before using your Personal information for a purpose that isn’t covered in this Privacy Policy.

mritchie712•18m ago
this is cool, but:

> This replaces about 500 lines of standard Python

isn't really a selling point when an LLM can do it in a few seconds. I think you'd be better off pitching simpler infra and better performance (if that's true).

i.e. why should I use this instead of turbopuffer? The answer of "write a little less code" is not compelling.

pickleballcourt•15m ago
Is there a major difference between pgvector and shapedql?