frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: SwellDB – Query AI-generated tables with SQL

https://github.com/SwellDB/SwellDB
2•giannakouris•6mo ago
I'm building a data system called SwellDB that uses LLMs to generate its tables on the fly.

Traditional databases only work over data that's already loaded and cleaned. But in the real world, data lives everywhere — in files, PDFs, web pages, APIs. To query it, we usually need custom ETL pipelines: extract, clean, transform, load. It’s slow, brittle, and different every time.

SwellDB flips that model: you define a table (schema + a description as a natural language prompt) and it generates the table just-in-time — using LLMs and your schema/prompt, on top of the connected data sources (files, databases, LLMs, web). Think: querying a DataFrame that materializes itself from raw input without you writing the ingestion logic.

It supports:

- Structured + unstructured sources: CSV, SQL, web search results (PDF to be added soon)

- Declarative table definitions in Python

- Output compatible with any SQL query engine (DuckDB, Apache DataFusion) or ingestible into any database

Repo: https://github.com/SwellDB/SwellDB

Short paper (4 pages): https://github.com/gsvic/gsvic.github.io/blob/gh-pages/paper...

Would love feedback if you get a chance to try it out, especially from folks dealing with hybrid or messy data sources.

Comments

lisa_coicadan•6mo ago
Really interesting project, love the idea of skipping traditional ETL by generating structured views on demand.

We’re building something in a similar space at Retab.com, but with a different philosophy: instead of querying live across unstructured sources, we focus on reliably turning raw inputs (PDFs, scanned docs, images, etc.) into clean, structured outputs, using schema-guided LLM generation, multi-model consensus, and an evaluation dashboard. So it’s less about on-the-fly queries, and more about building robust pipelines where you can trust the output and audit how it was produced. Curious if you’ve thought about integrating evaluation or schema validation layers downstream, or if SwellDB is mainly about exploration? Excited to follow the project either way!

Show HN: Pyrig – One command to set up a production-ready Python project

https://github.com/Winipedia/pyrig
1•Winipedia•46s ago•0 comments

Fast Response or Silence: Conversation Persistence in an AI-Agent Social Network [pdf]

https://github.com/AysajanE/moltbook-persistence/blob/main/paper/main.pdf
1•EagleEdge•52s ago•0 comments

C and C++ dependencies: don't dream it, be it

https://nibblestew.blogspot.com/2026/02/c-and-c-dependencies-dont-dream-it-be-it.html
1•ingve•1m ago•0 comments

Show HN: Vbuckets – Infinite virtual S3 buckets

https://github.com/danthegoodman1/vbuckets
1•dangoodmanUT•1m ago•0 comments

Open Molten Claw: Post-Eval as a Service

https://idiallo.com/blog/open-molten-claw
1•watchful_moose•2m ago•0 comments

New York Budget Bill Mandates File Scans for 3D Printers

https://reclaimthenet.org/new-york-3d-printer-law-mandates-firearm-file-blocking
1•bilsbie•3m ago•0 comments

The End of Software as a Business?

https://www.thatwastheweek.com/p/ai-is-growing-up-its-ceos-arent
1•kteare•4m ago•0 comments

Exploring 1,400 reusable skills for AI coding tools

https://ai-devkit.com/skills/
1•hoangnnguyen•4m ago•0 comments

Show HN: A unique twist on Tetris and block puzzle

https://playdropstack.com/
1•lastodyssey•8m ago•0 comments

The logs I never read

https://pydantic.dev/articles/the-logs-i-never-read
1•nojito•9m ago•0 comments

How to use AI with expressive writing without generating AI slop

https://idratherbewriting.com/blog/bakhtin-collapse-ai-expressive-writing
1•cnunciato•10m ago•0 comments

Show HN: LinkScope – Real-Time UART Analyzer Using ESP32-S3 and PC GUI

https://github.com/choihimchan/linkscope-bpu-uart-analyzer
1•octablock•10m ago•0 comments

Cppsp v1.4.5–custom pattern-driven, nested, namespace-scoped templates

https://github.com/user19870/cppsp
1•user19870•11m ago•1 comments

The next frontier in weight-loss drugs: one-time gene therapy

https://www.washingtonpost.com/health/2026/01/24/fractyl-glp1-gene-therapy/
1•bookofjoe•14m ago•1 comments

At Age 25, Wikipedia Refuses to Evolve

https://spectrum.ieee.org/wikipedia-at-25
1•asdefghyk•17m ago•3 comments

Show HN: ReviewReact – AI review responses inside Google Maps ($19/mo)

https://reviewreact.com
2•sara_builds•17m ago•1 comments

Why AlphaTensor Failed at 3x3 Matrix Multiplication: The Anchor Barrier

https://zenodo.org/records/18514533
1•DarenWatson•19m ago•0 comments

Ask HN: How much of your token use is fixing the bugs Claude Code causes?

1•laurex•22m ago•0 comments

Show HN: Agents – Sync MCP Configs Across Claude, Cursor, Codex Automatically

https://github.com/amtiYo/agents
1•amtiyo•23m ago•0 comments

Hello

2•otrebladih•24m ago•1 comments

FSD helped save my father's life during a heart attack

https://twitter.com/JJackBrandt/status/2019852423980875794
3•blacktulip•27m ago•0 comments

Show HN: Writtte – Draft and publish articles without reformatting, anywhere

https://writtte.xyz
1•lasgawe•29m ago•0 comments

Portuguese icon (FROM A CAN) makes a simple meal (Canned Fish Files) [video]

https://www.youtube.com/watch?v=e9FUdOfp8ME
1•zeristor•31m ago•0 comments

Brookhaven Lab's RHIC Concludes 25-Year Run with Final Collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
3•gnufx•33m ago•0 comments

Transcribe your aunts post cards with Gemini 3 Pro

https://leserli.ch/ocr/
1•nielstron•37m ago•0 comments

.72% Variance Lance

1•mav5431•38m ago•0 comments

ReKindle – web-based operating system designed specifically for E-ink devices

https://rekindle.ink
1•JSLegendDev•39m ago•0 comments

Encrypt It

https://encryptitalready.org/
1•u1hcw9nx•39m ago•1 comments

NextMatch – 5-minute video speed dating to reduce ghosting

https://nextmatchdating.netlify.app/
1•Halinani8•40m ago•1 comments

Personalizing esketamine treatment in TRD and TRBD

https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1736114
1•PaulHoule•42m ago•0 comments