frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Ktx – Open-source executable context layer for data agents

https://github.com/Kaelio/ktx
14•lucamrtl•3h ago
Hi HN, we’re open-sourcing ktx. It’s an executable context layer that makes agents reliable on your data stack.

We built it after going through the experience of building production-grade data agents for dozens of companies. If you’ve also tried building them, or simply tried using Claude Code or Codex on your data warehouse, you’ll know that accuracy is the #1 issue. Agents are great at generating valid SQL, but it’s not always correct SQL.

To cite a few examples of “agents gone wrong”:

- Stale column + hidden business rule: when preparing a board report, a finance analyst asks Claude Code for “ARR by customer segment”, it derives ARR from multiple tables (subscriptions, plans, accounts), then groups by accounts.industry. But CC doesn’t know that this industry column was deprecated a few months prior, or that past board reports excluded paused subscriptions from the ARR calculation

- Join fanout: a data analyst at a retailer uses their company’s internal agent to prep a product revenue deck for a QBR. The agent joins orders to order_items, then sums orders.total_amount_cents grouped by order_items.product_id. The SQL runs fine, but each order’s revenue is repeated once per line item, which most people will miss if most orders only have 1 item

- Missing attribution logic: a marketing analyst asks Codex “Which campaigns drove the most revenue?” Codex joins marketing_touches to users to orders and groups by utm_campaign. But since each order can have multiple touches before purchase, the same order can be credited to first touch, last touch, every touch, or every campaign the user clicked before buying. If the agent chooses the method that doesn’t match the team’s attribution logic, they’ll make suboptimal decisions

To solve this at first we gave the agent more context through skills + a wiki-style knowledge base. That gives it some useful extra context but still relies on it writing the SQL without incorrect assumptions.

The next solution we explored was implementing a classic semantic layer. That solves the executable part, but they’re such a pain to build and maintain since they were made for legacy BI tools. Plus as a standalone tool, they lack all the useful context from unstructured data sources like internal docs.

So we built ktx and split it into 2 parts:

1. Business context goes in Markdown wiki pages that are auto-ingested and auto-populated

2. Queryable definitions go into YAML files that define tables, row grain, joins, measures, dimensions, filters, and filter groups

That way, when an agent needs a metric, it asks ktx for a measure, dimensions, filters, and filter groups instead of writing the whole query itself. ktx’s planner chooses the join path, uses grain and relationship metadata, catches issues like join fanout and chasm joins, and compiles the warehouse SQL, while utilizing the extra unstructured knowledge it has access to.

ktx is Apache 2.0. It can ingest from most warehouses (BigQuery, Snowflake, Postgres & others), modeling tools (dbt, MetricFlow, LookML), BI tools (Looker, Metabase), doc tools like Notion, and corrections from user interactions.

Install manually:

npm install -g @kaelio/ktx

ktx setup

Or give this prompt to your agent:

Run npx skills add Kaelio/ktx --skill ktx and use ktx skill to install and configure ktx

We’d especially like feedback from people who’ve tried using Claude Code, Codex, or building custom agents on analytics warehouses. Where did they fail? And what did you try to make the answers more reliable?

Comments

tarun_anand•22m ago
How does this compare with Wren 2.0, OpenVikings etc

Claude Opus 4.8

https://www.anthropic.com/news/claude-opus-4-8
553•craigmart•1h ago•391 comments

US's big bet on quantum computing may not be legal

https://arstechnica.com/tech-policy/2026/05/uss-big-bet-on-quantum-computing-may-not-be-entirely-...
55•Bender•2d ago•46 comments

Show HN: Continue? Y/N: A 60-second game about AI agent permission fatigue

https://llmgame.scalex.dev
108•Wirbelwind•5h ago•57 comments

Indoor Wi-Fi Roaming with OpenWRT

https://taoofmac.com/space/blog/2026/05/26/1730
130•zdw•2d ago•57 comments

The Permanent Upper Crow

https://permanent-upper-crow.jasonwu.ink/
59•whiteblossom•2h ago•20 comments

Show HN: Ktx – Open-source executable context layer for data agents

https://github.com/Kaelio/ktx
14•lucamrtl•3h ago•1 comments

YouTube to automatically label AI-generated videos

https://blog.youtube/news-and-events/improving-ai-labels-viewers-creators/
1218•nopg•22h ago•722 comments

Trivial Pursuits

https://www.lrb.co.uk/the-paper/v48/n10/david-runciman/trivial-pursuits
10•diodorus•1h ago•1 comments

EU fines Temu €200M for allowing sale of illegal products

https://www.bbc.co.uk/news/articles/c1k2ydn1rz8o
195•jjp•4h ago•124 comments

News about Raspberry Pi 6 and Microcontroller Development

https://www.jeffgeerling.com/blog/2026/news-about-raspberry-pi-6-and-microcontroller-development/
22•rbanffy•2d ago•12 comments

Dynamic Workflows in Claude Code

https://claude.com/blog/introducing-dynamic-workflows-in-claude-code
75•mil22•1h ago•68 comments

Boston and Bermuda

https://askthepilot.com/boston-and-bermuda/
31•dangle1•2d ago•8 comments

Using Tailscale with an OrbStack VM on macOS

https://github.com/highpost/tailscale-macos-vm
9•highpost•2d ago•0 comments

Show HN: Hallucinate – Massively Multiplayer Online Rave

https://hallucinate.site
358•stagas•14h ago•153 comments

Show HN: Open-Source AI Racing Harness

https://www.elodin.systems/post/elodin-ai-grand-prix-race-sim-harness
46•danAtElodin•21h ago•5 comments

Bttf is a command line datetime Swiss army knife

https://github.com/BurntSushi/bttf
104•burntsushi•15h ago•73 comments

I'm Getting into Mesh Networks (Meshtastic, MeshCore, and Reticulum)

https://www.jonaharagon.com/posts/im-getting-into-mesh-networks-meshtastic-meshcore-and-reticulum/
323•Panda_•22h ago•121 comments

SimCity 3k in 4k (2025)

https://www.thran.uk/writ/hdid/2025/12/simcity-3k-in-4k.html
459•speckx•1d ago•185 comments

What Apple and Google are doing to push notifications

https://www.jacquescorbytuech.com/writing/what-apple-and-google-are-doing-your-push-notifications
387•iamacyborg•22h ago•380 comments

Disagreement among frontier LLMs on real-world fact-checks

https://lenz.io/research/llm-disagreement
455•kostaj•5h ago•312 comments

Creusot helps you prove your Rust code is correct

https://github.com/creusot-rs/creusot/tree/master
53•fanf2•3h ago•6 comments

Thornton Wilder's Last Play Vanished into Thin Air. Or Did It?

https://www.nytimes.com/2026/05/27/theater/thornton-wilder-emporium-last-play.html
3•lermontov•23h ago•0 comments

Ruby vs. Java vs. TypeScript: my experience on building a Cowork DOCX plugin

https://tanin.nanakorn.com/ruby-java-typescrip-claude-docx-plugin/
54•theanonymousone•2d ago•36 comments

Show HN: TapToyPia

https://memalign.github.io/m/taptoypia/index.html
7•memalign•4d ago•1 comments

Seeing Around Corners Using Smartphone-Grade Lidar

https://spectrum.ieee.org/smartphone-grade-lidar
62•marc__1•3d ago•17 comments

RamAIn (YC W26) Is Hiring

https://www.ycombinator.com/companies/ramain/jobs/hqvmyKN-founding-gtm-engineer
1•svee•15h ago

New York passes pied-a-terre tax

https://www.cnbc.com/2026/05/28/new-york-mamdani-pied-a-terre-tax-passes.html
204•proofofcontempt•3h ago•277 comments

The Ask

https://randsinrepose.com/archives/the-ask/
133•digitallogic•3d ago•93 comments

More Whimsical OEIS Sequences

https://www.jeremykun.com/shortform/2026-05-22-1528/
47•surprisetalk•2d ago•9 comments

Rust (and Slint) on a Jailbroken Kindle

https://sverre.me/blog/rust-on-kindle/
225•homarp•22h ago•35 comments