frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: fenic – LLMs as dataframe operators, query meaning and structure

https://github.com/typedef-ai/fenic
2•cpard•1h ago
Hey friends. I'd like to share a project that's dear to me. fenic is a dataframe API with LLMs added as first-class citizens, a classic lazy dataframe API extended with new operators that are backed by LLMs.

What this gets you is the ability to work with structured and unstructured data in the same context. Most importantly, the LLMs aren't integrates as opaque UDF black boxes. They're exposed as "semantic" operators that the planner can reason about alongside the classic ones.

(There are examples and code snippets on the repo to see how everything works together)

Why build this? I'm a data infra / systems person. When LLMs showed up, what I saw was a new type of compute that changes the characteristics of the workloads we deal with. I wanted to experiment with how our current systems can absorb these new workloads and compute types, and what it would take to make the DX as seamless as possible, that's where the UDF + arbitrary prompt was feeling too problematic.

To support this properly, we had to introduce a few really cool things:

New plan operators. You don't just send prompts at an LLM. You use operators like semantic join, semantic map and reduce, and semantic filter, among others. They mix with the classic operators, and because the planner sees them as real operators rather than black boxes, it can reorder work around them.

Typed outputs. There's ergonomics to turn the output of a semantic operator straight into a typed dataframe column. A Pydantic schema for the LLM output becomes a typed struct column you can unnest, explode, and so on.

New data types like a markdown data type. Markdown became an important way to share information with LLMs, even though it started life as a way to format text for presentation. It carries structure, and being able to access that structure the way you would a struct or JSON type adds to the developer experience I mentioned.

Async UDFs. One of the more interesting shifts in workloads from the LLM explosion is the need to put heavily I/O-bound steps in your pipeline: fetching a response from an API, crawling a website, and so on. Async UDFs fill that gap, and the implementation handles the nuances for you: concurrency, retries, and the rest.

An LLM-inference-aware planner and runtime. This is one of the parts I'm most excited about, and there's a lot still to do. Today: identical prompts within a batch collapse to a single model call, so duplicates cost zero tokens; requests are dispatched concurrently under per-provider rpm/tpm limits with retries and backoff; null and empty cells skip the model entirely; and you get token and cost metrics per operator. There's also an optional persistent response cache so re-runs skip the model.

MCP as a new catalog primitive. Much like a registered view, you can register a dataframe pipeline as an MCP tool in the catalog. fenic then serves an MCP server with that pipeline as the tool's logic, executed over your data.

These are just some of what's gone into fenic while experimenting with how LLMs can become part of our compute infrastructure. There's more, and plenty more to polish on what's already there.

I've been using fenic for all sorts of things. On the small/personal end, I use it to take my podcast audio recordings and turn them into nicely structured tables of metadata I can research. On the heavier end, I use it as tooling for agents to analyze agent traces exported from Pydantic Logfire, to discover evals and turn them into reproducible artifacts in the form of dataframe pipelines.

  pip install fenic
  Repo: https://github.com/typedef-ai/fenic
  Docs: https://docs.fenic.ai
There's also a skill you can use with claude code, codex etc. to quickly get started with fenic in your favourite agentic coding environment.

I'd love to hear your thoughts, criticism, and anything else that comes to mind.

I'm here to answer questions.

Eclipsa Video: HDR That Looks Right on Every Screen

https://android-developers.googleblog.com/2026/06/eclipsa-video-hdr-review.html
1•ledoge•1m ago•0 comments

Claude Sonnet 5 System Card

https://anthropic.com/claude-sonnet-5-system-card
2•adocomplete•2m ago•0 comments

Who can take on web exploit scans? The Banhammer™ can!

https://code.backwater.systems/blog/#2026-06-29T23:40:00.000Z
1•nate-gehringer•2m ago•0 comments

The DLL that was not present in memory despite not being formally unloaded 2

https://devblogs.microsoft.com/oldnewthing/20260626-00/?p=112472
1•supermatou•2m ago•0 comments

Valve explains the Steam Machine (and that pricing) [video]

https://www.youtube.com/watch?v=Y8_e3SteAgE
1•HelloUsername•2m ago•0 comments

Ask HN: Help my web browser project be better

2•roschdal•2m ago•0 comments

AgentWire: Orchestrating many Claude Code sessions via tmux and voice-control

https://github.com/dotdevdotdev/agentwire-dev
1•AdamTSaunders•3m ago•0 comments

Johnson Thermoelectric Energy Converter

https://en.wikipedia.org/wiki/Johnson_thermoelectric_energy_converter
1•simonebrunozzi•3m ago•0 comments

Prompt Injection Is Not a Chatbot Problem: How the Attack Surface Changes

https://agentsafelabs.com/blog/prompt-injection-is-not-a-chatbot-problem-how-the-attack-surface-c...
1•waqarjaved•3m ago•0 comments

Rust Service Isn't Leaking – It Could Be the Allocator

https://pranitha.dev/posts/rust-and-memory-allocators/
1•abhirag•3m ago•0 comments

Claude Desktop is now available on Linux (in beta)

https://twitter.com/ClaudeDevs/status/2071988881717871065
1•Recursing•3m ago•1 comments

Connect any self-hosted WordPress site to your AI

https://wpvibe.ai/start/
1•luispa•5m ago•0 comments

Comparing GLM 5.2 and Opus 4.8 implementing the same methods for the same repos

https://gist.github.com/smellslikeml/36bf4939d76f0f84d113e2ddde5e6d3c
1•mynameisfunk•7m ago•0 comments

Show HN: Deconstructing Anthropic's Coding Agent Control Model

https://www.highflame.com/blog/how-anthropic-contains-its-own-coding-agents-and-get-that-coverage...
1•sharathr•7m ago•0 comments

Apple to Adopt Next-Generation Display Color Standard

https://www.trendforce.com/presscenter/news/20260629-13124.html
1•ledoge•10m ago•0 comments

Useful LLM Prompts for Editing Your Own Technical Writing

https://techstackups.com/articles/useful-llm-prompts-for-editing-your-own-technical-writing/
1•ritzaco•10m ago•0 comments

Ford rehires 'greybeards' after AI tech fails to deliver

https://www.msn.com/en-us/news/technology/ford-rehires-human-engineers-after-ai-fails-to-match-qu...
2•ilamont•12m ago•0 comments

Shipping post-quantum cryptography to Python

https://blog.trailofbits.com/2026/06/30/shipping-post-quantum-cryptography-to-python/
1•logickkk1•14m ago•0 comments

Forget Work. Passive Income Is the New American Dream

https://www.wsj.com/lifestyle/careers/passive-income-dreams-2e67ee5c
1•apparent•16m ago•0 comments

California Is Chasing Wealth That Has Feet

https://progressandpoverty.substack.com/p/california-is-chasing-wealth-that
3•slackpad•17m ago•2 comments

Millions take omega-3 fish oil for brain health but a new study found no benefit

https://www.sciencedaily.com/releases/2026/06/260622014300.htm
2•eustoria•20m ago•0 comments

Investigating Linux Graphics (2025)

https://roscidus.com/blog/blog/2025/06/24/graphics/
1•birdculture•20m ago•0 comments

Apple Just Fixed Mac Gaming and Said Nothing [video]

https://www.youtube.com/watch?v=3ZlPEsiaGiQ
2•lordleft•20m ago•0 comments

Show HN: 143.dev – we open-sourced our internal coding-agent infrastructure

5•johnjwang•21m ago•0 comments

Om

https://netwert.com/ideapad/2026/06/30/om/
2•eustoria•23m ago•0 comments

Impressions from Visiting OpenAI, Anthropic, & Cursor

https://newsletter.pragmaticengineer.com/p/impressions-from-visiting-openai
1•monkeydust•23m ago•0 comments

New Claude app strings, Fable 5 coming back only with verification

https://twitter.com/kimmonismus/status/2071868011804266828
4•garo-pro•23m ago•0 comments

Show HN: Build better UI with Hermes agents

https://www.typeui.sh/docs/guides/hermes
1•elwingo1•24m ago•0 comments

We Are All Weird

https://seths.blog/2026/06/we-are-all-weird-2/
2•eustoria•24m ago•0 comments

The first early human eggs from stem cells

https://www.conception.bio/science-and-updates/the-first-early-human-eggs-from-stem-cells
1•mattkrisiloff•26m ago•0 comments