frontpage.

Traditional databases rely on RAG and vector databases or SQL-based transformations/analytics. But will they be able to preserve per-row contextual understanding?

We’ve released Agents as part of Datatune:

https://github.com/vitalops/datatune

In a single prompt, you can define multiple tasks for data transformations, and Datatune performs the transformations on your data at a per-row level, with contextual understanding.

Example prompt:

"Extract categories from the product description and name. Keep only electronics products. Add a column called ProfitMargin = (Total Profit / Revenue) * 100"

Datatune interprets the prompt and applies the right operation (map, filter, or an LLM-powered agent pipeline) on your data using OpenAI, Azure, Ollama, or other LLMs via LiteLLM.

Key Features

- Row-level map() and filter() operations using natural language

- Agent interface for auto-generating multi-step transformations

- Built-in support for Dask DataFrames (for scalability)

- Works with multiple LLM backends (OpenAI, Azure, Ollama, etc.)

- Compatible with LiteLLM for flexibility across providers

- Auto-token batching, metadata tracking, and smart pipeline composition

Token & Cost Optimization

- Datatune gives you explicit control over which columns are sent to the LLM, reducing token usage and API cost:

- Use input_fields to send only relevant columns

- Automatically handles batching and metadata internally

- Supports setting tokens-per-minute and requests-per-minute limits

- Defaults to known model limits (e.g., GPT-3.5) if not specified

- This makes it possible to run LLM-based transformations over large datasets without incurring runaway costs.

Why I Ditched Spotify, and How I Set Up My Own Music Stack

When to Hire a Computer Performance Engineering Team

Scientists are discovering a powerful new way to prevent cancer

Data Science Weekly – Issue 615

Real-Time Solutions at the "Cosmic Edge"

Cloudflare says it should have caught mis-issued 1.1.1.1 certificates earlier

UK government trial of M365 Copilot finds no clear productivity boost

Amazon's Hardcore Culture Reset

GPT-5: The Case of the Missing Agent

The Landscape of Agentic Reinforcement Learning for LLMs

Campfire Open Sourced

Subscriptions aren't enough for SaaS companies, we're now moving towards credits

Altair 8800, the first personal computer to make it big

Open Source Physics

Medium-voltage circuit breaker unlocks electricity abundance, savings

Hungry Hungry Hippos Autoplay

Show HN: Automatically surface and implement high-potential ideas for your repo [video]

Hybrid nanotube electrodes developed for safer brain-machine interfaces

Chinese Language on the Web (2016)

A One-Page Primer On: Statistical Power

Tell HN: Think twice before activating two-factor authentication on Vivaldi

Context Engineer MCP – Fixing Context Loss in AI Coding Agents

The complete list of African unicorns today

The million dollar mystery behind Milk.com

You Can Now Download the Tesla Robotaxi App, but It Only Does One Thing

What Is the Fourier Transform?

Interrupts – The Heartbeat of a Unix Kernel

Ask HN: What do you think of this idea?

Yes, America Has a Housing Emergency – Paul Krugman

Maak: The infinitely extensible command runner and automation à la Make

Show HN: Per row context understanding for data transformations