Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•2mo ago

So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•2mo ago

Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•2mo ago

Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?

Debcraft – Easiest way to modify and build Debian packages

Were Americans ever really healthy?

How to Write Great Prompts for String

Reinventing the Python Wheel

Why don't I drink? How much time you got?

"Far out, man": how Jimi Hendrix boosted the career of Sha Na Na (2024)

Build an AI Agent Web App with String and Lovable

Cascading retrieval with multi-vector representations

Earn $200 by referring only. FREE

What a bumble bee chooses to eat may not match its ideal diet

Shutting Down Clear Linux OS

Nuxt Joins Vercel

The Kap Programming Language

A Software for One

Women Are Falling Behind in America's Return to the Office

Astronomer launches internal investigation after viral Coldplay video

Build your CV on Subreply as a LinkedIn alternative

Curse Not the King

The Physics of Dissonance (MinutePhysics) [video]

Billionaire Gabe Newell: pitching VCs makes no business sense

Ccusage: A CLI tool for analyzing Claude Code usage from local JSONL files

Fuzzing macOS Userland (For Fun and Pain)

Free Online Minesweeper

DHH – I Hate TypeScript (3 min video)

Show HN: Interactive Bash tutorial that runs in the browser

Show HN: Castream – Native iOS/Android IRL multistreaming app

There Is No Antimemetics Division – A Novel (2025)

First earthquake, then fire: UC San Diego researchers test steel building

Ask HN: What are your favorite open source AI agent implementations?

Node.js 18 is being deprecated