frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•9mo ago
So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•9mo ago
Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•9mo ago
Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?

Infrastructure configuration can swing coding evals by several % points

https://www.anthropic.com/engineering/infrastructure-noise
2•jackyzhao•2m ago•0 comments

Preserving the Open Web: Inside the New Wayback Machine Plugin for WordPress

https://blog.archive.org/2026/02/04/inside-the-new-wayback-machine-plugin-for-wordpress/
2•bookofjoe•3m ago•0 comments

I'm in the Epstein Files

https://lapcatsoftware.com/articles/2026/2/2.html
2•zdw•10m ago•0 comments

Django: Profile Memory Usage with Memray

https://adamj.eu/tech/2026/01/29/django-profile-memray/
2•todsacerdoti•12m ago•0 comments

Instacloud as infinite cloud storage using Instagram as remote disk

https://github.com/depreciating/InstaCloud
2•el3ctron•12m ago•1 comments

Calling Lean Functions as Python Functions

https://www.philipzucker.com/leancall/
2•todsacerdoti•13m ago•0 comments

Expertise is a Relic; They want Drones

https://bill-rider.com/2026/02/05/expertise-is-a-relic-they-want-drones/
2•dyukqu•15m ago•0 comments

Fli4l

https://en.wikipedia.org/wiki/Fli4l
2•Tomte•16m ago•0 comments

Why AI-Generated Code Will Hurt Both Customers and Companies

https://beastx.ro/why-ai-generated-code-will-hurt-both-customers-and-companies
2•birdculture•17m ago•0 comments

Voxtral.c Voxtral Realtime 4B model inference as a C library

https://github.com/antirez/voxtral.c
2•antirez•18m ago•0 comments

Aqua2Terra: Bringing subsea cable architecture to terrestrial fiber networks

https://media.licdn.com/dms/document/media/v2/D4E1FAQHgI3NNaBkyXA/feedshare-document-pdf-analyzed...
2•Henry3•18m ago•0 comments

Ottermon.ai – Effortless Observability Deployed in Seconds

https://www.ottermon.ai/
2•puppion•20m ago•0 comments

Learning to Reason in 13 Parameters

https://twitter.com/jxmnop/status/2019251724020772933
3•MaxLeiter•20m ago•1 comments

The Logic of Surveillance (2013)

https://www.ianwelsh.net/the-logic-of-surveillance/
2•bediger4000•24m ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
2•kakugawa•24m ago•2 comments

Experimental: Add xx-zones protocol for area-limited window positioning

https://gitlab.freedesktop.org/wayland/wayland-protocols/-/merge_requests/264
1•watashiato•25m ago•0 comments

Show HN: API Unit manage and schedule real API test flows, not just requests

https://apiunit.io
2•tudalv•26m ago•0 comments

Elon Musk – "In 36 months, the cheapest place to put AI will be space" [video]

https://www.youtube.com/watch?v=BYXbuik3dgA
2•nomilk•27m ago•5 comments

Bringing Engineering-as-Code to the Sphinx Framework

https://sphinx-needs.readthedocs.io/en/stable/
1•ahamez•28m ago•0 comments

BalatroBench – Benchmarking LLMs' Strategic Performance Through Games

https://balatrobench.com/
1•S1M0N38-hn•28m ago•0 comments

We are QA Engineers now

https://serce.me/posts/2026-02-05-we-are-qa-engineers-now
1•SerCe•29m ago•0 comments

Building an AI-Native Pharma

https://www.formation.bio/blog/building-an-ai-native-pharma
1•lintaho•29m ago•0 comments

IP Based Geolocation by Apple

https://ip-geolocation.apple.com/
1•technimad•30m ago•1 comments

Codex and Claude Code Automated Coding Orchestrator Controlled via Telegram

1•ricrom•31m ago•0 comments

Probing the dark energy in the functional protein universe

https://www.pnas.org/doi/10.1073/pnas.2531111123
1•PaulHoule•31m ago•0 comments

Sopro v1.5: A 135M TTS model trained for ~$100, runs 20× real-time on CPU

https://huggingface.co/samuel-vitorino/sopro
3•sammyyyyyyy•32m ago•1 comments

(Hetzner) Statement on the adjustment of setup fees

https://www.hetzner.com/pressroom/statement-setup-fees-adjustment/
2•m0nhawk•33m ago•0 comments

Redditors hack Epstein personal email

https://old.reddit.com/r/circled/comments/1qw5crl/redditors_hack_epstein_personal_email/
1•gehwartzen•34m ago•0 comments

Meteor 3.4: 4x faster builds, 8x smaller bundles

https://blog.meteor.com/meteor-3-4-is-out-rspack-integration-4x-faster-builds-8x-smaller-bundles-...
2•italojs•35m ago•1 comments

XenoAtom.Terminal.UI – reactive retained‑mode terminal UI framework for .NET

https://xenoatom.github.io/terminal/
1•xoofx•36m ago•0 comments