frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•1y ago
So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•1y ago
Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•1y ago
Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?

Hallucination Is Inevitable: An Innate Limitation of Large Language Models

https://arxiv.org/abs/2401.11817
1•drob518•1m ago•0 comments

LLM-first document AI is missing a 50-year-old CS technique

https://bhavyagupta.dev/posts/llm-document-extractors-fixed-point
1•bhavya2k03•2m ago•0 comments

I built ~70 free visual editors that run in the browser

https://aicreate.com/
1•h222421•2m ago•0 comments

Decentralize Your Communication

https://blog.troed.se/posts/decentralize_your_communication/
1•speckx•2m ago•0 comments

Does Employment Slow Cognitive Decline? Evidence from Labor Market Shocks

https://www.nber.org/papers/w35117
1•littlexsparkee•3m ago•0 comments

Show HN: A Soccer Game Simulator Played by AI Agent

https://github.com/gangtao/AgentPitch/
1•gangtao•4m ago•0 comments

What's the most useful thing in your .gitconfig?

1•thorne_luke•4m ago•0 comments

H2 General Expense Database [video]

http://ajqvue.com/documentation.html
1•danap•8m ago•0 comments

Show HN: One container. 4 cores. Hybrid search. 10k concurrent users?

https://amgix.io/blog/2026/05/02/amgix-one-under-load/
1•kvasserman•10m ago•0 comments

Dithering with CSS

https://ikesau.co/blog/dithering-with-css/
1•speckx•10m ago•0 comments

Automatic Enum Handling in C – Parsing, Validating and Iterating

https://medium.com/@yair.lenga/automatic-enum-handling-in-c-parsing-validating-and-iterating-76de...
1•yairlenga•11m ago•0 comments

Real Experts Teach

https://togetherlondon.com/insights/real-experts-teach
1•lucidplot•11m ago•0 comments

1966 Ford Mustang Converted into a Tesla with Working 'Full Self-Driving'

https://electrek.co/2026/05/02/tesla-1966-mustang-ev-conversion-full-self-driving/
1•Brajeshwar•13m ago•0 comments

I Let AI Look at My Breasts–and I'm Glad I Did

https://www.wsj.com/tech/ai/joanna-stern-i-am-not-a-robot-ai-book-8e54657e
1•atestu•14m ago•0 comments

HTTPS: The Three Guarantees and the Handshake Beneath

https://toolkit.whysonil.dev/how-it-works/https/
1•otterwilde2•14m ago•0 comments

How Everest has changed since Into Thin Air

https://www.theatlantic.com/books/2026/05/whats-changed-since-jon-krakauer-climbed-everest/687019/
1•nlawalker•15m ago•1 comments

Aviate, Navigate, Communicate – What to Do About Mythos

https://www.hyperdimensional.co/p/aviate-navigate-communicate
1•mwigdahl•15m ago•0 comments

`Bun.Image` – fast builtin multi-format image processing library

https://twitter.com/bunjavascript/status/2050421589150404826
1•burnrate•15m ago•0 comments

Seeking maintainers for our OCaml SIP server, gRPC, and HTTP/2 libraries

https://discuss.ocaml.org/t/seeking-maintainers-for-our-ocaml-sip-server-grpc-and-http-2-librarie...
2•DASD•15m ago•0 comments

Steam HW survey – Linux drops m/m

https://store.steampowered.com/hwsurvey/Steam-Hardware-Software-Survey-Welcome-to-Steam
1•baal80spam•16m ago•0 comments

EFF's Recommendations for the EU's Digital Fairness Act

https://www.eff.org/deeplinks/2026/04/dos-and-donts-eus-digital-fairness-act-effs-recommendation-...
1•hn_acker•16m ago•1 comments

Musk texted OpenAI's Brockman about settlement two days before trial began

https://www.cnbc.com/2026/05/04/musk-altman-open-ai-settlement-trial-brockman.html
1•1vuio0pswjnm7•16m ago•0 comments

States Want to Block Open Records Laws That Reveal ALPRs' Sprawling Surveillance

https://www.eff.org/deeplinks/2026/04/open-records-laws-reveal-alprs-sprawling-surveillance-now-s...
1•hn_acker•16m ago•1 comments

Show HN: Visual SSL TLS Handshake Visualizer

https://www.sitesecurityscore.com/tools/ssl-tls-handshake-checker
1•lemax2•17m ago•0 comments

High-Quality Chaos

https://daniel.haxx.se/blog/2026/04/22/high-quality-chaos/
1•lladnar•17m ago•0 comments

Self Driving Car[Sor]

https://article.app/shishir/self-driving-car-sor
1•sjakati98•19m ago•0 comments

Voltage Tester vs. Multimeter

https://www.techtownforum.com/knowledge-base/article/equipment-appliances/tech-tools/voltage-test...
1•susam•19m ago•0 comments

Digital Hopes, Real Power: From Connection to Collective Action

https://www.eff.org/deeplinks/2026/04/digital-hopes-real-power-connection-collective-action
1•hn_acker•20m ago•0 comments

Auto CVE Checker–open-source CVE+SBoM+C/C++ scanner for ISO/SAE 21434 compliance

https://github.com/devender-sharma-emb/automotive-cve-tool
1•devvender•20m ago•0 comments

I tracked 7,700 UK petrol stations every 10 minutes for 3 months

https://www.fuelinsight.co.uk
3•theazureguy•21m ago•3 comments