frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•1y ago
So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•1y ago
Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•1y ago
Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?

Predictions as a Substitute for Reviews (2020)

https://acesounderglass.com/2020/08/06/predictions-as-a-substitute-for-reviews/
1•kqr•2m ago•0 comments

LogLeak: Composer GitHub Actions token disclosure in error messages, patched

https://blog.packagist.com/composer-2-9-8-and-2-2-28-fix-github-actions-token-disclosure-in-error...
1•damienwebdev•2m ago•0 comments

Live facial recognition to be used across county [Cambridgeshire, UK]

https://www.bbc.co.uk/news/articles/cedpgz9w22zo
1•saltwatercowboy•3m ago•0 comments

Show HN: TTP 0.3.0 – A transparent Tor proxy running in RAM

https://github.com/onyks-os/TransparentTorProxy
1•onyks•5m ago•0 comments

AI Defense Matrix: an open framework for defending AI systems

https://aidefensematrix.com/
2•escargot•5m ago•1 comments

Erlang/OTP 29.0

https://www.erlang.org/news/188
1•nifoc•6m ago•0 comments

Choosing the Right Agentic Design Pattern: A Decision-Tree Approach

https://machinelearningmastery.com/choosing-the-right-agentic-design-pattern-a-decision-tree-appr...
1•eigenBasis•6m ago•0 comments

A Rails proxy to enforce hard dollar caps on OpenAI usage

https://github.com/naurisSeglins/ai_budget_proxy
1•nseglins•6m ago•0 comments

Fragnesia

https://github.com/v12-security/pocs/tree/main/fragnesia
2•_ikke_•10m ago•0 comments

Show HN: MCPSafe – Free security scanner for MCP servers using 5-LLM consensus

https://mcpsafe.io
1•nhattruongadm•10m ago•0 comments

Forty Watts to Think With

https://atomsfrontier.substack.com/p/forty-watts-to-think-with
1•jpatel3•11m ago•0 comments

Subvert. The music platform owned by its community

https://www.subvert.fm/
1•riffraff•11m ago•0 comments

Red and Black Knights (extraordinary result) [video]

https://www.youtube.com/watch?v=UiX4CFIiegM
1•marvinborner•11m ago•0 comments

We Built a Custom Transport for Vercel's AI SDK

https://ably.com/blog/custom-transport-vercel-ai-sdk
1•zknill•11m ago•0 comments

Mojang adds Friends List and peer-to-peer multiplayer to Minecraft: Java Edition

https://www.minecraft.net/en-us/article/minecraft-26-2-snapshot-7
2•ObviouslyFlamer•15m ago•0 comments

The Physics–and Physicality–Of Extreme Juggling (2018)

https://www.wired.com/story/the-physicsand-physicalityof-extreme-juggling/
1•ColinWright•16m ago•0 comments

SaaS/DevTools Founders: Would You Acquire a Niche Tech Community?

http://towardsaws.com
1•kisanpakhreen•17m ago•1 comments

AI will soon be capable of telling convincing lies

https://www.theregister.com/ai-ml/2026/05/13/ai-will-soon-be-capable-of-telling-convincing-lies/5...
2•pluc•18m ago•1 comments

Top Business Ideas Under ₹5 Lakh in India – SMFG India Credit

https://www.smfgindiacredit.com/knowledge-center/business-ideas-under-5-lakhs.aspx
1•saumyaraut11•18m ago•0 comments

Show HN: Diom – Open-source back end primitives with no runtime dependencies

https://github.com/svix/diom
1•tasn•18m ago•0 comments

Not so dusty: How tech is changing woodworking

https://www.bbc.co.uk/news/articles/c747n11933eo
1•neversaydie•19m ago•0 comments

Shai-Hulud: Open Sourcing the Carnage

https://github.com/PedroTortoriello/Shai-Hulud-Open-Source
1•lionkor•20m ago•0 comments

Lenovo buys its BIOS maker of 20 years – here's why that matters

https://gagadget.com/en/707143-lenovo-buys-its-bios-maker-of-20-years-heres-why-that-matters/
1•taubek•22m ago•0 comments

Gallup Begins Research on Simulated Responses

https://news.gallup.com/opinion/methodology/709373/gallup-begins-research-simulated-responses.aspx
1•ep_jhu•25m ago•0 comments

Better Auth 1.6

https://better-auth.com/blog/1-6
1•ms7892•27m ago•0 comments

1908 Tunguska Event

https://en.wikipedia.org/wiki/Tunguska_event
2•simonebrunozzi•27m ago•0 comments

Google finds first AI-developed zero-day that bypasses 2FA

https://www.tomshardware.com/tech-industry/cyber-security/google-finds-first-ai-developed-zero-da...
3•pkaeding•30m ago•0 comments

I Moved My Digital Stack to Europe

https://monokai.com/articles/how-i-moved-my-digital-stack-to-europe/
81•monokai_nl•30m ago•36 comments

Experts don't know what data centers are doing to the electric grid

https://blog.ucs.org/mike-jacobs/what-are-data-centers-doing-to-the-electric-grid-experts-dont-know/
2•giuliomagnifico•31m ago•0 comments

Language Is Cognitive Exhaust: How AI Reconstructs Thought from Text [pdf]

https://dn720908.ca.archive.org/0/items/language-cognitive-exhaust-thought-compression-ai/SFL-07_...
3•scaledsystems•33m ago•0 comments