frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•1y ago
So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•1y ago
Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•1y ago
Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?

The Great Blogging Collapse: What Happened to 100 Successful Blogs?

https://danielstanica.com/posts/Great-Blogging-Collapse
1•thm•3m ago•0 comments

Design your MCP server like a UI, not an API

https://bump.sh/blog/4-rules-to-build-an-efficient-mcp-server/
1•scharrier•6m ago•0 comments

Trouble keeps finding Supermicro as server shipments attract police attention

https://www.theregister.com/legal/2026/07/02/trouble-keeps-finding-supermicro-as-strange-server-s...
1•jnord•6m ago•0 comments

"Can't wait to see what people will do with GPT-5.6 Sol"

https://twitter.com/thsottiaux/status/2072607914217320644
2•throwaway2027•7m ago•0 comments

Theoretical Bottlenecks for Scaling LLM Inference to Get Higher Token per Second

https://twitter.com/freddie_spirit/status/2072610863664501129
1•arjmandi•13m ago•1 comments

Show HN: Envcontract – Validate your .env and never commit a secret (100% local)

https://github.com/hamzamansoorch/envcontract
2•hamza_mansoor•13m ago•0 comments

New Apple compression algorithms: LZRAVEN and LZMESH

https://developer.apple.com/documentation/compression/compression_lzraven
1•a-french-anon•18m ago•1 comments

Website Keeps Me Focused

https://deepfocus.space/en
1•mike_watson•21m ago•0 comments

Poland's SGE unveils plans for UK fleet of 14 nuclear SMRs

https://www.energyvoice.com/renewables-energy-transition/nuclear/600094/polish-billionaire-solowo...
1•mpweiher•22m ago•0 comments

Show HN: Dart_agent_core – Run AI agents in Flutter apps with lifecycle hooks

https://github.com/memex-lab/dart_agent_core
1•sparkleMing•25m ago•0 comments

Andy Burnham could raise £15B – without a tax rise

https://taxpolicy.org.uk/2026/07/01/andy-burnham-tax-gap-15bn/
1•frereubu•25m ago•0 comments

Your site, your rules: new AI traffic options for all customers

https://blog.cloudflare.com/content-independence-day-ai-options/
1•frereubu•26m ago•0 comments

Apricot Computers: An underrated British brand

https://dfarq.homeip.net/apricot-computers-an-underrated-british-brand/
1•rbanffy•29m ago•0 comments

Show HN: A tool to sync env files to your Git worktrees

https://github.com/alxwrd/git-env
1•alxwrd•29m ago•0 comments

Show HN: Ghbrk – Let AI agents run Git/gh without exposing SSH keys/API tokens

https://github.com/marconae/ghbrk
2•marconae•31m ago•0 comments

Open Source AI Must Win

https://opensourceaimustwin.com
4•Gedxx•33m ago•0 comments

Building a car recognition application (pt. 1)

https://blog.wildedge.dev/posts/we-built-a-car-recognizer-in-an-afternoon
2•piotrekno1•35m ago•0 comments

Axelrod – A research tool for the Iterated Prisoner's Dilemma

https://github.com/Axelrod-Python/Axelrod
1•hamburgererror•36m ago•0 comments

Don't expect trackers to save your stolen car, experts say

https://www.bbc.com/news/articles/cp8r1798kp7o
1•mytailorisrich•37m ago•0 comments

The energy cost of web advertising

https://dl.acm.org/doi/10.1145/3774904.3792414
1•iamacyborg•38m ago•0 comments

A macOS bell that rings when your Codex CLI session needs input

https://github.com/foxtrotdev/codex-butler-bell
1•zeetyy404•39m ago•0 comments

Of Course Meta Platforms Is Going to Be a Cloud

https://www.nextplatform.com/cloud/2026/07/01/of-course-meta-platforms-is-going-to-be-a-cloud/526...
1•rbanffy•43m ago•0 comments

Floor plan area micro-eval

https://kerrickstaley.com/2026/07/01/floor-plan-area-micro-eval
1•KerrickStaley•45m ago•0 comments

Who wins the World Cup if *not football* decides?

https://dataguessr.com/world-cup-2026/
1•davidbauer•52m ago•0 comments

Built a cell from scratch for the first time

https://www.cnn.com/2026/07/01/science/synthetic-cell-research
1•vinnyglennon•53m ago•0 comments

Show HN: Get a structured knowledge base instantly from conversations

https://sofie.wiki
1•tapeo•53m ago•0 comments

Turning Roads into Power Plants

https://www.reps.energy/
1•doener•54m ago•0 comments

How much onboarding friction is acceptable in healthcare software?

https://geekyants.com/case-studies/dentify-40-percent-onboarding-time-reduction
3•Krishnaswaroop•54m ago•0 comments

Show HN: I trained a 1B LLM from scratch for $315 and open-sourced weights+data

https://huggingface.co/AIIT-Threshold/Tessera-1B
2•Aiit-threshold•56m ago•0 comments

Show HN: Azure DevOps/GitHub TUI

https://github.com/Elpulgo/azdo/releases/tag/v0.7.1-beta
1•elpulgo•56m ago•0 comments