frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•1y ago
So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•1y ago
Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•1y ago
Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?

Microsoft and Uber Are Running into an AI Cost Problem

https://firethering.com/microsoft-uber-ai-coding-tools-more-expensive-than-human-workers/
1•steveharing1•27s ago•0 comments

StyloBot- Open Source self hosted behavioural bot protection

https://stylobot.net
1•scottgal•2m ago•0 comments

Benchmarking Vortex File Format vs. Parquet, CSV vs. DuckDB, Polars, Datafusion

https://dataengineeringcentral.substack.com/p/benchmarking-vortex-file-format-vs
1•eigenBasis•3m ago•0 comments

Raft Consensus with a Minority of Nodes

https://padhye.org/raft-minority/
1•moarbugs•3m ago•0 comments

Delta Brain Sync · Streamlit

https://delta-brain-sync-k99vym7mbyebesrfdl84sm.streamlit.app
1•TELEFOXX•3m ago•0 comments

Solar, wind and batteries push down electricity bills for homes and business

https://reneweconomy.com.au/solar-wind-and-batteries-push-down-electricity-bills-for-homes-and-bu...
1•doener•4m ago•0 comments

EU plans to fine Google high triple-digit million euro sum, Handelsblatt reports

https://www.reuters.com/world/europe/eu-plans-fine-google-high-triple-digit-million-euro-sum-hand...
2•LelouBil•4m ago•0 comments

PHP – simple way to send HTTP headers before a script ends

https://shkspr.mobi/blog/2026/05/php-simple-way-to-send-http-headers-before-a-script-ends/
1•blenderob•4m ago•0 comments

Terjangkau: Neighborhood Explorer

https://github.com/altilunium/terjangkau
1•altilunium•4m ago•0 comments

Prompter – Compare and benchmark Ollama models side-by-side in your terminal

https://github.com/whonixnetworks/prompter
1•whonixnetworks•6m ago•0 comments

Show HN: Self-managing codebase with long-horizon agents

https://github.com/WillTaylor22/self-managing-codebase
1•wrftaylor•8m ago•1 comments

"Long-Term Support" doesn't mean what you think

https://pointieststick.com/2026/05/23/long-term-support-doesnt-mean-what-you-think/
1•birdculture•10m ago•0 comments

Five foundations for building complex Ruby on Rails apps

https://paweldabrowski.com/farewell-to-rails-way/five-foundations-for-building-complex-rails-apps
1•pdabrowski6•11m ago•0 comments

Tools and skills for humans and agents to review via Magnifica Humanitas

https://encyclical.ai/
2•willf•12m ago•0 comments

NL govt blocks DigiD takeover by solvinity

https://nos.nl/artikel/2615885-staatssecretaris-verbiedt-overname-solvinity-bedrijf-achter-digid
3•hvb2•21m ago•1 comments

Show HN: Judicex – Open-source legal AI that abstains instead of hallucinating

https://github.com/JustVugg/judicex
3•vforno•24m ago•0 comments

I bypassed AWS API Gateway auth with a trailing slash. Got $12K bounty

https://theguptalog.blogspot.com/2026/04/i-bypassed-aws-api-gateway-auth-with.html
6•tjek•24m ago•6 comments

Disappearing Polymorph

https://en.wikipedia.org/wiki/Disappearing_polymorph
1•thunderbong•28m ago•0 comments

The Cow and the Bison (and PFAS)

https://cmarmitage.substack.com/p/the-cow-and-the-bison-with-an-urgent
2•JumpCrisscross•30m ago•0 comments

Uber president says AI spending is getting 'harder to justify'

https://www.theverge.com/transportation/937116/uber-ai-investment-hard-to-justify
3•berlianta•32m ago•0 comments

'Epstein class' has become a populist battle cry in US politics

https://www.ft.com/content/e9ee464f-a43e-4088-aade-053e2c135f5d
3•JumpCrisscross•35m ago•0 comments

Find-dup-defs – find duplicated Python code at the speed of light

https://github.com/prostomarkeloff/find-dup-defs
1•notmarkeloff•36m ago•0 comments

We Are Living in Pinocchio's World

https://om.co/2026/05/25/we-are-living-in-pinocchios-world/
1•herbertl•36m ago•0 comments

Pope Leo Compares AI Threat to Biblical 'Tower of Babel'

https://www.wsj.com/world/pope-leo-ai-encyclical-c5e1af6c
1•doener•37m ago•0 comments

Juris Upatnieks, the founder of holography has died

https://www.lza.lv/en/activities/news/2556-in-memoriam-juris-upatnieks-7-may-1936-17-may-2026
1•tomaac_•38m ago•0 comments

Ask HN: Is there a need for YAML in post-LLM world?

1•throwaw12•39m ago•1 comments

New redesigned Google icons

https://www.theverge.com/tech/932417/google-gmail-docs-cal-sheets-workspace-icon-redesign
2•kailovel•40m ago•0 comments

The state of AI voice assistants is bad but there's a clear winner

https://simianwords.bearblog.dev/the-state-of-ai-voice-assistants-is-bad-but-theres-a-clear-winner/
1•simianwords•40m ago•0 comments

Show HN: Hush – A self-hostable, OpenMLS Discord alternative written in Go

https://github.com/hushhq/hush
2•MrClouds•40m ago•1 comments

Show HN: Audit your Linux VPS security in one command

https://github.com/Secure-Code-HQ/audit
1•juanisidoro•47m ago•0 comments