frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

GitHub faces a fight for its survival at Microsoft

https://www.theverge.com/tech/935250/microsoft-github-struggles-notepad
1•sbulaev•16s ago•0 comments

Who's the Best Batter? Estimating Probabilities from Unevenly Collected Data

https://winvector.github.io/WVExamples/blog/batting-estimation-stan/baseball_stats/
1•jmount•42s ago•0 comments

Why Taxing the Wealthy Is Harder Than It Looks

https://ofdollarsanddata.com/why-taxing-the-wealthy-is-harder-than-it-looks/
1•RickJWagner•1m ago•0 comments

Ted Turner Was Elon Musk Before Elon Musk Was Elon Musk

https://www.realclearmarkets.com/articles/2026/05/21/ted_turner_was_elon_musk_before_elon_musk_wa...
1•RickJWagner•2m ago•0 comments

Ask HN: What AI prompts have you found most reliable for actual work?

1•HrachShah•4m ago•0 comments

Ask HN: Is HN Blocking Mullvad VPN?

1•burger_moon•5m ago•0 comments

Megalodon: Mass GitHub Repo Backdooring via CI Workflows

https://safedep.io/megalodon-mass-github-repo-backdooring-ci-workflows/
1•Sudhanshu2310•5m ago•0 comments

NYS sues prediction platforms over gambling allegations

https://www.newsday.com/business/technology/prediction-markets-new-york-attorney-general-letitia-...
1•1vuio0pswjnm7•8m ago•0 comments

Proxmox Virtual Environment 9.2 with Dynamic Load Balancer Released

https://www.proxmox.com/en/about/company-details/press-releases/proxmox-virtual-environment-9-2
2•speckx•12m ago•0 comments

Codex for Everything Exfiltrates Connected Data

https://www.promptarmor.com/resources/codex-for-everything-exfiltrates-connected-data
2•takira•13m ago•0 comments

Inside SpaceX's IPO Plan

https://www.ft.com/content/a59be3cf-eee2-4b10-9c86-b6e4dc0dbbdb
3•1vuio0pswjnm7•13m ago•1 comments

The fastest growing political party is Cockroach Janata Party [video]

https://www.youtube.com/watch?v=uuFmKx5K9tc
1•Guestmodinfo•14m ago•0 comments

Leetcode.nvim

https://github.com/sidntrivedi/leetcode.nvim
2•sidntrivedi•15m ago•1 comments

Agents Sometimes Catastrophize

https://futuresearch.ai/blog/agents-catastrophize/
6•ddp26•16m ago•0 comments

EPA Official Agrees to Review Data Center Water Impact (AOC Shows Dirty Water)

https://news.bloomberglaw.com/environment-and-energy/epa-to-investigate-meta-data-center-link-to-...
2•zzzeek•17m ago•1 comments

DashAttention: Differentiable and Adaptable Sparse Hierarchical Attention

https://arxiv.org/abs/2605.18753
3•cmogni1•18m ago•0 comments

Test-Driving the Lance Lakehouse Format in DuckDB

https://duckdb.org/2026/05/21/test-driving-lance
2•tanelpoder•20m ago•0 comments

S3-Compatible object storage at $15/TB with free egress and CDN

https://filebase.com/blog/introducing-filebase-object-storage-with-free-egress/
4•acejam•20m ago•0 comments

Temporal is becoming Crystal Palace Football Club's front-of-shirt partner

https://temporal.io/blog/crystal-palace-partnership
2•ldite•21m ago•0 comments

SpaceX is heavily reliant on Starlink for growth and profit for IPO

https://www.cnbc.com/2026/05/21/spacex-starlink-growth-profit-nasdaq-ipo.html
2•drob518•21m ago•1 comments

SpaceX IPO reads like Hollywood fantasy version of the future

https://fortune.com/2026/05/21/spacex-ipo-musk-mars-colony-dinosaurs-space-exploration/
4•1vuio0pswjnm7•22m ago•1 comments

Apple to broadcast MLS game shot entirely on 15 iPhones

https://variety.com/2026/digital/news/apple-mls-match-shot-entirely-on-iphone-first-time-1236755744/
1•dkobia•22m ago•0 comments

White House postpones AI executive order signing ceremony

https://www.axios.com/2026/05/21/white-house-postpones-ai-eo-signing
2•anigbrowl•23m ago•0 comments

Ask HN: Failing interviews for mid-level SWE in UK, advice please

1•mjb8086•24m ago•0 comments

I created an extension for Claude that shares context on how you work

https://github.com/stubbleapp/Stubble
1•satay_chicken31•26m ago•0 comments

A multi-agent system for automating scientific discovery

https://www.nature.com/articles/s41586-026-10652-y
1•Timofeibu•27m ago•0 comments

Chewing gum restores dad's taste and smell years after Covid

https://discover.swns.com/2026/05/chewing-gum-restores-dads-taste-and-smell-years-after-covid/
12•speckx•28m ago•0 comments

Show HN: From one Claude agent to a fleet – in five small steps

1•sermakarevich•28m ago•0 comments

Sony Flamingo - The Coolest Record Player Ever Made

https://obsoletesony.substack.com/p/the-coolest-record-player-ever-made
2•reconnecting•29m ago•0 comments

A permissively licensed Vita FPGA Architecture in only 380 lines of Verilog

https://github.com/VitaSetLLC/VitaOS-Libre
1•VitaSetLLC•30m ago•0 comments
Open in hackernews

Ask HN: How do you send large sets of data to an LLM

2•obayesshelton•1y ago
So, I am hitting limits with the amount of data I am sending to Claude via the API.

I am trying to send about 5000 comments but I would love to send more but I am hitting limits of the message size.

What are some good design patterns when sending large sets of data to an LLM?

I ideally need to send all the data together at it gives context to the overall prompt.

Comments

curious_curios•1y ago
Some approaches we’ve used:

- Group the comments by theme, then pass groups to the LLM to summarize/deduplicate then pass the outputs of that as context.

- RAG where only relevant parts are included in the context.

- Use an LLM with a larger context window (like Gemini pro)

obayesshelton•1y ago
Yeah, I think I need to use Gemini

Question, how if possible could you query rows in a table?

Surely the better approach would be to have some sort of connection to table rows?