frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: Can sharded contexts scale up to long-context with global composition?

2•deazy•3h ago
I was exploring this conceptual architecture for neural networks (transformers/attention) based long-context models, its conceptual but grounded in sound existing research and architecture implementations on specialized hardware like gpu's and tpu's.

Can a we scale up independent shards of (mini) contexts, i.e Sub-global attention blocks or "sub-context experts" that can operate somewhat independently with global composition into a larger global attention as a paradigm for handling extremely long contexts. Context shared, distributed and sharded across chips, that can act as Independent shards of (mini) Contexts.

This could possibly (speculating here) make attention based context sub-quadratic. Its possible (again speculating here) google might have used something like this for having such long context windows.

Evidence points to this: Google's pioneering MoE research (Shazeer, GShard, Switch), advanced TPUs (v4/v5p/Ironwood) with massive HBM & high-bandwidth 3D Torus/OCS Inter-Chip Interconnect (ICI) enabling essential distribution (MoE experts, sequence parallelism like Ring Attention), and TPU pod VRAM capacities aligning with 10M token context needs. Google's Pathways & system optimizations further support possibility of such a distributed, concurrent model.

Share your thoughts on this if its possible, feasible or why it might not work.

'A Billion Streams and No Fans': Inside a $10M AI Music Fraud Case

https://www.wired.com/story/ai-bots-streaming-music/
1•tosh•38s ago•0 comments

Red Programming Language

https://www.red-lang.org/p/about.html
1•hotpocket777•1m ago•0 comments

Show HN: I made a zero-commission AI prompt marketplace for selling AI Prompts

https://promptstand.io
1•bmadigan•3m ago•0 comments

KumoRFM: A Foundation Model for In-Context Learning on Relational Data [pdf]

https://kumo.ai/research/kumo_relational_foundation_model.pdf
1•gk1•4m ago•0 comments

Google Glasses are back, sort of. XREAL and Google announce partnership for XR

https://www.laptopmag.com/gaming/vr/xreal-project-aura-google-io-2025
1•cokext30•4m ago•0 comments

Running Claude in a loop to write a novel

https://github.com/brumar/loop
1•handfuloflight•6m ago•0 comments

Show HN: Node.js Memory Limits Visualized

https://github.com/csabapalfi/node-memory-limits
1•csabapalfi•6m ago•0 comments

Reaching Higher: Megan Gleason '18 Climbs to International Competition

https://www.whitman.edu//whitman-stories/reaching-higher-megan-gleason-18-climbs-to-international-competition
1•mooreds•7m ago•0 comments

Google launches AI Ultra: A $3k/year 'VIP pass' to its most powerful AI tools

https://www.androidauthority.com/google-ai-ultra-plan-3559209/
1•abraham•7m ago•0 comments

From hype to harm: 78% of CISOs see AI attacks already

https://www.theregister.com/2025/05/16/cisos-report-ai-attacks/
2•Bender•7m ago•0 comments

Millions at risk after attackers steal UK legal aid data dating back 15 years

https://www.theregister.com/2025/05/19/legal_aid_agency_data_theft/
1•Bender•8m ago•0 comments

Delft unveils open-architecture quantum computer, Tuna-5

https://ioplus.nl/en/posts/delft-unveils-open-architecture-quantum-computer-tuna-5
1•donutloop•9m ago•0 comments

Show HN: GeniusPlants – AI-Powered Gardening Assistant

https://www.geniusplants.com/
1•eibrahim•10m ago•0 comments

IQM to deliver world-leading 300-qubit quantum computer to Finland

https://meetiqm.com/press-releases/iqm-to-deliver-world-leading-300-qubit-quantum-computer-to-finland/
1•donutloop•10m ago•0 comments

LastOS slaps neon paint on Linux Mint and dares you to run Photoshop

https://www.theregister.com/2025/05/19/lastos/
1•Bender•10m ago•0 comments

The Labor Market for Recent College Graduates

https://www.newyorkfed.org/research/college-labor-market#--:overview
1•NaOH•10m ago•0 comments

Helblazer811/Diffusion-Explorer: Interactive Visualizations

https://github.com/helblazer811/Diffusion-Explorer
1•diginova•11m ago•0 comments

D-Wave Announces General Availability of Advantage2 Quantum Computer

https://www.dwavequantum.com/company/newsroom/press-release/d-wave-announces-general-availability-of-advantage2-quantum-computer-its-most-advanced-and-performant-system/
1•donutloop•11m ago•0 comments

With AI Mode, Google Search Is About to Get Even Chattier

https://www.wired.com/story/google-ai-mode-search/
2•tosh•12m ago•0 comments

Gemma 3n preview: powerful, efficient, mobile-first AI

https://developers.googleblog.com/en/introducing-gemma-3n/
2•meetpateltech•12m ago•0 comments

Rbdoom-3-BFG: Doom 3 port using Nvidia's NVRHI

https://github.com/RobertBeckebans/RBDOOM-3-BFG
1•klaussilveira•12m ago•0 comments

Show HN: I Made Resend.com Cheaper

https://app.selfmailkit.com/
2•javidabd•13m ago•0 comments

FDA will limit Covid vaccines to people over 65 or high risk of serious illness

https://www.statnews.com/2025/05/20/fda-vaccine-framework-new-covid-shot-recommendations-vinay-prasad-marty-makary/
5•perihelions•13m ago•1 comments

KumoRFM: Gen-purpose model for making instant predictions over relational data

https://kumo.ai/company/news/kumo-relational-foundation-model/
2•agold97•13m ago•0 comments

Nvidia Donut: real-time rendering framework

https://github.com/NVIDIA-RTX/Donut
1•klaussilveira•13m ago•0 comments

Why 3D doesn't work and never will. Case closed. (2011)

https://web.archive.org/web/20110807170951/http://blogs.suntimes.com/ebert/2011/01/post_4.html
1•Tomte•13m ago•0 comments

Why are linked lists implemented the way they are in the Linux kernel? (2014)

https://www.quora.com/Why-are-linked-lists-implemented-the-way-they-are-in-the-Linux-kernel
1•Tomte•14m ago•0 comments

The evolution of onboard cameras in Formula One

https://www.dive-bomb.com/article/the-evolution-of-onboard-cameras-in-formula-one-part-1
1•austinallegro•15m ago•0 comments

Sockudo: High-Performance Pusher-Compatible WebSockets Built with Rust

https://sockudo.app/
1•kondro•16m ago•0 comments

The Meritocracy to Eugenics Pipeline

https://pluralistic.net/2025/05/20/big-cornflakes-energy/#caliper-pilled
4•rbanffy•16m ago•0 comments