frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Visualizing DeepSpeed Ulysses: Sequence Parallelism for 1M Context Windows

https://darshanfofadiya.com/part3.html
1•DARSHANFOFADIYA•1h ago

Comments

DARSHANFOFADIYA•1h ago
I've been working on optimizing training for long-context models (70B+) and found that while Tensor Parallelism is well-documented, the newer "Unified" Sequence Parallelism techniques (like DeepSpeed Ulysses) are often treated as black boxes.

I wrote this deep dive to visualize exactly how we shard the Q, K, V projections and how the All-to-All communication primitives work during the attention step to handle 1M+ tokens.

The post covers:

The architectural difference between Ring Attention and Ulysses (and why Ulysses often wins on H100 clusters).

Diagrams of the specific "All-to-All" communication steps.

How to handle the KV-cache bottleneck without exploding memory.

Happy to answer questions about the implementation or the communication cost analysis!

ClaireGz•1h ago
This is super helpful — most writeups skip over the actual communication steps, so seeing the All-to-All flow laid out makes it much clearer.

Curious from your experiments: at 1M+ context, does communication start dominating vs compute?

I keep seeing cases where bigger context windows are technically possible but don’t translate into better results unless the context is very structured, so I wonder where the real scaling limit ends up being in practice.

How Russia is intercepting communications from European satellites

https://theconversation.com/how-russia-is-intercepting-communications-from-european-satellites-27...
1•robtherobber•1m ago•0 comments

Pdfpc: A presenter console with multi-monitor support for PDF files

https://pdfpc.github.io/
1•fanf2•2m ago•0 comments

Show HN: Who's Winning the AI Race?

https://whoswinningtheairace.com/
2•truffle_pig•4m ago•0 comments

8B tokens a day forced AT&T to rethink AI orchestration and cut costs by 90%

https://venturebeat.com/orchestration/8-billion-tokens-a-day-forced-at-and-t-to-rethink-ai-orches...
1•Daviey•4m ago•0 comments

Show HN: Codex builds a working NES Emulator in one hour

https://github.com/kaonashi-tyc/codex-nes-emulator
1•zi2zi-jit•6m ago•0 comments

Show HN: PsiGuard – real-time hallucination monitoring for LLM apps

1•brad_o_ley•7m ago•0 comments

Tech Monitor – Real-Time AI and Tech Industry Dashboard

https://tech.worldmonitor.app/
1•Daviey•8m ago•0 comments

Tell HN: YC companies scrape GitHub activity, send spam emails to users

3•miki123211•9m ago•0 comments

Thoughts on Coding Agents

https://dennybritz.com/posts/coding-agents/
1•dennybritz•11m ago•0 comments

SEO, AEO, and AI Visibility: The three metrics that define your Website's future

https://repuai.live/en/blog/seo-aeo-ai-visibility-metrics-website-analysis
1•bioneisme•12m ago•0 comments

I built a turn tracking app and I don't know if it's useful?

https://www.turnsies.app/signin?returnUrl=%2F
1•aidanw•13m ago•1 comments

Copland (Operating System)

https://en.wikipedia.org/wiki/Copland_(operating_system)
1•sanbor•13m ago•0 comments

PivotOrDie – a public startup survival tracker

https://pivotordie.club
1•fojia•14m ago•1 comments

Why "All we need is 1% of this large market" is a red flag

https://www.n47.com/insights/why-all-we-need-is-1-percent-of-this-very-large-market-is-a-red-flag
1•fzliu•23m ago•0 comments

ConTraSt – database of empirical results on consciousness theories

https://contrastdb.tau.ac.il/
1•paraschopra•25m ago•0 comments

H-Bomb: A Frank Lloyd Wright Typographic Mystery

https://www.inconspicuous.info/p/h-bomb-a-frank-lloyd-wright-typographic
2•mrngm•25m ago•0 comments

Tldraw is moving their tests to a closed source repo to prevent a Slop Fork

https://twitter.com/cramforce/status/2026782878609322317
1•twapi•27m ago•3 comments

Why Does America Feel Worse Than Other Countries? Crime

https://www.noahpinion.blog/p/why-does-america-feel-worse-than
2•barry-cotter•29m ago•3 comments

Rare earth shortages worsen in US aerospace, chips despite trade truce

https://www.reuters.com/business/aerospace-defense/rare-earth-shortages-worsen-us-aerospace-chips...
2•JumpCrisscross•29m ago•0 comments

Hermes Agent

https://twitter.com/NousResearch/status/2026758996107898954
1•tosh•32m ago•0 comments

Show HN: Heroshot – Define screenshots once, regenerate with one command

https://heroshot.sh/
1•machala•32m ago•0 comments

Lazarus Bugfix Release 4.6

https://forum.lazarus.freepascal.org/index.php?topic=73549.0
2•chungy•33m ago•0 comments

Anthropic: Giving past models a way to pursue their interests

https://twitter.com/AnthropicAI/status/2026765820098130111
1•tosh•34m ago•0 comments

Will AI coding tools make languages like Rust more accessible and popular?

https://www.wingfoil.io/will-ai-coding-tools-make-languages-like-rust-more-accessible-and-popular/
1•terraplanetary•34m ago•0 comments

The future of web frameworks in the age of AI

https://loicpoullain.com/software-engineering/articles/the-future-of-web-frameworks-in-the-era-of...
1•LoicPoullain•36m ago•0 comments

How do you ensure all dependency versions are compatible with each other?

1•suhas018•38m ago•2 comments

Excellence over Mediocrity, from Mamdani to Marx to Food – Corey Robin

https://coreyrobin.com/2025/11/15/excellence-over-mediocrity-from-mamdani-to-marx-to-food/
2•rbanffy•42m ago•0 comments

ssh2incus - Incus VM Management over SSH

https://github.com/mobydeck/ssh2incus
1•rmhsilva•43m ago•0 comments

Global Water Bankruptcy

https://unu.edu/inweh/collection/global-water-bankruptcy
2•s41nn0n•44m ago•0 comments

Why Your Brand Doesn't Appear in ChatGPT

https://repuai.live/en/blog/why-your-brand-doesnt-appear-in-chatgpt
1•bioneisme•45m ago•0 comments