frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Libcusort – A radix sort for modern Nvidia GPUs

https://github.com/IlyaGrebnov/libcusort
1•grebnov•1h ago
I wrote a single-header CUDA radix sort library that's 1-6% faster than CUB for 32-bit keys and up to 38% faster for small types.

Why bother? CUB's radix sort is already memory-bound, which is theoretically optimal. But CUB's codebase prioritizes backward compatibility and generality across GPU generations, leaving room for targeted optimizations on modern hardware.

Benchmarks (RTX 5090, 67M elements): int32_t keys: 1.53ms vs 1.62ms (+6%) int8_t keys: 0.19ms vs 0.25ms (+32%) int32_t pairs: 3.07ms vs 3.10ms (+1%)

What's different: Implements OneSweep with decoupled lookback PDL (Programmatic Dependent Launch) overlaps histogram with first sort pass on Hopper+ Fine-grained PTX cache hints to avoid polluting L2 CUDA Graph caching drops launch overhead from ~100μs to ~5μs Auto-tuned tile sizes for 48 type combinations

The other goal is education. CUB is harder to read. I have tried to make every optimization decision explicit in the comments — not just what the code does, but why the GPU hardware forces that choice.

GitHub: https://github.com/IlyaGrebnov/libcusort

Happy to answer questions about GPU radix sort internals or the specific optimizations.

Pain and Reflection = Progress

https://federicopereiro.com/progress-formula/
1•swah•1m ago•0 comments

The open-source Ableton-style music composer for the web

https://github.com/AppsYogi-com/ComposeYogi
1•vaibhav1312•1m ago•0 comments

Audacious: Playback Control

https://github.com/madprops/playback-control
1•the_stocker•1m ago•0 comments

Interface Craft: a library for those committed to designing with uncommon care

https://www.interfacecraft.dev/
1•duck•2m ago•0 comments

What Is Cloud.microsoft?

https://support.microsoft.com/en-us/office/what-is-cloud-microsoft-7ba4c8b9-d062-4444-84a5-fca6c3...
1•microsoftedging•2m ago•0 comments

Are diffs still useful for AI-assisted code changes?

1•nuky•3m ago•1 comments

Can the American Oboe Sing Again?

https://www.nytimes.com/2026/01/14/arts/music/oboe-laubin-jim-phelan.html
2•perihelions•3m ago•0 comments

Show HN: DanceJump For YouTube – turning videos into browser rhythm game

https://chromewebstore.google.com/detail/dancejump-for-youtube/hhdeflibphdghcpblkekakmbennfcaci
1•maaydin•3m ago•0 comments

The Androgen Warp

https://factsandreason.substack.com/p/the-androgen-warp
1•paulpauper•3m ago•0 comments

Man got $2,500 whole-body MRI that found no problems – then had stroke

https://arstechnica.com/health/2026/01/man-got-2500-whole-body-mri-that-found-no-problems-then-ha...
1•_fs•4m ago•0 comments

Blogging, Writing, Musing, and Thinking

https://brianschrader.com/archive/blogging-writing-musing-and-thinking/
1•sonicrocketman•4m ago•0 comments

Verizon is down, with many users seeing 'SOS' – here's everything we know

https://www.techradar.com/news/live/verizon-outage-january-2026
2•bluedino•4m ago•1 comments

Tudor Trade War

https://www.ageofinvention.xyz/p/age-of-invention-tudor-trade-war
1•paulpauper•5m ago•0 comments

Redesigning my microkernel from the ground up

https://drewdevault.com/2026/01/12/2026-01-12-Hermes-from-the-ground-up.html
2•wicket•8m ago•0 comments

Open source AMD Linux driver to achieve parity with Windows in ray-tracing?

https://www.phoronix.com/news/RADV-10x-Fast-RT-Pipeline-Comp
1•lemome•9m ago•0 comments

Switch: robots.txt now required for Googlebot to index website

https://adactio.com/journal/22355
1•cdrnsf•9m ago•0 comments

GRPO vs. GDPO: Building Intuition for RL Reward Policies

https://huggingface.co/spaces/dylan-marimo-io/Reward-Policy-Intuition
1•dmadisetti•9m ago•1 comments

Grok will not generate bikini images: changes made to reject vulgar requests

https://www.msn.com/en-in/money/markets/elon-musk-s-grok-will-not-generate-bikini-images-xai-made...
2•randycupertino•10m ago•0 comments

Anything Down?

2•Artur-Defences•12m ago•1 comments

Thinking with Map: Reinforced Parallel Map-Augemented Agent for Geolocalization

https://amap-ml.github.io/Thinking-with-Map/
1•gmays•12m ago•0 comments

Signal creator Moxie Marlinspike wants to do for AI what he did for messaging

https://arstechnica.com/security/2026/01/signal-creator-moxie-marlinspike-wants-to-do-for-ai-what...
1•bookofjoe•13m ago•1 comments

Clarify CRM vs. HubSpot

1•isura•13m ago•1 comments

US tech giants allying with European far-right to strip back EU rules

https://www.brusselstimes.com/1916422/us-tech-giants-allying-with-european-far-right-to-strip-bac...
2•DyslexicAtheist•13m ago•0 comments

YouTube now has a way for parents to block kids from watching Shorts

https://techcrunch.com/2026/01/14/youtube-now-has-a-way-for-parents-to-block-kids-from-watching-s...
4•rbanffy•14m ago•1 comments

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR

https://www.tavus.io/post/sparrow-1-human-level-conversational-timing-in-real-time-voice
2•code_brian•14m ago•0 comments

Nature: Training LLM's on narrow tasks can lead to broad misalignment

https://www.nature.com/articles/s41586-025-09937-5
1•trajektorie•14m ago•0 comments

Remember when you owned stuff?

https://doctorow.medium.com/https-pluralistic-net-2026-01-14-sole-and-despotic-world-turned-upsid...
2•7777777phil•15m ago•1 comments

PySimpleGUI Shutdown in January 2026

https://github.com/PySimpleGUI/PySimpleGUI
2•gmargari•16m ago•0 comments

NASA's Artemis 2 Mission Will Make Spaceflight History

https://gizmodo.com/5-ways-nasas-artemis-2-mission-will-make-spaceflight-history-2000709895
2•rbanffy•16m ago•0 comments

Nearly 17K Fans Cancel 2026 World Cup Tickets Amid Boycotts

https://www.ticketnews.com/2026/01/nearly-17000-fans-cancel-2026-world-cup-tickets-amid-boycott/
2•DyslexicAtheist•16m ago•0 comments