frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Optimizing ClickHouse for Intel's 280 core processors

https://clickhouse.com/blog/optimizing-clickhouse-intel-high-core-count-cpu
63•ashvardanian•2h ago

Comments

epistasis•1h ago
This is my favorite type of HN post, and definitely going to be a classic in the genre for me.

> Memory optimization on ultra-high core count systems differs a lot from single-threaded memory management. Memory allocators themselves become contention points, memory bandwidth is divided across more cores, and allocation patterns that work fine on small systems can create cascading performance problems at scale. It is crucial to be mindful of how much memory is allocated and how memory is used.

In bioinformatics, one of the most popular alignment algorithms is roughly bottlenecked on random RAM access (the FM-index on the BWT of the genome), so I always wonder how these algorithms are going to perform on these beasts. It's been a decade since I spent any time optimizing large system performance for it though. NUMA was already challenging enough! I wonder how many memory channels these new chips have access to.

ashvardanian•38m ago
My expectation, they will perform great! I’m now mostly benchmarking on 192 core Intel, AMD, and Arm instances on AWS, and in some workloads they come surprisingly close to GPUs even on GPU-friendly workloads, once you get the SIMD and NUMA pinning parts right.

For BioInformatics specifically, I’ve just finished benchmarking Intel SPR 16-core UMA slices against Nvidia H100, and will try to extend them soon: https://github.com/ashvardanian/StringWa.rs

pixelpoet•1h ago
This post looks like excellent low-level optimisation writing just in the first sections, and (I know this is kinda petty, but...) my heart absolutely sings at their use of my preferred C++ coding convention where & (ref) neither belongs to the type nor the variable name!
nivertech•21m ago
I think it belongs to type, but since they use “auto” it looks standalone and can be confused with the “&” operator. I personally always used * and & as a prefix of the variable name, not as a suffix in the type name, except when used to specify types in templates.
bee_rider•54m ago
288 cores is an absurd number of cores.

Do these things have AVX512? It looks like some of the Sierra Forest chips do have AVX512 with 2xFMA…

That’s pretty wide. Wonder if they should put that thing on a card and sell it as a GPU (a totally original idea that has never been tried, sure…).

ashvardanian•51m ago
Sadly, no! On the bright side, they support new AVX2 VNNI extensions, that help with low precision integer dot products for Vector Search!

SimSIMD (inside USearch (inside ClickHouse)) already has those SIMD kernels, but I don’t yet have the hardware to benchmark :(

yvdriess•31m ago
Something that could help is to use llvm-mca or similar to get an idea of the potential speedup.
Sesse__•24m ago
A basic block simulator like llvm-mca is unlikely to give useful information here, as memory access is going to play a significant part in the overall performance.
pclmulqdq•14m ago
AVX-512 is on the P-cores only (along with AMX now). The E-cores only support 256-bit vectors.

If you're doing a lot of loading and storing, these E-core chips are probably going to outperform the chips with huge cores because they will be idling a lot. For CPU-bound tasks, the P-cores will win hands down.

jiehong•33m ago
Great work!

I like duckdb, but clickhouse seems more focused on large scale performance.

I just thought that the article is written from the point of view of a single person, but has multiple authors, which is a bit weird. Did I misunderstood something?

hobo_in_library•31m ago
Not sure what happened here, but it's not uncommon for a post to have one primary author and then multiple reviewers/supporters also credited
sdairs•15m ago
Yep that's pretty much the case here!
sdairs•11m ago
ClickHouse works in-process and on the CLI just like DuckDB, but also scales to hundreds of nodes - so it's really not limited to just large scale. Handling those smaller cases with a great experience is still a big focus for us

WASM 3.0 Completed

https://webassembly.org/news/2025-09-17-wasm-3.0/
417•todsacerdoti•2h ago•159 comments

Apple Photos app corrupts images

https://tenderlovemaking.com/2025/09/17/apple-photos-app-corrupts-images/
860•pattyj•9h ago•321 comments

Optimizing ClickHouse for Intel's 280 core processors

https://clickhouse.com/blog/optimizing-clickhouse-intel-high-core-count-cpu
66•ashvardanian•2h ago•13 comments

Gluon: a GPU programming language based on the same compiler stack as Triton

https://github.com/triton-lang/triton/blob/main/python/tutorials/gluon/01-intro.py
20•matt_d•1h ago•4 comments

Tinycolor supply chain attack post-mortem

https://sigh.dev/posts/ctrl-tinycolor-post-mortem/
97•STRiDEX•3h ago•44 comments

DeepMind and OpenAI win gold at ICPC

https://codeforces.com/blog/entry/146536
79•notemap•2h ago•60 comments

YouTube addresses lower view counts which seem to be caused by ad blockers

https://9to5google.com/2025/09/16/youtube-lower-view-counts-ad-blockers/
194•iamflimflam1•6h ago•391 comments

Launch HN: RunRL (YC X25) – Reinforcement learning as a service

https://runrl.com
37•ag8•4h ago•11 comments

Noise cancelling a fan

https://chillphysicsenjoyer.substack.com/p/noise-cancelling-a-fan
31•crescit_eundo•1d ago•28 comments

Drought in Iraq reveals tombs created 2,300 years ago

https://www.smithsonianmag.com/smart-news/severe-droughts-in-iraq-reveals-dozens-of-ancient-tombs...
54•pseudolus•3h ago•5 comments

Ton Roosendaal to step down as Blender chairman and CEO

https://www.cgchannel.com/2025/09/ton-roosendaal-to-step-down-as-blender-chairman-and-ceo/
107•cma•4h ago•14 comments

The Asus gaming laptop ACPI firmware bug

https://github.com/Zephkek/Asus-ROG-Aml-Deep-Dive
377•signa11•16h ago•160 comments

Famous cognitive psychology experiments that failed to replicate

https://buttondown.com/aethermug/archive/aether-mug-famous-cognitive-psychology/
72•PaulHoule•1h ago•32 comments

Event Horizon Labs (YC W24) Is Hiring

https://www.ycombinator.com/companies/event-horizon-labs/jobs/U6oyyKZ-founding-engineer-at-event-...
1•ocolegro•3h ago

Anthropic irks White House with limits on models’ use

https://www.semafor.com/article/09/17/2025/anthropic-irks-white-house-with-limits-on-models-uswhi...
176•mindingnever•2h ago•88 comments

DeepSeek writes less secure code for groups China disfavors?

https://www.washingtonpost.com/technology/2025/09/16/deepseek-ai-security/
172•otterley•3h ago•102 comments

U.S. investors, Trump close in on TikTok deal with China

https://www.wsj.com/tech/details-emerge-on-u-s-china-tiktok-deal-594e009f
298•Mgtyalx•1d ago•295 comments

Ask HN: What's a good 3D Printer for sub $1000?

94•lucideng•2d ago•102 comments

Depression reduces capacity to learn to actively avoid aversive events

https://www.eneuro.org/content/12/9/ENEURO.0034-25.2025
117•PaulHoule•3h ago•31 comments

Infinite Mac: Resource Fork Roundtripping

https://blog.persistent.info/2025/09/infinite-mac-resource-forks.html
8•tobr•1d ago•0 comments

Tau² benchmark: How a prompt rewrite boosted GPT-5-mini by 22%

https://quesma.com/blog/tau2-benchmark-improving-results-smaller-models/
149•blndrt•7h ago•45 comments

Alibaba's new AI chip: Key specifications comparable to H20

https://news.futunn.com/en/post/62202518/alibaba-s-new-ai-chip-unveiled-key-specifications-compar...
226•dworks•11h ago•244 comments

UUIDv47: Store UUIDv7 in DB, emit UUIDv4 outside (SipHash-masked timestamp)

https://github.com/stateless-me/uuidv47
113•aabbdev•6h ago•62 comments

How to motivate yourself to do a thing you don't want to do

https://ashleyjanssen.com/how-to-motivate-yourself-to-do-a-thing-you-dont-want-to-do/
190•mooreds•5h ago•161 comments

Determination of the fifth Busy Beaver value

https://arxiv.org/abs/2509.12337
227•marvinborner•10h ago•95 comments

Procedural Island Generation (III)

https://brashandplucky.com/2025/09/17/procedural-island-generation-iii.html
95•ibobev•8h ago•17 comments

Microsoft Python Driver for SQL Server

https://github.com/microsoft/mssql-python
63•kermatt•5h ago•24 comments

PureVPN IPv6 Leak

https://anagogistis.com/posts/purevpn-ipv6-leak/
154•todsacerdoti•10h ago•69 comments

Just for fun: animating a mosaic of 90s GIFs

https://alexplescan.com/posts/2025/09/15/gifs/
22•Bogdanp•1d ago•3 comments

Slow social media

https://herman.bearblog.dev/slow-social-media/
140•rishikeshs•18h ago•124 comments