frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Sirius: A GPU-native SQL engine

https://github.com/sirius-db/sirius
54•qianli_cs•6h ago

Comments

cpard•2h ago
It’s great to see substrait getting more seriously used!

It has been supported by engines like duckdb but the main serious use case of it I’m aware of is from Apache gluten where it is used to add Velox as the execution engine of Spark.

It’s an ambitious project and certainly has limitations but more projects like this are needed to push it forward.

b0a04gl•2h ago
query plans are tightly coupled to the engine that emits them : cost models, memory layout, parallelism strategy, codegen behavior all vary. substrait enables structural portability, but the actual execution efficiency depends on engine specific rewrites. a plan optimized in duckdb might underperform in sirius unless it's reshaped. how this handled for now?
tucnak•2h ago
Reminds me of PG-Strom[1] which is a Postgres extension for GPU-bound index access methods (most notably BRIN, select GIS functions) and the like; it relies on proprietary NVIDIA GPUDirect tech for peer-to-peer PCIe transactions between the GPU and NVMe devices. I'm not sure whether amdgpu kernel driver has this capability in the first place, and last I checked (~6 mo. ago) ROCm didn't have this in software.

However, I wonder whether the GPU's are a good fit for this to begin with.

Counterpoint: Xilinx side of the AMD shop has developed Alveo-series accelerators which used to be pretty basic SmartNIC platforms, but have since evolved to include A LOT more programmable logic and compute IP. You may have heard about these in video encoding applications, HFT, Blockchain stuff, what-have-you. A lot of it has to with AI stuff, see Versal[2]. Raw compute figures are often cited as "underwhelming," and it's unfortunate that so many pundits are mistaking the forest for the trees here. I don't think the AI tiles in these devices are really meant for end-to-end LLM inference, even though memory bandwidth in the high-end devices allows it.

The sauce is compute-in-network over fabrics.

Similarly to how PG-Strom would feed the GPU with relational data from disk, or network directly, many AI teams on the datacenter side are now experimenting with data movement, & intermediate computations (think K/V cache management) over 100/200/800+G fabrics. IMHO, compute-in-network is the MapReduce of this decade. Obviously, there's demand for it in the AI space, but a lot of it lends nicely to the more general-purpose applications, like databases. If you're into experimental networking like that, Corundum[3] by Alex Forencich is a great, perhaps the best, open source NIC design for up to 100G line rate. Some of the cards it supports also expose direct-attach NVMe's over MCIO for latency, and typically have as many as two, or four SFP28 ports for bandwidth.

This is a bit naive way to think about it, but it would have to do!

Postgres is not typically considered to "scale well," but oftentimes this is a statement about its tablespaces more than anything; it has foreign data[4] API, which is how you extend Postgres as single point-of-consumption, foregoing some transactional guarantees in the process. This is how pg_analytics[5] brings DuckDB to Postgres, or how Steampipe[6] similarly exposes many Cloud and SaaS applications. Depending on where you stand on this, the so-called alternative SQL engines may seem like moving in the wrong direction. Shrug.

[1] https://heterodb.github.io/pg-strom/

[2] https://xilinx.github.io/AVED/latest/AVED%2BOverview.html

[3] https://github.com/corundum/corundum

[4] https://wiki.postgresql.org/wiki/Foreign_data_wrappers

[5] https://github.com/paradedb/pg_analytics

[6] https://hub.steampipe.io/#plugins

bob1029•1h ago
> However, I wonder whether the GPU's are a good fit for this to begin with.

I think the GPU could be a great fit for OLAP, but when it comes to the nasty OLTP use cases the CPU will absolutely dominate.

Strictly serialized transaction processing facilities demand extremely low latency compute to achieve meaningful throughput. When the behavior of transaction B depends on transaction A being fully resolved, there are no magic tricks you can play anymore.

Consider that talking to L1 is at least 1,000x faster than talking to the GPU. Unless you can get a shitload of work done with each CPU-GPU message (and it is usually the case that you can), this penalty is horrifyingly crippling.

tucnak•47m ago
I think, TrueTime would constitute a "trick," insofar ordering is concerned?

> Consider that talking to L1 is at least 1,000x faster than talking to the GPU.

This is largely true for "traditional" architectures, but s/GPU/TPU and s/L1/CMEM and suddenly this is no big deal anymore. I'd like Googlers to correct me here, but it seems well in line with classic MapReduce, and probably something that they're doing a lot outside of LLM inference... ads?

JavaScript Trademark Update

https://deno.com/blog/deno-v-oracle4
239•thebeardisred•2h ago•62 comments

MCP: An (Accidentally) Universal Plugin System

https://worksonmymachine.substack.com/p/mcp-an-accidentally-universal-plugin
378•Stwerner•6h ago•170 comments

AI fakes duel over impeachment of Vice-President in Phillipines

https://factcheck.afp.com/doc.afp.com.63ZF9CP
36•anigbrowl•2h ago•20 comments

Life of an inference request (vLLM V1): How LLMs are served efficiently at scale

https://www.ubicloud.com/blog/life-of-an-inference-request-vllm-v1
30•samaysharma•2h ago•0 comments

2025 ARRL Field Day

https://www.arrl.org/field-day
46•rookderby•2h ago•7 comments

BusyBeaver(6) Is Quite Large

https://scottaaronson.blog/?p=8972
139•bdr•4h ago•102 comments

We ran a Unix-like OS Xv6 on our home-built CPU with a home-built C compiler

https://fuel.edby.coffee/posts/how-we-ported-xv6-os-to-a-home-built-cpu-with-a-home-built-c-compiler/
189•AlexeyBrin•8h ago•14 comments

Addictions Are Being Engineered

https://masonyarbrough.substack.com/p/engineered-addictions
245•echollama•6h ago•148 comments

Show HN: Vet – A tool for safely running remote shell scripts

https://getvet.sh
16•a10r•1h ago•5 comments

Unheard works by Erik Satie to premiere 100 years after his death

https://www.theguardian.com/music/2025/jun/26/unheard-works-by-erik-satie-to-premiere-100-years-after-his-death
157•gripewater•10h ago•34 comments

NovaCustom – Framework Laptop alternative focusing on privacy

https://novacustom.com/
15•CHEF-KOCH•3h ago•16 comments

Sirius: A GPU-native SQL engine

https://github.com/sirius-db/sirius
54•qianli_cs•6h ago•5 comments

Memory Safe Languages: Reducing Vulnerabilities in Modern Software Development [pdf]

https://media.defense.gov/2025/Jun/23/2003742198/-1/-1/0/CSI_MEMORY_SAFE_LANGUAGES_REDUCING_VULNERABILITIES_IN_MODERN_SOFTWARE_DEVELOPMENT.PDF
14•todsacerdoti•3h ago•0 comments

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights

https://jameshard.ing/pilot
1397•jamesharding•1d ago•189 comments

Parsing JSON in Forty Lines of Awk

https://akr.am/blog/posts/parsing-json-in-forty-lines-of-awk
57•thefilmore•5h ago•15 comments

Engineer creates ad block for the real world with augmented reality glasses

https://www.tomshardware.com/maker-stem/engineer-creates-ad-block-for-the-real-world-with-augmented-reality-glasses-no-more-products-or-branding-in-your-everyday-life
198•LorenDB•6d ago•119 comments

Use Plain Text Email

https://useplaintext.email/
74•cyrc•3h ago•51 comments

ZeQLplus: Terminal SQLite Database Browser

https://github.com/ZetloStudio/ZeQLplus
40•amadeuspagel•8h ago•9 comments

Evaluating Long-Context Question and Answer Systems

https://eugeneyan.com/writing/qa-evals/
4•swyx•3d ago•0 comments

LLMs Bring New Nature of Abstraction

https://martinfowler.com/articles/2025-nature-abstraction.html
38•hasheddan•3d ago•37 comments

Verifiably Correct Lifting of Position-Independent x86-64 Binaries (2024)

https://dl.acm.org/doi/10.1145/3658644.3690244
18•etiams•3d ago•4 comments

Lossless LLM 3x Throughput Increase by LMCache

https://github.com/LMCache/LMCache
124•lihanc111•4d ago•34 comments

Lago (Open-Source Usage Based Billing) is hiring for ten roles

https://www.ycombinator.com/companies/lago/jobs
1•AnhTho_FR•9h ago

History of Cycling Maps

https://cyclemaps.blogspot.com/
70•altilunium•11h ago•9 comments

JWST reveals its first direct image discovery of an exoplanet

https://www.smithsonianmag.com/smart-news/james-webb-space-telescope-reveals-its-first-direct-image-discovery-of-an-exoplanet-180986886/
316•divbzero•1d ago•138 comments

After successfully entering Earth's atmosphere, a European spacecraft is lost

https://arstechnica.com/space/2025/06/a-european-spacecraft-company-flies-its-vehicle-then-loses-it-after-reentry/
47•rbanffy•3d ago•20 comments

Sinaloa cartel used phone data and surveillance cameras to find FBI informants

https://www.reuters.com/world/americas/sinaloa-cartel-hacked-phones-surveillance-cameras-find-fbi-informants-doj-says-2025-06-27/
25•_tk_•2h ago•8 comments

No One Is in Charge at the US Copyright Office

https://www.wired.com/story/us-copyright-office-chaos-doge/
82•rntn•3h ago•44 comments

US Justice Department settles antitrust case for HPE's $14B takeover of Juniper

https://www.reuters.com/business/us-doj-settles-antitrust-case-hpes-14-billion-takeover-juniper-2025-06-28/
26•awat•3h ago•4 comments

Why the moon shimmers with shiny glass beads

https://phys.org/news/2025-06-moon-shimmers-shiny-glass-beads.html
8•PaulHoule•3d ago•2 comments