frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Sail: a Rust-Based Spark Replacement

https://lakesail.com/blog/sail-0-3/
5•chenxi9649•6h ago

Comments

chenxi9649•6h ago
Hey HN! We're excited to share Sail 0.3, our open-source distributed computing framework that serves as a drop-in replacement for Apache Spark.

Sail is a Rust-native execution engine that speaks the Spark Connect protocol. Your existing Spark SQL and DataFrame code runs unchanged, but executes on average 4x faster while using 94% less infrastructure spend.

Here's how we are achieving that performance:

1. No JVM overhead: Rust's zero-cost abstractions and deterministic memory management eliminate GC pauses

2. Columnar processing: Apache Arrow format enables SIMD instructions to process multiple records per CPU cycle

3. Zero-copy data transfer: Python UDFs run in-process with shared memory buffers (no serialization)

4. Lightweight workers: Processes start in seconds

What's new in v0.3:

- This release is a major milestone - we now support both Spark 3.5 and 4.0, including the new lightweight pyspark-client. The framework automatically detects your Spark version and adjusts its behavior accordingly, so one Sail binary works across versions.

Why this matters:

- Spark revolutionized big data 15 years ago, but its JVM foundation struggles with modern workloads. As teams process more real-time data and AI workloads, they're hitting walls with latency, cloud costs, and operational complexity. Sail is trying to solve all of these problems while not requiring you to rewrite everything that you already did with Spark.

- We're working toward unifying batch, streaming, and AI workloads in a single framework. Imagine running your ETL, real-time analytics, and model training on the same infrastructure with predictable performance. The project is open source (Apache 2.0) and we'd love your feedback! We have a growing community on Slack where early adopters are already running production workloads.

GitHub: https://github.com/lakehq/sail

Docs: https://lakesail.com

Our internal benchmarks(for the 4x and 94% number): https://docs.lakesail.com/sail/latest/introduction/benchmark...

Slack: https://lakesail.com/slack

Happy to answer any questions about the architecture, benchmarks, or migration path!

Libpostal: C library for parsing/normalizing street addresses around the world

https://github.com/openvenues/libpostal
1•nateb2022•29s ago•0 comments

Hamas used sexual violence as part of 'genocidal strategy', Israeli experts say

https://www.bbc.com/news/articles/c1mz8gxzg82o
1•mhga•4m ago•1 comments

New eclipsing variable star discovered in the Pegasus constellation

https://phys.org/news/2025-07-grigoriev-eclipsing-variable-star-pegasus.html
1•wglb•9m ago•1 comments

Tesla Robotaxi is not open to the public, but is open to influencers from Korea

https://twitter.com/Tslachan/status/1942688913644216688
2•TheAlchemist•11m ago•1 comments

Voracious honey bees threaten the food supply of native pollinators

https://phys.org/news/2025-07-voracious-honey-bees-threaten-food.html
1•wglb•12m ago•1 comments

UCD: Unlearning in LLMs via Contrastive Decoding

https://arxiv.org/abs/2506.12097
1•PaulHoule•12m ago•0 comments

Grok's system prompt has been updated

https://github.com/xai-org/grok-prompts/commit/c5de4a14feb50b0e5b3e8554f9c8aae8c97b56b4
1•flunhat•13m ago•0 comments

Alligator Alcatraz detainees allege inhumane conditions at detention center

https://www.cbsnews.com/miami/news/alligator-alcatraz-detainees-allege-inhumane-conditions-at-immigration-detention-center/
1•zzzeek•14m ago•0 comments

Signal and Messaging Layer Security

https://us.artechhouse.com/Signal-and-Messaging-Layer-Security-P2439.aspx
1•teleforce•15m ago•0 comments

Pixeltable

https://www.pixeltable.com/
1•handfuloflight•20m ago•0 comments

Keep secrets and configmaps syncronized across clusters and namespaces

https://github.com/powerhome/keess
1•mooreds•20m ago•0 comments

How the Great Flood of Software Will Reinvent Infrastructure

https://www.heavybit.com/press/heavybit-reinventing-enterprise-infrastructure
1•mooreds•21m ago•0 comments

A billionaire, an AI supercomputer, toxic emissions and a Memphis community

https://tennesseelookout.com/2025/07/07/a-billionaire-an-ai-supercomputer-toxic-emissions-and-a-memphis-community-that-did-nothing-wrong/
2•greenie_beans•22m ago•0 comments

Bandcamp is moving from PayPal to Stripe

https://get.bandcamp.help/hc/en-us/articles/23020652077079-Setting-up-your-PayPal-account
1•_DeadFred_•22m ago•1 comments

Telegram CEO Gives His View on Elon Musk, Sam Altman, and Mark Zuckerberg

https://www.businessinsider.com/telegram-ceo-pavel-durov-elon-musk-sam-altman-mark-zuckerberg-2025-6
1•andsoitis•24m ago•0 comments

Hippocratic License

https://firstdonoharm.dev/version/3/0/cl-eco-media-my-tal-xuar.txt
1•todsacerdoti•30m ago•0 comments

Altman: "I don't like smart glasses."

https://www.theverge.com/news/701845/sam-altman-i-dont-like-smart-glasses
4•Bluestein•32m ago•5 comments

Watchfiles: Simple, modern and fast file watching for Python, written in Rust

https://github.com/samuelcolvin/watchfiles
1•Labo333•34m ago•0 comments

AGI may be impossible to define, and that's a multibillion-dollar problem

https://arstechnica.com/civis/threads/agi-may-be-impossible-to-define-and-that%E2%80%99s-a-multibillion-dollar-problem.1508232/page-4
4•Bluestein•36m ago•0 comments

Six traits that make someone cool, according to science

https://www.the-independent.com/life-style/cool-millennial-gen-z-gen-x-b2783984.html
4•Bluestein•39m ago•1 comments

Fantoccini: Programmatically interact with web pages through WebDriver in Rust

https://github.com/jonhoo/fantoccini
1•nateb2022•41m ago•0 comments

Chemical Process Produces Critical Battery Metals with No Waste

https://spectrum.ieee.org/nmc-battery-aspiring-materials
1•defrost•41m ago•0 comments

Grok Praises Hitler No One

https://gizmodo.com/round-them-up-grok-praises-hitler-as-elon-musks-ai-tool-goes-full-nazi-2000626156
6•Bogdanp•42m ago•0 comments

Cooling the London Underground: The Never-Ending Quest [video]

https://www.youtube.com/watch?v=5Yw-Pp_RW08
2•JojoFatsani•42m ago•0 comments

The AI Remote Has Arrived: Let Go of the Knob

https://rahulpandita.me/blog/2025-05-27-AI-Remote
1•azhenley•43m ago•0 comments

Microsoft Patch Tuesday, July 2025 Edition

https://krebsonsecurity.com/2025/07/microsoft-patch-tuesday-july-2025-edition/
1•todsacerdoti•45m ago•0 comments

Experience AI Ethics Like Never Before – Meet SimulateAI

https://mindbomber.github.io/SimulateAI/
1•CitizenOfEarth•46m ago•1 comments

Elon Musk's AI chatbot is suddenly posting antisemitic tropes

https://www.cnn.com/2025/07/08/tech/grok-ai-antisemitism
11•bhouston•46m ago•5 comments

The Dinah Project [pdf]

https://ynet-pic1.yit.co.il/picserver6/wcm_upload_files/2025/07/08/SyIwkmqHee/The_Dinah_Project_full_report_A4_pages_web.pdf
1•YZF•47m ago•1 comments

OLMo 2 - a family of fully-open language models

https://allenai.org/olmo
18•oldfuture•47m ago•0 comments