frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Induction of a torpor-like state with ultrasound

https://engineering.washu.edu/news/2023/Induction-of-a-torpor-like-state-with-ultrasound.html
1•geox•4m ago•0 comments

YouTube loosens profanity rules for monetized videos

https://techcrunch.com/2025/07/30/youtube-loosens-profanity-rules-for-monetized-videos/
2•01-_-•5m ago•0 comments

Coding agents provide fast data visualization tools

https://mnky9800n.substack.com/p/coding-agents-provide-lightning-fast
1•mnky9800n•8m ago•0 comments

What Future Awaits for Software? AI First vs. AI Augmented

https://docsify-this.net/?basePath=https://gist.githubusercontent.com/joelewis/6859a7dc0ee18d39db41448e29fa76e1/raw/0cc574abdbf87ea472ae0ea5616a6b962a9f1282&homepage=what-future-awaits-for-software.md#/?id=what-future-awaits-for-software-ai-first-vs-ai-augmented
1•lewisjoe•8m ago•0 comments

Unintended AI consequence: the return of status markers

https://www.bostonglobe.com/2025/07/28/opinion/ai-college-admissions-personal-statements/
1•Anon84•11m ago•0 comments

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

https://arxiviq.substack.com/p/gepa-reflective-prompt-evolution
1•che_shr_cat•11m ago•0 comments

I tried Servo, the undercover web browser engine made with Rust

https://www.spacebar.news/servo-undercover-web-browser-engine/
2•robtherobber•13m ago•0 comments

Show HN: I got my first customer

https://www.superfa.st/
1•kalashvasaniya•14m ago•0 comments

Round Up and Reflections for Ennie Award for Best Adventure

https://tasker.land/2025/07/31/ennie-award-for-best-adventure-short-form-round-up-and-reflections/
1•GaiusCoffee•15m ago•0 comments

Sync primitives are Functionally Complete

http://kprotty.me/2025/07/31/sync-primitives-are-functionally-complete.html
2•todsacerdoti•16m ago•0 comments

What Can a Cell Remember?

https://www.quantamagazine.org/what-can-a-cell-remember-20250730/
1•jnord•17m ago•0 comments

Meetingco.st – How much does that meeting cost?

https://meetingco.st/
1•chillax•22m ago•0 comments

Marvel: Laser-Driven Fusion

https://marvelfusion.com/
1•doener•27m ago•0 comments

Google could be reading your ChatGPT conversations

https://www.fastcompany.com/91376687/google-indexing-chatgpt-conversations
2•greatgib•29m ago•0 comments

Banning VPNs to protect kids? Good luck with that

https://www.theregister.com/2025/07/31/banning_vpns_to_protect_kids/
2•rntn•29m ago•0 comments

The 1912 War on Fake Photos

https://www.bunkhistory.org/resources/the-1912-war-on-fake-photos
3•bookofjoe•30m ago•0 comments

Proxima and Hugging Face Announce the ConStellaration Challenge

https://www.proximafusion.com/press-news/proxima-and-hugging-face-announce-the-constellaration-challenge-help-us-optimize-the-future-of-fusion-energy-with-machine-learning
1•doener•30m ago•0 comments

Bret Victor – The Future of Programming [video]

https://vimeo.com/71278954
1•zdkaster•33m ago•0 comments

My Journey to Pure No-Code Addition (Spoiler: Plot Twist Ending)

2•tibastral2•38m ago•0 comments

Arab states call on Hamas to disarm and relinquish power in unprecedented move

https://www.cnn.com/2025/07/30/middleeast/arab-league-hamas-gaza-israel-intl
5•nailer•38m ago•0 comments

GDAL: Geospatial Data Abstraction Library

https://gdal.org/en/stable/index.html
2•teleforce•40m ago•0 comments

The Bull Market for Economists Is Over. It's an Ominous Sign for the Economy

https://www.nytimes.com/2025/07/28/business/economics-jobs-hiring.html
1•Michelangelo11•42m ago•0 comments

Proton Authenticator – new free 2FA app by Proton

https://proton.me/blog/authenticator-app
2•tmvst•50m ago•1 comments

Show HN: Winamp Style Web Audio Visualizer

https://audio-viz.soundshed.com/
1•webprofusion•54m ago•1 comments

Why leather is best motorbike protection – whilst being dragged along concrete

https://www.youtube.com/watch?v=xwuRUcAGIEU
2•lifeisstillgood•59m ago•0 comments

Top Anti-Tracking Extensions

https://addons.mozilla.org/blog/top-anti-tracking-extensions/
3•leonry•59m ago•0 comments

Show HN: Built a daily trivia challenge: Same 10 questions worldwide daily

https://apps.apple.com/gb/app/qwiz-daily-quiz-trivia-game/id6747268704
1•spacebots•1h ago•0 comments

GOP’s Josh Hawley and Democrats vote to advance congressional stock trading ban

https://www.cbsnews.com/news/hawley-democrats-vote-stock-trading-ban-committee/
19•hhs•1h ago•3 comments

How to trigger a command on Linux when disconnected from power

https://dataswamp.org/~solene/2025-05-31-linux-killswitch-on-power-disconnect.html
2•Mr_Minderbinder•1h ago•0 comments

Neko: Open-source self-hosted virtual browser

https://github.com/m1k1o/neko
1•avervaet•1h ago•0 comments
Open in hackernews

Show HN: Term – Rust-based data validation with OpenTelemetry

https://github.com/withterm/term
2•ericpsimon•19h ago
Hi HN, I'm Eric and I'm a recovering data engineer. Recently I have worked on the data platforms for multiple YC backed start-ups Kable (YC W22) and Finch (YC S20).

Every data team I've worked with struggles with data quality validation. Current solutions like Apache Deequ require spinning up entire Spark clusters just to check if your data meets basic quality constraints.

When I found Apache DataFusion, it was love at first sight - it provided the ergonomics of Apache Spark, without the overhead, JVM, etc. That is what led me to build Term. It is able to take advantage of the ergonomics of Spark without the overhead.

Term is a Rust library that provides Deequ-style data validation using Apache DataFusion. You can run comprehensive data quality checks anywhere - from your laptop to CI/CD pipelines - without any JVM or cluster setup. On a 1M row dataset with 20 constraints, Term completes validation in 0.21 seconds (vs 3.2 seconds without optimization) by intelligently batching operations into just 2 scans instead of 20.

The technical approach: Term leverages DataFusion's columnar processing engine to efficiently validate data in Arrow format. Validation rules compile directly to DataFusion's physical plans, and Rust's zero-cost abstractions mean the overhead is minimal. You get 100MB/s single-core throughput, which often outperforms distributed solutions for datasets under 100GB.

Term supports all the validation patterns you'd expect - completeness checks, uniqueness validation, statistical analysis (mean, correlation, standard deviation), pattern matching, custom SQL expressions, and built-in OpenTelemetry integration for production observability. The entire setup takes less than 5 minutes - just `cargo add term-guard` and you're validating data.

GitHub: https://github.com/withterm/term

I built this because I was tired of seeing teams skip data validation entirely rather than deal with Spark infrastructure. With Term, you can add validation to any Rust data pipeline with minimal overhead and zero operational complexity.

Coming next: Python/Node.js bindings, streaming support, and database connectivity. I'm particularly excited about making this accessible beyond the Rust ecosystem.

I'd love feedback on:

- The validation API - does it cover your use cases?

- Performance on your real-world datasets

- What validation patterns you need that aren't supported yet

- Ideas for the Python/Node.js API design

Happy to dive into technical details about DataFusion integration, performance optimizations, or anything else!