frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Digital Independence Day

https://di.day/
1•pabs3•2m ago•0 comments

What a bot hacking attempt looks like: SQL injections galore

https://old.reddit.com/r/vibecoding/comments/1qz3a7y/what_a_bot_hacking_attempt_looks_like_i_set_up/
1•cryptoz•3m ago•0 comments

Show HN: FlashMesh – An encrypted file mesh across Google Drive and Dropbox

https://flashmesh.netlify.app
1•Elevanix•4m ago•0 comments

Show HN: AgentLens – Open-source observability and audit trail for AI agents

https://github.com/amitpaz1/agentlens
1•amit_paz•5m ago•0 comments

Show HN: ShipClaw – Deploy OpenClaw to the Cloud in One Click

https://shipclaw.app
1•sunpy•7m ago•0 comments

Unlock the Power of Real-Time Google Trends Visit: Www.daily-Trending.org

https://daily-trending.org
1•azamsayeedit•9m ago•1 comments

Explanation of British Class System

https://www.youtube.com/watch?v=Ob1zWfnXI70
1•lifeisstillgood•10m ago•0 comments

Show HN: Jwtpeek – minimal, user-friendly JWT inspector in Go

https://github.com/alesr/jwtpeek
1•alesrdev•13m ago•0 comments

Willow – Protocols for an uncertain future [video]

https://fosdem.org/2026/schedule/event/CVGZAV-willow/
1•todsacerdoti•15m ago•0 comments

Feedback on a client-side, privacy-first PDF editor I built

https://pdffreeeditor.com/
1•Maaz-Sohail•19m ago•0 comments

Clay Christensen's Milkshake Marketing (2011)

https://www.library.hbs.edu/working-knowledge/clay-christensens-milkshake-marketing
2•vismit2000•25m ago•0 comments

Show HN: WeaveMind – AI Workflows with human-in-the-loop

https://weavemind.ai
6•quentin101010•31m ago•1 comments

Show HN: Seedream 5.0: free AI image generator that claims strong text rendering

https://seedream5ai.org
1•dallen97•33m ago•0 comments

A contributor trust management system based on explicit vouches

https://github.com/mitchellh/vouch
2•admp•35m ago•1 comments

Show HN: Analyzing 9 years of HN side projects that reached $500/month

2•haileyzhou•35m ago•0 comments

The Floating Dock for Developers

https://snap-dock.co
2•OsamaJaber•36m ago•0 comments

Arcan Explained – A browser for different webs

https://arcan-fe.com/2026/01/26/arcan-explained-a-browser-for-different-webs/
2•walterbell•37m ago•0 comments

We are not scared of AI, we are scared of irrelevance

https://adlrocha.substack.com/p/adlrocha-we-are-not-scared-of-ai
1•adlrocha•38m ago•0 comments

Quartz Crystals

https://www.pa3fwm.nl/technotes/tn13a.html
1•gtsnexp•41m ago•0 comments

Show HN: I built a free dictionary API to avoid API keys

https://github.com/suvankar-mitra/free-dictionary-rest-api
2•suvankar_m•43m ago•0 comments

Show HN: Kybera – Agentic Smart Wallet with AI Osint and Reputation Tracking

https://kybera.xyz
2•xipz•45m ago•0 comments

Show HN: brew changelog – find upstream changelogs for Homebrew packages

https://github.com/pavel-voronin/homebrew-changelog
1•kolpaque•48m ago•0 comments

Any chess position with 8 pieces on board and one pair of pawns has been solved

https://mastodon.online/@lichess/116029914921844500
2•baruchel•50m ago•1 comments

LLMs as Language Compilers: Lessons from Fortran for the Future of Coding

https://cyber-omelette.com/posts/the-abstraction-rises.html
2•birdculture•52m ago•0 comments

Projecting high-dimensional tensor/matrix/vect GPT–>ML

https://github.com/tambetvali/LaegnaAIHDvisualization
1•tvali•53m ago•1 comments

Show HN: Free Bank Statement Analyzer to Find Spending Leaks and Save Money

https://www.whereismymoneygo.com/
2•raleobob•56m ago•1 comments

Our Stolen Light

https://ayushgundawar.me/posts/html/our_stolen_light.html
2•gundawar•57m ago•0 comments

Matchlock: Linux-based sandboxing for AI agents

https://github.com/jingkaihe/matchlock
2•jingkai_he•1h ago•0 comments

Show HN: A2A Protocol – Infrastructure for an Agent-to-Agent Economy

2•swimmingkiim•1h ago•1 comments

Drinking More Water Can Boost Your Energy

https://www.verywellhealth.com/can-drinking-water-boost-energy-11891522
1•wjb3•1h ago•0 comments
Open in hackernews

Show HN: Term – Rust-based data validation with OpenTelemetry

https://github.com/withterm/term
3•ericpsimon•6mo ago
Hi HN, I'm Eric and I'm a recovering data engineer. Recently I have worked on the data platforms for multiple YC backed start-ups Kable (YC W22) and Finch (YC S20).

Every data team I've worked with struggles with data quality validation. Current solutions like Apache Deequ require spinning up entire Spark clusters just to check if your data meets basic quality constraints.

When I found Apache DataFusion, it was love at first sight - it provided the ergonomics of Apache Spark, without the overhead, JVM, etc. That is what led me to build Term. It is able to take advantage of the ergonomics of Spark without the overhead.

Term is a Rust library that provides Deequ-style data validation using Apache DataFusion. You can run comprehensive data quality checks anywhere - from your laptop to CI/CD pipelines - without any JVM or cluster setup. On a 1M row dataset with 20 constraints, Term completes validation in 0.21 seconds (vs 3.2 seconds without optimization) by intelligently batching operations into just 2 scans instead of 20.

The technical approach: Term leverages DataFusion's columnar processing engine to efficiently validate data in Arrow format. Validation rules compile directly to DataFusion's physical plans, and Rust's zero-cost abstractions mean the overhead is minimal. You get 100MB/s single-core throughput, which often outperforms distributed solutions for datasets under 100GB.

Term supports all the validation patterns you'd expect - completeness checks, uniqueness validation, statistical analysis (mean, correlation, standard deviation), pattern matching, custom SQL expressions, and built-in OpenTelemetry integration for production observability. The entire setup takes less than 5 minutes - just `cargo add term-guard` and you're validating data.

GitHub: https://github.com/withterm/term

I built this because I was tired of seeing teams skip data validation entirely rather than deal with Spark infrastructure. With Term, you can add validation to any Rust data pipeline with minimal overhead and zero operational complexity.

Coming next: Python/Node.js bindings, streaming support, and database connectivity. I'm particularly excited about making this accessible beyond the Rust ecosystem.

I'd love feedback on:

- The validation API - does it cover your use cases?

- Performance on your real-world datasets

- What validation patterns you need that aren't supported yet

- Ideas for the Python/Node.js API design

Happy to dive into technical details about DataFusion integration, performance optimizations, or anything else!