frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Economists vs. Technologists on AI

https://ideasindevelopment.substack.com/p/economists-vs-technologists-on-ai
1•econlmics•1m ago•0 comments

Life at the Edge

https://asadk.com/p/edge
1•tosh•7m ago•0 comments

RISC-V Vector Primer

https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md
2•oxxoxoxooo•11m ago•1 comments

Show HN: Invoxo – Invoicing with automatic EU VAT for cross-border services

2•InvoxoEU•11m ago•0 comments

A Tale of Two Standards, POSIX and Win32 (2005)

https://www.samba.org/samba/news/articles/low_point/tale_two_stds_os2.html
2•goranmoomin•15m ago•0 comments

Ask HN: Is the Downfall of SaaS Started?

3•throwaw12•16m ago•0 comments

Flirt: The Native Backend

https://blog.buenzli.dev/flirt-native-backend/
2•senekor•18m ago•0 comments

OpenAI's Latest Platform Targets Enterprise Customers

https://aibusiness.com/agentic-ai/openai-s-latest-platform-targets-enterprise-customers
1•myk-e•20m ago•0 comments

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

https://www.cnbc.com/2026/02/06/anthropic-goldman-sachs-ai-model-accounting.html
2•myk-e•23m ago•3 comments

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

https://www.ft.com/content/83488628-8dfd-4060-a7b0-71b1bb012785
1•1vuio0pswjnm7•24m ago•1 comments

Big Tech's AI Push Is Costing More Than the Moon Landing

https://www.wsj.com/tech/ai/ai-spending-tech-companies-compared-02b90046
3•1vuio0pswjnm7•26m ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
2•1vuio0pswjnm7•27m ago•0 comments

Suno, AI Music, and the Bad Future [video]

https://www.youtube.com/watch?v=U8dcFhF0Dlk
1•askl•29m ago•2 comments

Ask HN: How are researchers using AlphaFold in 2026?

1•jocho12•32m ago•0 comments

Running the "Reflections on Trusting Trust" Compiler

https://spawn-queue.acm.org/doi/10.1145/3786614
1•devooops•37m ago•0 comments

Watermark API – $0.01/image, 10x cheaper than Cloudinary

https://api-production-caa8.up.railway.app/docs
1•lembergs•39m ago•1 comments

Now send your marketing campaigns directly from ChatGPT

https://www.mail-o-mail.com/
1•avallark•42m ago•1 comments

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•54m ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
5•o8vm•56m ago•1 comments

Haniri: A live autonomous world where AI agents survive or collapse

https://www.haniri.com
1•donangrey•57m ago•1 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•tosh•1h ago•0 comments

Atlas: Manage your database schema as code

https://github.com/ariga/atlas
1•quectophoton•1h ago•0 comments

Geist Pixel

https://vercel.com/blog/introducing-geist-pixel
2•helloplanets•1h ago•0 comments

Show HN: MCP to get latest dependency package and tool versions

https://github.com/MShekow/package-version-check-mcp
1•mshekow•1h ago•0 comments

The better you get at something, the harder it becomes to do

https://seekingtrust.substack.com/p/improving-at-writing-made-me-almost
2•FinnLobsien•1h ago•0 comments

Show HN: WP Float – Archive WordPress blogs to free static hosting

https://wpfloat.netlify.app/
1•zizoulegrande•1h ago•0 comments

Show HN: I Hacked My Family's Meal Planning with an App

https://mealjar.app
1•melvinzammit•1h ago•0 comments

Sony BMG copy protection rootkit scandal

https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootkit_scandal
2•basilikum•1h ago•0 comments

The Future of Systems

https://novlabs.ai/mission/
2•tekbog•1h ago•1 comments

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•1h ago•0 comments
Open in hackernews

Show HN: Term – Rust-based data validation with OpenTelemetry

https://github.com/withterm/term
3•ericpsimon•6mo ago
Hi HN, I'm Eric and I'm a recovering data engineer. Recently I have worked on the data platforms for multiple YC backed start-ups Kable (YC W22) and Finch (YC S20).

Every data team I've worked with struggles with data quality validation. Current solutions like Apache Deequ require spinning up entire Spark clusters just to check if your data meets basic quality constraints.

When I found Apache DataFusion, it was love at first sight - it provided the ergonomics of Apache Spark, without the overhead, JVM, etc. That is what led me to build Term. It is able to take advantage of the ergonomics of Spark without the overhead.

Term is a Rust library that provides Deequ-style data validation using Apache DataFusion. You can run comprehensive data quality checks anywhere - from your laptop to CI/CD pipelines - without any JVM or cluster setup. On a 1M row dataset with 20 constraints, Term completes validation in 0.21 seconds (vs 3.2 seconds without optimization) by intelligently batching operations into just 2 scans instead of 20.

The technical approach: Term leverages DataFusion's columnar processing engine to efficiently validate data in Arrow format. Validation rules compile directly to DataFusion's physical plans, and Rust's zero-cost abstractions mean the overhead is minimal. You get 100MB/s single-core throughput, which often outperforms distributed solutions for datasets under 100GB.

Term supports all the validation patterns you'd expect - completeness checks, uniqueness validation, statistical analysis (mean, correlation, standard deviation), pattern matching, custom SQL expressions, and built-in OpenTelemetry integration for production observability. The entire setup takes less than 5 minutes - just `cargo add term-guard` and you're validating data.

GitHub: https://github.com/withterm/term

I built this because I was tired of seeing teams skip data validation entirely rather than deal with Spark infrastructure. With Term, you can add validation to any Rust data pipeline with minimal overhead and zero operational complexity.

Coming next: Python/Node.js bindings, streaming support, and database connectivity. I'm particularly excited about making this accessible beyond the Rust ecosystem.

I'd love feedback on:

- The validation API - does it cover your use cases?

- Performance on your real-world datasets

- What validation patterns you need that aren't supported yet

- Ideas for the Python/Node.js API design

Happy to dive into technical details about DataFusion integration, performance optimizations, or anything else!