frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•3mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•3mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•3mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•3mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•3mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•3mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•3mo ago
Does Polars have a Java library?
mousomashakel•3mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•3mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•3mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•3mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•3mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•3mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•3mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: JavaScript-free (X)HTML Includes

https://github.com/Evidlo/xsl-website
54•Evidlo•6h ago•16 comments

Nitro: A tiny but flexible init system and process supervisor

https://git.vuxu.org/nitro/about/
142•todsacerdoti•6h ago•46 comments

The First Media over QUIC CDN: Cloudflare

https://moq.dev/blog/first-cdn/
154•kixelated•7h ago•75 comments

Top Secret: Automatically filter sensitive information

https://thoughtbot.com/blog/top-secret
54•thunderbong•20h ago•3 comments

Computer Fraud Laws Used to Prosecute Leaking Air Crash Footage to CNN

https://www.techdirt.com/2025/08/22/investigators-used-terrible-computer-fraud-laws-to-ensure-people-were-punished-for-leaking-air-crash-footage-to-cnn/
61•BallsInIt•1h ago•5 comments

Glyn: Type-safe PubSub and Registry for Gleam actors with distributed clustering

https://github.com/mbuhot/glyn
24•TheWiggles•3h ago•2 comments

70% of Japan smartphone games bypass in-app payment to avoid US tech giants

https://english.kyodonews.net/articles/-/59689
52•anigbrowl•1h ago•8 comments

FFmpeg 8.0

https://ffmpeg.org/index.html#pr8.0
710•gyan•10h ago•169 comments

Mail Carriers Pause US Deliveries as Tariff Shift Sows Confusion

https://www.bloomberg.com/news/articles/2025-08-21/global-mail-services-halt-us-deliveries-ahead-of-de-minimis-end
58•voxadam•2h ago•35 comments

Transcribe music in abc with syntax highlighting

https://fugue-state.io/app?project=24024aab-22f1-43cc-abef-c1647cc59597
9•jonzudell•3h ago•2 comments

Shader Academy: Learn computer graphics by solving challenges

https://shaderacademy.com/
7•pykello•2d ago•0 comments

Launch HN: BlankBio (YC S25) - Making RNA Programmable

37•antichronology•8h ago•24 comments

U.S. government takes 10% stake in Intel

https://www.cnbc.com/2025/08/22/intel-goverment-equity-stake.html
416•givemeethekeys•4h ago•420 comments

Leaving Gmail for Mailbox.org

https://giuliomagnifico.blog/post/2025-08-18-leaving-gmail/
172•giuliomagnifico•8h ago•204 comments

What Happened to Egghead Software

https://dfarq.homeip.net/what-happened-to-egghead-software/
22•zdw•3d ago•4 comments

Why the Internet Is Turning to Shit

https://www.currentaffairs.org/news/why-the-internet-is-turning-to-shit
37•Improvement•2h ago•20 comments

LabPlot: Free, open source and cross-platform Data Visualization and Analysis

https://labplot.org/
194•turrini•16h ago•36 comments

Writing Micro Compiler in OCaml (2014)

http://troydm.github.io/blog/2014/03/29/writing-micro-compiler-in-ocaml/
15•notagoodidea•3d ago•0 comments

The issue of anti-cheat on Linux (2024)

https://tulach.cc/the-issue-of-anti-cheat-on-linux/
85•todsacerdoti•1d ago•156 comments

Show HN: Clyp – Clipboard Manager for Linux

https://github.com/murat-cileli/clyp
68•timeoperator•9h ago•38 comments

Closing the Nix gap: From environments to packaged applications for rust

https://devenv.sh/blog/2025/08/22/closing-the-nix-gap-from-environments-to-packaged-applications-for-rust/
44•domenkozar•9h ago•17 comments

Bluesky Goes Dark in Mississippi over Age Verification Law

https://www.wired.com/story/bluesky-goes-dark-in-mississippi-age-verification/
26•BallsInIt•2h ago•9 comments

South Korea deploys hologram police officer to fight crime – and it's working

https://www.scmp.com/week-asia/lifestyle-culture/article/3322654/south-korea-deploys-hologram-police-officer-fight-crime-and-its-working
15•amichail•1h ago•3 comments

Should the web platform adopt XSLT 3.0?

https://github.com/whatwg/html/issues/11578
91•protomolecool•7h ago•61 comments

Waymo granted permit to begin testing in New York City

https://www.cnbc.com/2025/08/22/waymo-permit-new-york-city-nyc-rides.html
496•achristmascarl•8h ago•453 comments

Our Response to Mississippi's Age Assurance Law

https://bsky.social/about/blog/08-22-2025-mississippi-hb1126
123•Kye•5h ago•89 comments

Launch HN: Inconvo (YC S23) – AI agents for customer-facing analytics

33•ogham•12h ago•21 comments

What about using rel="share-url" to expose sharing intents?

https://shkspr.mobi/blog/2025/08/what-about-using-relshare-url-to-expose-sharing-intents/
77•edent•13h ago•34 comments

Everything is correlated (2014–23)

https://gwern.net/everything
238•gmays•23h ago•107 comments

Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing

https://arxiv.org/abs/2508.12631
112•omarsar•10h ago•22 comments