frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•10mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•10mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•10mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•10mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•10mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•10mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•10mo ago
Does Polars have a Java library?
mousomashakel•10mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•10mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•10mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•10mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•10mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•10mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•10mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: A simple iOS app that helps you give yourself some time"

https://apps.apple.com/tr/app/alnuo/id6761344069
3•sezginozgur•22m ago•0 comments

Show HN: I made open source, zero power PCB hackathon badges

https://github.com/KaiPereira/Overglade-Badges
3•kaipereira•28m ago•0 comments

Show HN: Ownscribe – local meeting transcription, summarization and search

https://github.com/paberr/ownscribe
2•paberr•2h ago•0 comments

Show HN: AdaShape-3D modeler for intuitive 3D printing parts / Windows 11

https://adashape.com
2•fsloth•2h ago•2 comments

Show HN: I built a frontpage for personal blogs

https://text.blogosphere.app/
727•ramkarthikk•1d ago•186 comments

Show HN: Apfel – The free AI already on your Mac

https://apfel.franzai.com
691•franze•1d ago•142 comments

Show HN: Pluck – Copy any UI from any website, paste it into AI coding tools

https://www.pluck.so/
3•bring-shrubbery•3h ago•5 comments

Show HN: Travel Hacking Toolkit – Points search and trip planning with AI

https://github.com/borski/travel-hacking-toolkit
78•borski•12h ago•33 comments

Show HN: ctx – an Agentic Development Environment (ADE)

https://ctx.rs
44•luca-ctx•1d ago•51 comments

Show HN: Tusk for macOS and Gnome

https://shapemachine.xyz/tusk/
2•factorialboy•4h ago•0 comments

Show HN: Hacker News RSS Feed Directory

https://huggingface.co/datasets/open-index/hacker-news-rss
4•tamnd•1h ago•0 comments

Show HN: Deeplink – Go library for short links, click tracking, and OG previews

https://github.com/yinebebt/deeplink
3•yinebeb_sc•5h ago•2 comments

Show HN: ZipSee – explore remote ZIP archives using HTTP range requests

https://zipsee.pages.dev/
3•vsekar•5h ago•0 comments

Show HN: Mtproto.zig – High-performance Telegram proxy with DPI evasion

https://github.com/sleep3r/mtproto.zig
18•slp3r•17h ago•11 comments

Show HN: Ismcpdead.com – Live dashboard tracking MCP adoption and sentiment

https://ismcpdead.com
34•sagirodin•19h ago•20 comments

Show HN: Docking – extensible Linux dock in Python

https://docking.cc
2•edumucelli•6h ago•0 comments

Show HN: Web Push Notifications for Hacker News

https://hn-push.val.run
3•kinlan•7h ago•1 comments

Show HN: GraphReFly – Reactive graph protocol for human and LLM co-operation

https://graphrefly.dev/
5•clfhhc•7h ago•2 comments

Show HN: DotReader – connects ideas across your books automatically

https://dotreader.info
4•efecerre•14h ago•1 comments

Show HN: Made a little Artemis II tracker

https://artemis-ii-tracker.com/
145•codingmoh•1d ago•54 comments

Show HN: Cursor Cmd+K like command generator for all terminals

https://github.com/64bit/commandOK
4•gigapotential•8h ago•1 comments

Show HN: Dull – Instagram Without Reels, YouTube Without Shorts (iOS)

https://getdull.app
150•kasparnoor•2d ago•117 comments

Show HN: TinyOS – A minimalist RTOS for Cortex-M written in C

https://github.com/cmc-labo/tinyos-rtos
98•hpscript•17h ago•41 comments

Show HN: Rust HTTP library modeled after Go but 2-3x faster

https://github.com/alshdavid/uhttp
4•apatheticonion•10h ago•2 comments

Show HN: A P2P messenger with dual network modes (Fast and Tor)

https://github.com/Realman78/Kiyeovo/
38•Realman78•1d ago•22 comments

Show HN: I built a DNS resolver from scratch in Rust – no DNS libraries

https://github.com/razvandimescu/numa
111•rdme•2d ago•64 comments

Show HN: I tested 15 free AI models at building real software on a $25/year VPS

https://georgelarson.me/writing/2026-04-03-25-dollar-ai-lab/
33•j0rg3•1d ago•5 comments

Show HN: Minimal Brain Teaser Web Game (Handcrafted, No AI)

https://mehuleo.github.io/encircle/
23•arttaboi•1d ago•14 comments

Show HN: Postgres extension for BM25 relevance-ranked full-text search

https://github.com/timescale/pg_textsearch
201•tjgreen•3d ago•57 comments

Show HN: SyNumpy – a Header only C++17 library for working with NumPy Arrays

https://github.com/symisc/sy-numpy-cpp
20•symisc_devel•1d ago•3 comments