frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•7mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•7mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•7mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•7mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•7mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•7mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•7mo ago
Does Polars have a Java library?
mousomashakel•7mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•7mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•7mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•7mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•7mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•7mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•7mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: Offline tiles and routing and geocoding in one Docker Compose stack

https://www.corviont.com/
79•packet_mover•10h ago•23 comments

Show HN: Comet MCP – Give Claude Code a browser that can click

https://github.com/hanzili/comet-mcp
4•hanzili•1h ago•0 comments

Show HN: Auxide- a Real-Time Audio Graph Library for Rust

https://github.com/Michael-A-Kuykendall/auxide
2•MKuykendall•1h ago•0 comments

Show HN: I built an international calling platform for the past 6 months

https://voklit.com
19•ahmgeek•5h ago•13 comments

Show HN: Til.re – The URL is your timer, no signup required

https://til.re
5•iamlin•2h ago•1 comments

Show HN: uvx ptn, scan a QR, get a terminal in your phone

https://github.com/lyehe/porterminal
90•yxl448•21h ago•34 comments

Show HN: Website that plays the lottery every second

https://lotteryeverysecond.lffl.me/
235•Loeffelmann•1d ago•131 comments

Show HN: Vibe Coding a static site on a $25 Walmart Phone

https://stetsonblake.com/%2425+Walmart+Phone+for+Hackers
66•stets•6h ago•61 comments

Show HN: FP-pack – Functional pipelines in TypeScript without monads

https://github.com/superlucky84/fp-pack
14•superlucky84•11h ago•16 comments

Show HN: Walkie-Talkie Toy over WiFi/Tailscale on Raspberry Pi

https://github.com/spieglt/talkis
5•spieglt•5h ago•0 comments

Show HN: ZELF – A modular ELF64 packer with 22 vintage and modern codecs

https://github.com/seb3773/zelf
2•seb3773•6h ago•0 comments

Show HN: A New Year gift for Python devs–My self-healing project's DNA analyzer

https://github.com/misa76868-lang/ultra-meta-seed
3•ultra-meta•6h ago•1 comments

Show HN: Turbo – Python Web Framework

https://github.com/lucianmarin/turbo
3•lcnmrn•8h ago•0 comments

Show HN: Use Claude Code to Query 600 GB Indexes over Hacker News, ArXiv, etc.

https://exopriors.com/scry
387•Xyra•3d ago•140 comments

Show HN: Missing puzzle in binary analysis: finding num constants in .so/.bin

3•DenisDolya•9h ago•0 comments

Show HN: OpenWorkers – Self-hosted Cloudflare workers in Rust

https://openworkers.com/introducing-openworkers
495•max_lt•2d ago•152 comments

Show HN: The ASCII Side of the Moon

https://aleyan.com/projects/ascii-side-of-the-moon/?lat=32.72&lon=-117.16&date=2025-12-29T14%3A24Z
22•aleyan•5d ago•1 comments

Show HN: Go-Highway – Portable SIMD for Go

https://github.com/ajroetker/go-highway
22•kingcauchy•1d ago•1 comments

Show HN: I built a clipboard tool to strip/keep specific formatting like Italics

https://custompaste.com
35•EvaWorld9•1d ago•14 comments

Show HN: Jetbase – A Python database migration tool (Alembic alternative)

https://github.com/jetbase-hq/jetbase
3•jaz1•11h ago•2 comments

Show HN: Enroll, a tool to reverse-engineer servers into Ansible config mgmt

https://enroll.sh
260•_mig5•3d ago•50 comments

Show HN: CCC – Control Claude Code Sessions Remotely via Telegram

https://github.com/kidandcat/ccc
3•kidandcat•11h ago•2 comments

Show HN: Wario Synth – Turn any song into Game Boy version

https://www.wario.style
71•birdmania•2d ago•17 comments

Show HN: I used AI to recreate a $4000 piece of audio hardware as a plugin

16•johnwheeler•1d ago•7 comments

Show HN: BusterMQ, Thread-per-core NATS server in Zig with io_uring

https://bustermq.sh/
133•jbaptiste•3d ago•69 comments

Show HN: Dealta – A game-theoretic decentralized trading protocol

https://github.com/orgs/Dealta-Foundation/repositories
54•kalenvale•1d ago•34 comments

Show HN: HackLens – A fast and clean Android/iOS Hacker News reader

4•drop_dev•14h ago•0 comments

Show HN: 22 GB of Hacker News in SQLite

https://hackerbook.dosaygo.com
719•keepamovin•4d ago•217 comments

Show HN: Neural Net Flies Navigate through a maze

https://claude.ai/public/artifacts/4c5465b6-c384-4c9f-8a92-a4f840367ec2
13•logicallee•6d ago•5 comments

Show HN: Tasker – An open-source desktop agent for browser and OS automation

https://automatewithtasker.com/
33•schnetzlerjoe•2d ago•29 comments