frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•9mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•9mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•9mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•9mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•9mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•9mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•9mo ago
Does Polars have a Java library?
mousomashakel•9mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•9mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•9mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•9mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•9mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•9mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•9mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: Moongate – Ultima Online server emulator in .NET 10 with Lua scripting

https://github.com/moongate-community/moongatev2
26•squidleon•1h ago•8 comments

Show HN: Interactive 3D globe of EU shipping emissions

https://seafloor.pages.dev
4•marcohaber•1h ago•1 comments

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

https://dev.moment.com/
131•armandhammer10•11h ago•40 comments

Show HN: PageAgent, A GUI agent that lives inside your web app

https://alibaba.github.io/page-agent/
130•simon_luv_pho•22h ago•66 comments

Show HN: Jido 2.0, Elixir Agent Framework

https://jido.run/blog/jido-2-0-is-here
309•mikehostetler•23h ago•62 comments

Show HN: Metateam: run many Claude/Codex/Gemini CLI instances in one terminal UI

https://www.metateam.ai/
2•falsename•3h ago•1 comments

Show HN: Tensor Spy: inspect NumPy and PyTorch tensors in the browser, no upload

https://tensorspy.com/
20•jacobn•3d ago•1 comments

Show HN: Moltty – Organized, Persistent AI Coding Sessions

https://moltty.com/
6•ronreiter•3h ago•1 comments

Show HN: Markdown-to-Book – Convert Markdown to KDP Ready PDFs and EPUBs

https://github.com/vpuna/markdown-to-book
3•vpuna•4h ago•1 comments

Show HN: Moji – A read-it-later app with self-organizing smart collections

https://moji.pcding.com
10•desmonding•14h ago•3 comments

Show HN: What Installing a New OS Taught Me About Disk Partitions

https://access2vivek.com/what-installing-debian-taught-me/
2•access2vivek•6h ago•0 comments

Show HN: PenguWarpOS – OS SIM made in py for Linux newbies

https://github.com/theidioticdev/penguwarpos/tree/testing
2•theidioticdev•6h ago•0 comments

Show HN: Poppy – A simple app to stay intentional with relationships

https://poppy-connection-keeper.netlify.app/
173•mahirhiro•1d ago•111 comments

Show HN: SitDeck – Customizable live dashboard of news, markets, threats

https://sitdeck.com
20•danushman•17h ago•13 comments

Show HN: Kanon 2 Enricher – the first hierarchical graphitization model

https://isaacus.com/blog/kanon-2-enricher
10•ubutler•3d ago•6 comments

Show HN: Hormuz Crisis Dashboard Real-time shipping disruption tracker

https://www.hormuztracker.com/
26•MrNekked•1d ago•2 comments

Show HN: Stacked Game of Life

https://stacked-game-of-life.koenvangilst.nl/
193•vnglst•5d ago•27 comments

Show HN: Tailwatch – Hierarchical event monitor with push notifications

https://tailwatch.dev
3•overthinkerer•8h ago•3 comments

Show HN: 1v1 coding game that LLMs struggle with

https://yare.io
3•levmiseri•8h ago•2 comments

Show HN: A simple, auto-layout family tree generator

https://familytreeeasy.com
6•familytreeeasy•9h ago•4 comments

Show HN: WingNews – Htmx Hacker News Reader

https://news.wingman.actor/
3•ChaseRensberger•10h ago•0 comments

Show HN: Rust compiler in PHP emitting x86-64 executables

https://github.com/mrconter1/rustc-php
65•mrconter11•5d ago•50 comments

Show HN: Steadwing – Your Autonomous On-Call Engineer

https://www.steadwing.com/
6•abejith•11h ago•0 comments

Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups

https://github.com/lixiasky-back/coroTracer
48•lixiasky•3d ago•3 comments

Show HN: Vertex.js – A 1kloc SPA Framework

https://lukeb42.github.io/vertex-manual.html
46•LukeB42•5d ago•25 comments

Show HN: Kybernis – Prevent AI agents from executing the same action twice

https://kybernis.io
6•wingrammer•18h ago•2 comments

Show HN: MCPSec – OWASP MCP Top Scanner for Model Context Protocol Configs

https://github.com/pfrederiksen/mcpsec
2•iamdesertpaul•13h ago•0 comments

Show HN: An AI Agent Running a Real Business (Thewebsite.app)

https://www.thewebsite.app/
3•thewebsite_ai•13h ago•5 comments

Show HN: I made a design portfolio reviewer

https://www.evalv.ai/
2•eldardesign•14h ago•4 comments

Show HN: A shell-native cd-compatible directory jumper using power-law frecency

https://github.com/jghub/sd-switchdir
24•jghub•2d ago•16 comments