frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•8mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•8mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•8mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•8mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•8mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•8mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•8mo ago
Does Polars have a Java library?
mousomashakel•8mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•8mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•8mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•8mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•8mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•8mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•8mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: I trained a 9M speech model to fix my Mandarin tones

https://simedw.com/2026/01/31/ear-pronunication-via-ctc/
261•simedw•11h ago•95 comments

Show HN: Phage Explorer

https://phage-explorer.org/
72•eigenvalue•6h ago•16 comments

Show HN: SF Microclimates

https://github.com/solo-founders/sf-microclimates
36•weisser•5d ago•31 comments

Show HN: Blink – Native macOS code snippet manager. Local, offline, <1s search

https://www.enclyralabs.com/
2•enclyra•2h ago•2 comments

Show HN: Amla Sandbox – WASM bash shell sandbox for AI agents

https://github.com/amlalabs/amla-sandbox
136•souvik1997•21h ago•71 comments

Show HN: Pinecone Explorer – Desktop GUI for the Pinecone vector database

https://www.pinecone-explorer.com
25•arsentjev•3d ago•3 comments

Show HN: Kolibri, a DIY music club in Sweden

https://kolibrinkpg.com/
130•EastLondonCoder•1d ago•23 comments

Show HN: I built an AI conversation partner to practice speaking languages

https://apps.apple.com/us/app/talkbits-speak-naturally/id6756824177
64•omarisbuilding•13h ago•48 comments

Show HN: Interactive Equation Solver

2•dharmatech•4h ago•0 comments

Show HN: Cicada – A scripting language that integrates with C

https://github.com/heltilda/cicada
55•briancr•23h ago•31 comments

Show HN: Mystral Native – Run JavaScript games natively with WebGPU (no browser)

https://github.com/mystralengine/mystralnative
45•Flux159•3d ago•16 comments

Show HN: Ourguide – OS wide task guidance system that shows you where to click

https://ourguide.ai
52•eshaangulati•4d ago•22 comments

Show HN: ShapedQL – A SQL engine for multi-stage ranking and RAG

https://playground.shaped.ai
79•tullie•3d ago•23 comments

Show HN: OpenVideo – A self-hostable, open-source video editor in the browser

https://github.com/openvideodev/openvideo
2•snapmotion•10h ago•1 comments

Show HN: Foundry – Turns your repeated workflows into one-click commands

https://github.com/lekt9/openclaw-foundry
10•getfoundry•11h ago•3 comments

Show HN: Hosted OpenClaw with Secure Isolation

https://moltcloud.ai/blog/hosted-openclaw/
2•stubbi•11h ago•0 comments

Show HN: I'm building an AI-proof writing tool. How would you defeat it?

https://auth-auth.vercel.app/
21•callmeed•2d ago•30 comments

Show HN: LemonSlice – Upgrade your voice agents to real-time video

129•lcolucci•3d ago•130 comments

Show HN: Git primitives for autonomous coding agents

https://github.com/raine/git-surgeon
2•rane•13h ago•0 comments

Show HN: The HN Arcade

https://andrewgy8.github.io/hnarcade/
346•yuppiepuppie•3d ago•116 comments

Show HN: Daily Cat

https://daily.cat/
3•abraham•13h ago•0 comments

Show HN: Using World Models for Consistent AI Filmmaking

https://getartcraft.com/news/world-models-for-film
2•echelon•14h ago•0 comments

Show HN: SHDL – A minimal hardware description language built from logic gates

https://github.com/rafa-rrayes/SHDL
47•rafa_rrayes•3d ago•21 comments

Show HN: Build Web Automations via Demonstration

https://www.notte.cc/launch-week-i/demonstrate-mode
32•ogandreakiro•3d ago•20 comments

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
215•jmuncor•2d ago•119 comments

Show HN: One Human + One Agent = One Browser From Scratch in 20K LOC

https://emsh.cat/one-human-one-agent-one-browser/
316•embedding-shape•3d ago•151 comments

Show HN: I built a small browser engine from scratch in C++

https://github.com/beginner-jhj/mini_browser
144•crediblejhj•2d ago•45 comments

Show HN: We Built the 1. EU-Sovereignty Audit for Websites

https://lightwaves.io/en/eu-audit/
104•cmkr•3d ago•88 comments

Show HN: Shelvy Books

https://shelvybooks.com
48•tekkie00•2d ago•18 comments

Show HN: Only 1 LLM can fly a drone

https://github.com/kxzk/snapbench
179•beigebrucewayne•5d ago•92 comments