frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•6mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•6mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•6mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•6mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•6mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•6mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•6mo ago
Does Polars have a Java library?
mousomashakel•6mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•6mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•6mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•6mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•6mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•6mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•6mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: Mint – an open-source photo editor and digital compositor for the web

https://mint.photo/
4•performative•24m ago•1 comments

Show HN: Wealthfolio 2.0- Open source investment tracker. Now Mobile and Docker

https://wealthfolio.app/?v=2.0
590•a-fadil•1d ago•191 comments

Show HN: PolyGPT – ChatGPT, Claude, Gemini, Perplexity responses side-by-side

https://polygpt.app
7•ncvgl•5h ago•3 comments

Show HN: Compare Word documents in the browser (client-side only)

https://compare2word.com/
3•nighwatch•3h ago•0 comments

Show HN: NB2 Hub – Free Nano Banana Pro AI Image Generator

https://nano-banana2.app
3•zane0924•4h ago•0 comments

Show HN: Vibe Prolog

https://github.com/nlothian/Vibe-Prolog
43•nl•3d ago•9 comments

Show HN: My hobby OS that runs Minecraft

https://astral-os.org/posts/2025/10/31/astral-minecraft.html
231•avaliosdev•4d ago•29 comments

Show HN: I made an app to keep track of your sailboat maintenance

https://boatpassport.app
2•joaon•6h ago•1 comments

Show HN: Search London StreetView panoramas by text

https://london.publicinsights.uk
23•dfworks•1d ago•11 comments

Show HN: 32V TENS device from built from scratch under $100

https://littlemountainman.github.io/2025/11/17/tens/
66•autonomydriver•5d ago•24 comments

Show HN: PokeSuite – Pokémon TCG pack simulator and competitive team builder

https://www.pokesuite.com
2•Fsen•8h ago•1 comments

Show HN: F32 – An Extremely Small ESP32 Board

https://github.com/PegorK/f32
294•pegor•2d ago•51 comments

Show HN: I made a down detector for down detector

https://downdetectorsdowndetector.com
579•gusowen•3d ago•169 comments

Show HN: Awesome J2ME

https://github.com/hstsethi/awesome-j2me
77•catstor•2d ago•53 comments

Show HN: Skedular, a Smart Booking and Workspace Management Platform

https://skedular.app
2•mortezaalizadeh•12h ago•0 comments

Show HN: I built a synth for my daughter

https://bitsnpieces.dev/posts/a-synth-for-my-daughter/
1272•random_moonwalk•1w ago•209 comments

Show HN: OCR Arena – A playground for OCR models

https://www.ocrarena.ai/battle
18•kbyatnal•1d ago•3 comments

Show HN: RowboatX – open-source Claude Code for everyday automations

https://github.com/rowboatlabs/rowboat
130•segmenta•3d ago•41 comments

Show HN: MCP Traffic Analysis Tool

https://github.com/mcp-shark/mcp-shark
37•o4isec•4d ago•0 comments

Show HN: Tangent – Security log pipeline powered by WASM

https://github.com/telophasehq/tangent
28•ethanblackburn•2d ago•2 comments

Show HN: A game where you invest into startups from history

https://startupgambit.com
41•vire00•6d ago•35 comments

Show HN: I made a Rust Terminal UI for OpenSnitch, a Linux application firewall

https://github.com/amalbansode/opensnitch-tui
3•quadrophenia•17h ago•0 comments

Show HN: ESPectre – Motion detection based on Wi-Fi spectre analysis

https://github.com/francescopace/espectre
212•francescopace•5d ago•50 comments

Show HN: Parqeye – A CLI tool to visualize and inspect Parquet files

https://github.com/kaushiksrini/parqeye
165•kaushiksrini•4d ago•35 comments

Show HN: Guts – convert Golang types to TypeScript

https://github.com/coder/guts
105•emyrk•3d ago•30 comments

Show HN: Browser-based interactive 3D Three-Body problem simulator

https://trisolarchaos.com/?pr=O_8(0.6)&n=3&s=5.0&so=0.00&im=rk4&dt=1.00e-4&rt=1.0e-6&at=1.0e-8&bs...
244•jgchaos•4d ago•111 comments

Show HN: Continuous Claude – run Claude Code in a loop

https://github.com/AnandChowdhary/continuous-claude
168•anandchowdhary•1w ago•61 comments

Show HN: Even Turns, track your families turns

https://eventurns.com
5•gdesplin•21h ago•0 comments

Show HN: A subtly obvious e-paper room air monitor

https://www.nicolin-dora.ch/blog/en-epaper-room-air-monitor-part-1/
65•nomarv•4d ago•28 comments

Show HN: City2Graph – Python Open Source for Geospatial Graph Neural Networks

https://city2graph.net/index.html
4•yutasato•22h ago•1 comments