frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•10mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•10mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•10mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•10mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•10mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•10mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•10mo ago
Does Polars have a Java library?
mousomashakel•10mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•10mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•10mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•10mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•10mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•10mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•10mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: Turbolite – a SQLite VFS serving sub-250ms cold JOIN queries from S3

https://github.com/russellromney/turbolite
53•russellthehippo•1h ago•14 comments

Show HN: Orloj – agent infrastructure as code (YAML and GitOps)

https://github.com/OrlojHQ/orloj
15•An0n_Jon•15h ago•9 comments

Show HN: Layerleak – Like Trufflehog, but for Docker Hub

https://github.com/Brumbelow/layerleak
5•brumbelow•1h ago•5 comments

Show HN: Burn Room – End-to-End Encrypted Ephemeral SSH Chat

https://burnroom.chat
2•joematrix•1h ago•0 comments

Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR

https://github.com/jonwiggins/optio
71•jawiggins•1d ago•55 comments

Show HN: A plain-text cognitive architecture for Claude Code

https://lab.puga.com.br/cog/
136•marciopuga•21h ago•45 comments

Show HN: Vizier – A physical design advisor for DuckDB

4•habedi0•4h ago•0 comments

Show HN: Micro – apps without ads, algorithms or tracking

https://micro.mu
6•asim•4h ago•6 comments

Show HN: NerdFlair, a Claude Code QoL Plugin

https://github.com/jcraigk/nerdflair
2•block_dagger•5h ago•1 comments

Show HN: Wit – Stops merge conflicts when multiple AI agents edit the same repo

https://github.com/amaar-mc/wit
6•amaarc•5h ago•2 comments

Show HN: I took back Video.js after 16 years and we rewrote it to be 88% smaller

https://videojs.org/blog/videojs-v10-beta-hello-world-again
632•Heff•2d ago•138 comments

Show HN: SentinelGate – Access control for AI agents (open-source MCP proxy)

https://github.com/Sentinel-Gate/Sentinelgate
6•andreadev•6h ago•0 comments

Show HN: Yoink – Spotify to lossless with full metadata, self-hostable, ad-free

https://yoinkify.com
48•chasefrazier•1d ago•33 comments

Show HN: Full graphical desktop running on a 128MB VPS Alpine+XRDP+WindowMaker

https://tierhive.com/blog/tierhive-howto/alpine-minimal-remote-desktop-on-a-128mb-vps
6•backtogeek•7h ago•3 comments

Show HN: Mantyx – A platform to orchestrate, manage, and share your agents

https://mantyx.io/
6•grillorafael•12h ago•0 comments

Show HN: Breathe-Memory – Associative memory injection for LLMs (not RAG)

https://github.com/tkenaz/breathe-memory
5•mvyshnyvetska•7h ago•1 comments

Show HN: AI Roundtable – Let 200 models debate your question

https://opper.ai/ai-roundtable/
109•felix089•2d ago•84 comments

Show HN: ProofShot – Give AI coding agents eyes to verify the UI they build

https://github.com/AmElmo/proofshot
154•jberthom•2d ago•96 comments

Show HN: Paseo – Open-source coding agent interface (desktop, mobile, CLI)

https://github.com/getpaseo/paseo
10•boudra•7h ago•0 comments

Show HN: DuckDB community extension for prefiltered HNSW using ACORN-1

https://github.com/cigrainger/duckdb-hnsw-acorn
89•cigrainger•1d ago•7 comments

Show HN: Cloneify – AI assistant that runs your business from WhatsApp/Slack

https://cloneify.ai
3•ad-tech•8h ago•1 comments

Show HN: Pgsemantic – Point at your Postgres DB, get vector search instantly

https://github.com/varmabudharaju/pgsemantic
14•varmabudharaju•1d ago•1 comments

Show HN: Email.md – Markdown to responsive, email-safe HTML

https://www.emailmd.dev/
371•dancablam•2d ago•94 comments

Show HN: Cq – Stack Overflow for AI coding agents

https://blog.mozilla.ai/cq-stack-overflow-for-agents/
221•peteski22•3d ago•99 comments

Show HN: Gemini can now natively embed video, so I built sub-second video search

https://github.com/ssrajadh/sentrysearch
426•sohamrj•2d ago•108 comments

Show HN: Robust LLM extractor for websites in TypeScript

https://github.com/lightfeed/extractor
63•andrew_zhong•16h ago•43 comments

Show HN: Alexandria, free open source news aggregation and classification suite

https://github.com/hephaistos-io/alexandria
5•RicDan•9h ago•0 comments

Show HN: Gridland: make terminal apps that also run in the browser

https://www.gridland.io/
104•rothific•2d ago•13 comments

Show HN: Automate your workflow in plain English

https://www.operator23.com/
11•Mrakermo•23h ago•7 comments

Show HN: Hooky – A lightweight HTTP webhook server written in Go

https://github.com/virtuallytd/hooky
2•virtuallytd•11h ago•1 comments