frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•11mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•11mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•11mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•11mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•11mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•11mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•11mo ago
Does Polars have a Java library?
mousomashakel•11mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•11mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•11mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•11mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•11mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•11mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•11mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: TRUST – Coding Rust like it's 1989

https://github.com/wojtczyk/trust
109•wojtczyk•17h ago•70 comments

Show HN: Kstack – Skill pack for monitoring/troubleshooting K8s in Claude Code

https://github.com/kubetail-org/kstack
14•andres•17h ago•5 comments

Show HN: Stage CLI – An easier way of reading your AI generated changes locally

https://github.com/ReviewStage/stage-cli
28•cpan22•7h ago•30 comments

Show HN: DiffCAD, a FreeCAD workbench to review model changes like code

https://github.com/eblanshey/DiffCAD
4•eblanshey•2h ago•0 comments

Show HN: Agent-skills-eval – Test whether Agent Skills improve outputs

https://github.com/darkrishabh/agent-skills-eval
67•darkrishabh•17h ago•32 comments

Show HN: I built open-source auth for AI agents (Go, single binary)

https://github.com/shark-auth/shark
4•raulgooo•2h ago•0 comments

Show HN: Tilde.run – Agent sandbox with a transactional, versioned filesystem

https://tilde.run/
191•ozkatz•1d ago•126 comments

Show HN: Rig – a Ghostty sidecar for managing agents

https://github.com/backnotprop/rig
3•ramoz•4h ago•0 comments

Show HN: DAG-based Kanji learning through components

https://mykanji.app/
2•barisozmen•4h ago•0 comments

Show HN: I built an open-source email builder, alternative to Beefree/Unlayer

https://play.templatical.com
151•oahmadov•1d ago•37 comments

Show HN: Bilig – a headless spreadsheet engine for Node services and agents

https://github.com/proompteng/bilig
3•GregKonush•5h ago•0 comments

Show HN: PHP-fts – Full-text search engine in pure PHP, no extensions

https://github.com/olivier-ls/php-fts
81•asmodios•1d ago•23 comments

Show HN: Airbyte Agents – context for agents across multiple data sources

143•mtricot•2d ago•45 comments

Show HN: Bookstoread.ai – Discover non-fiction books using AI

https://bookstoread.ai
6•yotc•5h ago•0 comments

Show HN: Hallucinopedia

http://halupedia.com/
290•bstrama•1d ago•257 comments

Show HN: Local-first long-term memory engine for AI agents·MCP/CLI· 100% local

https://github.com/misaelzapata/memoirs
4•misaelzapata•6h ago•0 comments

Show HN: I made a vertical-pedalling bike with a novel drivetrain [video]

https://www.youtube.com/watch?v=4HLOsi2gWXQ
20•tonyonodi•11h ago•0 comments

Show HN: LaoTzu Writer Studio

8•homeonthemtn•11h ago•3 comments

Show HN: AICW Video open-source to cut video into clips with captions, voiceover

https://github.com/aicw-io/aicw-video
4•eummm•12h ago•1 comments

Show HN: Explore color palettes inspired by 3000 master painter artworks

https://paletteinspiration.com/
208•ouli•2d ago•82 comments

Show HN: SereneUI – A VSCode-inspired, open-source UI for Postgres

https://github.com/serenedb/serenedb/tree/main/serene-ui
15•gnusi•12h ago•0 comments

Show HN: Apple's SHARP running in the browser via ONNX runtime web

https://github.com/bring-shrubbery/ml-sharp-web
182•bring-shrubbery•4d ago•46 comments

Show HN: nfsdiag – A NFS diagnostic application

https://github.com/lsferreira42/nfsdiag
82•lsferreira42•5d ago•6 comments

Show HN: Password-less authentication via Ramanujan's 1/π series and Nvidia-B200

https://zenodo.org/records/20049892
5•Prakash_1•17h ago•0 comments

Show HN: Ableton Live MCP

https://github.com/bschoepke/ableton-live-mcp
121•bschoepke•4d ago•78 comments

Show HN: I built a game where AI agents compete to ship code

https://aion.quest/
9•xkoda•1d ago•5 comments

Show HN: Platos – like Claude Managed Agents but open-source and self-hosted

https://github.com/winsenlabs/platos
7•tejassuds•1d ago•0 comments

Show HN: Vanilla-scroll-sky: CSS-only modern scroll-driven storytelling sections

https://github.com/ulrischa/vanilla-scroll-sky
6•ulrischa•1d ago•0 comments

Show HN: Mac Juice Monitor – Bluetooth battery levels in the macOS menu bar

https://github.com/p32929/mac-juice-monitor
5•heliskyr2•1d ago•3 comments

Show HN: I Replaced React in GraphiQL with Svelte

https://code.webb.page/eol/graphiql.git/about/
5•NetOpWibby•20h ago•1 comments