frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java

https://github.com/moustafa-nasr/fahmatrix
46•mousomashakel•2mo ago
Hey HN, I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

Features:

Lightweight and dependency-free

CSV/TSV import with auto-headers

Series/DataFrame structures (like pandas)

describe(), mean(), stdDev(), percentile() and more

Fast parallel operations on numeric columns

Java 17+ support

Docs: https://moustafa-nasr.github.io/Fahmatrix/ GitHub: https://github.com/moustafa-nasr/fahmatrix

I’d love feedback from the Java and data communities — especially if you’ve ever wanted a simple dataframe utility in Java without needing full-scale ML libraries.

Happy to answer any questions!

Comments

rickette•2mo ago
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
mousomashakel•2mo ago
Thanks so much! Yep, I’ve seen the Kotlin DataFrame lib — very elegant. Fahmatrix is meant for plain Java users who want similar capabilities without switching ecosystems. Appreciate the support!
uwemaurer•2mo ago
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

https://github.com/jtablesaw/tablesaw

https://github.com/dflib/dflib

My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

theanonymousone•2mo ago
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
radus•2mo ago
Polars and duckdb interoperate nicely and can enable this flexibility
theanonymousone•2mo ago
Does Polars have a Java library?
mousomashakel•2mo ago
Totally agree that SQL can be the best tool for many jobs. My goal with Fahmatrix is to serve the opposite niche: where devs want something that's Java-native, procedural, and simple without reaching for an external engine. SQL support or DSL might come later though — I see the appeal.
theanonymousone•2mo ago
Sure. So maybe notehr comment would be to make it (particularly the Series class), as compatible with Java Streams as possible.

Next step would likely be compatibility with popular libraries such as Apache Commons Math: https://commons.apache.org/proper/commons-math/userguide/sta...

mousomashakel•2mo ago
Thanks! I'm aware of those great projects. Fahmatrix aims to offer a lightweight, dependency-free alternative that’s easy to embed in any Java app. DuckDB is super impressive, especially for SQL-heavy tasks — but my goal is more about a native, fluent API for those who prefer direct Java code over SQL.
skanga•2mo ago
What about Tablesaw, Apache Arrow? How does this compare ...
mousomashakel•2mo ago
Good question. I’ll publish benchmarks soon, but the core difference is that Fahmatrix is fully Java, no JNI, and minimalistic — ideal for small projects or environments like Android. Tablesaw and Arrow are more powerful, but heavier. Fahmatrix aims to be the “just enough” middle ground.
owlstuffing•2mo ago
Nice!

I’m currently using manifold-sql with duckdb for this.

mousomashakel•2mo ago
Thanks! That’s a great combo — manifold-sql + duckdb gives you strong typing with powerful SQL under the hood. Fahmatrix is aiming to complement that approach for cases where you want quick, native Java code without SQL — e.g., when building data flows or custom logic inline. Would love to hear if you’ve hit any pain points that a Java-native approach could help with.

Show HN: Molab, a cloud-hosted Marimo notebook workspace

https://molab.marimo.io/notebooks
61•akshayka•5h ago•8 comments

Show HN: I built library management app for those who outgrew spreadsheets

https://www.librari.io/
43•hmkoyan•4h ago•27 comments

Show HN: Simulating autonomous drone formations

https://github.com/sushrut141/ketu
12•wanderinglight•3d ago•2 comments

Show HN: Interactive Bash tutorial that runs in the browser

https://sandbox.bio/tutorials/bash-script
2•raboukhalil•1h ago•0 comments

Show HN: Tech docs → video explainers in seconds

https://symvol.io/
2•feliks22•1h ago•0 comments

Show HN: AI File Sorter: Organize Files and Folders with AI (Local LLMs)

https://github.com/hyperfield/ai-file-sorter
3•hyperfield•1h ago•0 comments

Show HN: Benchstreet – the stock prediction AI benchmark

https://github.com/puffinsoft/benchstreet
4•ColonelParrot•2h ago•0 comments

Show HN: RateMyPrompt – share and rate prompts with auto AI evals

https://www.josh.ing/ratemyprompt
8•jshchnz•4h ago•4 comments

Show HN: PlutoFilter- A single-header, zero-allocation image filter library in C

https://github.com/sammycage/plutofilter
76•sammycage•4d ago•16 comments

Show HN: Improving search ranking with chess Elo scores

https://www.zeroentropy.dev/blog/improving-rag-with-elo-scores
189•ghita_•2d ago•64 comments

Show HN: Mock FedCM Integrations

https://mockfedcm.com/
11•portaouflop•10h ago•0 comments

Show HN: Brainfork – Create a personal RAG MCP server in seconds

https://brainfork.is
11•softwareiseasy•12h ago•0 comments

Show HN: 0xDEAD//TYPE – A fast-paced typing shooter with retro vibes

https://0xdeadtype.theden.sh/
112•theden•5d ago•27 comments

Show HN: Tips for getting great Text2Cypher outputs from LLMs for Graph RAG

https://blog.kuzudb.com/post/improving-text2cypher-for-graphrag-via-schema-pruning/
4•laminarflow027•10h ago•0 comments

Show HN: Shoggoth Mini – A soft tentacle robot powered by GPT-4o and RL

https://www.matthieulc.com/posts/shoggoth-mini
593•cataPhil•3d ago•107 comments

Show HN: Mark 1.0, a notation that unifies JSON, HTML, JSX, XML, YAML, and more

https://marknotation.org/
7•henryluo•10h ago•10 comments

Show HN: Child Affordability Index

https://childaffordabilityindex.com/
4•simplekindofman•10h ago•3 comments

Show HN: I built this to talk Danish to my girlfriend – works with any language

https://menerdu.vercel.app/
204•lil_csom•5d ago•107 comments

Show HN: BloomSearch – Keyword search with hierarchical Bloom filters

https://github.com/danthegoodman1/bloomsearch
65•dangoodmanUT•5d ago•12 comments

Show HN: Fluent-state: a new fresh take on local React state

https://github.com/marsbos/fluent-state
5•bosmarcel•13h ago•1 comments

Show HN: We made our own inference engine for Apple Silicon

https://github.com/trymirai/uzu
179•darkolorin•3d ago•46 comments

Show HN: The Aletheia Protocol – A blueprint for a web without exploitation

https://github.com/Esrbwt1/aletheia-protocol
2•Esrbwt•13h ago•1 comments

Show HN: A 'Choose Your Own Adventure' written in Emacs Org Mode

https://tendollaradventure.com/sample/
153•dskhatri•2d ago•25 comments

Show HN: Beyond Z²+C, Plot Any Fractal

https://www.juliascope.com/
100•akunzler•3d ago•26 comments

Show HN: The HTML Maze – Escape an eerie labyrinth built with HTML pages

https://htmlmaze.com/
62•kyrylo•4d ago•19 comments

Show HN: An MCP server that gives LLMs temporal awareness and time calculation

https://github.com/jlumbroso/passage-of-time-mcp
91•lumbroso•2d ago•53 comments

Show HN: Self-made web media player without <video> or <audio>

https://mediabunny.dev/examples/media-player/
8•vanilagy•16h ago•6 comments

Show HN: DataRamen, a Fast SQL Explorer with Automatic Joins and Data Navigation

https://dataramen.xyz/
47•oleksandr_dem•2d ago•55 comments

Show HN: UML is dead – so I'm building the tool to revive it

https://www.rapidcharts.ai/
5•SamiCostox•16h ago•3 comments

Show HN: ArchGW – An intelligent edge and service proxy for agents

https://github.com/katanemo/archgw/
116•honorable_coder•6d ago•15 comments