frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Buckaroo – Data table UI for Notebooks

https://github.com/paddymul/buckaroo
105•paddy_m•8mo ago
Buckaroo is my open source project. It is a dataframe viewer that has the basic features we expect in a modern table - scroll, search, sort. In addition there are summary stats, and histograms available. Buckaroo support Pandas and Polars dataframes and works on Jupter, Marimo, VSCode and Google Colab notebooks. All of this is extensible. I think of Buckaroo as a framework for building table UIs, and an initial data exploration app built on top of that framework. AG-Grid is used for the core table display and it has been customized with a declarative layer so you don't have to pass JS functions around for customizations. On the python side there is a framework for adding summary stats (with a small DAG for dependencies). There is also an entire Low Code UI for point and click selection of common commands (drop column). The lowcode UI also generates a python function that accomplishes the same tasks. This is built on top of JLisp - a small lisp interpreter that reads JSON flavored lisp.

Auto Cleaning looks at columns and heuristically suggests common cleaning operations. The operations are added to the lowcode UI where they can be edited. Multiple cleaning strategies can be applied and the best fit retained. Autocleaning without a UI and multiple strategies is very opaque. Since this runs heuristically (not with an LLM), it’s fast and data stays local.

I'm eager to hear feedback from data scientists and other users of dataframes/notebooks.

Comments

ZeroCool2u•8mo ago
This looks really cool. I will say my default solution for this, and the default across my org, is Data Wrangler in VS Code[1]. My only wish list item is if the low code solution wrote polars instead of pandas. Any thoughts on how hard that might be to accomplish?

1: https://marketplace.visualstudio.com/items?itemName=ms-tools...

paddy_m•8mo ago
Thank you.

The Buckaroo lowcode UI is capable of working with Polars, but I don't currently have any commands plumbed in. I will work on that.

I'm aware of Data Wrangler and they did nice work, but it's closed source and from what I can tell non-extensible. What features do you like in Data Wrangler, what do you wish it did differently?

paddy_m•8mo ago
I made a Marimo WASM example that you can play with in your browser [1]

I need to make some updates to the polars functionality, I just completed some extensive refactorings of the Lowcode UI focussed on pandas, time to clean that up for polars too.

Also the python codegen for polars is non-idiomatic with multiple re-assignments to a dataframe, vs one big select block. I have some ideas for how to fix that, but they'll take time.

https://marimo.io/p/@paddy-mullen/notebook-sctuj8

RyanHamilton•8mo ago
Congratulations on launching. Buckaroo looks great.
franky47•8mo ago
But does it work across data tables with 8 dimensions?
trsohmers•8mo ago
Only with the oscillation overthruster flag enabled.
hodder•8mo ago
Looks cool to me. I often just end out exporting and opening in excel to do this
epistasis•8mo ago
This is really great, I'm looking forward to playing with it.

Currently I use a mix of quak (preferred) and itable (if starting fom a colab notebook). It will be interesting to compare for my use cases, which most consist of checking for the distribution of data in a new file, or verifying that a transform I did resulted in the right sort of stuff.

mathisd•8mo ago
How does it compare to Data Wrangler ? I like Data Wrangler because it let us open up in a separate VS Code window.

BOHR Chain's "AI Protocol" $2M raise: technical architecture seems non-existent

1•Kangaroo_•3m ago•0 comments

Manager Is a System. They Need an API

https://reluctantleadership.substack.com/p/your-manager-is-a-system
1•oxygenfoxx•8m ago•0 comments

Gary Marcus on the Problems Facing AI and LLM Scaling [video]

https://www.youtube.com/watch?v=aI7XknJJC5Q
1•7777777phil•10m ago•0 comments

RISC-V and Post-Quantum Cryptography

https://fprox.substack.com/p/risc-v-and-post-quantum-cryptography
1•hasheddan•10m ago•0 comments

Skillware

https://github.com/ARPAHLS/skillware
1•rosspeili•11m ago•0 comments

Ask HN: What non-fiction do you read?

1•yanis_t•11m ago•0 comments

Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor (2019)

https://arxiv.org/abs/1905.09866
1•bhickey•12m ago•0 comments

Scientific Insolvency in GPQA and HLE: A forensic audit reveals 58% error rate

https://zenodo.org/records/18293568
1•jopsammy•14m ago•1 comments

Running Claude Code dangerously (safely)

https://blog.emilburzo.com/2026/01/running-claude-code-dangerously-safely/
1•emilburzo•17m ago•3 comments

Calculate your reach on X/Twitter

https://allscreenshots.com/tools/x-algorithm-calculator
1•erikpau•20m ago•1 comments

Show HN: TakaTime – Self-Hosted WakaTime Alternative (Go and MongoDB)

https://github.com/Rtarun3606k/TakaTime
1•Rtarun3606k•21m ago•0 comments

GDPR as a blueprint for risk-aware architecture

https://medium.com/@antonbm/gdpr-as-a-blueprint-for-risk-aware-architecture-d8f811d1ec1a
1•antonmb•25m ago•0 comments

Show HN: AI Clothes Changer – virtual try-on with pose control

https://girlgenai.com
1•jokera•25m ago•0 comments

Local models to support home network infrastructure?

1•DrAwdeOccarim•26m ago•0 comments

AGI basic building block in your terminal

https://github.com/bokan/claude-skill-self-improvement
1•bbokan•27m ago•0 comments

Are published ANN-Benchmarks DBMS results trustworthy?

https://blog.ydb.tech/are-published-ann-benchmarks-dbms-results-trustworthy-f2573eca4e07
1•AlexClickHouse•29m ago•0 comments

Sorting Algortihms Visualized [video]

https://www.youtube.com/shorts/FI-9z00yvnE
1•dnnsthnnr•29m ago•1 comments

Show HN: Governed AI Portfolio–admission control for agentic sys in production

1•lexseasson•31m ago•0 comments

Photic Sneeze Reflex

https://en.wikipedia.org/wiki/Photic_sneeze_reflex
1•thunderbong•32m ago•0 comments

I got 3 parallel agents to change 149 files with 17 errors instead of 500

1•mvgnus•32m ago•0 comments

Net Zero: a multi-trillion-pound catastrophe

https://www.spiked-online.com/2026/01/19/net-zero-a-multi-trillion-pound-catastrophe/
2•mpweiher•33m ago•0 comments

PDFTextor – Fast GUI tool to extract text from single or multiple PDFs

https://gum.new/gum/cmk3n0dst002504ky9ulpdf2u
1•Dev_Master•33m ago•1 comments

The Lost Art of Structure Packing

http://www.catb.org/esr/structure-packing/
2•tosh•33m ago•0 comments

Ardalambion – Of the Tongues of Arda, the Invented World of JRR Tolkien

https://www.ardalambion.org/
2•saberhagen•33m ago•0 comments

News Espressif Introduces ESP32-E22, First Wi-Fi 6E Connectivity Co-Processor

https://www.espressif.com/en/news/ESP32_E22_Announcement
1•hasheddan•36m ago•0 comments

Small Kafka: Tansu and SQLite on a free t3.micro

https://blog.tansu.io/articles/broker-aws-free-tier
1•rmoff•37m ago•0 comments

I Forked Google Flatbuffers

https://digitalarsenal.github.io/flatbuffers/
1•tjkoury•38m ago•1 comments

Sony and Tcl Sign Memorandum of Understanding for Strategic Partnership

https://www.sony.co.jp/en/news-release/202601/26-0120E/
1•ksec•44m ago•0 comments

Show HN: Async HTTP handler plugin for the AWS SDK for Ruby, built on async-HTTP

https://github.com/thomaswitt/aws-sdk-http-async
1•thomas_witt•45m ago•0 comments

Show HN: Pikchr.pl – Make Pikchr diagrams using Prolog

https://github.com/exlee/pikchr.pl
2•xlii•45m ago•0 comments