frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Buckaroo – Data table UI for Notebooks

https://github.com/paddymul/buckaroo
105•paddy_m•7mo ago
Buckaroo is my open source project. It is a dataframe viewer that has the basic features we expect in a modern table - scroll, search, sort. In addition there are summary stats, and histograms available. Buckaroo support Pandas and Polars dataframes and works on Jupter, Marimo, VSCode and Google Colab notebooks. All of this is extensible. I think of Buckaroo as a framework for building table UIs, and an initial data exploration app built on top of that framework. AG-Grid is used for the core table display and it has been customized with a declarative layer so you don't have to pass JS functions around for customizations. On the python side there is a framework for adding summary stats (with a small DAG for dependencies). There is also an entire Low Code UI for point and click selection of common commands (drop column). The lowcode UI also generates a python function that accomplishes the same tasks. This is built on top of JLisp - a small lisp interpreter that reads JSON flavored lisp.

Auto Cleaning looks at columns and heuristically suggests common cleaning operations. The operations are added to the lowcode UI where they can be edited. Multiple cleaning strategies can be applied and the best fit retained. Autocleaning without a UI and multiple strategies is very opaque. Since this runs heuristically (not with an LLM), it’s fast and data stays local.

I'm eager to hear feedback from data scientists and other users of dataframes/notebooks.

Comments

ZeroCool2u•7mo ago
This looks really cool. I will say my default solution for this, and the default across my org, is Data Wrangler in VS Code[1]. My only wish list item is if the low code solution wrote polars instead of pandas. Any thoughts on how hard that might be to accomplish?

1: https://marketplace.visualstudio.com/items?itemName=ms-tools...

paddy_m•7mo ago
Thank you.

The Buckaroo lowcode UI is capable of working with Polars, but I don't currently have any commands plumbed in. I will work on that.

I'm aware of Data Wrangler and they did nice work, but it's closed source and from what I can tell non-extensible. What features do you like in Data Wrangler, what do you wish it did differently?

paddy_m•7mo ago
I made a Marimo WASM example that you can play with in your browser [1]

I need to make some updates to the polars functionality, I just completed some extensive refactorings of the Lowcode UI focussed on pandas, time to clean that up for polars too.

Also the python codegen for polars is non-idiomatic with multiple re-assignments to a dataframe, vs one big select block. I have some ideas for how to fix that, but they'll take time.

https://marimo.io/p/@paddy-mullen/notebook-sctuj8

RyanHamilton•7mo ago
Congratulations on launching. Buckaroo looks great.
franky47•7mo ago
But does it work across data tables with 8 dimensions?
trsohmers•7mo ago
Only with the oscillation overthruster flag enabled.
hodder•7mo ago
Looks cool to me. I often just end out exporting and opening in excel to do this
epistasis•7mo ago
This is really great, I'm looking forward to playing with it.

Currently I use a mix of quak (preferred) and itable (if starting fom a colab notebook). It will be interesting to compare for my use cases, which most consist of checking for the distribution of data in a new file, or verifying that a transform I did resulted in the right sort of stuff.

mathisd•7mo ago
How does it compare to Data Wrangler ? I like Data Wrangler because it let us open up in a separate VS Code window.

The One Startup Book Worth Re-Reading Annually

https://medium.com/@gp2030/the-one-startup-book-worth-re-reading-annually-41fc7cbc7771
1•light_triad•1m ago•0 comments

Poland to start producing anti-personnel mines to lay along eastern border

https://www.reuters.com/business/aerospace-defense/poland-start-producing-anti-personnel-mines-la...
1•JumpCrisscross•1m ago•0 comments

Show HN: Thugg.lol – a Link-in-Bio platform built from scratch

1•m6jo9•4m ago•0 comments

A Polemic on the Importance of Beauty

https://www.nubero.ch/blog/017/
1•nubero•5m ago•0 comments

Show HN: GitForms – Zero-cost contact forms using GitHub Issues as database

https://gitforms-landing.vercel.app/
1•lgreco•5m ago•0 comments

The Resistors Were Teenage Hackers and Computer Pioneers

https://spectrum.ieee.org/teenage-hackers
1•rbanffy•6m ago•0 comments

What Is Ultorg?

https://www.ultorg.com/docs/intro/what-is-ultorg/
1•thunderbong•6m ago•0 comments

Hfjfgj

https://blog.cloudflare.com/post-quantum-warp/
1•mihat•7m ago•0 comments

Nano Banana is so good that you can use it to play a RPG at 1 frame a minute

https://johnfn.substack.com/p/nano-banana-is-so-good-that-you-can
3•johnfn•8m ago•0 comments

Hybrid GPU–CPU Approach to Faster Vector Indexing and Cheaper Queries

https://milvus.io/blog/faster-index-builds-and-scalable-queries-with-gpu-cagra-in-milvus.md
1•Fendy•9m ago•0 comments

AssetOpsBench, IBM's first industry 4.0 benchmark – IBM Research

https://research.ibm.com/blog/asset-ops-benchmark
1•rbanffy•10m ago•0 comments

Cellhasher – Server Rack for Mobile Device Boards

https://cellhasher.com/
1•walterbell•11m ago•0 comments

Show HN: Jsonlinter.org

https://jsonlinter.org
2•plsft•11m ago•0 comments

The AI Agents Roadmap Nobody Is Teaching You

https://www.decodingai.com/p/ai-agents-foundations-course
1•BerislavLopac•11m ago•0 comments

Significant Performance Gains for Radeon RADV Ray-Tracing Performance in 2025

https://www.phoronix.com/review/radeon-radv-rt-2025#google_vignette
1•doener•12m ago•0 comments

TamaGo: Bare Metal Go

https://github.com/usbarmory/tamago
1•nateb2022•12m ago•0 comments

Gsdf: GPU accelerated 3D/2D CAD design in Go

https://github.com/soypat/gsdf
1•nateb2022•15m ago•0 comments

Show HN: Open-Source Postgres MCP Server and Natural Language Agent

https://github.com/pgEdge/pgedge-postgres-mcp
1•pgedge_postgres•16m ago•0 comments

Amazon in talks to invest about $10B in OpenAI

https://www.reuters.com/business/retail-consumer/openai-talks-raise-least-10-billion-amazon-use-i...
1•JumpCrisscross•17m ago•0 comments

Warner Doesn't Trust Paramount

https://www.bloomberg.com/opinion/newsletters/2025-12-17/warner-doesn-t-trust-paramount
3•ioblomov•17m ago•1 comments

Building AI Agents on Postgres: Why We Built the PgEdge Agentic AI Toolkit

https://www.pgedge.com/blog/building-ai-agents-on-postgres-why-we-built-the-pgedge-agentic-ai-too...
1•pgedge_postgres•17m ago•0 comments

Show HN: Created a New Ip.now

https://yip.is
1•plsft•18m ago•0 comments

The Factory Workers Who Build the Power Grid by Hand

https://www.wsj.com/business/the-factory-workers-who-build-the-power-grid-by-hand-4a846658
2•scrlk•19m ago•1 comments

Reinforcing Private-Public Investments

https://parthchopra.substack.com/p/on-reinforcing-private-public-investments
1•probe•22m ago•0 comments

Abusing x86 instructions to optimize PS3 emulation [RPCS3] [video]

https://www.youtube.com/watch?v=40tyEVx_umY
2•davikr•24m ago•0 comments

The Oscars Moving to YouTube Beginning in 2029, Will Stream Free Worldwide

https://variety.com/2025/film/news/oscars-youtube-2029-1236610989/
4•Risse•24m ago•2 comments

Exclusive-How China built its 'Manhattan Project' to rival the West in AI chips

https://finance.yahoo.com/news/exclusive-china-built-manhattan-project-141758929.html
4•WheelsAtLarge•25m ago•0 comments

DB migration tool – For those of us who don't use SQLAlchemy

https://github.com/rodmena-limited/migretti
1•rodmena•25m ago•0 comments

Open source platform for BYOC deployments

https://github.com/nuonco/nuon
3•MorehouseJ09•25m ago•0 comments

Evaluating AI's ability to perform scientific research tasks

https://openai.com/index/frontierscience/
1•Anon84•26m ago•0 comments