(author here) I originally started this project to solve one of my main daily issues: having to quickly check data sparse in a bunch of datasets in different file formats (.xpt, .sas7bdat, .csv, .tsv, Excel, etc) and several folders.
I wanted it to
- be performant: some files can have 50k rows or more, so an HTML table could not be the solution
- be easy to configure: web-app, no installation or configuration, so my colleagues could use it too
- (later) have a way to quickly explore data via queries: what's better than DuckDB for this?
VISUALIZE
bill_length_mm AS x
, bill_depth_mm AS y
, species AS color
FROM penguins_clean
DRAW point
SCALE x DOMAIN 30 60
SCALE y ZERO false
SCALE color TO viridis
;
VISUALIZE
bill_length_mm AS x
, bill_depth_mm AS y
, species AS color
FROM penguins_clean
DRAW point
FACET BY species COLS
SCALE x ZERO false
SCALE y ZERO false
;
caerbannogwhite•1h ago
I wanted it to - be performant: some files can have 50k rows or more, so an HTML table could not be the solution - be easy to configure: web-app, no installation or configuration, so my colleagues could use it too - (later) have a way to quickly explore data via queries: what's better than DuckDB for this?
Then, a few weeks ago, I read Thomas Lin Pedersen / Posit's alpha release here on HN (https://news.ycombinator.com/item?id=47833558), and I thought it could be nice to have charts in Bedevere. So I asked Claude to add the GGSQL grammar to my DuckDB extension (the-stats-duck - separate repo: https://github.com/caerbannogwhite/the-stats-duck).
Here are some other things you can try:
VISUALIZE bill_length_mm AS x , bill_depth_mm AS y , species AS color FROM penguins_clean DRAW point SCALE x DOMAIN 30 60 SCALE y ZERO false SCALE color TO viridis ;
VISUALIZE bill_length_mm AS x , bill_depth_mm AS y , species AS color FROM penguins_clean DRAW point FACET BY species COLS SCALE x ZERO false SCALE y ZERO false ;
Feedback is very welcome!