frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Misata – synthetic data engine using LLM and Vectorized NumPy

https://github.com/rasinmuhammed/misata
11•rasinmuhammed•3d ago
Hey HN, I’m the author.

I built Misata because existing tools (Faker, Mimesis) are great for random rows but terrible for relational or temporal integrity. I needed to generate data for a dashboard where "Timesheets" must happen after "Project Start Date," and I wanted to define these rules via natural language.

How it works: LLM Layer: Uses Groq/Llama-3.3 to parse a "story" into a JSON schema constraint config.

Simulation Layer: Uses Vectorized NumPy (no loops) to generate data. It builds a DAG of tables to ensure parent rows exist before child rows (referential integrity).

Performance: Generates ~250k rows/sec on my M1 Air.

It’s early alpha. The "Graph Reverse Engineering" (describe a chart -> get data) is experimental but working for simple curves.

pip install misata

I’d love feedback on the simulator.py architecture—I’m currently keeping data in-memory (Pandas) which hits a ceiling at ~10M rows. Thinking of moving to DuckDB for out-of-core generation next. Thoughts?

Show HN: TinyPDF – 3kb pdf library (70x smaller than jsPDF)

https://github.com/Lulzx/tinypdf
107•lulzx•1d ago•15 comments

Show HN: Misata – synthetic data engine using LLM and Vectorized NumPy

https://github.com/rasinmuhammed/misata
11•rasinmuhammed•3d ago•0 comments

Show HN: I Made Loom for Mobile

https://demoscope.app
57•admtal•8h ago•35 comments

Show HN: Stepped Actions – distributed workflow orchestration for Rails

https://github.com/envirobly/stepped
79•klevo•5d ago•11 comments

Show HN: MCPShark Viewer (VS Code/Cursor extension)- view MCP traffic in-editor

22•mywork-dev•2d ago•0 comments

Show HN: Linggen – A local-first memory layer for your AI (Cursor, Zed, Claude)

https://github.com/linggen/linggen
25•linggen•7h ago•10 comments

Show HN: Stop AI scrapers from hammering your self-hosted blog (using porn)

https://github.com/vivienhenz24/fuzzy-canary
341•misterchocolat•3d ago•260 comments

Show HN: Stickerbox, a kid-safe, AI-powered voice to sticker printer

https://stickerbox.com/
39•spydertennis•5h ago•40 comments

Show HN: I Built an Image Captioning Tool Using Llama.cpp

https://github.com/paradox460/imagecaption
2•paradox460•3h ago•0 comments

Show HN: Music player for big local collections with mpd support

https://github.com/olegantonyan/mpz
4•oleg_antonyan•4h ago•0 comments

Show HN: Learning a Language Using Only Words You Know

https://simedw.com/2025/12/15/langseed/
77•simedw•4d ago•27 comments

Show HN: Picknplace.js, an alternative to drag-and-drop

https://jgthms.com/picknplace.js/
420•bbx•3d ago•143 comments

Show HN: I open-sourced my Go and Next B2B SaaS Starter (deploy anywhere, MIT)

https://github.com/moasq/production-saas-starter
75•moh_quz•13h ago•35 comments

Show HN: I implemented generics in my programming language

https://axe-docs.pages.dev/features/generics/
36•death_eternal•4d ago•19 comments

Show HN: Composify – Open-Source Visual Editor / Server-Driven UI for React

https://github.com/composify-js/composify
66•injung•1d ago•5 comments

Show HN: UK Butchers Meat Price Tracker

https://offer-spider.onrender.com
15•wolfer•6d ago•7 comments

Show HN: I built a fast RSS reader in Zig

https://github.com/superstarryeyes/hys
90•superstarryeyes•3d ago•32 comments

Show HN: Zynk, a Fast, P2P Encrypted File Transfers and Messaging Across Devices

9•justmarc•9h ago•2 comments

Show HN: Bithoven – A high-level, imperative language for Bitcoin Smart Contract

https://github.com/ChrisCho-H/bithoven
26•hyunhum•4d ago•8 comments

Show HN: High-Performance Wavelet Matrix for Python, Implemented in Rust

https://pypi.org/project/wavelet-matrix/
91•math-hiyoko•2d ago•10 comments

Show HN: Sqlit – A lazygit-style TUI for SQL databases

https://github.com/Maxteabag/sqlit
183•MaxTeabag•4d ago•40 comments

Show HN: A local-first memory store for LLM agents (SQLite)

https://github.com/CaviraOSS/OpenMemory
48•nullure•5d ago•16 comments

Show HN: Paper2Any – Open tool to generate editable PPTs from research papers

https://github.com/OpenDCAI/DataFlow-Agent
12•Mey0320•1d ago•2 comments

Show HN: Spice Cayenne – SQL acceleration built on Vortex

https://spice.ai/blog/introducing-spice-cayenne-data-accelerator
27•lukekim•1d ago•3 comments

Show HN: We built a small app with my wife to track promises we do

https://lovechecks.app/
2•warkanlock•12h ago•0 comments

Show HN: Daily Set Puzzle – I rebuilt it after setgame.com's SSL cert expired

https://www.anniehu.com/set/
9•anniegracehu•22h ago•2 comments

Show HN: Agents.db – an AGENTS.md alternative for LLM agent context

https://github.com/krazyjakee/AGENTS.db
2•krazyjakee•15h ago•0 comments

Show HN: Nano AI - Infinite Canvas AI Image Editor

https://nanoai.love/editor
3•akseli_ukkonen•16h ago•0 comments

Show HN: Xsql – SQL schema conversion via an intermediate representation (Rust)

https://github.com/Dawaman43/xsql
3•dawitworku•16h ago•0 comments

Show HN: Interactive Common Lisp: An Enhanced REPL

https://github.com/atgreen/icl
101•atgreen•6d ago•7 comments