frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Misata – synthetic data engine using LLM and Vectorized NumPy

https://github.com/rasinmuhammed/misata
1•rasinmuhammed•2h ago
Hey HN, I’m the author.

I built Misata because existing tools (Faker, Mimesis) are great for random rows but terrible for relational or temporal integrity. I needed to generate data for a dashboard where "Timesheets" must happen after "Project Start Date," and I wanted to define these rules via natural language.

How it works: LLM Layer: Uses Groq/Llama-3.3 to parse a "story" into a JSON schema constraint config.

Simulation Layer: Uses Vectorized NumPy (no loops) to generate data. It builds a DAG of tables to ensure parent rows exist before child rows (referential integrity).

Performance: Generates ~250k rows/sec on my M1 Air.

It’s early alpha. The "Graph Reverse Engineering" (describe a chart -> get data) is experimental but working for simple curves.

pip install misata

I’d love feedback on the simulator.py architecture—I’m currently keeping data in-memory (Pandas) which hits a ceiling at ~10M rows. Thinking of moving to DuckDB for out-of-core generation next. Thoughts?

Unstract: Open-source platform to ship document extraction APIs in minutes

https://github.com/Zipstack/unstract
1•naren87•33s ago•0 comments

What distinguishes great software engineers? (2019) [pdf]

https://faculty.washington.edu/ajko/papers/Li2019WhatDistinguishesEngineers.pdf
1•damethos•44s ago•0 comments

Postgres CDC in ClickHouse, A year in review

https://clickhouse.com/blog/postgres-cdc-year-in-review-2025
1•saisrirampur•1m ago•0 comments

SpacetimeDB Launched a Referral Program

https://spacetimedb.com/blog/all-new-spacetimedb-pricing
1•aleasoni•1m ago•0 comments

Lightning: Real-time editing for tiled map data

https://felt.com/blog/lightning-tiles
1•hinting•3m ago•0 comments

Ask HN: How Much Has Office Politics Affected Your Career?

1•karakoram•6m ago•0 comments

Show HN: Pothole Detection System (YOLOv8 – FastAPI – Docker – React Native)

https://github.com/PeterHdd/pothole-detection-yolo
1•peterhddcoding•6m ago•0 comments

Mozilla Names New CEO, Firefox to Evolve into a "Modern AI Browser"

https://www.phoronix.com/news/Mozilla-New-CEO-AI
1•sva_•8m ago•2 comments

Ask HN: Is Claude Code good enough already?

2•calflegal•8m ago•0 comments

The Roots

https://thinkhuman.com/the-roots/
1•jamesgill•9m ago•0 comments

Digital Gardening

https://www.contraption.co/digital-gardening/
1•philip1209•9m ago•0 comments

Serving bots some holiday cheer in my /.env

https://aero.zip/.env
2•pypt•14m ago•0 comments

CoreWeave's Staggering Fall from Market Grace Highlights AI Bubble Fears

https://www.wsj.com/tech/ai/coreweave-stock-market-ai-bubble-a3c8c321
3•ewoodrich•16m ago•0 comments

Canada's former Ambassador to Venezuela: Trump's plan to dominate [audio]

https://www.canadaland.com/podcast/161-trumps-plan-to-dominate-the-americas-canada-included/
1•thomassmith65•16m ago•0 comments

File d'attente – file-based job queue

https://git.sr.ht/~marcc/filed
1•todsacerdoti•16m ago•0 comments

Not-Such-Better-Living Through Chemistry (2023)

https://www.science.org/content/blog-post/not-such-better-living-through-chemistry
1•Tomte•17m ago•0 comments

Show HN: Rconvolve – Fast audio convolution and IR extraction built with Rust

https://rconvolve.pages.dev/
1•alex-russo•17m ago•0 comments

Your Mission Is an API

https://theuncredentialed.substack.com/p/mission-as-an-api
2•uncred•18m ago•1 comments

The Theory Underlying Concept Maps and How to Construct and Use Them (2008)

https://cmap.ihmc.us/docs/theory-of-concept-maps.php
1•Tomte•18m ago•0 comments

MiMo-V2-Flash: High-Efficiency Inference, Code and Agent Foundation Model

https://platform.xiaomimimo.com/#/docs/news/news20251216
1•gainsurier•19m ago•0 comments

Show HN: PgEdge Anonymizer – for replacing PII in test databases from prod

https://github.com/pgEdge/pgedge-anonymizer
1•pgedge_postgres•20m ago•0 comments

Open Source AI tool that sets up cloud infra from code

1•jvcor13•21m ago•0 comments

Prompt caching: 10x cheaper LLM tokens, but how?

https://ngrok.com/blog/prompt-caching/
1•samwho•22m ago•0 comments

Show HN: Kafkatop 2.0 – top for Kafka – rewritten in Go with partition analytics

https://github.com/sivann/kafkatop
1•sivann•23m ago•0 comments

Show HN: Live AI Evaluation to Detect Hallucinations in Real Time

https://ragmetrics.ai/live-ai-evaluation
1•olivierc_RM•23m ago•0 comments

My internet money fixation: Psychoanalysis

https://text-incubation.com/my-internet-money-fixation-psychoanalysis
1•krrishd•23m ago•0 comments

Vendor Lock in Nightmares

https://eliocapella.com/blog/vendor-lock-in-nightmares/
1•eliocs•26m ago•0 comments

Cloudflare Radar 2025 Year in Review

https://radar.cloudflare.com/year-in-review/2025#internet-traffic-growth
3•kjhughes•26m ago•0 comments

Hyperspectral Academy

https://www.pixxel.space/academy/topic/hyperspectral-academy-learning
1•marklit•27m ago•0 comments

The Science and Strategy Behind Wyoming's Snow Fences [video]

https://www.youtube.com/watch?v=dL1_9jMKjO0
4•ljoshua•32m ago•0 comments