frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Lumina – Open-source observability for LLM applications

https://github.com/use-lumina/Lumina
3•iggycodexs•1h ago
Hey HN! I built Lumina – an open-source observability platform for AI/LLM applications. Self-host it in 5 minutes with Docker Compose, all features included.

The Problem:

I've been building LLM apps for the past year, and I kept running into the same issues: - LLM responses would randomly change after prompt tweaks, breaking things - Costs would spike unexpectedly (turns out a bug was hitting GPT-4 instead of 3.5) - No easy way to compare "before vs after" when testing prompt changes - Existing tools were either too expensive or missing features in free tiers

What I Built:

Lumina is OpenTelemetry-native, meaning: - Works with your existing OTEL stack (Datadog, Grafana, etc.) - No vendor lock-in – standard trace format - Integrates in 3 lines of code

Key features: - Cost & quality monitoring – Automatic alerts when costs spike or responses degrade - Replay testing – Capture production traces, replay them after changes, see diffs - Semantic comparison – Not just string matching – uses Claude to judge if responses are "better" or "worse" - Self-hosted tier – 50k traces/day, 7-day retention, ALL features included (alerts, replay, semantic scoring)

How it works:

Start Lumina

git clone https://github.com/use-lumina/Lumina cd Lumina/infra/docker docker-compose up -d

// Add to your app (no API key needed for self-hosted!)

import { Lumina } from '@uselumina/sdk';

const lumina = new Lumina({ endpoint: 'http://localhost:8080/v1/traces', });

// Wrap your LLM call const response = await lumina.traceLLM( async () => await openai.chat.completions.create({...}), { provider: 'openai', model: 'gpt-4', prompt: '...' } );

That's it. Every LLM call is now tracked with cost, latency, tokens, and quality scores.

What makes it different:

1. Free self-hosted with limits that work – 50k traces/day and 7-day retention (resets daily at midnight UTC). All features included: alerts, replay testing, semantic scoring. Perfect for most development and small production workloads. Need more? Upgrade to managed cloud.

2. OpenTelemetry-native – Not another proprietary format. Use standard OTEL exporters, works with existing infra. Can send traces to both Lumina AND Datadog simultaneously.

3. Replay testing – The killer feature. Capture 100 production traces, change your prompt, replay them all, get a semantic diff report. Like snapshot testing for LLMs.

4. Fast – Built with Bun, Postgres, Redis, NATS. Sub-500ms from trace to alert. Handles 10k+ traces/min on a single machine.

What I'm looking for:

- Feedback on the approach (is OTEL the right foundation?) - Bug reports (tested on Mac/Linux/WSL2, but I'm sure there are issues) - Ideas for what features matter most (alerts? replay? cost tracking?) - Help with the semantic scorer (currently uses Claude, want to make it pluggable)

Why open source:

I want this to be the standard for LLM observability. That only works if it's: - Free to use and modify (Apache 2.0) - Easy to self-host (Docker Compose, no cloud dependencies) - Open to contributions (good first issues tagged)

The business model is managed hosting for teams who don't want to run infrastructure. But the core product is and always will be free.

Try it: - GitHub: https://github.com/use-lumina/Lumina - Demo video: [YouTube link] - Docs: https://docs.uselumina.io - Quick start: 5 minutes from `git clone` to dashboard

I'd love to hear what you think! Especially interested in: - What observability problems you're hitting with LLMs - Missing features that would make this useful for you - Any similar tools you're using (and what they do better)

Thanks for reading!

Conditional Privilege Escalation Synology DSM 7.3.2

https://thecontractor.io/synology-dsm-7-3-2/
1•splintersio•1m ago•0 comments

Sodebo Ultim 3 Smashes Jules Verne Trophy Record

https://www.sail-world.com/news/293210/Sodebo-Ultim-3-smashes-Jules-Verne-Trophy-Record
1•tonfa•2m ago•0 comments

Pity the Rich

https://www.gojiberries.io/pity-the-rich/
1•neehao•5m ago•0 comments

Glowing Polyhedrons – LED filament 3D objects using graph theory

https://cpldcpu.github.io/2026/01/24/glowing-polyhedrons/
1•cpldcpu•8m ago•0 comments

Hexapod Simulator

https://hexapod-simulator.onrender.com/
1•ustad•11m ago•0 comments

Shot Heard Round the World America Rock [video]

https://www.youtube.com/watch?v=Y6ikO6LMxF4
1•rolph•14m ago•0 comments

You'll Be Back – Hamilton Animatic [video]

https://www.youtube.com/watch?v=-P_1RYVTjcA
1•rolph•16m ago•0 comments

I hate forms and I must data entry

https://skeda.app/blog/i-hate-forms-and-yet-i-must-data-entry
1•Dansvidania•18m ago•0 comments

Sony Data Discman

https://huguesjohnson.com/random/sony-ebook/
1•naves•19m ago•0 comments

Deutsche Telekom is violating Net Neutrality

https://netzbremse.de/en/
3•tietjens•20m ago•0 comments

"Destination Space" (1959 movie)

https://archive.org/details/destination-space-1959-colorized
2•Animats•21m ago•1 comments

Riemann Mapping Theorem

https://en.wikipedia.org/wiki/Riemann_mapping_theorem
2•ogogmad•22m ago•1 comments

Anchor Interpolated Morph (Aim)

https://nerdy.dev/anchor-interpolated-morphing
1•SouravInsights•24m ago•0 comments

Why Developing for MS SharePoint Is a Horrible, Terrible, and Painful Experience

https://jordansrowles.medium.com/why-developing-for-microsoft-sharepoint-is-a-horrible-terrible-a...
1•thibautg•33m ago•0 comments

Like digging 'your own grave': The translators grappling with losing work to AI

https://www.cnn.com/2026/01/23/tech/translation-language-jobs-ai-automation-intl
7•myk-e•34m ago•0 comments

Introduction to PostgreSQL Indexes

https://dlt.github.io/blog/posts/introduction-to-postgresql-indexes/
2•dlt•36m ago•0 comments

Hands-On with Two Apple Network Server Prototype ROMs

http://oldvcr.blogspot.com/2026/01/hands-on-with-two-apple-network-server.html
2•todsacerdoti•36m ago•0 comments

White whale spotted off Crescent Head coast in 'unusual' summer sighting

https://www.abc.net.au/news/2026-01-14/rare-white-whale-spotted-off-new-south-wales-mid-north-coa...
2•Tomte•39m ago•0 comments

Show HN: Sightline – Shodan-style search for real-world infra using OSM Data

https://github.com/ni5arga/sightline
2•ni5arga•41m ago•0 comments

Bing is not as bad as you think

https://blog.nilch.org/blog/bing-is-not-as-bad-as-you-think-a-search-engine-quality-study/
2•UnmappedStack•44m ago•0 comments

Oops, Someone Is Wrong on the Internet (2023)

https://medium.com/luminasticity/oops-someone-is-wrong-on-the-internet-2068c7dbf072
1•bryanrasmussen•48m ago•0 comments

GLM4.7-Flash the new Local LLM king at 30B A3B and OpenCode?

https://grigio.org/glm4-7-flash-the-new-local-llm-king-at-30b-a3b/
1•grigio•49m ago•0 comments

Validation: CRM for orgs with internal competition

1•Yaswanth21•50m ago•0 comments

Burhan(TruthCert)fail-closed verification LLM outputs(measure false-ship rate)

https://github.com/mahmood726-cyber/Burhan
1•mahmood726•50m ago•2 comments

Exploring radio emissions from confirmed exoplanets using SKA (2024)

https://www.frontiersin.org/journals/astronomy-and-space-sciences/articles/10.3389/fspas.2024.141...
1•zeristor•52m ago•0 comments

Agent Context Graph

https://github.com/markjspivey-xwisee/agent-context-graph
1•markjspivey•56m ago•1 comments

Show HN: Bootstrapping a compiler from raw machine code

https://github.com/nineties/hex-to-compiler
1•nineties•58m ago•0 comments

Show HN: AutoShorts – Local, GPU-accelerated AI video pipeline for creators

https://github.com/divyaprakash0426/autoshorts
1•divyaprakash•1h ago•1 comments

Target 1: Baseten

https://www.silares.com/targets/target-1-baseten
2•todsacerdoti•1h ago•0 comments

A multi-entry CFG design conundrum

https://bernsteinbear.com/blog/multiple-entry/
2•fbuilesv•1h ago•0 comments