frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LayerClaw – Observability tool for PyTorch training

2•prabhavsanga•2h ago
Hi HN! I built LayerClaw (https://github.com/layerclaw/layerclaw), a local-first observability tool for PyTorch training. The problem: When training neural networks, things go wrong silently. Your loss explodes at step 47,392. Your gradients vanish in layer 12. Your GPU memory spikes randomly. By the time you notice, you've wasted hours or days of compute.

I got tired of adding print statements, manually checking TensorBoard files, and tracking down training issues after the fact. Existing tools either require cloud accounts (W&B, Neptune) or are too heavyweight for quick experiments (MLflow, TensorBoard for gradient analysis).

What LayerClaw does:

- Automatically tracks gradients, metrics, and system resources during training - Stores everything locally (SQLite + Parquet, no cloud required) - Detects anomalies: gradient explosions, NaN/Inf values, loss spikes - Provides a CLI to compare runs: `tracer compare run1 run2 --metric loss` - Minimal overhead with async writes (~2-3%)

Quick example:

```python import tracer import torch

# Initialize (one line) tracer.init(project="my-project", track_gradients=True)

# Your normal training loop model = YourModel() tracer._state.tracer.attach_hooks(model)

for batch in dataloader: loss = train_step(model, batch) tracer.log({"loss": loss.item()}) tracer.step()

tracer.finish() ```

Then analyze: `tracer anomalies my-run --auto`

What makes it different:

1. Local-first: No sign-ups, no data leaving your machine, no vendor lock-in 2. Designed for debugging: Deep gradient tracking and anomaly detection built-in (not an afterthought) 3. Lightweight: Add 2 lines to your training loop, minimal overhead 4. Works with everything: Vanilla PyTorch, HuggingFace Transformers, PyTorch Lightning

Current limitations (v0.1.0):

- CLI-only (web UI planned for v0.2) - Single-machine training (distributed support coming) - Early stage - would love feedback on what's most useful

Available now: - GitHub: https://github.com/layerclaw/layerclaw

I'm looking for contributors! I've created several "good first issues" for anyone interested in contributing. Areas where I need help: - Web UI for visualizations - Distributed training support - More framework integrations - Real-time monitoring dashboard

If you've struggled with ML training issues before, I'd love your input on what would be most valuable. PRs welcome, or just star the repo if you find it interesting!

What features would make this indispensable for your workflow?

Show HN: Inklings – Handwritten family notes turned into a printed book monthly

https://inklings.social
2•archaeal•45m ago•1 comments

Show HN: Bunqueue – Job queue for Bun using SQLite instead of Redis

https://github.com/egeominotti/bunqueue
5•kernelvoid•2d ago•1 comments

Show HN: Interactive California Budget (By Claude Code)

https://california-budget.com
30•sberens•4h ago•11 comments

Show HN: The Last Worm – Visualizing guinea worm eradication, from 3.5M to 10

https://echomoltinsson.github.io/last-worm/
3•onyx_writes•1h ago•0 comments

Show HN: SymDerive – A functional, stateless symbolic math library

23•dinunnob•3d ago•5 comments

Show HN: Ghidra MCP Server – 110 tools for AI-assisted reverse engineering

https://github.com/bethington/ghidra-mcp
269•xerzes•17h ago•63 comments

Show HN: Craftplan – I built my wife a production management tool for her bakery

https://github.com/puemos/craftplan
532•deofoo•3d ago•156 comments

Show HN: EpsteIn – Search the Epstein files for your LinkedIn connections

https://github.com/cfinke/EpsteIn
60•cfinke•5h ago•14 comments

Show HN: Mmdr – 1000x faster Mermaid rendering in pure Rust (no browser)

https://github.com/1jehuang/mermaid-rs-renderer/blob/master/README.md
10•jeremyh1•10h ago•1 comments

Show HN: LayerClaw – Observability tool for PyTorch training

2•prabhavsanga•2h ago•0 comments

Show HN: GitHub Browser Plugin for AI Contribution Blame in Pull Requests

https://blog.rbby.dev/posts/github-ai-contribution-blame-for-pull-requests/
60•rbbydotdev•1d ago•34 comments

Show HN: Viberails – Easy AI Audit and Control

https://www.viberails.io/
5•maximelb•5h ago•1 comments

Show HN: Safe-now.live – Ultra-light emergency info site (<10KB)

https://safe-now.live
184•tinuviel•1d ago•94 comments

Show HN: Octosphere, a tool to decentralise scientific publishing

https://octosphere.social/
61•crimsoneer•1d ago•32 comments

Show HN: Cohesix 0.4.0-alpha, a no-std control-plane OS

https://github.com/lukeb-aidev/cohesix
2•Cohesix•2h ago•0 comments

Show HN: Sandboxing untrusted code using WebAssembly

https://github.com/mavdol/capsule
75•mavdol04•1d ago•22 comments

Show HN: Tabstack Research – An API for verified web research (by Mozilla)

7•MrTravisB•6h ago•3 comments

Show HN: C discrete event SIM w stackful coroutines runs 45x faster than SimPy

https://github.com/ambonvik/cimba
63•ambonvik•1d ago•17 comments

Show HN: OpenShears – I built an uninstaller because OpenClaw refuses to die

https://github.com/oswarld/openshears
4•haebom•12h ago•0 comments

Show HN: AI Blocker by Kiddokraft

https://kiddokraft.org/wiki?name=ai-blocker
4•Rezhe•13h ago•0 comments

Show HN: DuoBolt – a review-first duplicate file finder powered by BLAKE3

https://duobolt.app/
3•r9ne•8h ago•1 comments

Show HN: Adboost – A browser extension that adds ads to every webpage

https://github.com/surprisetalk/AdBoost
124•surprisetalk•2d ago•127 comments

Show HN: BPU – An embedded scheduler for stable UART pipelines

9•DenisDolya•3d ago•1 comments

Show HN: Camel OpenAI Integration Patterns

https://github.com/ibek/camel-openai-patterns
2•aivi•9h ago•0 comments

Show HN: SlitherPong, a hybrid of the Snake and Pong video games

https://www.slitherpong.com/
3•AmbroseBierce•9h ago•2 comments

Show HN: PII-Shield – Log Sanitization Sidecar with JSON Integrity (Go, Entropy)

https://github.com/aragossa/pii-shield
18•aragoss•1d ago•9 comments

Show HN: Nocterm – Flutter-inspired TUI framework with hot reload (Dart)

https://nocterm.dev
4•norbert515•10h ago•6 comments

Show HN: Two-week creative lab for developers building with real-time AI video

https://daydream.live/interactive-ai-video-program
10•cmuir•10h ago•2 comments

Show HN: Teaching AI agents to write better GraphQL

https://skills.sh/apollographql/skills
5•daleseo•10h ago•2 comments

Show HN: Ec – a terminal Git conflict resolver inspired by IntelliJ

https://github.com/chojs23/ec
16•neozz•1d ago•2 comments