frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LayerClaw – Lightweight observability for PyTorch training runs

https://github.com/layerclaw/layerclaw
1•prabhavsanga•1h ago

Comments

prabhavsanga•1h ago
Hi HN! I built LayerClaw (https://github.com/layerclaw/layerclaw), a local-first observability tool for PyTorch training.

The problem: When training neural networks, things go wrong silently. Your loss explodes at step 47,392. Your gradients vanish in layer 12. Your GPU memory spikes randomly. By the time you notice, you've wasted hours or days of compute.

I got tired of adding print statements, manually checking TensorBoard files, and tracking down training issues after the fact. Existing tools either require cloud accounts (W&B, Neptune) or are too heavyweight for quick experiments (MLflow, TensorBoard for gradient analysis).

What LayerClaw does:

- Automatically tracks gradients, metrics, and system resources during training - Stores everything locally (SQLite + Parquet, no cloud required) - Detects anomalies: gradient explosions, NaN/Inf values, loss spikes - Provides a CLI to compare runs: `tracer compare run1 run2 --metric loss` - Minimal overhead with async writes (~2-3%)

Quick example:

```python import tracer import torch

# Initialize (one line) tracer.init(project="my-project", track_gradients=True)

# Your normal training loop model = YourModel() tracer._state.tracer.attach_hooks(model)

for batch in dataloader: loss = train_step(model, batch) tracer.log({"loss": loss.item()}) tracer.step()

tracer.finish() ```

Then analyze: `tracer anomalies my-run --auto`

What makes it different:

1. Local-first: No sign-ups, no data leaving your machine, no vendor lock-in 2. Designed for debugging: Deep gradient tracking and anomaly detection built-in (not an afterthought) 3. Lightweight: Add 2 lines to your training loop, minimal overhead 4. Works with everything: Vanilla PyTorch, HuggingFace Transformers, PyTorch Lightning

Current limitations (v0.1.0):

- CLI-only (web UI planned for v0.2) - Single-machine training (distributed support coming) - Early stage - would love feedback on what's most useful

Available now: - GitHub: https://github.com/layerclaw/layerclaw

*I'm looking for contributors!* I've created several "good first issues" for anyone interested in contributing. Areas where I need help: - Web UI for visualizations - Distributed training support - More framework integrations - Real-time monitoring dashboard

If you've struggled with ML training issues before, I'd love your input on what would be most valuable. PRs welcome, or just star the repo if you find it interesting!

What features would make this indispensable for your workflow?

So We Built Our Own Agentic Developer

https://builders.fullscript.com/posts/lessons-learned-from-building-nitro-fullscripts-autonomous-...
1•ncrum•1m ago•0 comments

The Art of Being Lazy(log)

https://www.warpstream.com/blog/the-art-of-being-lazy-log-lower-latency-and-higher-availability-w...
1•ordinarily•3m ago•0 comments

Scientists Discover Life Thriving Beneath Fukushima's Dead Reactors

https://dailygalaxy.com/2026/02/strange-life-under-fukushima-dead-reactors/
1•SunshineTheCat•4m ago•0 comments

Technocracy 2.0

https://brooklynrail.org/2026/02/field-notes/technocracy-2-0/
2•antonomon•6m ago•0 comments

Something Wild Going on with Emails?

2•trevyn•6m ago•0 comments

Home Assistant Comm Badge

https://github.com/graffitiwriter/Home-Assistant-Comm-Badge
1•taubek•7m ago•0 comments

SanDisk crushes wallets with up to 2.8X SSD price hikes

https://www.tomshardware.com/pc-components/ssds/sandisk-crushes-wallets-with-up-to-2-8x-ssd-price...
2•vmykyt•10m ago•0 comments

Start all of your commands with a comma

https://rhodesmill.org/brandon/2009/commands-with-comma/
2•theblazehen•13m ago•0 comments

Sh-DSL – Write/Use Shell with Janet

https://janet-lang.org/spork/api/sh-dsl.html
1•veqq•13m ago•0 comments

Exploring Different Keyboard Sensing Technologies – LTT Labs

https://www.lttlabs.com/articles/2026/01/27/exploring-different-keyboard-sensing-technologies#buc...
1•rbanffy•13m ago•0 comments

Windsurf Tab v2

https://windsurf.com/blog/windsurf-tab-2
1•swyx•14m ago•0 comments

Securely run Claude Code agents in Docker

https://edspencer.net/2026/2/4/run-claude-code-agents-docker-herdctl
1•edspencer•14m ago•0 comments

Hand-Crafting Domain-Specific Compression with an LLM

https://engineering.nanit.com/hand-crafting-domain-specific-compression-with-an-llm-3c42f5c2b070
1•PaulHoule•15m ago•0 comments

The perks of being a mole rat

https://worksinprogress.co/issue/the-perks-of-being-a-mole-rat/
1•ortegaygasset•15m ago•0 comments

Show HN: A TikTok-style research paper reader

https://pokepaper.com/
1•hajimi_hacker•15m ago•0 comments

PaperBanana – Automating Academic Illustration

https://paperbanana.org/
1•bilsbie•17m ago•0 comments

Readr, Safari-Like Reading Mode for Chrome

https://github.com/login
1•ymolodtsov•17m ago•2 comments

GitHub integrates Claude and Codex AI coding agents directly into GitHub

https://github.blog/changelog/2026-02-04-claude-and-codex-are-now-available-in-public-preview-on-...
2•thoughtpeddler•17m ago•1 comments

ClickHouse Agent Skills

https://github.com/ClickHouse/agent-skills
1•clickpiper-pete•19m ago•0 comments

Anthropic's new AI tool: Next black stock market day for the software industry

https://www.heise.de/en/news/Anthropic-s-new-AI-tool-Next-black-stock-market-day-for-the-software...
2•doener•21m ago•1 comments

Ask HN: How can you enforce rules for Claude etc.

1•blackknightdev•22m ago•2 comments

Tell HN: Electrolux HR chief hired to layoff workforce bought 12 room apartment

2•dssadasadsdsa12•23m ago•2 comments

Mean People Fail (2014)

https://paulgraham.com/mean.html
14•insuranceguru•25m ago•17 comments

NYC subway gates tested by the MTA use AI tech to track fare evaders

https://gothamist.com/news/modern-nyc-subway-gates-tested-by-the-mta-use-ai-tech-to-track-fare-ev...
2•geox•25m ago•0 comments

Show HN: Autonomous AI radio station about engineering, history and philosophy

https://www.hermestransmissions.com/
1•ivanachillee•30m ago•0 comments

GitHub ponders kill switch for pull requests to stop AI slop

https://www.theregister.com/2026/02/03/github_kill_switch_pull_requests_ai/
1•abdelhousni•30m ago•2 comments

DaveLovable

https://github.com/davidmonterocrespo24/DaveLovable
1•dmcrespo•31m ago•0 comments

Why our society needs free and open power grid data [video]

https://fosdem.org/2026/schedule/event/WQBBR9-map-your-grid/
1•protontypes•31m ago•0 comments

Show HN: RAGStack – Scale-to-zero serverless RAG on your own AWS

https://portfolio.hatstack.fun/read/post/RAGStack-Lambda
1•hatmanstack•32m ago•0 comments

The Idea River

https://nik.art/the-idea-river/
1•herbertl•33m ago•0 comments