frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: picomap - a data management tool for machine learning

https://github.com/rdilip/picomap
1•r2d•6h ago

Comments

r2d•6h ago
Hi HN! I built a tiny (<200 LOC) utility to make dataset management for machine learning easy.

I train small-ish machine learning models (<500M parameters) for protein generation, where the datasets are much less standardized than ImageNet or The Pile. Since we train on cloud compute a lot, we're constantly moving data on and off + making permanent changes to the dataset and the dataset elements themselves are all different sizes (instead of images being 256 x 256, different proteins are different lengths). picomap is a slightly spruced up version of some code I wrote last year that stores all your data in a single memory-mapped file. This makes dataset management simple, just push your dataset to your cloud compute. I find that for my usecases, this helps me keep the GPUs happy and fed and I normally don't even bother with the standard PyTorch Dataset + DataLoader. Feedback welcome!

(very inspired by mmap_ninja + the nanogpt data management)

Merriam-Webster banks on "actual intelligence" over artificial intelligence

https://www.marketplace.org/story/2025/10/29/artificial-intelligence-versus-actual-intelligence-a...
1•Geekette•20s ago•0 comments

Ask HN: Does Anyone Use DGX Spark for High Performance Scientific Computing?

1•dwasher•31s ago•0 comments

No One Is Coming to Save You. It's Both Freeing and Scary

https://philliphhughes.substack.com/p/no-one-is-coming-to-save-you-its
1•phughes1980•4m ago•0 comments

After a 13-year journey to the screen, there's an urgency to Nuremberg's release

https://www.theglobeandmail.com/culture/film-and-tv/film/article-nuremberg-release-russell-crowe-...
1•petethomas•7m ago•0 comments

New study links melatonin and heart failure, but experts say don't panic yet

https://www.washingtonpost.com/health/2025/11/03/melatonin-heart-failure-sleep-aid/
1•XzetaU8•7m ago•1 comments

Show HN: A Golang Telegram AI Bot- make your TG bot more smarter

https://github.com/yincongcyincong/MuseBot
2•yincong0822•9m ago•4 comments

NetBSD 11 prepares for launch with 57 supported platforms

https://www.theregister.com/2025/08/05/netbsd_11_is_near/
2•TMWNN•14m ago•0 comments

Skiddly Voice AI for Shopify Stores – Abandoned Cart Recovery

1•gravitybrain•17m ago•0 comments

Schaltwerk – The IDE Without Editor

https://github.com/2mawi2/schaltwerk
2•ttobi•17m ago•0 comments

Once Australia's second priciest city, Melbourne has become more affordable

https://www.theguardian.com/australia-news/2025/oct/25/once-australias-second-priciest-city-melbo...
5•PaulHoule•19m ago•0 comments

MongoDB cloud accepted an email with .con for 7 years before locking my account

1•colus001•20m ago•1 comments

Vibe Check: Claude Skills Need a 'Share' Button

https://every.to/vibe-check/vibe-check-claude-skills-need-a-share-button
1•gmays•22m ago•0 comments

Tell HN: X is opening any tweet link in a webview whether you press it or not

11•stillatit•23m ago•1 comments

Find Bitcoin-Friendly Coffee Shops – We Need Submissions

https://bitcoinlatte.com/
3•cranberryturkey•31m ago•0 comments

Pain Points of OCaml

https://quamserena.com/2025-11-03/pain-points-of-ocaml
1•quamserena•34m ago•0 comments

Ask HN: How do you satisfy your intellectual curiosity?

1•nanfinitum•49m ago•0 comments

Influence of Klein fields on human blood and vital parameters [pdf]

https://www.slowjuice.de/wp-content/uploads/2025/03/Die-Naturheilkunde-06_2024-Einfluss-Kleinsche...
1•sharpshadow•54m ago•0 comments

Some Australian states are set to get a free electricity period every day

https://www.abc.net.au/news/2025-11-04/solar-sharer-free-energy-three-hours-outlier-states/105968998
1•defrost•55m ago•1 comments

Cara Pembayaran Tunaiku

1•Wawanjoko•55m ago•0 comments

The Noise and the Signal

https://russmiles.substack.com/p/the-noise-and-the-signal
1•saikatsg•56m ago•0 comments

State Ofthe Art Novel InFlow 1Gearturbine/Reaction 2Imploturbocompressor/Impulse

1•monterrey•58m ago•0 comments

It Can Apply and Positive in Favor the Newton III Law on an Engine System Device

1•monterrey•1h ago•0 comments

Australia to offer three hours free solar per day to millions

https://www.reuters.com/business/energy/australia-offer-three-hours-free-solar-per-day-millions-2...
1•Physkal•1h ago•0 comments

Labs for Broke – EKS for Pennies

https://georgedeblog.com/blog/labs-for-broke/
2•prognostikos•1h ago•0 comments

New Cancer Therapy Trains Immune System to Attack and Destroy Resistant Cancers

https://ecency.com/hive-172582/@kur8/new-cancer-therapy-universal-vaccine
1•signa11•1h ago•0 comments

Google Quantum AI revived a decades-old concept known as quantum money

https://arxiv.org/abs/2510.06212
1•salkahfi•1h ago•0 comments

The Work of AI, Ourselves

https://oliverbatemandoesthework.substack.com/p/the-work-of-ai-ourselves
1•13years•1h ago•0 comments

Moon Duchin on the 'Mathematical Quagmire' of Gerrymandering

https://www.nytimes.com/2025/11/03/science/duchin-math-elections-gerrymandering.html
1•mmooss•1h ago•0 comments

Why AC is cheap, but AC repair is a luxury

https://a16z.substack.com/p/why-ac-is-cheap-but-ac-repair-is
62•walterbell•1h ago•28 comments

Small language models review. SLMs on incremental intelligence

https://agentherbie.com/articles/slms-incremental-intelligence
1•vicpara•1h ago•0 comments